100+ AI-modeller

AI-modeller

Varje ledande AI-modell — ett abonnemang. Bläddra, jämför och kör 100+ modeller från världens främsta labb.

+19
282+
AI-modeller
27+
Leverantörer
6
Modelltyper
1
Abonnemang

En plattform. Varje modell.

Sluta jonglera abonnemang. Kunya ger dig tillgång till varje ledande AI-modell — chatt, bild, video, ljud och kod — med en enda inloggning.

Chatt och resonemang

GPT, Claude, Gemini, Grok, Llama, Mistral, DeepSeek med mera. Byt modell mitt i konversationen.

Bildgenerering

GPT Image, FLUX, Stable Diffusion, Seedream, Grok Imagine — alla stora bildmodeller i en studio.

Videoskapande

Sora, Veo, Kling, Luma, Runway. Text-till-video, bild-till-video, ansiktsbyte, läppsynk.

Musik och röst

Fullständig låtgenerering, röstkloning, TTS på 50+ språk, transkription, poddar.

Chatt

(74)

Claude Opus 4.8

Anthropic

Premium

Most capable Opus — enhanced coding, agentic workflows, and long-horizon reasoning with 1M context

VisionResonemangVerktyg
1.0M tokens

Claude Opus 4.7

Anthropic

Premium

Previous Opus — enhanced SWE, vision, and long-horizon agentic reasoning with 1M context

VisionResonemangVerktyg
1.0M tokens

Claude Opus 4.6

Anthropic

Premium

Hybrid reasoning model with 1M context, top-tier coding and agentic performance

VisionResonemangVerktyg
1.0M tokens

Claude Opus 4.5

Anthropic

Premium

Previous premium model with maximum intelligence

VisionResonemangVerktyg
200K tokens

Claude Sonnet 4.6

Anthropic

Premium

Best combination of speed and intelligence, near-flagship performance

VisionResonemangVerktyg
1.0M tokens

Claude Sonnet 4.5

Anthropic

Premium

Previous smart model for complex agents and coding

VisionResonemangVerktyg
200K tokens

Claude Haiku 4.5

Anthropic

Snabb

Fastest model with near-frontier intelligence

VisionVerktyg
200K tokens

GPT-5.5

OpenAI

Premium

Newest frontier model — highest reasoning for coding and professional work

VisionResonemangVerktyg
1.1M tokens

GPT-5.4

OpenAI

Premium

Highly capable GPT model for coding and agentic tasks

VisionResonemangVerktyg
128K tokens

GPT-5.4 Pro

OpenAI

Premium

Most powerful GPT model with maximum compute for complex reasoning

VisionResonemangVerktyg
128K tokens

GPT-5.2

OpenAI

Premium

The best model for coding and agentic tasks across industries

VisionResonemangVerktyg
128K tokens

GPT-5.1

OpenAI

Premium

Intelligent reasoning model with configurable reasoning effort

VisionResonemangVerktyg
128K tokens

GPT-5

OpenAI

Premium

Previous intelligent reasoning model for coding and agentic tasks

VisionResonemangVerktyg
128K tokens

GPT-5 mini

OpenAI

Snabb

A faster, cost-efficient version of GPT-5

VisionVerktyg
128K tokens

GPT-5 nano

OpenAI

Snabb

Fastest, most cost-efficient version of GPT-5

Vision
128K tokens

GPT-5.2 Pro

OpenAI

Premium

Smarter and more precise responses

VisionResonemangVerktyg
128K tokens

GPT-4.1

OpenAI

Premium

Smartest non-reasoning model

VisionVerktyg
128K tokens

GPT-4.1 mini

OpenAI

Snabb

Smaller, faster version of GPT-4.1

VisionVerktyg
128K tokens

o3

OpenAI

Premium

Reasoning model for complex tasks

Resonemang
128K tokens

o3 Pro

OpenAI

Premium

Version of o3 with more compute for better responses

Resonemang
128K tokens

o4 mini

OpenAI

Snabb

Fast, cost-efficient reasoning model

Resonemang
128K tokens

Gemini 3.1 Pro

Google

Premium

Most advanced reasoning model with complex problem-solving

VisionResonemangVerktyg
1.0M tokens

Gemini 3.1 Flash-Lite

Google

Snabb

Cheapest frontier-class model — half the cost of Gemini 3 Flash with strong tool calling

VisionVerktyg
1.0M tokens

Gemini 3.1 Flash Live

Google

Snabb

Low-latency Live API model for real-time dialogue and voice-first AI applications

VisionVerktyg
131K tokens

Gemini 3.5 Flash

Google

Snabb

Frontier intelligence optimized for agentic workflows, coding, and video at higher speed

VisionResonemangVerktyg
1.0M tokens

Gemini 3 Flash

Google

Snabb

Frontier intelligence with superior search and grounding

VisionResonemangVerktyg
1.0M tokens

Gemini 2.5 Pro

Google

Premium

State-of-the-art thinking model for complex problems

VisionResonemangVerktyg
1.0M tokens

Gemini 2.5 Flash

Google

Snabb

Best price-performance for large scale processing

VisionResonemangVerktyg
1.0M tokens

Gemini 2.5 Flash-Lite

Google

Snabb

Fastest flash model for cost-efficiency

VisionVerktyg
1.0M tokens

Grok 4.3

xAI

Premium

Fastest, most intelligent Grok — 1M context, 3 reasoning levels, top agentic tool calling

VisionResonemangVerktyg
1.0M tokens

Grok 4.20 Multi-Agent

xAI

Premium

Latest Grok beta optimized for multi-agent orchestration

VisionResonemangVerktyg
2.0M tokens

Grok 4.20 Reasoning

xAI

Premium

Latest Grok beta with extended reasoning

VisionResonemangVerktyg
2.0M tokens

Grok 4.20

xAI

Premium

Fast Grok without reasoning overhead

VisionVerktyg
2.0M tokens

Grok 3 Mini

xAI

Snabb

Smaller, faster Grok with reasoning

Resonemang
131K tokens

DeepSeek V4 Flash

DeepSeek

Snabb

1M context, thinking + non-thinking modes, tool calls

ResonemangVerktyg
1.0M tokens

DeepSeek V4 Pro

DeepSeek

Premium

Flagship model — 1M context, thinking + non-thinking modes

ResonemangVerktyg
1.0M tokens
Q

Qwen3 Max (Direct)

Qwen

Premium

Alibaba's flagship general-purpose LLM via DashScope - top-tier reasoning and coding

ResonemangVerktyg
131K tokens
Q

Qwen Deep Research

Qwen

Premium

Automated deep research - plans research steps, performs web searches, generates structured reports

131K tokens

MiniMax M2.7

MiniMax

Premium

Recursive self-improvement — SOTA in software engineering, tool calling, and office productivity

ResonemangVerktyg
205K tokens

MiniMax M2.7 Highspeed

MiniMax

Snabb

M2.7 at ~100 tps — same performance, faster and more agile

ResonemangVerktyg
205K tokens

MiniMax M2.5

MiniMax

Premium

Peak performance and ultimate value — master the complex

ResonemangVerktyg
205K tokens

MiniMax M2.5 Highspeed

MiniMax

Snabb

M2.5 at ~100 tps — same performance, faster and more agile

ResonemangVerktyg
205K tokens

MiniMax M2.1

MiniMax

Premium

Polyglot programming mastery with precision code refactoring

Verktyg
205K tokens

MiniMax M2

MiniMax

Premium

Agentic capabilities with function calling and advanced reasoning

Verktyg
200K tokens

Seed 2.0 Pro

ByteDance

Premium

ByteDance flagship — 76.5% SWE-Bench, 98.3% AIME 2025, hour-long video understanding

VisionResonemangVerktyg
262K tokens

Llama 4 Maverick

Meta

Premium

Meta's flagship Llama 4 model

Vision

Llama 4 Scout

Meta

Snabb

Efficient Llama 4 model

Llama 3.3 70B

Meta

Premium

Meta's powerful open source model

Llama 3.3 70B

Meta

Snabb

Meta's powerful open source model

Mistral Medium 3.1

Mistral

Premium

Balanced Mistral model

Verktyg

Mistral Large 2512

Mistral

Premium

Latest large Mistral model

Verktyg

Mistral Small Creative

Mistral

Snabb

Creative writing focused model

Q

Qwen3 235B

Qwen

Premium

Large Qwen model with 235B parameters

Verktyg
Q

Qwen3 VL 235B

Qwen

Premium

Vision-language Qwen model

Vision
Q

Qwen3 Max

Qwen

Premium

Most powerful Qwen model

Verktyg
Z

GLM 5.1

Z-AI

Premium

Latest Z-AI flagship — enhanced long-horizon coding and autonomous agent tasks

ResonemangVerktyg
203K tokens
Z

GLM 5

Z-AI

Premium

Z-AI flagship model with strong reasoning and tool use

Verktyg
203K tokens
Z

GLM 5 Turbo

Z-AI

Snabb

Fast inference model optimized for agentic workflows and tool use

Verktyg
203K tokens
Z

GLM 4.7

Z-AI

Premium

Latest GLM model

Verktyg
Z

GLM 4.6

Z-AI

Premium

Powerful GLM model

Verktyg
Z

GLM 4.5 Air

Z-AI

Snabb

Lightweight GLM model

X

MiMo v2.5 Pro

Xiaomi

Premium

Xiaomi's 1T-parameter flagship — agentic workflows, tool calling, and advanced reasoning with 1M context

ResonemangVerktyg
1.0M tokens
X

MiMo v2 Flash

Xiaomi

Snabb

Xiaomi's fast AI model

Nemotron 3 Nano

NVIDIA

Snabb

Nvidia's compact model

Kimi K2.6

Moonshot

Premium

Long-horizon coding, UI/UX generation, and multi-agent orchestration with parallel sub-agents

VisionResonemangVerktyg
262K tokens

Kimi K2.5

Moonshot

Premium

State-of-the-art visual coding and agentic tool-calling with multimodal reasoning

VisionResonemangVerktyg
262K tokens
S

Step 3.5 Flash

StepFun

Snabb

196B MoE reasoning model — activates 11B per token, extremely fast

Resonemang
256K tokens
Q

Qwen 3.5 Plus

Qwen

Premium

Hybrid attention + MoE vision-language model with 1M context

VisionResonemangVerktyg
1.0M tokens
O

Hunter Alpha

OpenRouter

Premium

1T parameter frontier model built for agentic multi-step reasoning

Verktyg
1.0M tokens
O

Healer Alpha

OpenRouter

Premium

Omni-modal frontier model with vision, hearing, reasoning, and action

VisionVerktyg
262K tokens
N

Hermes 4 405B

Nous Research

Premium

Flagship uncensored reasoning model from Nous Research — hybrid think/respond mode, low refusal rates, strong at math, code, and structured output

Ocensurerad
131K tokens
N

Hermes 4 70B

Nous Research

Snabb

Efficient uncensored reasoning model from Nous Research — hybrid think/respond mode, low refusal rates, strong at math, code, and structured output

Ocensurerad
131K tokens

Seed 2.0 Lite

ByteDance

Snabb

Versatile multimodal model with low latency for agent and vision tasks

VisionVerktyg
262K tokens
Kunya

Kunya V1

Kunya

Premium

Intelligently routed model — Opus-level quality at budget cost. Routes to the best model for each request.

VisionResonemangVerktyg
1.0M tokens

Bild

(41)

Nano Banana 2

Google

High-efficiency image generation optimized for speed and volume, up to 4K with thinking

Resonemang

Nano Banana Pro

Google

Professional asset production with advanced reasoning and 4K output

Resonemang

Nano Banana

Google

Fast native image generation with editing — the original Gemini image model

GPT Image 2

OpenAI

Latest state-of-the-art image generation with fast, high-quality output and flexible sizes

GPT Image 1.5

OpenAI

Image generation with native editing

GPT Image 1

OpenAI

Image generation with native editing support

DALL·E 3

OpenAI

High quality image generation with text rendering

Grok Imagine

xAI

Fast and affordable image generation

FLUX.2 Max

Black Forest Labs

Top-tier image quality with editing and multi-reference support

FLUX.2 Pro

Black Forest Labs

High-end image generation with strong prompt adherence and editing

FLUX.2 Flex

Black Forest Labs

Complex text, typography, and multi-reference editing

FLUX.2 Klein 4B

Black Forest Labs

Fastest and most cost-effective FLUX model

Seedream 4.5

ByteDance

ByteDance model with strong editing consistency and text rendering

S

Riverflow V2 Pro

Sourceful

Most powerful Riverflow with perfect text rendering and 4K support

S

Riverflow V2 Fast

Sourceful

Fastest Riverflow for production and latency-critical workflows

S

Riverflow V2 Max Preview

Sourceful

Most powerful Riverflow V2 preview - unified text-to-image and image-to-image

S

Riverflow V2 Standard Preview

Sourceful

Standard Riverflow V2 preview with great quality

S

Riverflow V2 Fast Preview

Sourceful

Fastest Riverflow V2 preview model

FLUX.1 Schnell

Black Forest Labs

Ultra-fast image generation in ~1 second

S

Stable Diffusion 3.5 Large

Stability AI

Latest SD with improved quality, typography, and prompt understanding

S

Stable Diffusion 3.5 Large Turbo

Stability AI

Fast SD 3.5 Large with 4-step generation

S

Stable Diffusion 3.5 Medium

Stability AI

Balanced SD 3.5 with great quality/speed ratio

S

SDXL

Stability AI

Stable Diffusion XL - high quality 1024x1024 images

S

SDXL Lightning

Stability AI

Ultra-fast SDXL with 4-step generation

S

Stable Diffusion LoRA

Stability AI

SDXL with customizable LoRA weights for fine-tuned styles

Qwen Image 2512 LoRA

Alibaba

High-quality image generation with LoRA fine-tuning support

Qwen Image 2512

Alibaba

Qwen's native image generation model

B

Bria Fibo

Bria

Professional-grade image generation with clean licensing

F

Reve Edit

FAL AI

Advanced image editing with precise control

F

AuraFlow

FAL AI

Open-source flow-based image generation

F

Kolors

FAL AI

High-quality bilingual image generation (English/Chinese)

Seedream 5.0 Lite Edit

ByteDance

ByteDance Seedream 5.0 Lite image editing — intelligent multi-image editing with reasoning, style transfer, and beautification (2K-3K)

Q

Qwen Image Max

Qwen

Alibaba's flagship image generation - high realism, fine detail, excellent text rendering

Q

Qwen Image Edit Max

Qwen

Alibaba's image editing model - modify text, add/remove objects, style transfer, detail enhancement

Z

Z-Image Turbo

Z-Image

Lightweight fast image generation with Chinese & English text rendering

W

Wan 2.6 Text-to-Image

Wan

Alibaba Wan 2.6 text-to-image generation - photorealistic to illustrative styles

Seedream 5.0 Lite

ByteDance

ByteDance Seedream 5.0 Lite — high-quality 2K/3K image generation with text-to-image and image editing

Midjourney V7

Midjourney

Midjourney V7 — industry-leading image generation with stunning aesthetics. 4 images per generation. Supports --ar, --s, --c and all V7 parameters.

B

Seedream 5.0 (Dreamina)

ByteDance (Dreamina)

ByteDance Seedream 5.0 via Dreamina/ModelArk — high-quality 2K image generation. Admin-only for comparison with Evolink provider.

B

Seedream 5.0 Lite (Dreamina)

ByteDance (Dreamina)

ByteDance Seedream 5.0 Lite via Dreamina/ModelArk — fast 2K image generation. Admin-only for comparison with Evolink provider.

Kunya

Kunya V1 Image

Kunya

Intelligently routed image generation — Z-Image Turbo for fast/cheap, Seedream for quality, GPT Image for editing.

Video

(124)

Sora 2

OpenAI

OpenAI Sora 2 — physics-aware world simulation with audio (up to 12s, 720p)

Sora 2 Pro

OpenAI

OpenAI Sora 2 Pro — highest quality with audio (up to 12s, 1080p)

Sora 2 Image-to-Video

OpenAI

OpenAI Sora 2 — animate images with physics simulation (up to 12s, 720p)

Sora 2 Pro Image-to-Video

OpenAI

OpenAI Sora 2 Pro — highest quality image animation (up to 12s, 1080p)

Sora 2 Remix

OpenAI

OpenAI Sora 2 — transform existing videos with style changes

Google Veo 3.1 Fast

Google

Google Veo 3.1 — fast cinematic generation (up to 8s, 720p)

Google Veo 3.1

Google

Google Veo 3.1 — cinematic video (up to 8s, 1080p)

Google Veo 3.1 Image-to-Video

Google

Google Veo 3.1 — image-to-cinema (up to 8s, 1080p)

Google Veo 3.1 Extend

Google

Google Veo 3.1 Extend — continue an existing video up to ~30s total (720p/1080p)

Google Veo 3.1 Reference-to-Video

Google

Google Veo 3.1 — generate video from up to 3 reference images (up to 8s, 1080p)

Google Veo 3.1 First-Last-Frame

Google

Google Veo 3.1 — animate between a first and last keyframe (up to 8s, 1080p)

Grok Imagine Video

xAI

AI video generation from text, images, and video with native audio

K

Kling 2.5 Pro

Kling

High-quality video with excellent character consistency

K

Kling 2.5 Pro Image-to-Video

Kling

Transform images into videos with motion

K

Kling 1.6 Pro

Kling

Professional video generation

K

Kling 1.6 Pro Image-to-Video

Kling

Professional image-to-video generation

L

Luma Ray 2

Luma

Photorealistic video with incredible motion (5s or 9s)

L

Luma Ray 2 Flash

Luma

Fast version of Ray 2 for quicker generation (5s or 9s)

L

Luma Dream Machine

Luma

Realistic motion and physics-aware generation (5s)

R

Runway Gen-3 Turbo Image-to-Video

Runway

Fast cinematic video from images (5s or 10s, 768p)

Minimax Video-01

MiniMax

Narrative-coherent video (fixed 6s clips; use scene chaining for longer)

Minimax Video-01 Live

MiniMax

Real-time video generation (fixed 6s clips)

Hailuo 2.3

MiniMax

Latest MiniMax model — cinematic motion, expressive faces, anime & illustration styles, 15 camera commands

Hailuo 2.3 Fast

MiniMax

Fast & cost-effective image-to-video — same quality, optimized for speed

T

Hunyuan Video

Tencent

Tencent open-source video model

V

Vidu Q2

Vidu

High-quality text-to-video generation

V

Vidu Q2 Image-to-Video

Vidu

Transform images into dynamic videos

F

CogVideoX 5B

FAL AI

Open-source video generation model

F

LTX Video

FAL AI

Affordable high-quality video generation

F

Video Upscaler

FAL AI

Enhance video resolution and quality

F

Frame Interpolation

FAL AI

Increase video frame rate smoothly

L

LTX Video v2

Lightricks

Open-source model with 20s 4K support and improved quality

L

LTX Video v2 Image-to-Video

Lightricks

Animate images with LTX v2 - up to 20 seconds

F

Face Swap (Legacy)

FAL AI

Basic face swap in images and videos

E

Advanced Face Swap

Easel

Premium face swap with hair preservation, 2x upscale, and detail enhancement

E

GIF Face Swap

Easel

Swap faces on GIFs — fun for social sharing

F

LivePortrait

FAL AI

Make any portrait mimic your expressions - face puppeteering

F

LivePortrait Lightning

FAL AI

Fast face puppeteering - your expressions control any face

DreamActor M2.0

ByteDance

ByteDance motion transfer — full body, expressions, lip movement from driving video to any character (humans, animals, cartoons)

OmniHuman

ByteDance

ByteDance OmniHuman — audio-driven avatar animation with emotion and cognitive simulation

OmniHuman 1.5

ByteDance

ByteDance OmniHuman 1.5 — film-grade talking avatar from photo + audio with micro-expressions and cognitive simulation

S

Sync-3 Lipsync

Sync

Most powerful lipsync — native visual intelligence for professional-quality video-to-video

S

Sync Lipsync 2 Pro

Sync

High-quality realistic lipsync preserving natural teeth and unique facial features

S

Sync Lipsync 2

Sync

Realistic lipsync animations from audio with advanced synchronization

F

LatentSync

FAL AI

Budget-friendly video-to-video lip sync — $0.20 flat for up to 40s, then $0.005/s

K

Kling LipSync

Kling

Kling audio-to-video lip sync — realistic lip movements from audio (2-60s audio, 720p/1080p)

F

Hallo v2

FAL AI

Portrait animation with audio-driven lip sync

F

Sonic

FAL AI

Lip sync video generation from audio input — up to 60s

F

MuseTalk

FAL AI

Real-time lip sync for virtual presenters — up to 120s

Wan 2.2 Text-to-Video

Alibaba

Wan 2.2 A14B — high-quality anime/artistic video with improved motion and expressions (480p-720p)

Wan 2.2 Animate Move

Alibaba

Wan 2.2 motion transfer — replicate expressions and movements from a reference video onto a character image

Wan 2.2 Animate Replace

Alibaba

Wan 2.2 character replacement — replace the character in a video while preserving scene lighting and motion

F

AnimateDiff V2V

FAL AI

Transform videos with anime and artistic styles

F

AnimateDiff SparseCtrl

FAL AI

Anime-style video with motion control from sparse frames

Wan Video 2.1 (Legacy)

Alibaba

Anime and artistic video generation (superseded by Wan 2.2)

Wan Video 2.1 I2V (Legacy)

Alibaba

Image-to-anime animations (superseded by Wan 2.2)

F

ToonCrafter

FAL AI

Generate cartoon/anime interpolation between keyframes

K

Kling Motion Brush

Kling

Kling face puppeteering - drive faces with your video

K

Kling Lip Sync (v2.5 Legacy)

Kling

Kling v2.5 lip sync — superseded by Kling LipSync audio-to-video endpoint

K

Kling O3 Pro V2V Reference (FAL)

Kling

Kling O3 Pro — generate the next shot from a reference video, preserving motion & camera style (3-15s, 1080p)

K

Kling O3 Standard V2V Reference (FAL)

Kling

Kling O3 Standard — generate the next shot from a reference video (3-15s, 720p)

K

Kling O3 Pro V2V Edit (FAL)

Kling

Kling O3 Pro — edit existing videos with element injection and style transfer (3-15s, 1080p)

K

Kling 3.0 Pro Text-to-Video (FAL)

Kling

Kling V3 Pro — cinematic text-to-video with multi-shot and native audio (3-15s, 1080p)

K

Kling 3.0 Pro Image-to-Video (FAL)

Kling

Kling V3 Pro — animate images with multi-shot storyboarding (3-15s, 1080p)

K

Kling O3 Standard T2V (FAL)

Kling

Kling O3 Standard — text-to-video with multi-shot and audio (3-15s, 720p)

K

Kling O3 Standard I2V (FAL)

Kling

Kling O3 Standard — animate images with start/end frame control (3-15s, 720p)

K

Kling O3 Standard Ref2V (FAL)

Kling

Kling O3 Standard — reference-to-video with @Element character locking + @Image style refs (3-15s, 720p)

K

Kling O3 Pro Text-to-Video (FAL)

Kling

Kling O3 Pro — reference-driven text-to-video with character consistency (3-15s, 1080p)

K

Kling O3 Pro Image-to-Video (FAL)

Kling

Kling O3 Pro — best-in-class image-to-video with element referencing (3-15s, 1080p)

K

Kling O3 Pro Ref2V (FAL)

Kling

Kling O3 Pro — reference-to-video with @Element character locking (frontal+multi-angle refs) + @Image style refs (3-15s, 1080p)

K

Kling O3 4K Ref2V (FAL)

Kling 4K

Kling O3 4K — reference-to-video with @Element character locking at native 4K. Up to 7 refs (3-15s)

K

Kling 3.0 4K Text-to-Video (FAL)

Kling 4K

Kling V3 Native 4K — professional-grade 4K video from text (3-15s)

K

Kling 3.0 4K Image-to-Video (FAL)

Kling 4K

Kling V3 Native 4K — professional-grade 4K video from images (3-15s)

K

Kling O3 4K Text-to-Video (FAL)

Kling 4K

Kling O3 Native 4K — professional-grade 4K video with reference support (3-15s)

K

Kling O3 4K Image-to-Video (FAL)

Kling 4K

Kling O3 Native 4K — professional-grade 4K video from images with references (3-15s)

K

Kling 3.0 Standard (Direct)

Kling Direct

Kling V3 Standard via direct API — 720p text-to-video (5/10/15s)

K

Kling 3.0 Pro (Direct)

Kling Direct

Kling V3 Pro via direct API — 1080p text-to-video (5/10/15s)

K

Kling 3.0 Standard Image-to-Video (Direct)

Kling Direct

Kling V3 Standard via direct API — 720p image-to-video (5/10s)

K

Kling 3.0 Pro Image-to-Video (Direct)

Kling Direct

Kling V3 Pro via direct API — 1080p image-to-video (5/10s)

K

Kling O3 Standard (Direct)

Kling Direct

Kling O3 Standard via direct API — 720p text-to-video (3-15s)

K

Kling O3 Pro (Direct)

Kling Direct

Kling O3 Pro via direct API — 1080p text-to-video (3-15s)

K

Kling O3 Standard Image-to-Video (Direct)

Kling Direct

Kling O3 Standard via direct API — 720p image-to-video (3-15s)

K

Kling O3 Pro Image-to-Video (Direct)

Kling Direct

Kling O3 Pro via direct API — 1080p image-to-video (3-15s)

K

Kling 3.0 4K (Direct)

Kling Direct

Kling V3 native 4K text-to-video via direct API (3-15s)

K

Kling 3.0 4K Image-to-Video (Direct)

Kling Direct

Kling V3 native 4K image-to-video via direct API (3-10s)

K

Kling O3 4K (Direct)

Kling Direct

Kling O3 native 4K text-to-video via direct API (3-15s)

K

Kling O3 4K Image-to-Video (Direct)

Kling Direct

Kling O3 native 4K image-to-video via direct API (3-15s)

Seedance 2.0 Text-to-Video (FAL)

Seedance

ByteDance Seedance 2.0 via FAL — cinematic T2V with native audio, up to 15s at 1080p

Seedance 2.0 Image-to-Video (FAL)

Seedance

ByteDance Seedance 2.0 via FAL — animate images with native audio, start/end frame control, up to 15s

Seedance 2.0 Reference-to-Video (FAL)

Seedance

ByteDance Seedance 2.0 via FAL — multimodal ref system: up to 9 images + 3 videos + 3 audio, native audio

Seedance 2.0 Fast T2V (FAL)

Seedance

ByteDance Seedance 2.0 Fast via FAL — lower latency and cost, up to 15s

Seedance 2.0 Fast I2V (FAL)

Seedance

ByteDance Seedance 2.0 Fast via FAL — fast image-to-video with native audio

Seedance 2.0 Fast Ref2V (FAL)

Seedance

ByteDance Seedance 2.0 Fast via FAL — fast multimodal reference, up to 9 images + 3 videos + 3 audio

H

Happy Horse 1.0 Text-to-Video

Happy Horse

Alibaba Happy Horse 1.0 — #1 ranked AI video model, native audio + lip-sync, up to 15s at 1080p

H

Happy Horse 1.0 Image-to-Video

Happy Horse

Alibaba Happy Horse 1.0 — #1 ranked I2V with native audio, multilingual lip-sync, up to 15s at 1080p

H

Happy Horse 1.0 Reference-to-Video

Happy Horse

Alibaba Happy Horse 1.0 — reference-driven video with character consistency (1-9 images), native audio, 1080p

H

Happy Horse 1.0 Video Edit

Happy Horse

Alibaba Happy Horse 1.0 — natural language video editing with up to 5 reference images, 1080p

W

Wan 2.6 I2V Flash

Wan

Alibaba Wan 2.6 - image-to-video with audio, up to 15s at 1080p

W

Wan 2.6 I2V Standard

Wan

Alibaba Wan 2.6 - higher quality image-to-video, up to 15s at 1080p

W

Wan 2.6 Text-to-Video

Wan

Alibaba Wan 2.6 - cinematic multi-shot text-to-video with audio, up to 15s at 1080p

W

Wan 2.2 Keyframe-to-Video

Wan

Alibaba Wan 2.2 - generate video from first and last frame images, 5s at 1080p

W

Wan 2.6 Reference-to-Video

Wan

Alibaba Wan 2.6 - replicate character appearance from reference videos, multi-character support, up to 10s

W

Wan 2.1 Video Editing (VACE)

Wan

Alibaba Wan 2.1 - multi-image reference, video redraw, local editing, extension, frame expansion

W

Wan 2.2 Image-to-Animation

Wan

Alibaba Wan 2.2 - animate a person image using motion from a reference video, up to 30s

W

Wan 2.2 Video Character Swap

Wan

Alibaba Wan 2.2 - replace people in videos with people from images, keeping original background, up to 30s

Seedance 2.0 Text-to-Video

Seedance

ByteDance Seedance 2.0 — text-driven video with synchronized audio, lip-sync, web search, up to 15s

Seedance 2.0 Image-to-Video

Seedance

ByteDance Seedance 2.0 — first/last frame image-driven video with synchronized audio, up to 15s

Seedance 2.0 Reference-to-Video

Seedance

ByteDance Seedance 2.0 — multimodal @-reference system: up to 9 images + 3 videos + 3 audio tracks

Seedance 2.0 Fast Text-to-Video

Seedance

ByteDance Seedance 2.0 Fast — faster text-driven video at lower cost, synchronized audio, up to 15s

Seedance 2.0 Fast Image-to-Video

Seedance

ByteDance Seedance 2.0 Fast — faster image-driven video at lower cost, synchronized audio, up to 15s

Seedance 2.0 Fast Reference-to-Video

Seedance

ByteDance Seedance 2.0 Fast — faster multimodal @-reference at lower cost, up to 9 images + 3 videos + 3 audio

Seedance 1.5 Pro

Seedance

ByteDance Seedance 1.5 — synchronized audio+video generation with lip-sync and foley (up to 12s)

K

Kling 3.0 Text-to-Video

Kling

Kling V3 — standard text-to-video with multi-shot and sound effects (5s or 10s)

K

Kling 3.0 Image-to-Video

Kling

Kling V3 — image-to-video with first/last frame, multi-shot, and sound effects (5s or 10s)

K

Kling 3.0 Motion Control

Kling

Kling V3 — motion transfer from reference video to character in reference image (up to 10s per render)

K

Kling O3 Text-to-Video

Kling

Kling O3 (V3 Omni) — highest quality text-to-video with multi-shot and sound (3-15s)

K

Kling O3 Image-to-Video

Kling

Kling O3 (V3 Omni) — best-in-class image-to-video with reference images, elements, and multi-shot (3-15s)

W

Wan 2.7 Text-to-Video

Wan

Alibaba Wan 2.7 — multi-shot narrative, auto BGM/SFX or driving-audio lip-sync, 2-15s

H

Happy Horse 1.0 Text-to-Video

HappyHorse

Alibaba Happy Horse 1.0 — #1 ranked text-to-video, native audio + lip-sync, 3-15s

H

Happy Horse 1.0 Image-to-Video

HappyHorse

Alibaba Happy Horse 1.0 — image-to-video with native audio, 3-15s

H

Happy Horse 1.0 Reference-to-Video

HappyHorse

Alibaba Happy Horse 1.0 — reference-driven video with 1-9 images, native audio, 3-15s

H

Happy Horse 1.0 Video Edit

HappyHorse

Alibaba Happy Horse 1.0 — natural language video editing with up to 5 reference images

K

Kling O1 Image-to-Video

Kling

Kling O1 — style-focused image-to-video with first/last frame support (5s or 10s)

Kunya

Kunya V1 Video

Kunya

Intelligently routed video generation — Kling for quality, Seedance for speed, resolution-aware selection.

Ljud

(17)

Whisper

OpenAI

Speech-to-text transcription

TTS-1

OpenAI

Text-to-speech optimized for speed

TTS-1 HD

OpenAI

Text-to-speech optimized for quality

Gemini 3.1 Flash TTS

Google

Powerful, low-latency speech generation with expressive audio tags for precise narration control — 70+ languages

Google TTS Standard

Google

Google Cloud Text-to-Speech — standard voices, 40+ languages

Google TTS Neural2

Google

Google Neural2 voices — highly natural-sounding TTS using novel synthesis methods

Google Chirp3 HD

Google

Google's most expressive TTS — Chirp3 HD voices with studio-quality audio

Google TTS Studio

Google

Google Studio voices — highest quality, human-like expressiveness

E

ElevenLabs TTS

ElevenLabs

ElevenLabs Eleven v3 — ultra-realistic voice synthesis with 30+ languages and voice cloning

E

ElevenLabs Flash

ElevenLabs

ElevenLabs Flash v2.5 — lowest latency TTS for real-time applications, 32 languages

Q

Qwen3 TTS Flash

Qwen

Alibaba's multilingual TTS with 49 voices, 10+ languages - ElevenLabs alternative

Q

Qwen3 TTS Flash (Nov 2025)

Qwen

Snapshot version of Qwen3 TTS Flash with 49 voices

Q

Qwen3 TTS Instruct Flash

Qwen

Instruction-controllable TTS - control speech style via text instructions, 10+ languages

Q

Qwen3 TTS Voice Design

Qwen

Generate custom voices from text descriptions - design unique voices without audio samples

Q

Qwen3 TTS Voice Clone

Qwen

Clone voices from 10-20 second audio samples - highly natural voice replication

C

CosyVoice V3 Plus

CosyVoice

Next-gen generative TTS model - high-quality real-time streaming synthesis

C

CosyVoice V3 Flash

CosyVoice

Fast CosyVoice TTS - cost-effective streaming synthesis

Musik

(14)

Lyria RealTime

Google

Google DeepMind real-time streaming music generation with interactive steering

MiniMax Music

MiniMax

Generate music from text prompts with optional reference audio

MiniMax Music v2

MiniMax

Lyric-driven composition with synchronized vocals and structure tags

C

CassetteAI Music

CassetteAI

Ultra-fast professional music generation - 3 min track in under 10s

S

Sonauto V2

Sonauto

Full songs in any style with lyrics, tags, and BPM control

B

Beatoven

Beatoven

Royalty-free instrumental music with stem generation for remixing

S

Stable Audio

Stability AI

High-quality music and sound design generation

MusicGen Large

Meta

Meta's large music generation model

E

ElevenLabs Music

ElevenLabs

Studio-grade music with vocals or instrumentals, up to 10 min, multilingual lyrics

S

Suno V5

Suno (Kunya)

Latest Suno model — superior musical expression, fast generation, vocals + instrumentals

S

Suno V4.5

Suno (Kunya)

Recommended Suno model — smarter prompts, up to 8 min, great vocal quality

S

Suno V4.5+

Suno (Kunya)

Enhanced V4.5 with richer tones and new creative methods, up to 8 min

S

Suno V4.5 All

Suno (Kunya)

V4.5 full-featured — all capabilities unlocked, up to 8 min

S

Suno V4

Suno (Kunya)

Improved vocal quality, up to 4 min, lighter and faster generation

Kod

(12)

GPT-5.3 Codex

OpenAI

Premium

Most capable agentic coding model with frontier reasoning

400K tokens

GPT-5.1 Codex

OpenAI

Premium

Optimized for agentic coding

128K tokens

GPT-5.1 Codex Max

OpenAI

Premium

Most intelligent coding model for long-horizon tasks

128K tokens

GPT-5 Codex

OpenAI

Premium

Optimized for agentic coding in Codex

128K tokens
Q

Qwen3 Coder Plus (Direct)

Qwen

Premium

Alibaba's flagship code model via DashScope - code generation, completion, and debugging

ResonemangVerktyg
131K tokens
Q

Qwen3 Coder Flash (Direct)

Qwen

Snabb

Fast, cost-effective code model via DashScope for rapid code tasks

Verktyg
131K tokens

Devstral 2512

Mistral

Premium

123B agentic coding model with 256K context

Verktyg
256K tokens

Codestral 2508

Mistral

Snabb

Fast coding model for completion, correction, and test generation

256K tokens

Devstral 2512

Mistral

Snabb

123B agentic coding model

256K tokens
Q

Qwen3 Coder Plus

Qwen

Premium

Enhanced coding model

Q

Qwen3 Coder Flash

Qwen

Snabb

Fast coding model

K

Kat Coder Pro

Kwaipilot

Snabb

Coding assistant from Kwaipilot

Redo att nå varje modell?

Börja med Kunya och kör vilken modell som helst direkt. Inga API-nycklar att hantera, inga separata abonnemang.

Inget kreditkort krävs Avsluta när som helst Omedelbar åtkomst