Model Registry

The world's most powerful models.
All in one premium catalogue.

Run, customize, and orchestrate 1000+ models across 25 core categories. Experience Hollywood-grade video, image synthesis, and language processing with zero coldstarts.

983 Models
25 Modalities
19 Global Providers

Showing 983 of 983 engines

bytedance/seedance-2.0/text-to-video
20% OFF
BytedanceText To Video

seedance-2.0

Seedance 2.0 (Text-to-Video) generates Hollywood-grade cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability. Built on Seed's unified multimodal architecture, it leads on instruction adherence, motion quality, and visual aesthetics.

Cost rate
$0.6000$0.4800/ sec
bytedance/seedance-2.0-fast/text-to-video
20% OFF
BytedanceText To Video

seedance-2.0-fast

Seedance 2.0 Fast (Text-to-Video) generates cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability — optimized for faster generation at lower cost. Built on Seed's unified multimodal architecture.

Cost rate
$0.5000$0.4000/ sec
vidu/q3/text-to-video
50% OFF
ViduText To Video

q3

Vidu Q3 Text-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Cost rate
$0.3500$0.1750/ sec
bytedance/seedance-2.0/text-to-video-turbo
20% OFF
BytedanceText To Video

seedance-2.0

Seedance 2.0 (Text-to-Video Turbo) generates cinematic 720p/1080p videos from text prompts —delivering high-resolution output at near-480p speed with native audio-visual synchronization, director-level control, and exceptional motion stability.

Cost rate
$0.7000$0.5600/ sec
bytedance/seedance-2.0-fast/text-to-video-turbo
20% OFF
BytedanceText To Video

seedance-2.0-fast

Seedance 2.0 Fast (Text-to-Video Turbo) generates cinematic 720p/1080p videos from text prompts using speed-optimized inference —the fastest and most affordable Seedance option with native audio-visual synchronization and director-level control.

Cost rate
$0.6000$0.4800/ sec
vidu/q3-pro/text-to-video
50% OFF
ViduText To Video

q3-pro

Vidu Q3 Pro Text to Video is a fast AI video generation model that creates high-quality, audio-capable videos from text prompts with support for 1–16 second outputs. Ready-to-use REST inference API for cinematic clips, advertising creatives, social media videos, product visuals, storytelling, and professional text-to-video workflows with simple integration, no coldstarts, and affordable pricing.

Cost rate
$0.2500$0.1250/ sec
skywork-ai/skyreels-v4/text-to-video
Skywork AIText To Video

skyreels-v4

SkyReels V4 Text to Video is a fast AI video generation model that creates high-quality videos from text prompts using the SkyReels V4 text2video workflow. Ready-to-use REST inference API for cinematic clips, storytelling videos, social media content, advertising creatives, product visuals, concept videos, and professional text-to-video workflows with simple integration, no coldstarts, and affordable pricing.

Cost rate
$0.1000/ sec
pruna-ai/p-video/text-to-video
Pruna AIText To Video

p-video

Pruna AI P-Video Text to Video is a fast AI video generation model that creates high-quality videos from text prompts. Ready-to-use REST inference API for cinematic clips, social media videos, advertising creatives, product visuals, motion design, and AI video generation workflows with simple integration, no coldstarts, and affordable pricing.

Cost rate
$0.0200/ sec
pixverse/pixverse-c1/text-to-video
PixverseText To Video

pixverse-c1

PixVerse C1 generates film-grade videos from text prompts with flexible duration (1-15s), multiple resolutions up to 1080p, and optional native audio generation. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Cost rate
$0.1000/ sec
google/veo3.1/text-to-video
GoogleText To Video

veo3.1

Google Veo 3.1 converts text prompts into videos with synchronized audio at native 1080p for high-quality outputs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Cost rate
$3.2000/ sec
alibaba/wan-2.7/text-to-video
AlibabaText To Video

wan-2.7

WAN 2.7 Text-to-Video turns plain prompts into coherent, cinematic clips with crisp detail, stable motion, and strong instruction-following—great for ads, explainers, and social posts. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Cost rate
$0.5000/ sec
alibaba/wan-2.6/text-to-video
AlibabaText To Video

wan-2.6

WAN 2.6 Text-to-Video turns plain prompts into coherent, cinematic clips with crisp detail, stable motion, and strong instruction-following—great for ads, explainers, and social posts. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Cost rate
$0.5000/ sec
alibaba/wan-2.5/text-to-video
AlibabaText To Video

wan-2.5

WAN 2.5 makes 480p-1080p text/image-to-video with synced audio and is faster, more affordable than Google Veo3. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Cost rate
$0.2500/ sec
google/veo3.1-fast/text-to-video
GoogleText To Video

veo3.1-fast

Google Veo 3.1 Fast creates text-to-video with native 1080p and synchronized audio, delivering high-quality videos for creators. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Cost rate
$1.2000/ sec
bytedance/seedance-v1.5-pro/text-to-video-fast
BytedanceText To Video

seedance-v1.5-pro

Seedance 1.5 Pro Fast (Text-to-Video) converts text prompts into cinematic, live-action-leaning videos with strong prompt adherence, expressive yet stable motion, and consistent aesthetics. It supports 4–12s duration control, multiple aspect ratios (9:16, 1:1, 16:9), and 720p/1080p output with seed-reproducible results—ideal for ads, trailers, and short-drama beats. Built for stable production use with a ready-to-use REST API, no cold starts, and predictable pricing.

Cost rate
$0.2000/ sec
bytedance/seedance-v1.5-pro/text-to-video
BytedanceText To Video

seedance-v1.5-pro

Seedance 1.5 Pro (Text-to-Video) generates cinematic, live-action–leaning clips from text with strong prompt adherence, expressive motion, and stable aesthetics. It supports 4–12s duration control (including Smart Duration), multiple aspect ratios (including adaptive), and reproducible generation via seeds—ideal for ads and short-drama workflows.

Cost rate
$0.2600/ sec
alibaba/happyhorse-1.0/text-to-video
AlibabaText To Video

happyhorse-1.0

Alibaba Happy Horse 1.0 (Text-to-Video) generates cinematic 720p / 1080p videos from text prompts with smooth camera movement, expressive motion, and strong prompt fidelity. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Cost rate
$0.7000/ sec
kwaivgi/kling-v2.5-turbo-pro/text-to-video
Kling AIText To Video

kling-v2.5-turbo-pro

Kling 2.5 Turbo Pro is a Text-to-Video model that delivers cinematic visuals, fluid motion, and precise prompt-to-motion responsiveness. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Cost rate
$0.3500/ sec
kwaivgi/kling-video-o3-4k/text-to-video
Kling AIText To Video

kling-video-o3-4k

Kling Video O3 4K generates cinematic 4K videos from text prompts with subject consistency, natural physics simulation, and precise semantic understanding. Supports multi-prompt scene transitions, element references, and optional audio generation. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Cost rate
$2.1000/ sec
kwaivgi/kling-video-o3-pro/text-to-video
Kling AIText To Video

kling-video-o3-pro

Kling Omni Video O3 is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Text-to-Video mode generates cinematic videos from text prompts with subject consistency, natural physics simulation, and precise semantic understanding. Supports audio generation. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Cost rate
$0.5600/ sec
kwaivgi/kling-video-o3-std/text-to-video
Kling AIText To Video

kling-video-o3-std

Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Text-to-Video mode generates cinematic videos from text prompts with subject consistency, natural physics simulation, and precise semantic understanding. Supports audio generation. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Cost rate
$0.4200/ sec
kwaivgi/kling-v3.0-std/text-to-video
Kling AIText To Video

kling-v3.0-std

Kling 3.0 Standard delivers high-quality text-to-video generation with smooth motion, cinematic visuals, accurate prompt adherence, and native audio for ready-to-share clips. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Cost rate
$0.4200/ sec
kwaivgi/kling-v3.0-pro/text-to-video
Kling AIText To Video

kling-v3.0-pro

Kling 3.0 Pro delivers top-tier text-to-video generation with smooth motion, cinematic visuals, accurate prompt adherence, and native audio for ready-to-share clips. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Cost rate
$0.5600/ sec
kwaivgi/kling-v3.0-4k/text-to-video
Kling AIText To Video

kling-v3.0-4k

Kling V3.0 4K delivers top-tier 4K text-to-video generation with smooth motion, cinematic visuals, accurate prompt adherence, and optional audio. Supports flexible aspect ratios, multi-prompt, and element references. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Cost rate
$2.1000/ sec