AI Models - Browse All AI APIs

Text to ImageImage EditingSyncUp to 1536$0.04/image

Cheapest OpenAI gpt-image-2 channel via yunwu.ai. Sync API, text-to-image and multi-image editing capped at 1024-class output for predictable cost.

$0.04 / call

GPT Image 2 Beta

OpenAI

Text to ImageImage EditingMulti-Image (up to 16)NSFW AllowedAsync$0.06/image

OpenAI GPT Image 2 (beta channel via kie.ai). Text-to-image and multi-image editing (up to 16 reference images), aspect-ratio controlled output. NSFW content allowed by default (filter off); pass nsfw_checker=true to enable SFW filtering.

$0.06 / call

GPT Image 2

OpenAI

Text to ImageImage EditingMulti-Image (up to 10)Async$0.05/image

OpenAI gpt-image-2 via RunningHub rhart-image-g-2. Text-to-image and multi-image editing (up to 10 reference images), controlled by aspect ratio.

$0.05 / call

VEO 3.1 Lite

Google

Text to VideoImage to Video4s / 6s / 8s$0.10/video

Google VEO 3.1 Lite via OpenAI-style /v1/videos API. Reference image support, 4s/6s/8s durations, cost-effective video generation.

$0.10 / call

VEO 3.1 Lite 4K

Google

Text to VideoImage to Video4K4s / 6s / 8s$0.11/video

Google VEO 3.1 Lite 4K via OpenAI-style /v1/videos API. 4K resolution, reference image support, 4s/6s/8s durations.

$0.11 / call

Nanobananapro-gemini

Google

Text to ImageImage Editing1K/2K/4K$0.025/image

Gemini 3 Pro Image via GeminiGen channel. Professional asset creation with advanced reasoning and high-fidelity text rendering.

$0.025 / call

Nanobanana2-gemini

Google

Text to ImageImage Editing1K/2K/4K$0.025/image

Gemini 3.1 Flash Image via GeminiGen channel. High-performance image generation optimized for speed and high-volume use.

$0.025 / call

Seedance 2 Fast

ByteDance

Text to VideoImage to Video4-15sMulti-Ratio$0.075/s

ByteDance Seedance 2 Omni Fast mode. 4-15s flexible duration, multiple aspect ratios, per-second pricing.

$0.075/s / call

Seedance 2 Pro

ByteDance

Text to VideoImage to Video4-15sCinematic$0.15/s

ByteDance Seedance 2 Omni Pro mode. 4-15s flexible duration, highest quality, cinematic output.

$0.15/s / call

VEO 3.1 Fast HD

Google

Text to VideoImage to Video720p8s

VEO 3.1 Fast HD (720p) video generation via GeminiGen. 8s fixed duration, 16:9 aspect ratio, reference image support.

$0.07 / call

VEO 3.1 Fast Full HD

Google

Text to VideoImage to Video1080p8s

VEO 3.1 Fast Full HD (1080p) video generation via GeminiGen. 8s fixed duration, 16:9 aspect ratio, reference image support.

$0.07 / call

Grok Video 3 (Official)

xAI

Text to VideoImage to Video6s/10s/15s720p

Official Grok Video 3 via GeminiGen API. Fast generation with customizable resolution, duration (6/10/15s), and reference image support.

$0.08 / call

VEO 3.1 Fast 4K (Beta)

RunningHub

Image to VideoStart-End Frame4K1080p

VEO 3.1 Fast 4K video generation via RunningHub. Requires start frame image. Supports start-end frame video generation.

$0.07-$0.10 / call

SparkPix Image

SparkPix

Text to ImageSub 1sText RenderingLoRA Support

Sub 1 second text-to-image model built for production use cases. State-of-the-art speed, quality, and text rendering.

$0.008 / call

SparkPix Image Edit

SparkPix

Image EditingMulti-ImageSub 1sText Rendering

Sub 1 second multi-image editing model. Fast, affordable AI image editing with precise prompt adherence and multi-image support.

$0.013 / call

P-Video

Pruna AI

T2V & I2VDraft ModeBuilt-in Audio720p/1080p1-20sMulti-Aspect

Fast video generation in ~10 seconds. Text/image/audio-to-video with draft mode for 4x faster previews. Built-in audio generation, up to 1080p 48FPS.

$0.05+ / call

Grok Imagine Image

xAI

Text to ImageMultiple SizesHD Quality

Multimodal AI image generation by X platform. Generates high-quality images from text descriptions.

$0.03 / call

Grok Imagine Image Pro

xAI

Text to ImageHD QualityPro DetailMultiple Sizes

Upgraded multimodal AI model by X platform with stronger understanding and finer detail generation for higher precision images.

$0.10 / call

Seedance 2.0 Beta (Cinematic)

Seedance

T2V & I2VUp to 4 Ref Images5/10/15sCinematic Lighting16:9/9:16/4:3/3:4

Hollywood-grade cinematic video generator. Dual-mode T2V & I2V with up to 4 reference images. Professional color grading, dramatic lighting, and smooth camera movement.

$1.00+ / call

DreamActor V2 (Motion Transfer)

ByteDance

Motion TransferMulti-PersonAnime SupportLip SyncMax 30s

ByteDance DreamActor V2 motion transfer. Drive any character image with reference video motion, supporting multi-person, anime and pets.

$0.06/s / call

Kling Lip-Sync Video

Kling

Lip SyncMulti-CharacterAudio AlignmentMinute-Level Duration

Kling AI lip-sync video generation. Frame-level lip synchronization with audio for real humans, 3D and 2D characters.

$0.065/5s / call

Kling Lip-Sync TTS

Kling

Text to SpeechMulti-LanguageVoice CloneSpeed Control

Kling text-to-speech synthesis with multi-language support, voice cloning, speed control and emotion styles.

$0.01 / call

Text to VideoImage to Video3-15sAudioLow Cost

Kling V3 (Stable-QN)

Kling

0Active

Kling V3 video via Stable-QN channel. Supports text-to-video and image-to-video, 3-15s with optional audio.

$0.07/s+ / call

Multi-ModalImage/Video Input3-15sAudioVideo Editing

Kling V3 Omni (Stable-QN)

Kling

0Active

Kling V3 Omni-Video via Stable-QN channel. Multi-modal input with image_list, video_list and keep-original-sound.

$0.07/s+ / call

Nanobanana-2-beta

Google

Text to ImageImage Editing1K/2K/4K QualityLow Cost

Budget-friendly Gemini 3.1 Flash image generation. Supports text-to-image and image editing at lower cost.

$0.025+ / call

MiniMax M2.5

MiniMax

Coding SOTATool CallingWeb SearchOffice Tasks

MiniMax M2.5 reaches or sets new SOTA in coding, tool calling, search, and office productivity tasks.

$0.442 / $1.765 / 1M

Text to VideoImage to VideoStart-End Frame1-16s540p-1080p

Vidu Q3 Turbo

Vidu

-75%Active

Fast video generation by Vidu Q3 Turbo. Supports text/image/start-end frame to video, 1-16s, 540p-1080p.

$0.028/s+ / call

GPT-5.4 Pro

OpenAI

Deep ReasoningMulti-turnFrontier

Uses more compute to think deeper and deliver consistently better answers. Supports multi-turn model interactions and advanced API features.

$6.618 / $52.942 / 1M

GPT-5.4

OpenAI

FrontierProfessionalComplex Tasks

Our frontier model for complex professional work.

$0.552 / $4.412 / 1M

Gemini 3.1 Flash Lite Preview

Google

Ultra FastCost EffectiveLightweight

Most cost-effective multimodal model with fastest performance for high-frequency lightweight tasks.

$0.055 / $0.331 / 1M

Latest ProEnhanced ReasoningMultimodal

Gemini 3.1 Pro Preview

Google

-86%Active

Latest Pro model with enhanced reasoning and multimodal capabilities.

$0.442 / $2.648 / 1M

Kling Custom Voice

Kling

Custom VoiceAudio UploadVideo ReferenceFor TTS/Lip Sync

Create custom voice profiles from audio samples. Upload .mp3/.wav/.mp4/.mov (5-30s) or reference a video ID.

$0.006 / per call

Kling Advanced Lip Sync

Kling

Lip SyncMulti-FaceCustom AudioPer-Second Pricing

Sync one or multiple faces in a video with custom audio. Supports precise timing control.

$0.006 / per 5s

Kling Motion Control

Kling

Motion Control720p / 1080pUp to 30s Video

Generate videos with character motion control. Provide a reference image and motion video to create animated content.

$0.045/s+ / call

Kling Face Recognition

Kling

Face DetectionVideo InputSession-based

Identify faces in video for advanced lip-sync. Returns session ID and face IDs.

$0.001 / per call

Claude Opus 4.6

Anthropic

Latest OpusUltimate PerformancePremium

Latest Opus model with ultimate performance and reasoning capabilities.

Claude Opus 4.6 (Thinking)

Anthropic

Extended ThinkingUltimate PerformanceMost Powerful

Claude Opus 4.6 with extended thinking capability for the most complex reasoning tasks.

Claude Sonnet 4.6

Anthropic

Latest SonnetBest PerformanceTop Efficiency

Latest Sonnet model with best performance and efficiency.

Claude Sonnet 4.6 (Thinking)

Anthropic

Extended ThinkingBest PerformanceDeep Reasoning

Claude Sonnet 4.6 with extended thinking capability for complex reasoning tasks.

Kling Omni-Image

Kling

Text to ImageImage Editing1k/2k QualityMulti-Image Input

AI image generation and editing by Kling. Supports 1k/2k resolution and multi-image input for creative editing.

$0.027+ / call

Kling Sound Effects

Kling

Text to SFX3-10sSound Effects

Generate sound effects from text descriptions. 3-10 second audio with natural quality.

$0.030 / per call

Text to ImageReference Image1K/2KLow Cost

Kling Omni-Image (Stable-QN)

Kling

-75%Active

Kling OmniImage via Stable-QN channel. Text-to-image with reference image support, 1K/2K resolution.

$0.023+ / call

Kling Video-to-Audio

Kling

Video DubbingSFX + BGMASMR Mode

Auto-generate sound effects and background music for videos. Supports ASMR mode for immersive content.

$0.003 / per call

Image Upscale2K / 4KDetail Enhancement

SeedVR 2.5 Image Upscale

SeedVR

-80%Active

AI image upscaling and enhancement. Upscale images to 2K or 4K resolution with high quality detail preservation.

$0.015+ / call

Kling TTS

Kling

TTSMultiple VoicesSpeed Control

Text-to-speech with multiple voice options. Adjustable speed and multi-language support.

$0.006 / per call

Grok-4.2

xAI

16-Agent ClusterReal-time DataSelf-Evolution

Trillion-parameter model with 16-Agent cluster collaboration, real-time data processing and self-evolution.

$0.618 / $3.089 / 1M

Nanobanana2

Google

Text to ImageImage Editing1K/2K/4K Quality$0.06-$0.10/image

Fast image generation powered by Gemini 3.1 Flash. Supports text-to-image and image editing with 1K/2K/4K quality.

$0.06+ / call

Minimax Speech 2.6 HD

Minimax

HD QualityVoice CloneAsync TTS

High-definition async TTS by Minimax (海螺). Rich expressiveness with natural prosody. Supports voice clone and voice design.

$0.083 / 1K chars

Nanobanana-2-lite

Google

Image Editing1K/2K/4K Quality$0.04/image

Budget-friendly image editing powered by Gemini 3.1 Flash via RunningHub. Image-to-image only with 1K/2K/4K quality.

$0.04 / call

Minimax Speech 02 Turbo

Minimax

FastVoice CloneCost-Effective

Fast and cost-effective async TTS by Minimax (海螺). Supports voice clone, voice design, and pronunciation dictionaries.

$0.048 / 1K chars

Grok 4.2 Image

xAI

Text to ImageImage EditingMask InpaintingMultiple Sizes

Image generation and editing powered by Grok 4.2. Supports text-to-image creation and image editing with mask inpainting.

$0.02 / call

Fast ResponseMultimodalCost Effective

Gemini 3 Flash Preview

Google

-50%Active

Fast and efficient multimodal model. Great for quick responses and simple tasks.

$0.111 / $0.662 / 1M

Advanced ReasoningMultimodalHigh Quality

Gemini 3 Pro Preview

Google

-86%Active

Advanced multimodal reasoning model with superior capabilities.

$0.442 / $2.648 / 1M

Extended ThinkingAdvanced ReasoningDeep Analysis

Gemini 3 Pro (Thinking)

Google

-86%Active

Gemini 3 Pro with extended thinking capability for complex reasoning tasks.

$0.442 / $2.648 / 1M

Doubao Seedream 5.0 Lite

Doubao

Text to ImageImage EditingMulti-Image Fusion2K/3K

Latest Doubao Seedream 5.0 Lite image generation. Supports text-to-image, image editing, and multi-image fusion with 2K/3K resolution.

$0.05 / call

GPT-5.2

OpenAI

Latest ModelAdvanced ReasoningPremium

Latest GPT model with advanced reasoning and enhanced capabilities.

$0.383 / $3.089 / 1M

GPT-5.2 Chat

OpenAI

Chat OptimizedAdvancedFast

GPT-5.2 optimized for conversational interactions.

$0.383 / $3.089 / 1M

Doubao Seedream 4.5

Doubao

Text to ImageImage Editing2K/4KMulti-Image Input

High quality Doubao Seedream 4.5 image generation. Supports text-to-image and image editing with 2K/4K resolution.

$0.05 / call

Premium Quality4K ResolutionAudio Generation5s / 8s

VEO 3.1 4K

Google

-95%Active

Google VEO 3.1 standard mode. Premium quality with audio generation support. 5s or 8s video output.

$0.111+ / call

GPT-5.1

OpenAI

High PerformanceBalancedBest Value

Powerful model with excellent performance and efficiency.

$0.265 / $2.206 / 1M

GPT-5.1 (2025-11-13)

OpenAI

StableReproducibleProduction

Snapshot version of GPT-5.1 for reproducible outputs.

$0.265 / $2.206 / 1M

Claude Opus 4.5

Anthropic

Latest OpusBest PerformancePremium

Latest Opus model with enhanced capabilities and improved reasoning.

Claude Opus 4.5 (Thinking)

Anthropic

Extended ThinkingBest PerformanceMost Powerful

Claude Opus 4.5 with extended thinking capability for the most complex reasoning tasks.

Web SearchReal-time DataGrounded

GPT-5 Search API

OpenAI

-60%Beta

GPT-5 with integrated web search for real-time information.

$1.471 / $11.765 / 1M

Web SearchStableProduction

GPT-5 Search (2025-10-14)

OpenAI

-60%Beta

Snapshot version of GPT-5 Search API for stable deployments.

$1.471 / $11.765 / 1M

Grok Video 3 (10s)

xAI

Audio + VideoText to VideoImage to Video10s

Latest Grok video model with synchronized audio and video generation, 10-second output.

$0.083 / call

Grok Video 3

xAI

Text to VideoImage to Video5s16:9 / 9:16

High-quality 5-second video generation powered by Grok. Supports horizontal and vertical aspect ratios.

$0.08 / call

Claude Haiku 4.5

Anthropic

Fast ResponseLow CostBasic Tasks

Fast and affordable model for lightweight tasks. Best for simple queries and quick responses.

$0.353 / $1.765 / 1M

Claude Haiku 4.5 (Thinking)

Anthropic

Extended ThinkingFast ResponseCost Effective

Claude Haiku 4.5 with extended thinking capability for complex reasoning tasks.

$0.353 / $1.765 / 1M

GPT-5

OpenAI

General PurposeReliableVersatile

Versatile model for general-purpose tasks.

$0.265 / $2.206 / 1M

Claude Sonnet 4.5

Anthropic

Latest ModelEnhanced PerformanceBest Value

Latest Sonnet model with improved performance and efficiency.

Claude Sonnet 4.5 (Thinking)

Anthropic

Extended ThinkingComplex ReasoningDeep Analysis

Claude Sonnet 4.5 with extended thinking capability for complex reasoning tasks.

Gemini 3 Pro Image (Pro)

Google

Text to ImageImage Editing1K/2K/4K Quality99% Success Rate

Premium image generation powered by Gemini 3 Pro. 99% success rate. Best quality and reliability.

$0.06+ / call

Gemini 3 Pro Image (Lite)

Google

Text to ImageImage Editing1K/2K/4K Quality$0.06-$0.10/image

High quality image generation powered by Gemini 3 Pro. 97% success rate. Supports text-to-image and image editing.

$0.06+ / call

High PerformanceMultimodalComplex Tasks

Gemini 2.5 Pro

Google

-86%Beta

Powerful multimodal model for complex tasks with excellent performance.

$0.184 / $1.471 / 1M

Extended ThinkingHigh PerformanceDeep Analysis

Gemini 2.5 Pro (Thinking)

Google

-86%Beta

Gemini 2.5 Pro with extended thinking capability for complex reasoning.

$0.184 / $1.471 / 1M

Fast ResponseCost EffectiveBest Value

Gemini 2.5 Flash

Google

-40%Beta

Fast and cost-effective multimodal model. Best balance of speed and quality.

$0.045 / $0.368 / 1M

Extended ThinkingFast ResponseCost Effective

Gemini 2.5 Flash (Thinking)

Google

-40%Beta

Gemini 2.5 Flash with extended thinking capability for reasoning tasks.

$0.045 / $0.368 / 1M

Text to ImageImage EditingMultiple Aspect Ratios

Gemini 2.5 Flash Image

Google

-50%Active

Fast image generation powered by Gemini 2.5 Flash. Supports text-to-image and image editing with natural language.

$0.206 / call

Fast ReasoningEfficientLow Cost

o3-mini

OpenAI

-75%Beta

Fast reasoning model with efficient output generation.

$0.486 / $1.942 / 1M

Advanced ReasoningDeep ThinkingAnalysis

o3

OpenAI

-70%Beta

Advanced reasoning model for complex analytical tasks.

$0.883 / $3.530 / 1M

Ultra FastLowest CostHigh Volume

Gemini Flash Lite

Google

-86%Beta

Lightweight and ultra-fast model. Best for simple tasks and high volume.

$0.015 / $0.059 / 1M

Claude Opus 4

Anthropic

Most CapableSuperior ReasoningComplex Tasks

Most capable model with superior reasoning and analysis capabilities.

$5.295 / $26.471 / 1M

Claude Opus 4 (Thinking)

Anthropic

Extended ThinkingSuperior ReasoningMost Powerful

Claude Opus 4 with extended thinking capability for the most complex reasoning tasks.

$5.295 / $26.471 / 1M

Claude Sonnet 4

Anthropic

BalancedCode GenerationAnalysis

Balanced model with excellent performance and cost efficiency. Great for most tasks.

Claude Sonnet 4 (Thinking)

Anthropic

Extended ThinkingComplex ReasoningBest Value

Claude Sonnet 4 with extended thinking capability for complex reasoning tasks.

1536 DimensionsFastCost Effective

Text Embedding 3 Small

OpenAI

-90%Active

Small embedding model, efficient and cost-effective for most use cases.

$0.018 / 1M tokens

Text Embedding 3 Large

OpenAI

3072 DimensionsHigh AccuracyFlexible

Large embedding model for higher accuracy and flexible dimensions.

$0.059 / 1M tokens