Compare 85+ AI Model APIs, One Key — 95% Cheaper

GLM-5.2

Zhipu

ReasoningTool CallingOpenAI-compatible

Zhipu GLM-5.2 — a reasoning model with strong function-calling / tool-use, served via the OpenAI-compatible chat-completions endpoint.

$1.4 / $4 / 1M

GPT-5.5

OpenAI

Agentic CodingDeep ReasoningWeb SearchMultimodal

OpenAI’s advanced reasoning model for agentic coding, knowledge work, scientific research, and complex multi-step task execution. Served via the Responses API with adjustable reasoning effort (low–xhigh), web search, and function calling.

$3 / $18 / 1M

Subtitle Eraser

API Models

Subtitle RemovalOn-screen TextRegion Targeting$0.01/s

NEWSubtitle Removal

Remove hardcoded subtitles and burned-in on-screen text from any video, leaving a clean background. Pick subtitle-only or any-text mode, quality or smaller-size output, and optionally target only chosen regions. Billed $0.01 per second of input video.

$0.01/s / call

Seedance 2.0

ByteDance

NEWMost Capable9 Ref Images

The fullest, most all-around capable Seedance 2.0 channel — full multimodal generation from text, a first / first+last frame, up to 9 reference images, reference video, reference audio, web search and native audio. Supports real people and asset creation. 480p / 720p / 1080p, 4-15s, per-second pricing.

Full MultimodalUp to 9 Ref ImagesReal-Person + Assets480p/720p/1080p4-15s

$0.13/s+ / call

Seedance 2.0 Fast

ByteDance

Full MultimodalUp to 9 Ref ImagesReal-Person + Assets480p/720pFast

NEWMost CapableFast

The faster, cheaper variant of the fullest Seedance 2.0 channel — same full multimodal capability (text / first+last frame / up to 9 reference images / reference video / reference audio / web search / native audio), supports real people and asset creation. 480p / 720p, 4-15s, per-second pricing.

$0.11/s+ / call

gemini-3.1-flash-image

Google

Text to ImageImage Editing512 / 1K / 2K / 4K$0.04+/image

Google's gemini-3.1-flash-image (GA release). High-quality image generation and conversational editing at low latency. Priced by resolution: 512 $0.04, 1K/2K $0.06, 4K $0.10.

$0.04+ / call

gemini-3-pro-image

Google

Text to ImageImage Editing1K / 2K / 4K$0.12+/image

Google's gemini-3-pro-image (GA release). Top-quality, high-fidelity image generation and editing with advanced reasoning. Priced by resolution: 1K/2K $0.12, 4K $0.21.

$0.12+ / call

Seedance 2.0 Real Person

ByteDance

Real-PersonUp to 3 ImagesDriving Video + Audio4-15s480p–4k

NEWReal-PersonMultimodal

Seedance 2.0 Real Person — full multimodal video generation that ACCEPTS real faces (unlike Ark-direct Seedance, which rejects them). Combine up to 3 reference images (character / scene / lighting), a driving video for camera and motion transfer, and an audio track for voice, plus a text prompt. 4-15s, 480p to 4k. With a driving video, billing follows the minimum-billing table (input + output seconds). Use only with consented subjects.

$0.15/s+ / call

Seedance 2.0 Real Person Fast

ByteDance

Real-PersonImage + Driving VideoFast4-15s480p–4k

NEWReal-PersonFast

Seedance 2.0 Real Person Fast — faster, lower-cost real-person video. One reference image plus an optional driving video for motion/camera transfer, with a text prompt. Real-person supported. 4-15s, 480p to 4k. Use only with consented subjects.

$0.12/s+ / call

LTX-2.3 Video

Lightricks

T2V & I2V480p / 720p / 1080p16:9 / 9:165-20s

NEWT2V + I2V

LTX-2.3 unified text-to-video and image-to-video. Send a prompt for T2V, or add one reference image for I2V — fast, with 480p / 720p / 1080p output. Billed per second by resolution.

$0.02/s+ / call

Minimax Speech 2.8 HD

Minimax

HD QualityEmotion-AwareVoice CloneVoice Design

Latest high-fidelity TTS by MiniMax (海螺). Predicts emotion and intonation from context for ultra-natural, expressive, personalized speech. Supports voice clone and voice design.

$0.07 / 1K chars

Minimax Speech 2.8 Turbo

Minimax

FastCost-EffectiveVoice CloneVoice Design

Latest fast, cost-effective async TTS by MiniMax (海螺). Great quality-to-price for high-volume synthesis. Supports voice clone and voice design.

$0.04 / 1K chars

Minimax Speech 02 HD

Minimax

HD QualityEmotion-AwareVoice CloneVoice Design

High-fidelity TTS by MiniMax (海螺). Predicts emotion and intonation from context to produce ultra-natural, expressive, personalized speech — built for social, podcasts, audiobooks, news, education and digital humans. Supports voice clone and voice design.

$0.07 / 1K chars

Grok Imagine Video 1.5 (Beta)

xAI

Image to VideoFlat per-clip price480p / 720p5-15s

NEWBetaFlat Price

Grok Imagine Video 1.5 (Beta) — an alternative RunningHub channel for xAI Grok image-to-video. Turn one reference image into a cinematic clip with an optional prompt. Simple flat per-clip pricing by duration (5 / 8 / 10 / 12 / 15s), 480p or 720p.

$0.35+ / call

Grok Imagine Video 1.5

xAI

NEW#1 I2V ArenaNative Audio

xAI Grok Imagine Video 1.5 Preview — image-to-video with native synchronized audio. #1 on the Image-to-Video Arena, with lifelike motion, strong prompt adherence and consistent characters. 480p / 720p, 1-15s.

Image to VideoNative Audio480p / 720p1-15s

$0.098/s+ / call

Kling V3 Image

Kling

Text to ImageImage to Image1K/2K

Kling V3 image generation. Text-to-image and single-reference image-to-image, 1K/2K resolution. $0.05 per image.

$0.05 / call

Kling V3 Omni

Kling

Multi-ImageElement ConsistencySeries Output1K/2K/4K

Kling V3 Omni image generation. Multi-image reference & fusion, element consistency, single/series output, 1K/2K/4K — 1K/2K $0.05, 4K $0.10 per image.

Omni Flash (Stable)

Google

T2V / I2V / V2VUp to 7 Ref ImagesVoice + Character720p / 1080p / 4k

NEWStableFull Suite

Omni Flash (Stable) — lower-cost, full-suite Gemini Omni video. Text / image (up to 7 refs) / video-to-video, plus reusable voices and consistent characters. 720p / 1080p / 4k, 4 / 6 / 8 / 10s, 16:9 or 9:16, optional seed.

$0.35+ / call

Claude Opus 4.8

Anthropic

Most CapableLong-Horizon AutonomyAgentic WorkflowsProduction Quality

Anthropic's most capable model yet — built to autonomously carry long, complex work end to end. Ideal for big projects, building agents, and high-stakes scenarios demanding top quality and autonomy.

$3 / $13 / 1M

Claude Opus 4.7

Anthropic

1M Context128K OutputAdaptive ThinkingLatest Opus

Latest Opus model with 1M context, 128K max output, and adaptive thinking — same tools and platform features as Opus 4.6.

$3.676 / $18.382 / 1M

Claude Opus 4.7 (Thinking)

Anthropic

Extended Thinking1M Context128K OutputMost Powerful

Claude Opus 4.7 with extended thinking explicitly enabled for the most complex reasoning tasks.

$3.676 / $18.382 / 1M

GA ReleaseAgentic ExecutionLong-horizon TasksProduction-ready

Gemini 3.5 Flash

Google

-50%Активна

GA release. Our most intelligent Flash model — consistent leadership on agentic execution, coding, and long-horizon tasks at scale.

$0.662 / $3.971 / 1M

Grok Imagine Image

xAI

Text to ImageMultiple SizesHD Quality

Multimodal AI image generation by X platform. Generates high-quality images from text descriptions.

Grok Imagine Image Pro

xAI

Text to ImageHD QualityPro DetailMultiple Sizes

Upgraded multimodal AI model by X platform with stronger understanding and finer detail generation for higher precision images.

$0.10 / call

Seedance 2.0 (Official)

ByteDance

T2V & I2VMultimodal Reference4-15s480p/720p/1080p

NEWDirect OfficialStable

ByteDance Seedance 2.0 cinematic video — direct official Volcengine API, stable and high-concurrency. Text, image and multimodal generation with friendly per-second pricing.

$0.092/s+ / call

Omni Flash (Gemini)

Google

T2V & I2V720p / 1080p / 4k4-10s1 or 3 Ref Images

NEWUnified T2V + I2V4K

Gemini Omni Flash — unified video generator for both text-to-video and image-to-video (1 or 3 reference images). 720p / 1080p / 4k, 4 / 6 / 8 / 10s, optional 16:9 or 9:16 framing. One slug, two modes — drop in a prompt, optionally drop in images.

$0.30+ / call

Seedance 2.0 Cinematic

ByteDance

Real-Person RefCinematicHigh Quality

Film-grade edition of Seedance 2.0 — cinematic lighting, mood and camera motion, and it ACCEPTS real-person / realistic human reference images (unlike Ark-direct Seedance 2.0, which rejects real faces). Up to 4 reference images for identity-locked image-to-video — ideal for film-grade portrait and character work. Quality tier sits above the Ark variants; generation takes longer (typically 60-180s). Use only with consented subjects.

Real-Person ReferenceFilm-GradeT2V & I2VUp to 4 Ref Images

$1.00+ / call

Seedance 2.0 Fast (Official)

ByteDance

Text to VideoImage to VideoMultimodal Reference4-15s480p/720p

NEWDirect OfficialStable

Seedance 2.0 Fast — direct official Volcengine API, faster and lower-cost. Text, image and multimodal video generation, stable under high concurrency.

$0.071/s+ / call

DreamActor V2 (Motion Transfer)

ByteDance

Motion TransferMulti-PersonAnime SupportLip SyncMax 30s

ByteDance DreamActor V2 motion transfer. Drive any character image with reference video motion, supporting multi-person, anime and pets.

$0.06/s / call

Kling Lip-Sync Video

Kling

Lip SyncMulti-CharacterAudio AlignmentMinute-Level Duration

Kling AI lip-sync video generation. Frame-level lip synchronization with audio for real humans, 3D and 2D characters.

$0.065/5s / call

Kling Lip-Sync TTS

Kling

Text to SpeechMulti-LanguageVoice CloneSpeed Control

Kling text-to-speech synthesis with multi-language support, voice cloning, speed control and emotion styles.

$0.01 / call

Doubao Seedream 5.0 Lite (Official)

Doubao

Text to ImageImage to Image2K / 4KPNG · No WatermarkOfficial Ark API

Doubao Seedream 5.0 Lite via ByteDance Volcano Ark official API. Unified text-to-image and image-to-image (pass image for I2I, omit for T2I). 2K / 4K output, no watermark, PNG.

$0.055 / call

Omni Video V3.1-Lite (Start-End, Official Stable)

Google

Start + End FrameBoth Required720p / 1080p16:9 / 9:16Native AudioOfficial Stable

Smooth cinematic transitions between a required first frame and required last frame. Outputs 720p or 1080p with native audio. Official stable channel — pricier than V3.1-fast but reliable, ideal for production.

$0.72-$1.152 / call

GPT Image 2 Lite

OpenAI

Image to ImageMulti-Image (up to 10)1K/2K/4KSync$0.03/image

Cheapest OpenAI gpt-image-2 channel. Sync image-to-image edit API with multi-image fusion (up to 10), 1K/2K/4K output and quality control. Flat $0.03 per image.

$0.03 / call

GPT Image 2 Beta

OpenAI

Text to ImageImage EditingMulti-Image (up to 16)1K / 2K / 4K$0.03+/image

OpenAI GPT Image 2 (beta channel). Text-to-image and multi-image editing (up to 16 reference images), aspect-ratio controlled output. Independent channel from the primary gpt-image-2 route for redundancy. Priced by resolution: $0.03 (1K) / $0.045 (2K) / $0.06 (4K).

$0.03+ / call

GPT Image 2

OpenAI

Text to ImageImage EditingMulti-Image (up to 10)Async1K/2K/4K

OpenAI gpt-image-2. Text-to-image and multi-image editing (up to 10 reference images), aspect-ratio control, flat $0.03 per image at Medium/High quality.

$0.03 / call

Nanobananapro-gemini

Google

Text to ImageImage Editing1K/2K/4K$0.025/image

Gemini 3 Pro Image via a budget channel. Professional asset creation with advanced reasoning and high-fidelity text rendering.

Nanobanana2-gemini

Google

Text to ImageImage Editing1K/2K/4K$0.025/image

Gemini 3.1 Flash Image via a budget channel. High-performance image generation optimized for speed and high-volume use.

VEO 3.1 Fast HD

Google

Text to VideoImage to Video720p8s

VEO 3.1 Fast HD (720p) video generation. 8s fixed duration, 16:9 aspect ratio, reference image support.

$0.07 / call

VEO 3.1 Fast Full HD

Google

Text to VideoImage to Video1080p8s

VEO 3.1 Fast Full HD (1080p) video generation. 8s fixed duration, 16:9 aspect ratio, reference image support.

$0.07 / call

VEO 3.1 Fast 4K (Beta)

Google

Image to VideoStart-End Frame4K1080p

VEO 3.1 Fast 4K video generation. Requires start frame image. Supports start-end frame video generation.

$0.07-$0.10 / call

SparkPix Image

SparkPix

Text to ImageSub 1sText RenderingLoRA Support

Sub 1 second text-to-image model built for production use cases. State-of-the-art speed, quality, and text rendering.

$0.008 / call

SparkPix Image Edit

SparkPix

Image EditingMulti-ImageSub 1sText Rendering

Sub 1 second multi-image editing model. Fast, affordable AI image editing with precise prompt adherence and multi-image support.

$0.013 / call

P-Video

Pruna AI

T2V & I2VDraft ModeBuilt-in Audio720p/1080p1-20sMulti-Aspect

Fast video generation in ~10 seconds. Text/image/audio-to-video with draft mode for 4x faster previews. Built-in audio generation, up to 1080p 48FPS.

Nanobanana-2-beta

Google

Text to ImageImage Editing1K/2K/4K Quality1K/2K $0.05

Budget-friendly Gemini 3.1 Flash image generation. Text-to-image and image editing — 1K/2K $0.05, 4K $0.08 per image.

MiniMax M2.5

MiniMax

Coding SOTATool CallingWeb SearchOffice Tasks

MiniMax M2.5 reaches or sets new SOTA in coding, tool calling, search, and office productivity tasks.

$0.442 / $1.765 / 1M

Gemini 3.1 Flash Lite

Google

Ultra FastCost EffectiveLightweight

Most cost-effective multimodal model with fastest performance for high-frequency lightweight tasks.

$0.055 / $0.331 / 1M

Latest ProEnhanced ReasoningMultimodal

Gemini 3.1 Pro Preview

Google

-86%Активна

Latest Pro model with enhanced reasoning and multimodal capabilities.

$0.442 / $2.648 / 1M

Kling Custom Voice

Kling

Custom VoiceAudio UploadVideo ReferenceFor TTS/Lip Sync

Create custom voice profiles from audio samples. Upload .mp3/.wav/.mp4/.mov (5-30s) or reference a video ID.

$0.006 / per call

Motion Control720p / 1080pUp to 30s Video

Kling Motion Control

Kling

-70%Активна

Generate videos with character motion control. Provide a reference image and motion video to create animated content.

$0.06/s+ / call

Kling Face Recognition

Kling

Face DetectionVideo InputSession-based

Identify faces in a video and return a session ID and face IDs for Kling lip-sync video generation.

$0.01 / per call

Claude Opus 4.6

Anthropic

Latest OpusUltimate PerformancePremium

Latest Opus model with ultimate performance and reasoning capabilities.

Claude Opus 4.6 (Thinking)

Anthropic

Extended ThinkingUltimate PerformanceMost Powerful

Claude Opus 4.6 with extended thinking capability for the most complex reasoning tasks.

Claude Sonnet 4.6

Anthropic

Latest SonnetBest PerformanceTop Efficiency

Latest Sonnet model with best performance and efficiency.

Claude Sonnet 4.6 (Thinking)

Anthropic

Extended ThinkingBest PerformanceDeep Reasoning

Claude Sonnet 4.6 with extended thinking capability for complex reasoning tasks.

Kling Omni-Image

Kling

Text to ImageImage Editing1k/2k QualityMulti-Image Input

AI image generation and editing by Kling (omni-image, model kling-image-o1). Supports 1K/2K resolution and multi-image input. $0.05 per image.

$0.05 / call

Kling Sound Effects

Kling

Text to SFX3-10sSound Effects

Generate sound effects from text descriptions. 3-10 second audio with natural quality.

$0.030 / per call

Kling Video-to-Audio

Kling

Video DubbingSFX + BGMASMR Mode

Auto-generate sound effects and background music for videos. Supports ASMR mode for immersive content.

$0.003 / per call

Image Upscale2K / 4KDetail Enhancement

SeedVR 2.5 Image Upscale

SeedVR

-80%Активна

AI image upscaling and enhancement. Upscale images to 2K or 4K resolution with high quality detail preservation.

$0.015+ / call

Kling TTS

Kling

TTSMultiple VoicesSpeed Control

Text-to-speech with multiple voice options. Adjustable speed and multi-language support.

$0.01 / per call

Kling V3 Omni

Kling

Multi-ModalText/Image/Video Input5-15sAudioKeep Original Sound

Kling V3 Omni-Video with extended duration and keep-original-sound support for video editing. Flat $0.15/s billing.

$0.15/s / call

16-Agent ClusterReal-time DataSelf-Evolution

Grok-4.2

xAI

-70%Активна

Trillion-parameter model with 16-Agent cluster collaboration, real-time data processing and self-evolution.

$0.618 / $3.089 / 1M

Nanobanana2

Google

Text to ImageImage Editing1K/2K/4K Quality1K/2K $0.05

Fast image generation powered by Gemini 3.1 Flash. Supports text-to-image and image editing — 1K/2K $0.05, 4K $0.08 per image.

Minimax Speech 2.6 HD

Minimax

HD QualityVoice CloneAsync TTS

High-definition async TTS by Minimax (海螺). Rich expressiveness with natural prosody. Supports voice clone and voice design.

$0.07 / 1K chars

Nanobanana-2-lite

Google

Image Editing1K/2K/4K Quality1K/2K $0.04

Budget-friendly image editing powered by Gemini 3.1 Flash. Image-to-image only — 1K/2K $0.04, 4K $0.07 per image.

$0.04+ / call

Minimax Speech 02 Turbo

Minimax

FastVoice CloneCost-Effective

Fast and cost-effective async TTS by Minimax (海螺). Supports voice clone, voice design, and pronunciation dictionaries.

$0.04 / 1K chars

Grok 4.2 Image

xAI

Text to ImageImage EditingMask InpaintingMultiple Sizes

Image generation and editing powered by Grok 4.2. Supports text-to-image creation and image editing with mask inpainting.

Fast ResponseMultimodalCost Effective

Gemini 3 Flash Preview

Google

-50%Активна

Fast and efficient multimodal model. Great for quick responses and simple tasks.

$0.111 / $0.662 / 1M

Advanced ReasoningMultimodalHigh Quality

Gemini 3 Pro Preview

Google

-86%Активна

Advanced multimodal reasoning model with superior capabilities.

$0.442 / $2.648 / 1M

Extended ThinkingAdvanced ReasoningDeep Analysis

Gemini 3 Pro (Thinking)

Google

-86%Активна

Gemini 3 Pro with extended thinking capability for complex reasoning tasks.

$0.442 / $2.648 / 1M

Kling V3

Kling

Text to VideoImage to Video3-15sAudioPer-Second Pricing

Latest Kling V3 video generation. Supports 3-15s flexible duration, text-to-video and image-to-video with optional audio.

$0.12/s+ / call

Eleven Flash v2.5

ElevenLabs

Ultra-Fast32 LanguagesLowest Cost

Ultra low latency model in 32 languages. Ideal for real-time conversational use cases.

$0.0425 / 1K chars

Eleven Turbo v2.5

ElevenLabs

Low Latency32 LanguagesSSML Support

High quality, low latency model in 32 languages. Best for developer use cases where speed matters.

$0.0425 / 1K chars

Eleven Multilingual v2

ElevenLabs

High Quality29 LanguagesEmotionally Rich

Most life-like, emotionally rich mode in 29 languages. Best for voice overs, audiobooks, post-production.

$0.085 / 1K chars

Eleven v3

ElevenLabs

70+ LanguagesAudio TagsMost Expressive

Most expressive model with 70+ languages. Supports audio tags like [laughs], [whispers] for emotional control.

$0.085 / 1K chars

ElevenLabs Dialogue

ElevenLabs

Multi-SpeakerNatural FlowConversation

Multi-speaker dialogue generation with natural conversation flow. Perfect for podcasts and audiobooks.

$0.085 / 1K chars

Voice Isolator

ElevenLabs

Noise RemovalSpeech ExtractionAudio Cleanup

Extract speech from background noise, music and ambient sounds. Clean audio extraction.

$0.102 / min

AI Dubbing

ElevenLabs

Video Dubbing29 LanguagesPreserve Emotion

Translate audio/video while preserving emotion, timing and tone. Automatic lip-sync.

$0.2805 / min

Doubao Seedream 4.5

Doubao

Text to ImageImage Editing2K/4KMulti-Image Input

High quality Doubao Seedream 4.5 image generation. Supports text-to-image and image editing with 2K/4K resolution.

$0.05 / call

Omni Video V3.1-fast (Start-End Frame, Budget)

Google

Start + End Frame8s720p / 1080p / 4K16:9 / 9:16Budget Channel

High-performance start-end frame video. Provide first + optional last frame, the model interpolates motion between them in seconds. Budget channel — cheaper than the official VEO, less stable.

$0.30 / call

Claude Opus 4.5

Anthropic

Latest OpusBest PerformancePremium

Latest Opus model with enhanced capabilities and improved reasoning.

Claude Opus 4.5 (Thinking)

Anthropic

Extended ThinkingBest PerformanceMost Powerful

Claude Opus 4.5 with extended thinking capability for the most complex reasoning tasks.

Grok Video 3 (10s)

xAI

Text to VideoImage to Video10s720p / 480p$0.01/s

Grok Video 3. 10-second video at $0.01/s. Supports text-to-video and image-to-video (up to 7 reference images).

$0.10 / call

Grok Video 3

xAI

Text to VideoImage to Video6 / 10 / 15s720p / 480p$0.01/s

Grok Video 3. Per-second pricing $0.01/s, 6 / 10 / 15 second output. Both T2V (omit images) and I2V (1-7 reference images) supported.

$0.06-$0.15 / call

Claude Haiku 4.5

Anthropic

Fast ResponseLow CostBasic Tasks

Fast and affordable model for lightweight tasks. Best for simple queries and quick responses.

$0.353 / $1.765 / 1M

Claude Haiku 4.5 (Thinking)

Anthropic

Extended ThinkingFast ResponseCost Effective

Claude Haiku 4.5 with extended thinking capability for complex reasoning tasks.

$0.353 / $1.765 / 1M

Claude Sonnet 4.5

Anthropic

Latest ModelEnhanced PerformanceBest Value

Latest Sonnet model with improved performance and efficiency.

Claude Sonnet 4.5 (Thinking)

Anthropic

Extended ThinkingComplex ReasoningDeep Analysis

Claude Sonnet 4.5 with extended thinking capability for complex reasoning tasks.

Gemini 3 Pro Image (Pro)

Google

Text to ImageImage Editing1K/2K/4K Quality99% Success Rate

Premium image generation powered by Gemini 3 Pro. 99% success rate. Best quality and reliability.

$0.06+ / call

Gemini 3 Pro Image (Lite)

Google

Text to ImageImage Editing1K/2K/4K Quality$0.06-$0.10/image

High quality image generation powered by Gemini 3 Pro. 97% success rate. Supports text-to-image and image editing.

$0.06+ / call

High PerformanceMultimodalComplex Tasks

Gemini 2.5 Pro

Google

-86%Бета

Powerful multimodal model for complex tasks with excellent performance.

$0.184 / $1.471 / 1M

Extended ThinkingHigh PerformanceDeep Analysis

Gemini 2.5 Pro (Thinking)

Google

-86%Бета

Gemini 2.5 Pro with extended thinking capability for complex reasoning.

$0.184 / $1.471 / 1M

Fast ResponseCost EffectiveBest Value

Gemini 2.5 Flash

Google

-40%Бета

Fast and cost-effective multimodal model. Best balance of speed and quality.

$0.045 / $0.368 / 1M

Extended ThinkingFast ResponseCost Effective

Gemini 2.5 Flash (Thinking)

Google

-40%Бета

Gemini 2.5 Flash with extended thinking capability for reasoning tasks.

$0.045 / $0.368 / 1M

Text to ImageImage EditingMultiple Aspect Ratios

Gemini 2.5 Flash Image

Google

-50%Активна

Fast image generation powered by Gemini 2.5 Flash. Supports text-to-image and image editing with natural language.

$0.206 / call

Ultra FastLowest CostHigh Volume

Gemini Flash Lite

Google

-86%Бета

Lightweight and ultra-fast model. Best for simple tasks and high volume.

$0.015 / $0.059 / 1M

Claude Opus 4

Anthropic

Most CapableSuperior ReasoningComplex Tasks

Most capable model with superior reasoning and analysis capabilities.

$5.295 / $26.471 / 1M

Claude Opus 4 (Thinking)

Anthropic

Extended ThinkingSuperior ReasoningMost Powerful

Claude Opus 4 with extended thinking capability for the most complex reasoning tasks.

$5.295 / $26.471 / 1M

Claude Sonnet 4

Anthropic

BalancedCode GenerationAnalysis

Balanced model with excellent performance and cost efficiency. Great for most tasks.

Claude Sonnet 4 (Thinking)

Anthropic

Extended ThinkingComplex ReasoningBest Value

Claude Sonnet 4 with extended thinking capability for complex reasoning tasks.

1536 DimensionsFastCost Effective

Text Embedding 3 Small

OpenAI

-90%Активна

Small embedding model, efficient and cost-effective for most use cases.

$0.018 / 1M tokens

Text Embedding 3 Large

OpenAI

3072 DimensionsHigh AccuracyFlexible

Large embedding model for higher accuracy and flexible dimensions.

$0.059 / 1M tokens