85+ AI 模型 API 对比 & 在线试用

AI Lip-Sync

API Models

Image + AudioTalking HeadPer-Second PricingAny Portrait

NEW

Turn a portrait image + an audio clip into a talking-head video. The output length follows the audio; billed per second.

$0.02/s / call

GPT Image 2 All

OpenAI

OpenAI-compatibleText to ImageImage Editing1K/2K/4Klow/medium/high

NEWOpenAI SDK同步

GPT Image 2（All）—— OpenAI 兼容的图片 API。把 OpenAI SDK、Codex、Cursor 的 base_url 指向 https://apimodels.app/api/v1、model 用 gpt-image-2-all，images.generate() / images.edit() 直接可用。完整的分辨率 × 质量矩阵:1K / 2K / 4K 搭配 low / medium / high(以及 auto),支持文生图与多图编辑(最多 16 张参考图)。同步返回、无需轮询,按分辨率 × 质量分档计费,$0.005/张起。生成速度:low 约 40 秒、medium 约 60-90 秒、high 约 2 分钟。一个 API Key,国内可直连,无需 OpenAI 组织认证。

$0.005+ / call

Claude Fable 5

Anthropic

Ultra-Long ContextMultimodalComplex ReasoningEnterprise-Grade

Claude Fable 5 是 Anthropic 最新、最强的公开可用大语言模型，在 API Models 上价格约为官方的一半（比官方便宜约 50%，输入 $5 / 输出 $25 每百万 token）。开箱支持 Claude Code、Anthropic SDK 及 Cursor 等 Anthropic 兼容客户端调用。具备超长上下文、多模态（识图）理解、复杂推理与企业级知识工作能力，并带严格安全保护。通过 Anthropic Messages API（/v1/messages）提供原生工具调用，请求参数与 Opus 4.8 一致。

$5 / $25 / 1M

Claude Sonnet 5

Anthropic

1M ContextAdaptive Thinking128K OutputTool Calling

Claude Sonnet 5 是 Anthropic 最新的 Sonnet 模型，默认且最大都是 100 万 token 上下文窗口（没有更小的上下文变体）、最大 128K 输出 token、自适应思维，工具与平台功能与 Claude Sonnet 4.6 相同，唯一例外是不支持优先级（Priority Tier）。在 API Models 上仅 $1 输入 / $4.5 输出（每百万 token），并支持提示词缓存（缓存命中 $0.1/M）。通过 Anthropic Messages API（/v1/messages）提供原生工具调用，Claude Code、Anthropic SDK 及 Cursor 等 Anthropic 兼容客户端可直接接入，请求参数与 Opus 4.8 一致。

$1 / $4.5 / 1M

GPT Image 2 Lite

OpenAI

Image to ImageMulti-Image (up to 10)1K onlyCheapestFastSync

The cheapest, fastest gpt-image-2 channel — 1K-only at a flat $0.015/image. Built for person editing & generation and light-text tutorial/diagram images. Not for 2K/4K HD or text-heavy full-screen layouts (use gpt-image-2 for those).

$0.015 / call

gemini-3.1-flash-image

Google

Text to ImageImage Editing512 / 1K / 2K / 4K$0.04+/image

Google's gemini-3.1-flash-image (GA release). High-quality image generation and conversational editing at low latency. Priced by resolution: 512 $0.04, 1K/2K $0.06, 4K $0.10.

$0.04+ / call

Text to ImageImage Editing1K / 2K / 4K$0.10+/image25-37.5% off official

gemini-3-pro-image

Google

-25%可用

Google's gemini-3-pro-image (GA release). Top-quality, high-fidelity image generation and editing with advanced reasoning. Priced by resolution: 1K/2K $0.10 (25% cheaper than official), 4K $0.15 (37.5% cheaper than official).

$0.10+ / call

Eleven Flash v2.5

ElevenLabs

Ultra-Fast32 LanguagesLowest Cost

超低延迟模型，支持 32 种语言。适合实时对话场景。

$0.0425 / 1K chars

Eleven Turbo v2.5

ElevenLabs

Low Latency32 LanguagesSSML Support

高质量低延迟模型，支持 32 种语言。适合对速度有要求的开发场景。

$0.0425 / 1K chars

Eleven Multilingual v2

ElevenLabs

High Quality29 LanguagesEmotionally Rich

最逼真、最富情感的模型，支持 29 种语言。适合配音、有声书和后期制作。

$0.085 / 1K chars

Eleven v3

ElevenLabs

70+ LanguagesAudio TagsMost Expressive

最具表现力的模型，支持 70+ 种语言。支持 [laughs]、[whispers] 等音频标签实现情感控制。

$0.085 / 1K chars

ElevenLabs Dialogue

ElevenLabs

Multi-SpeakerNatural FlowConversation

多角色对话生成，自然对话流。适合播客和有声书制作。

$0.085 / 1K chars

Voice Isolator

ElevenLabs

Noise RemovalSpeech ExtractionAudio Cleanup

从背景噪声、音乐和环境音中提取人声。高质量音频提取。

$0.102 / min

AI Dubbing

ElevenLabs

Video Dubbing29 LanguagesPreserve Emotion

翻译音频/视频并保留情感、节奏和语调。自动唇形同步。

$0.2805 / min

Seedance 2.0 · sd-A

ByteDance

2.0 / Fast / MiniFull Multimodal720p / 1080p HDAsset Library

NEW高清三档

Seedance 2.0 sd-A — full multimodal HD video (720p / 1080p) with an in-playground tier switch (2.0 / Fast / Mini): text, first/last frame, up to 9 reference images + video + audio, web search, native audio, and asset-library characters.

$0.073/s+ / call

Seedance 2 Mini

ByteDance

Multimodal T/I/V/ANative AI Audio480p / 720p4-15sRef Image/Video/Audio

NEWMini原生音频

ByteDance Seedance 2 Mini — multimodal text / image / video / audio → video with native AI audio. Reference images, reference video and reference audio, web search, 480p / 720p, 4-15s. Per-second pricing by resolution and whether AI audio is generated.

$0.042/s+ / call

DeepSeek V4 Flash

DeepSeek

High ThroughputTool Calling1M ContextOpenAI-compatible

DeepSeek V4 Flash — the lightweight, high-throughput, cost-effective member of the DeepSeek V4 family for general chat and basic text. 1M-token context, tool calling, streaming; OpenAI-compatible.

$0.12 / $0.24 / 1M

DeepSeek V4 Pro

DeepSeek

Reasoning / ThinkingAgentTool Calling1M Context

DeepSeek V4 Pro — DeepSeek’s high-performance model with top-tier reasoning and agent capabilities, 1M-token context, and full thinking (reasoning_content) output. Tool calling, streaming; OpenAI-compatible.

$0.37 / $0.74 / 1M

Qwen3.7 Max

Alibaba

Reasoning / ThinkingAgent1M ContextTool Calling

Alibaba Qwen3.7 Max — the most capable Qwen3.7 model: top-tier reasoning and agent ability, 1M-token context, hybrid thinking. Tool calling, streaming; OpenAI-compatible.

$1.76 / $5.29 / 1M

Qwen3.7 Plus

Alibaba

Reasoning / ThinkingBest Value1M ContextTool Calling

Alibaba Qwen3.7 Plus — the best-value Qwen3.7 model: strong reasoning at a fraction of Max’s price, 1M-token context, hybrid thinking. Tool calling, streaming; OpenAI-compatible.

$0.88 / $1.18 / 1M

GPT-5.4

OpenAI

FrontierAdjustable ReasoningWeb SearchMultimodal

GPT-5.4 是 OpenAI 面向复杂专业工作与 Agent 编码的前沿模型。通过 Responses API（/v1/responses）调用，推理强度用请求体的 reasoning.effort（low–xhigh）控制，支持联网搜索、函数调用与多模态输入。OpenAI Codex 设 wire_api = "responses" 即可直接接入。

$1.8 / $10.8 / 1M

GLM-5.2

Zhipu

ReasoningTool CallingOpenAI-compatible

Zhipu GLM-5.2 — a reasoning model with strong function-calling / tool-use, served via the OpenAI-compatible chat-completions endpoint.

$1 / $3.5 / 1M

GPT-5.5

OpenAI

Agentic CodingDeep ReasoningWeb SearchMultimodal

GPT-5.5 是 OpenAI 面向 Agent 编码、知识工作、科研与复杂多步任务的高级推理模型。通过 Responses API（/v1/responses）调用，支持可调推理强度（low–xhigh）、联网搜索、函数调用与多模态输入。OpenAI Codex 设 wire_api = "responses" 即可直接接入。

$3 / $18 / 1M

视频字幕擦除

API Models

Subtitle RemovalOn-screen TextRegion Targeting$0.01/s

NEW字幕擦除

视频字幕擦除。去除视频里的硬字幕和画面文字，重建干净背景；可选「字幕」或「全部文字」模式、质量或更小体积输出，可选区域定位，$0.01 每秒。

$0.01/s / call

Seedance 2.0

ByteDance

Full MultimodalUp to 9 Ref ImagesReal-Person + Assets480p/720p/1080p4-15s

NEW最全能最多9图

Seedance 2.0 最全能渠道 —— 全模态视频生成:文生 / 首帧 / 首尾帧 / 最多 9 张参考图 / 参考视频 / 参考音频 / 联网搜索 / 原生音频,任意组合。支持真人 + 素材创建。480p / 720p / 1080p,时长 4 / 5 / 6 / 8 / 10 / 12 / 15 秒,按秒计费 $0.13/s 起。

$0.13/s+ / call

Seedance 2.0 Fast

ByteDance

Full MultimodalUp to 9 Ref ImagesReal-Person + Assets480p/720pFast

NEW最全能极速

Seedance 2.0 Fast —— 最全能 Seedance 2.0 渠道的提速降价版,保留全模态能力(文生 / 首尾帧 / 最多 9 图 / 参考视频 / 参考音频 / 联网搜索 / 原生音频),支持真人 + 素材创建。480p / 720p,4-15 秒,按秒计费 $0.11/s 起。

$0.11/s+ / call

LTX-2.3 视频

Lightricks

T2V & I2V480p / 720p / 1080p16:9 / 9:165-20s

NEW文生+图生

LTX-2.3 视频 —— 文生视频 + 图生视频二合一:只填提示词即文生视频,加一张参考图即图生视频(自动识别)。支持 480p / 720p / 1080p、16:9 / 9:16,文生 5-15 秒、图生 5-20 秒。按秒计费:480p $0.02/s、720p $0.04/s、1080p $0.045/s。

$0.02/s+ / call

Minimax Speech 2.8 HD

Minimax

HD QualityEmotion-AwareVoice CloneVoice Design

MiniMax speech-2.8-hd 最新高保真 TTS,情感语调自然,支持声音克隆与声音设计。$0.07/千字符。

$0.07 / 1K chars

Minimax Speech 2.8 Turbo

Minimax

FastCost-EffectiveVoice CloneVoice Design

MiniMax speech-2.8-turbo 最新快速经济 TTS,性价比高、多语言强,支持声音克隆与声音设计。$0.04/千字符。

$0.04 / 1K chars

Minimax Speech 02 HD

Minimax

HD QualityEmotion-AwareVoice CloneVoice Design

MiniMax speech-02-hd 高保真 TTS,情感语调自然,支持声音克隆与声音设计。$0.07/千字符。

$0.07 / 1K chars

Grok Imagine Video 1.5(Beta)

xAI

Image to VideoFlat per-clip price480p / 720p5-15s

NEWBeta一口价

Grok Imagine Video 1.5（Beta）—— xAI Grok 图生视频的另一条 RunningHub 稳定渠道。上传 1 张参考图、可选填提示词，即可生成电影级短视频。一口价按时长计费（5 / 8 / 10 / 12 / 15 秒），480p 与 720p 同价，成本生成前即可预知。$0.35/条起。

$0.35+ / call

Grok Imagine Video 1.5

xAI

Image to VideoNative Audio480p / 720p1-15s

NEW图生视频榜首原生音频

xAI Grok Imagine Video 1.5 Preview —— 图生视频 + 原生同步音频。在盲测中登顶 Image-to-Video Arena（720p），超越 Seedance 2.0，具备逼真动作、强提示词遵循与一致的角色生成。480p / 720p、1-15 秒，按秒计费 $0.098/s 起。

$0.098/s+ / call

Kling V3 Image

Kling

Text to ImageImage to Image1K/2K

可灵 V3 图像生成。文生图 + 单图参考图生图(image_reference 主体/人脸),1K/2K 分辨率,$0.05/张。

$0.05 / call

Kling V3 Omni

Kling

Multi-ImageElement ConsistencySeries Output1K/2K/4K

可灵 V3 全能版(omni-image)。多图参考与融合(image_list)、元素一致性(element_list)、单图/组图输出,1K/2K/4K——1K/2K $0.05、4K $0.10/张。

全能视频 Omni Flash 稳定版

Google

T2V / I2V / V2VUp to 7 Ref ImagesVoice + Character720p / 1080p / 4k

NEW稳定版全套

Omni Flash (Stable) — lower-cost, full-suite Gemini Omni video. Text / image (up to 7 refs) / video-to-video, plus reusable voices and consistent characters. 720p / 1080p / 4k, 4 / 6 / 8 / 10s, 16:9 or 9:16, optional seed.

$0.35+ / call

Claude Opus 4.8

Anthropic

Most CapableLong-Horizon AutonomyAgentic WorkflowsProduction Quality

claude-opus-4-8 是目前 Anthropic 最强的"能一个人扛住长时间复杂工作的工具"，特别适合开发者做大项目、建 Agent，或者对质量和自主性要求极高的场景。

$3 / $13 / 1M

Claude Opus 4.7

Anthropic

1M Context128K OutputAdaptive ThinkingLatest Opus

Claude Opus 4.7 支持 100 万 token 上下文窗口、128K 最大输出 token、自适应思维，与 Opus 4.6 拥有相同的工具与平台能力。

$3.676 / $18.382 / 1M

Claude Opus 4.7 (Thinking)

Anthropic

Extended Thinking1M Context128K OutputMost Powerful

Claude Opus 4.7 扩展思维版，显式开启长链推理，适用于最复杂的任务。

$3.676 / $18.382 / 1M

GA ReleaseAgentic ExecutionLong-horizon TasksProduction-ready

Gemini 3.5 Flash

Google

-50%可用

Gemini 3.5 Flash 已正式发布 (GA)，性能稳定，可大规模用于生产。最智能的 Flash 模型，在智能体执行、编码与长链任务上持续领先。

$0.662 / $3.971 / 1M

Grok Imagine Image

xAI

Text to ImageMultiple SizesHD Quality

X 平台推出的多模态 AI 模型，根据文本描述生成高质量图像。支持多种尺寸和风格。

Grok Imagine Image Pro

xAI

Text to ImageHD QualityPro DetailMultiple Sizes

X 平台升级版多模态 AI 模型，更强理解力与生成细节，实现更高精度图像生成。

$0.10 / call

Seedance 2.0 (Official)

ByteDance

T2V & I2VMultimodal Reference4-15s480p/720p/1080p

NEW直连官方稳定可靠

ByteDance Seedance 2.0 电影级视频生成 —— 直连火山引擎官方 API，稳定、高并发。支持文生、图生、多模态参考生视频，按秒计费 $0.092/s 起。

$0.092/s+ / call

全能视频 Omni Flash

Google

T2V · I2V · Edit720p / 1080p4-10s1 or 3 Ref Images

文生 + 图生视频编辑

全能视频 Omni Flash —— 一个模型同时覆盖文生视频与图生视频：纯文本直接生成，或上传 1 张 / 3 张参考图做单图动态化或多图融合（注意：上游不支持 2 张）。支持 720p / 1080p / 4K、4 / 6 / 8 / 10 秒、可选 16:9 / 9:16，按档计费 $0.30 起，4K 档 $0.48 起。

$0.20+ / call

Seedance 2.0 影视版

ByteDance

Real-Person ReferenceFilm-GradeT2V & I2VUp to 4 Ref Images

支持真人参考影视级高品质

Seedance 2.0 影视版 —— ByteDance Seedance 2.0 的影视级（Cinematic）版本。最大亮点:支持真人 / 拟真人脸参考图(Ark 直连的 Seedance 2.0 会拒真人脸,本影视版渠道可用),最多 4 张参考图做身份锁定的图生视频,适合影视级人物 / 肖像创作。光影、运镜品质明显高于 Ark 直连版,代价是生成更慢(通常 60-180 秒)。文生 / 图生视频,5 / 10 / 15 秒,按时长计费 $1.00 / 5s 起。请仅对已获授权的对象使用。

$1.00+ / call

Seedance 2.0 Fast (Official)

ByteDance

Text to VideoImage to VideoMultimodal Reference4-15s480p/720p

NEW直连官方稳定可靠

Seedance 2.0 Fast —— 直连火山引擎官方 API，提速降价版，稳定高并发。文生 / 图生 / 多模态视频，$0.071/s 起。

$0.071/s+ / call

DreamActor V2 (Motion Transfer)

ByteDance

Motion TransferMulti-PersonAnime SupportLip SyncMax 30s

字节跳动即梦动作模仿 V2。单图+参考视频精准驱动角色动作，支持多人同框、二次元和宠物，$0.06/秒。

$0.06/s / call

Kling Lip-Sync Video

Kling

Lip SyncMulti-CharacterAudio AlignmentMinute-Level Duration

可灵 AI 对口型视频生成。帧级口型同步，支持真实人物、3D 及 2D 动画角色，支持本地音频和在线配音。

$0.065/5s / call

Kling Lip-Sync TTS

Kling

Text to SpeechMulti-LanguageVoice CloneSpeed Control

可灵对口型语音合成。支持多语言多方言、语速调节、情感风格、音色克隆。

$0.01 / call

Doubao Seedream 5.0 Lite (Official)

Doubao

Text to ImageImage to Image2K / 4KPNG · No WatermarkOfficial Ark API

最新 Doubao Seedream 5.0 Lite 图片生成。支持文生图、图片编辑和多图融合，2K/3K 分辨率。

$0.055 / call

Omni Video V3.1-Lite (Start-End, Official Stable)

Google

Start + End FrameBoth Required720p / 1080p16:9 / 9:16Native AudioOfficial Stable

Omni Video V3.1-Lite 首尾帧官方稳定版。首尾帧必填,生成平滑电影级过渡,720p/1080p 带原生音频。720p $0.72 起。

$0.72-$1.152 / call

GPT Image 2

OpenAI

Text to ImageImage EditingMulti-Image (up to 16)Async1K/2K/4K

OpenAI gpt-image-2. Text-to-image and multi-image editing (up to 16 reference images), aspect-ratio control, native 1K / 2K / 4K — $0.025 / $0.04 / $0.06 per image.

$0.025+ / call

Nanobananapro-gemini

Google

Text to ImageImage Editing1K/2K/4K$0.025/image

Gemini 3 Pro Image 经济渠道。$0.025/张,文生图+编辑,1K/2K/4K,高保真文字渲染。

Nanobanana2-gemini

Google

Text to ImageImage Editing1K/2K/4K$0.025/image

Gemini 3.1 Flash Image 经济渠道。$0.025/张,文生图+编辑,为速度与走量优化。

VEO 3.1 Fast HD

Google

Text to VideoImage to Video720p8s

VEO 3.1 快速 720p 档。固定 8 秒,支持参考图,$0.07/条。

$0.07 / call

VEO 3.1 Fast Full HD

Google

Text to VideoImage to Video1080p8s

VEO 3.1 快速 1080p 档。固定 8 秒,支持参考图,$0.07/条(与 720p 同价出 1080p)。

$0.07 / call

VEO 3.1 Fast 4K (Beta)

Google

Image to VideoStart-End Frame4K1080p

VEO 3.1 Fast 4K(Beta)。至少一张首帧,支持首尾帧,1080p/4K,5s $0.07、8s $0.10。

$0.07-$0.10 / call

SparkPix Image

SparkPix

Text to ImageSub 1sText RenderingLoRA Support

SparkPix 亚秒级文生图。1 秒内出图、$0.01/张,画质与文字渲染出色,支持 LoRA。

$0.01 / call

SparkPix Image Edit

SparkPix

Image EditingMulti-ImageSub 1sText Rendering

1 秒内完成的多图编辑模型。快速、实惠，支持精准提示词控制、文字渲染和多图编辑。$0.013/张。

$0.013 / call

P-Video

Pruna AI

T2V & I2VDraft ModeBuilt-in Audio720p/1080p1-20sMulti-Aspect

Pruna AI 快速视频生成。文生/图生/音频生视频,内置音频与对白,1080p 48FPS,草稿模式约 4 倍速预览。

Nanobanana-2-beta

Google

Text to ImageImage Editing1K/2K/4K Quality1K/2K $0.05

经济实惠的 Gemini 3.1 Flash 图片生成。支持文生图和图片编辑，成本更低。

MiniMax M2.5

MiniMax

Coding SOTATool CallingWeb SearchOffice Tasks

MiniMax M2.5 在编程、工具调用、搜索和办公效率任务上达到或刷新了 SOTA。

$0.442 / $1.765 / 1M

Gemini 3.1 Flash Lite

Google

Ultra FastCost EffectiveLightweight

最具性价比的多模态模型，速度最快，适用于高频轻量级任务。

$0.375 / $2.25 / 1M

Latest ProEnhanced ReasoningMultimodal

Gemini 3.1 Pro Preview

Google

-86%可用

最新 Pro 模型，具备增强推理和多模态能力。

$0.442 / $2.648 / 1M

Kling Custom Voice

Kling

Custom VoiceAudio UploadVideo ReferenceFor TTS/Lip Sync

从音频样本创建自定义声音。上传 .mp3/.wav/.mp4/.mov (5-30秒) 或引用视频 ID。

$0.006 / per call

Motion Control720p / 1080pUp to 30s Video

Kling Motion Control

Kling

-70%可用

角色动作控制视频生成。提供参考图片和动作视频，即可创建动画内容。

$0.06/s+ / call

Kling Face Recognition

Kling

Face DetectionVideo InputSession-based

视频人脸识别，传入 videoUrl 或 videoId，返回 sessionId 和 faceId，用于可灵对口型视频生成。

$0.001 / per call

Claude Opus 4.6

Anthropic

Latest OpusUltimate PerformancePremium

最新 Opus 模型，具备终极性能和推理能力。

Claude Opus 4.6 (Thinking)

Anthropic

Extended ThinkingUltimate PerformanceMost Powerful

Claude Opus 4.6 扩展思维版，适用于最复杂的推理任务。

Claude Sonnet 4.6

Anthropic

Latest SonnetBest PerformanceTop Efficiency

最新 Sonnet 模型，具备最佳性能和效率。

Claude Sonnet 4.6 (Thinking)

Anthropic

Extended ThinkingBest PerformanceDeep Reasoning

Claude Sonnet 4.6 扩展思维版，适用于复杂推理任务。

Kling Omni-Image

Kling

Text to ImageImage Editing1k/2k QualityMulti-Image Input

Kling AI 图片生成与编辑。支持 1k/2k 分辨率和多图输入，实现创意编辑。

$0.05 / call

Kling Sound Effects

Kling

Text to SFX3-10sSound Effects

从文字描述生成音效。3-10 秒音频，自然音质。

$0.030 / per call

Kling Video-to-Audio

Kling

Video DubbingSFX + BGMASMR Mode

为视频自动生成音效和背景音乐。支持 ASMR 模式，打造沉浸式内容。

$0.003 / per call

Kling TTS

Kling

TTSMultiple VoicesSpeed Control

文字转语音，多种声音可选。可调语速，支持多语言。

$0.01 / per call

Kling V3 Omni

Kling

Multi-ModalText/Image/Video Input5-15sAudioKeep Original Sound

Kling V3 Omni-Video，支持扩展时长和保留原声的视频编辑功能。

$0.15/s / call

Nanobanana2

Google

Text to ImageImage Editing1K/2K/4K Quality1K/2K $0.05

基于 Gemini 3.1 Flash 的快速图片生成。支持文生图和图片编辑，1K/2K/4K 画质。

Minimax Speech 2.6 HD

Minimax

HD QualityVoice CloneAsync TTS

Minimax (海螺) 高清异步 TTS。表现力丰富，韵律自然。支持声音克隆和声音设计。

$0.07 / 1K chars

Nanobanana-2-lite

Google

Image Editing1K/2K/4K Quality1K/2K $0.04

经济实惠的 Gemini 3.1 Flash 图片编辑。仅支持图生图，1K/2K/4K 画质。

$0.04+ / call

Minimax Speech 02 Turbo

Minimax

FastVoice CloneCost-Effective

Minimax (海螺) 快速经济异步 TTS。支持声音克隆、声音设计和发音词典。

$0.04 / 1K chars

Grok 4.2 Image

xAI

Text to ImageImage EditingMask InpaintingMultiple Sizes

基于 Grok 4.2 的图片生成与编辑。支持文生图和蒙版修复编辑。

Fast ResponseMultimodalCost Effective

Gemini 3 Flash Preview

Google

-50%可用

快速高效的多模态模型。适合快速响应和简单任务。

$0.111 / $0.662 / 1M

Advanced ReasoningMultimodalHigh Quality

Gemini 3 Pro Preview

Google

-86%可用

高级多模态推理模型，具备卓越能力。

$0.442 / $2.648 / 1M

Extended ThinkingAdvanced ReasoningDeep Analysis

Gemini 3 Pro (Thinking)

Google

-86%可用

Gemini 3 Pro 扩展思维版，适用于复杂推理任务。

$0.442 / $2.648 / 1M

Kling V3

Kling

Text to VideoImage to Video3-15sAudioPer-Second Pricing

最新可灵 V3 视频生成。3-15 秒可变时长，支持文生视频和图生视频，可选音频。按秒分档计费(标准 $0.12/s 起,专业+音频 $0.24/s)。

$0.12/s+ / call

Doubao Seedream 4.5

Doubao

Text to ImageImage Editing2K/4KMulti-Image Input

高质量 Doubao Seedream 4.5 图片生成。支持文生图和图片编辑，2K/4K 分辨率。

$0.05 / call

Omni Video V3.1-fast (Start-End Frame, Budget)

Google

Start + End Frame8s720p / 1080p / 4K16:9 / 9:16Budget Channel

Omni Video V3.1-fast 首尾帧版。给首帧(必填)+ 可选尾帧,生成 8 秒补间动作,带动态音频同步。经济档,固定 8 秒 $0.30/条(720p / 1080p / 4K 同价)。

$0.30 / call

Claude Opus 4.5

Anthropic

Latest OpusBest PerformancePremium

最新 Opus 模型，具备增强能力和改进推理。

Claude Opus 4.5 (Thinking)

Anthropic

Extended ThinkingBest PerformanceMost Powerful

Claude Opus 4.5 扩展思维版，适用于最复杂的推理任务。

Grok Video 3 (10s)

xAI

Text to VideoImage to Video10s720p / 480p$0.01/s

最新 Grok 视频模型，音频视频同步生成，10 秒输出。

$0.10 / call

Grok Video 3

xAI

Text to VideoImage to Video6 / 10 / 15s720p / 480p$0.01/s

基于 Grok 的高质量 5 秒视频生成。支持横屏和竖屏宽高比。

$0.06-$0.15 / call

Claude Haiku 4.5

Anthropic

Fast ResponseLow CostBasic Tasks

快速且经济的轻量级任务模型。适合简单查询和快速响应。

$0.353 / $1.765 / 1M

Claude Haiku 4.5 (Thinking)

Anthropic

Extended ThinkingFast ResponseCost Effective

Claude Haiku 4.5 扩展思维版，适用于复杂推理任务。

$0.353 / $1.765 / 1M

Claude Sonnet 4.5

Anthropic

Latest ModelEnhanced PerformanceBest Value

最新 Sonnet 模型，性能和效率均有提升。

Claude Sonnet 4.5 (Thinking)

Anthropic

Extended ThinkingComplex ReasoningDeep Analysis

Claude Sonnet 4.5 扩展思维版，适用于复杂推理任务。

Gemini 3 Pro Image (Pro)

Google

Text to ImageImage Editing1K/2K/4K Quality99% Success Rate

基于 Gemini 3 Pro 的高端图片生成。99% 成功率，最佳画质和可靠性。

$0.06+ / call

Gemini 3 Pro Image (Lite)

Google

Text to ImageImage Editing1K/2K/4K Quality$0.06-$0.10/image

基于 Gemini 3 Pro 的高质量图片生成。97% 成功率，支持文生图和图片编辑。

$0.06+ / call

High PerformanceMultimodalComplex Tasks

Gemini 2.5 Pro

Google

-86%测试

强大的多模态模型，性能优异，适用于复杂任务。

$0.184 / $1.471 / 1M

Extended ThinkingHigh PerformanceDeep Analysis

Gemini 2.5 Pro (Thinking)

Google

-86%测试

Gemini 2.5 Pro 扩展思维版，适用于复杂推理。

$0.184 / $1.471 / 1M

Fast ResponseCost EffectiveBest Value

Gemini 2.5 Flash

Google

-40%测试

快速且经济的多模态模型。速度与质量的最佳平衡。

$0.045 / $0.368 / 1M

Extended ThinkingFast ResponseCost Effective

Gemini 2.5 Flash (Thinking)

Google

-40%测试

Gemini 2.5 Flash 扩展思维版，适用于推理任务。

$0.045 / $0.368 / 1M

Text to ImageImage EditingMultiple Aspect Ratios

Gemini 2.5 Flash Image

Google

-50%可用

基于 Gemini 2.5 Flash 的快速图片生成。支持文生图和自然语言图片编辑。

$0.0295 / call

Ultra FastLowest CostHigh Volume

Gemini Flash Lite

Google

-86%测试

轻量超快模型。适合简单任务和大批量处理。

$0.015 / $0.059 / 1M

Claude Opus 4

Anthropic

Most CapableSuperior ReasoningComplex Tasks

最强大的模型，具备卓越的推理和分析能力。

$5.295 / $26.471 / 1M

Claude Opus 4 (Thinking)

Anthropic

Extended ThinkingSuperior ReasoningMost Powerful

Claude Opus 4 扩展思维版，适用于最复杂的推理任务。

$5.295 / $26.471 / 1M

Claude Sonnet 4

Anthropic

BalancedCode GenerationAnalysis

均衡型模型，性能优秀且成本高效。适用于大多数任务。

Claude Sonnet 4 (Thinking)

Anthropic

Extended ThinkingComplex ReasoningBest Value

Claude Sonnet 4 扩展思维版，适用于复杂推理任务。

1536 DimensionsFastCost Effective

Text Embedding 3 Small

OpenAI

-90%可用

小型嵌入模型，高效且经济，适用于大多数场景。

$0.018 / 1M tokens

Text Embedding 3 Large

OpenAI

3072 DimensionsHigh AccuracyFlexible

大型嵌入模型，精度更高，维度灵活可调。

$0.059 / 1M tokens