
claude-fable-5Claude Fable 5 is Anthropic's latest and most capable publicly available model — and on API Models it costs about 50% less than official Anthropic pricing ($5 input / $25 output per 1M tokens). Ultra-long context, multimodal (vision) understanding, complex reasoning and enterprise-grade knowledge work, with strict safety safeguards. Use it through the Anthropic Messages API (/v1/messages) — the native path for Claude Code, the Anthropic SDK, and Anthropic-compatible clients like Cursor — with native tool use (function calling). Request params match Claude Opus 4.8.
About half the official Anthropic price
Handles very large inputs end to end
Text + image (vision) comprehension
Strong multi-step reasoning and analysis
View complete API reference with all parameters and examples.
Enable real-time streaming responses with Server-Sent Events.
{
"model": "claude-fable-5",
"stream": true,
"max_tokens": 1024,
"messages": [...]
}Enable Claude to use tools and call functions.
{
"model": "claude-fable-5",
"max_tokens": 1024,
"tools": [{
"name": "get_weather",
"description": "Get current weather for a location",
"input_schema": {
"type": "object",
"properties": {
"location": {"type": "string", "description": "City name"}
},
"required": ["location"]
}
}],
"tool_choice": {"type": "auto"},
"messages": [{"role": "user", "content": "What's the weather in Tokyo?"}]
}Analyze PDF documents by sending them as base64 encoded content.
{
"model": "claude-fable-5",
"max_tokens": 1024,
"messages": [{
"role": "user",
"content": [{
"type": "document",
"source": {
"type": "base64",
"media_type": "application/pdf",
"data": "<base64_encoded_pdf>"
}
}, {
"type": "text",
"text": "Summarize this document."
}]
}]
}Get structured JSON responses that match your schema.
{
"model": "claude-fable-5",
"max_tokens": 1024,
"output_format": {
"type": "json_schema",
"schema": {
"type": "object",
"properties": {
"name": {"type": "string"},
"age": {"type": "integer"}
},
"required": ["name", "age"]
}
},
"messages": [{"role": "user", "content": "Extract info: John is 30 years old"}]
}Enable Claude to search the web for up-to-date information.
{
"model": "claude-fable-5",
"max_tokens": 1024,
"tools": [{
"type": "web_search_20250305",
"name": "web_search",
"max_uses": 5
}],
"messages": [{"role": "user", "content": "What's the latest news about AI?"}]
}| Parameter | Type | Required | Description |
|---|---|---|---|
| model | string | Yes | Model identifier (e.g., claude-fable-5) |
| messages | array | Yes | Array of message objects with role and content |
| max_tokens | integer | Yes | Maximum tokens in the response (1 - 128000) |
| system | string | No | System prompt to set context |
| stream | boolean | No | Enable streaming responses (SSE) |
| temperature | number | No | Sampling temperature (0.0 - 1.0) |
| top_p | number | No | Nucleus sampling threshold (0.0 - 1.0) |
| top_k | integer | No | Top-k sampling (0 - infinity) |
| stop_sequences | array | No | Sequences that stop generation |
| tools | array | No | Function calling tools definition |
| tool_choice | object | No | Tool selection strategy (auto/any/tool) |
| thinking | object | No | Enable extended thinking mode |
| output_format | object | No | Structured output with JSON schema |
View complete API reference with streaming, thinking, and more.
Billing: Cost = (input_tokens * input_price + output_tokens * output_price) / 1,000,000
Claude Fable 5 is a Large Language Model API provided by Anthropic. Claude Fable 5 is Anthropic's latest and most capable publicly available model — and on API Models it costs about 50% less than official Anthropic pricing ($5 input / $25 output per 1M tokens). Ultra-long context, multimodal (vision) understanding, complex reasoning and enterprise-grade knowledge work, with strict safety safeguards. Use it through the Anthropic Messages API (/v1/messages) — the native path for Claude Code, the Anthropic SDK, and Anthropic-compatible clients like Cursor — with native tool use (function calling). Request params match Claude Opus 4.8. Through API Models platform, you can access this model via a unified API at prices significantly lower than official rates. Current pricing: Input: $5, Output: $25 per 1M tokens.
Build intelligent conversational systems to automatically answer user queries and improve service efficiency.
Automatically write articles, emails, ad copy, and other text content to boost productivity.
Assist with code writing, debugging, and code review to accelerate software development.
Understand and analyze unstructured data, extract key insights, and generate summary reports.
Claude Fable 5 is available through API Models at: Input: $5, Output: $25 per 1M tokens. This is up to 95% cheaper than official pricing.
Sign up at API Models, get your API key, and call our unified API endpoint. We provide detailed API documentation with code examples in cURL, Python, and Node.js.
API Models offers the same Claude Fable 5 model at 60-95% lower cost through our aggregation platform. We provide a unified API interface so you do not need separate accounts for each provider - one API key to access all models.
Claude Fable 5 is Anthropic's latest and most capable publicly available LLM — ultra-long context, multimodal (vision) understanding, complex reasoning, and enterprise-grade knowledge work, with strict safety safeguards. Best for long-document/image tasks, complex multi-step reasoning, agentic coding, and high-quality knowledge work. On apimodels.app it's billed at $5 input / $25 output per 1M tokens — about half the official price.
Yes — Fable 5 supports native tool use through the Anthropic Messages API (/v1/messages), the format Claude Code and the Anthropic SDK use. Point ANTHROPIC_BASE_URL at https://apimodels.app/api/v1, set the model to claude-fable-5, and use your apimodels API key (see /docs/claude-code). Anthropic-API-compatible clients like Cursor work too when configured as a custom Anthropic model. Send tools in Anthropic format ({name, input_schema}); request params are identical to Claude Opus 4.8.
Call it through the Anthropic Messages API (/v1/messages) — it natively supports tool calling (tool_use) and image input (vision), the same way Claude Code and the Anthropic SDK work. Request params are identical to Claude Opus 4.8 — just set the model to claude-fable-5.
Input $5 / output $25 per 1M tokens — about half the official Anthropic price (~50% cheaper). Prompt caching is supported too: cache-read (hit) tokens are billed at $0.5/M — far below the input rate — and cache writes at $9/M. Great for long sessions with lots of repeated context.
Fable 5 is Anthropic's newest flagship — the strongest overall, with better multimodal — at $5/$25 on apimodels.app (~half the official price). Opus 4.8 is more established and cheaper per token ($3/$13) and is still excellent for long-horizon agentic work. Pick claude-fable-5 for the latest/most-capable, claude-opus-4-8 for the lower per-token cost — both share one API key so you can route by task.
On API Models, Claude Fable 5 runs alongside 60+ models on one API key and one balance, so choosing is about fit, not lock-in. It supports Ultra-Long Context, Multimodal, Complex Reasoning, ~50% Cheaper, and you can weigh it on price and capability against other Large Language Model models, then switch by changing a single model-name string — no new account or integration. Browse every Large Language Model option with live pricing at apimodels.app/models.
Claude Fable 5 supports: Ultra-Long Context, Multimodal, Complex Reasoning, ~50% Cheaper. See the API Models docs for full parameters and call examples.
Yes. API Models exposes Claude Fable 5 through a single unified API and one key — no separate provider accounts, and no need to handle each provider's regional network access yourself.
We support Stripe (Visa, Mastercard, and other international cards) and Alipay. Credits are available instantly after payment.