Models/Claude Sonnet 5

Claude Sonnet 5

claude-sonnet-5

Claude Sonnet 5 is Anthropic's newest Sonnet — a 1M-token context window (both the default and the maximum; there is no smaller-context variant), up to 128K output tokens, adaptive thinking, and the same tools and platform features as Claude Sonnet 4.6 (Priority Tier is the one exception — Sonnet 5 does not support it). On API Models it's just $1 input / $4.5 output per 1M tokens, with prompt caching (cache reads at $0.1/M). Use it through the Anthropic Messages API (/v1/messages) — the native path for Claude Code, the Anthropic SDK, and Anthropic-compatible clients like Cursor — with native tool use (function calling). Request params match Claude Opus 4.8.

1M ContextAdaptive Thinking128K OutputTool Calling

Input$1

Output$4.5

per 1M tokens

1M Token Context

Default and maximum — no smaller variant

Adaptive Thinking

Extended reasoning when the task needs it

128K Max Output

Long, complete responses in one call

Tools + Claude Code / Cursor

Native tool use via the Anthropic Messages API

API Documentation

View complete API reference with all parameters and examples.

View Docs

Advanced Features

Streaming

Enable real-time streaming responses with Server-Sent Events.

{
  "model": "claude-sonnet-5",
  "stream": true,
  "max_tokens": 1024,
  "messages": [...]
}

Function Calling (Tools)

Enable Claude to use tools and call functions.

{
  "model": "claude-sonnet-5",
  "max_tokens": 1024,
  "tools": [{
    "name": "get_weather",
    "description": "Get current weather for a location",
    "input_schema": {
      "type": "object",
      "properties": {
        "location": {"type": "string", "description": "City name"}
      },
      "required": ["location"]
    }
  }],
  "tool_choice": {"type": "auto"},
  "messages": [{"role": "user", "content": "What's the weather in Tokyo?"}]
}

PDF Support

Analyze PDF documents by sending them as base64 encoded content.

{
  "model": "claude-sonnet-5",
  "max_tokens": 1024,
  "messages": [{
    "role": "user",
    "content": [{
      "type": "document",
      "source": {
        "type": "base64",
        "media_type": "application/pdf",
        "data": "<base64_encoded_pdf>"
      }
    }, {
      "type": "text",
      "text": "Summarize this document."
    }]
  }]
}

Structured Output (JSON Schema)

Get structured JSON responses that match your schema.

{
  "model": "claude-sonnet-5",
  "max_tokens": 1024,
  "output_format": {
    "type": "json_schema",
    "schema": {
      "type": "object",
      "properties": {
        "name": {"type": "string"},
        "age": {"type": "integer"}
      },
      "required": ["name", "age"]
    }
  },
  "messages": [{"role": "user", "content": "Extract info: John is 30 years old"}]
}

Web Search

Enable Claude to search the web for up-to-date information.

{
  "model": "claude-sonnet-5",
  "max_tokens": 1024,
  "tools": [{
    "type": "web_search_20250305",
    "name": "web_search",
    "max_uses": 5
  }],
  "messages": [{"role": "user", "content": "What's the latest news about AI?"}]
}

API Parameters Reference

Parameter	Type	Required	Description
model	string	Yes	Model identifier (e.g., claude-sonnet-5)
messages	array	Yes	Array of message objects with role and content
max_tokens	integer	Yes	Maximum tokens in the response (1 - 128000)
system	string	No	System prompt to set context
stream	boolean	No	Enable streaming responses (SSE)
temperature	number	No	Sampling temperature (0.0 - 1.0)
top_p	number	No	Nucleus sampling threshold (0.0 - 1.0)
top_k	integer	No	Top-k sampling (0 - infinity)
stop_sequences	array	No	Sequences that stop generation
tools	array	No	Function calling tools definition
tool_choice	object	No	Tool selection strategy (auto/any/tool)
thinking	object	No	Enable extended thinking mode
output_format	object	No	Structured output with JSON schema

Full API Documentation

View complete API reference with streaming, thinking, and more.

View Documentation

Pricing

Input

per 1M tokens

Output

$4.5

per 1M tokens

Billing: Cost = (input_tokens * input_price + output_tokens * output_price) / 1,000,000

Last updated: 2026-07-02

TL;DR Claude Sonnet 5 is a Anthropic large language model model, callable via API Models' unified API (model name `claude-sonnet-5`). Pricing: Input: $1, Output: $4.5 per 1M tokens. One API key for all image / video / LLM / audio models.

About Claude Sonnet 5

Claude Sonnet 5 is a Large Language Model API provided by Anthropic. Claude Sonnet 5 is Anthropic's newest Sonnet — a 1M-token context window (both the default and the maximum; there is no smaller-context variant), up to 128K output tokens, adaptive thinking, and the same tools and platform features as Claude Sonnet 4.6 (Priority Tier is the one exception — Sonnet 5 does not support it). On API Models it's just $1 input / $4.5 output per 1M tokens, with prompt caching (cache reads at $0.1/M). Use it through the Anthropic Messages API (/v1/messages) — the native path for Claude Code, the Anthropic SDK, and Anthropic-compatible clients like Cursor — with native tool use (function calling). Request params match Claude Opus 4.8. Through API Models platform, you can access this model via a unified API with transparent pay-as-you-go pricing. Current pricing: Input: $1, Output: $4.5 per 1M tokens.

Key Features

1M Token Context -- Default and maximum — no smaller variant
Adaptive Thinking -- Extended reasoning when the task needs it
128K Max Output -- Long, complete responses in one call
Tools + Claude Code / Cursor -- Native tool use via the Anthropic Messages API

Use Cases

Chatbot & Customer Support

Build intelligent conversational systems to automatically answer user queries and improve service efficiency.

Content Generation

Automatically write articles, emails, ad copy, and other text content to boost productivity.

Code Assistant

Assist with code writing, debugging, and code review to accelerate software development.

Data Analysis

Understand and analyze unstructured data, extract key insights, and generate summary reports.

Why API Models

Unified API -- One API key to access all models, no need to register on multiple platforms
Transparent Pricing -- Pay-as-you-go with per-call pricing published on every model page, ideal for indie developers and startups
Instant Access -- Start using immediately after signup, supports Stripe and Alipay payments
Full Documentation -- Detailed API docs with code examples in cURL, Python, and Node.js

Frequently Asked Questions

How much does Claude Sonnet 5 cost?

Claude Sonnet 5 is available through API Models at: Input: $1, Output: $4.5 per 1M tokens. Billing is pay-as-you-go — you only pay for what you generate.

How to use Claude Sonnet 5 API?

Sign up at API Models, get your API key, and call our unified API endpoint. We provide detailed API documentation with code examples in cURL, Python, and Node.js.

What is the difference between API Models and the official Anthropic API?

API Models offers the same Claude Sonnet 5 model through our aggregation platform. We provide a unified API interface so you do not need separate accounts for each provider - one API key to access all models.

What is Claude Sonnet 5 and what makes it special?

Claude Sonnet 5 is Anthropic's newest Sonnet model: a 1M-token context window (both default and maximum — there's no smaller-context variant), up to 128K output tokens, adaptive thinking, and the same tools and platform features as Claude Sonnet 4.6, with one exception — it does not support Priority Tier. On apimodels.app it's just $1 input / $4.5 output per 1M tokens, balancing capability with a low price.

Can I use Sonnet 5 with Claude Code or Cursor? Does it support tools / function calling?

Yes — Sonnet 5 supports native tool use (function calling) through the Anthropic Messages API (/v1/messages), the format Claude Code and the Anthropic SDK use. Point ANTHROPIC_BASE_URL at https://apimodels.app/api/v1, set the model to claude-sonnet-5, and use your apimodels API key (see /docs/claude-code). Anthropic-API-compatible clients like Cursor work too when configured as a custom Anthropic model. Send tools in Anthropic format ({name, input_schema}); request params are identical to Claude Opus 4.8.

How is Sonnet 5 priced? Is there a cache discount?

Input $1 / output $4.5 per 1M tokens. Prompt caching is supported: cache-read (hit) tokens are billed at just $0.1/M — a tenth of the input rate — and cache writes at $2/M. The 1M-token context plus caching makes it ideal for long documents, long sessions, and agentic coding.

Sonnet 5 vs Sonnet 4.6 vs Opus 4.8 — which should I pick?

Sonnet 5 is the newest Sonnet — 1M context + adaptive thinking at $1/$4.5, the best value for most coding/chat/long-document work. Sonnet 4.6 is the previous Sonnet; Opus 4.8 ($3/$13) is for the hardest long-horizon agentic reasoning. Pick claude-sonnet-5 for the newest, cheapest, still-strong option — all share one API key so you can switch by task.

How does Claude Sonnet 5 compare to other Large Language Model models?

On API Models, Claude Sonnet 5 runs alongside 60+ models on one API key and one balance, so choosing is about fit, not lock-in. It supports 1M Context, Adaptive Thinking, 128K Output, Tool Calling, and you can weigh it on price and capability against other Large Language Model models, then switch by changing a single model-name string — no new account or integration. Browse every Large Language Model option with live pricing at apimodels.app/models.

What can Claude Sonnet 5 do?

Claude Sonnet 5 supports: 1M Context, Adaptive Thinking, 128K Output, Tool Calling. See the API Models docs for full parameters and call examples.

Can I access the Claude Sonnet 5 API from anywhere (incl. China)?

Yes. API Models exposes Claude Sonnet 5 through a single unified API and one key — no separate provider accounts, and no need to handle each provider's regional network access yourself.

What payment methods are supported?

We support Stripe (Visa, Mastercard, and other international cards) and Alipay. Credits are available instantly after payment.

Models/Claude Sonnet 5

Claude Sonnet 5

claude-sonnet-5

1M ContextAdaptive Thinking128K OutputTool Calling

Input$1

Output$4.5

per 1M tokens

1M Token Context

Default and maximum — no smaller variant

Adaptive Thinking

Extended reasoning when the task needs it

128K Max Output

Long, complete responses in one call

Tools + Claude Code / Cursor

Native tool use via the Anthropic Messages API

API Documentation

View complete API reference with all parameters and examples.

View Docs

Advanced Features

Streaming

Enable real-time streaming responses with Server-Sent Events.

{
  "model": "claude-sonnet-5",
  "stream": true,
  "max_tokens": 1024,
  "messages": [...]
}

Function Calling (Tools)

Enable Claude to use tools and call functions.

{
  "model": "claude-sonnet-5",
  "max_tokens": 1024,
  "tools": [{
    "name": "get_weather",
    "description": "Get current weather for a location",
    "input_schema": {
      "type": "object",
      "properties": {
        "location": {"type": "string", "description": "City name"}
      },
      "required": ["location"]
    }
  }],
  "tool_choice": {"type": "auto"},
  "messages": [{"role": "user", "content": "What's the weather in Tokyo?"}]
}

PDF Support

Analyze PDF documents by sending them as base64 encoded content.

{
  "model": "claude-sonnet-5",
  "max_tokens": 1024,
  "messages": [{
    "role": "user",
    "content": [{
      "type": "document",
      "source": {
        "type": "base64",
        "media_type": "application/pdf",
        "data": "<base64_encoded_pdf>"
      }
    }, {
      "type": "text",
      "text": "Summarize this document."
    }]
  }]
}

Structured Output (JSON Schema)

Get structured JSON responses that match your schema.

{
  "model": "claude-sonnet-5",
  "max_tokens": 1024,
  "output_format": {
    "type": "json_schema",
    "schema": {
      "type": "object",
      "properties": {
        "name": {"type": "string"},
        "age": {"type": "integer"}
      },
      "required": ["name", "age"]
    }
  },
  "messages": [{"role": "user", "content": "Extract info: John is 30 years old"}]
}

Web Search

Enable Claude to search the web for up-to-date information.

{
  "model": "claude-sonnet-5",
  "max_tokens": 1024,
  "tools": [{
    "type": "web_search_20250305",
    "name": "web_search",
    "max_uses": 5
  }],
  "messages": [{"role": "user", "content": "What's the latest news about AI?"}]
}

API Parameters Reference

Parameter	Type	Required	Description
model	string	Yes	Model identifier (e.g., claude-sonnet-5)
messages	array	Yes	Array of message objects with role and content
max_tokens	integer	Yes	Maximum tokens in the response (1 - 128000)
system	string	No	System prompt to set context
stream	boolean	No	Enable streaming responses (SSE)
temperature	number	No	Sampling temperature (0.0 - 1.0)
top_p	number	No	Nucleus sampling threshold (0.0 - 1.0)
top_k	integer	No	Top-k sampling (0 - infinity)
stop_sequences	array	No	Sequences that stop generation
tools	array	No	Function calling tools definition
tool_choice	object	No	Tool selection strategy (auto/any/tool)
thinking	object	No	Enable extended thinking mode
output_format	object	No	Structured output with JSON schema

Full API Documentation

View complete API reference with streaming, thinking, and more.

View Documentation

Pricing

Input

per 1M tokens

Output

$4.5

per 1M tokens

Billing: Cost = (input_tokens * input_price + output_tokens * output_price) / 1,000,000

Last updated: 2026-07-02

About Claude Sonnet 5

Key Features

1M Token Context -- Default and maximum — no smaller variant
Adaptive Thinking -- Extended reasoning when the task needs it
128K Max Output -- Long, complete responses in one call
Tools + Claude Code / Cursor -- Native tool use via the Anthropic Messages API

Use Cases

Chatbot & Customer Support

Build intelligent conversational systems to automatically answer user queries and improve service efficiency.

Content Generation

Automatically write articles, emails, ad copy, and other text content to boost productivity.

Code Assistant

Assist with code writing, debugging, and code review to accelerate software development.

Data Analysis

Understand and analyze unstructured data, extract key insights, and generate summary reports.

Why API Models

Unified API -- One API key to access all models, no need to register on multiple platforms
Transparent Pricing -- Pay-as-you-go with per-call pricing published on every model page, ideal for indie developers and startups
Instant Access -- Start using immediately after signup, supports Stripe and Alipay payments
Full Documentation -- Detailed API docs with code examples in cURL, Python, and Node.js

Frequently Asked Questions

How much does Claude Sonnet 5 cost?

Claude Sonnet 5 is available through API Models at: Input: $1, Output: $4.5 per 1M tokens. Billing is pay-as-you-go — you only pay for what you generate.

How to use Claude Sonnet 5 API?

Sign up at API Models, get your API key, and call our unified API endpoint. We provide detailed API documentation with code examples in cURL, Python, and Node.js.

What is the difference between API Models and the official Anthropic API?

What is Claude Sonnet 5 and what makes it special?

Can I use Sonnet 5 with Claude Code or Cursor? Does it support tools / function calling?

How is Sonnet 5 priced? Is there a cache discount?

Sonnet 5 vs Sonnet 4.6 vs Opus 4.8 — which should I pick?

How does Claude Sonnet 5 compare to other Large Language Model models?

What can Claude Sonnet 5 do?

Claude Sonnet 5 supports: 1M Context, Adaptive Thinking, 128K Output, Tool Calling. See the API Models docs for full parameters and call examples.

Can I access the Claude Sonnet 5 API from anywhere (incl. China)?

Yes. API Models exposes Claude Sonnet 5 through a single unified API and one key — no separate provider accounts, and no need to handle each provider's regional network access yourself.

What payment methods are supported?

We support Stripe (Visa, Mastercard, and other international cards) and Alipay. Credits are available instantly after payment.