API ModelsAPI Models
Models/Gemini 3.1 Flash Lite Preview
Google

Gemini 3.1 Flash Lite Preview

gemini-3.1-flash-lite-preview

Most cost-effective multimodal model with fastest performance for high-frequency lightweight tasks. Best for massive agentic tasks, simple data extraction, and ultra-low latency applications.

Ultra FastCost EffectiveLightweight
Input¥0.75
Output¥4.50
per 1M tokens

Ultra Fast

Fastest response time

Cost Effective

Lowest price per token

Lightweight

High-frequency tasks

Agentic

Massive agent workloads

API Documentation

View complete API reference with all parameters and examples.

View Docs

Full API Documentation

View complete API reference with streaming, thinking, and more.

View Documentation

Pricing

Input
¥0.75
per 1M tokens
Output
¥4.50
per 1M tokens

Billing: Cost = (input_tokens * input_price + output_tokens * output_price) / 1,000,000