
Gemini 3.1 Flash Lite Preview
gemini-3.1-flash-lite-previewMost cost-effective multimodal model with fastest performance for high-frequency lightweight tasks. Best for massive agentic tasks, simple data extraction, and ultra-low latency applications.
Ultra FastCost EffectiveLightweight
Input¥0.75
Output¥4.50
per 1M tokens
Ultra Fast
Fastest response time
Cost Effective
Lowest price per token
Lightweight
High-frequency tasks
Agentic
Massive agent workloads
API Documentation
View complete API reference with all parameters and examples.
Full API Documentation
View complete API reference with streaming, thinking, and more.
Pricing
Input
¥0.75
per 1M tokens
Output
¥4.50
per 1M tokens
Billing: Cost = (input_tokens * input_price + output_tokens * output_price) / 1,000,000