google

Google: Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.

Input Cost
$0.25
per 1M tokens
Output Cost
$1.50
per 1M tokens
Context Window
1,048,576
tokens
Compare vs GPT-4o
Developer ID: google/gemini-3.1-flash-lite-preview

Related Models

google
Free/1M

Google: Gemma 3 4B (free)

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

📝 32,768 ctx Compare →
google
Free/1M

Google: Gemma 3 12B (free)

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

📝 32,768 ctx Compare →
google
Free/1M

Google: Gemma 3n 4B (free)

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, s...

📝 8,192 ctx Compare →