Product Launchllminference optimizationgooglemultimodal
Google Releases Gemini 3.1 Flash Lite
8.2
Relevance Score
Google today introduced Gemini 3.1 Flash Lite, a new AI model in the Gemini 3 family designed for faster responses and lower operating costs. The model is rolling out in preview to developers via the Gemini API in Google AI Studio and to enterprise customers through Vertex AI, with pricing at $0.25 per million input tokens and $1.50 per million output tokens. Google cites benchmarks showing 2.5× faster time to first answer token and 45% faster output while maintaining similar quality.



