name: optimizing-gemini-models description: The authoritative expert guide for selecting Gemini models in Diktalo. Enforces Gemini 2.5 and Gemini 3 standards and deprecates older versions.
Optimizing Gemini Models (The Bible)
[!IMPORTANT] CURRENT STANDARDS: GEMINI 3 and GEMINI 2.5 LEGACY (DO NOT USE): Gemini 1.0, 1.5, 2.0 (Flash/Pro/Lite)
This skill serves as the single source of truth for AI model selection in the SaaS. All AI features MUST adhere to the standards defined here, pulling directly from the official Google documentation.
Current Supported Models
Google AI offers two main active generations for production and previews:
Gemini 3 Series
- Gemini 3.1 Pro (
gemini-3.1-pro-preview/gemini-3-pro): Advanced intelligence, complex problem-solving skills, and powerful agentic capabilities. - Gemini 3 Flash (
gemini-3-flash-preview/gemini-3-flash): Frontier-class performance rivaling larger models at a fraction of the cost. - Nano Banana Pro (
gemini-3-pro-image-preview): State-of-the-art image generation and editing models for highly contextual native image creation.
Gemini 2.5 Series
- Gemini 2.5 Pro (
gemini-2.5-pro): Most advanced stable model for complex tasks, featuring deep reasoning and coding capabilities. - Gemini 2.5 Flash (
gemini-2.5-flash): Best price-performance model for low-latency, high-volume tasks that require reasoning. - Gemini 2.5 Flash-Lite (
gemini-2.5-flash-lite): Ultra-fast and cost-effective model for simpler tasks. - Nano Banana (
gemini-2.5-flash-image): State-of-the-art native image generation and editing.
Specialized Task Models
- Embeddings:
gemini-embedding-001(High-dimensional vector representations for advanced semantic search).
Model Selection Standards mapping
| Use Case | Recommended Model | Reason |
|---|---|---|
| Transcription | gemini-2.5-flash OR gemini-3-flash |
Optimized for speed, cost, and extreme context windows. |
| Chat & Reasoning | gemini-2.5-pro OR gemini-3-pro |
Best-in-class reasoning for complex instruction following and agentic workflows. |
| Summarization | gemini-2.5-flash |
Balanced performance for processing large transcripts. |
| Support Bot | gemini-2.5-flash |
Low latency for real-time user interaction. |
| Embeddings | gemini-embedding-001 |
MANDATORY. text-embedding-004 is fully deprecated. |
| Image Generation | gemini-2.5-flash-image |
Nano Banana is the standard for fast, creative native workflows. |
Deprecation Policy & Known Deprecations
As an expert, be aware that models enter deprecation schedules frequently. DO NOT USE:
- Any
gemini-1.0-*orgemini-1.5-*models. - Any
gemini-2.0-*models (e.g.gemini-2.0-flash,gemini-2.0-flash-lite). text-embedding-004(Replace immediately withgemini-embedding-001).
When a new model generation stabilizes:
- Update this SKILL first. Keep checking
https://ai.google.dev/gemini-api/docs/modelsandhttps://ai.google.dev/gemini-api/docs/deprecations. - Audit
api/ai.tsand replace all instances of the old version. - Delete any fallback logic that relies on dead models.
Implementation Checklist
When implementing or updating AI features:
- Check this SKILL: Confirm you are using the
Recommended Model. - Verify
latestAlias: If available, using the-latestsuffix (likegemini-2.5-flash-latest) is preferred over static dating unless stability is an issue. - Test for 404/500: Immediately test the new model endpoint. If it fails, check
https://ai.google.dev/gemini-api/docs/modelsand update this document.
Error Handling Reference
- 404 Not Found: You are using a Legacy/Deprecated model (e.g., 1.5-pro, 2.0-flash, text-embedding-004). UPGRADE IMMEDIATELY.
- 500 Internal Error: Often caused by using a
Previewmodel in production without proper error handling. Test cautiously.

