diff --git a/src/content/docs/docs/plugins/google-genai.md b/src/content/docs/docs/plugins/google-genai.md index 333bc269..cc00919a 100644 --- a/src/content/docs/docs/plugins/google-genai.md +++ b/src/content/docs/docs/plugins/google-genai.md @@ -79,6 +79,29 @@ const embeddings = await ai.embed({ }); ``` +## Available Models + +The Google Generative AI plugin provides access to a comprehensive range of models through the Gemini API. Here's a complete reference of all available models: + +| Model variant | Input(s) | Output | Optimized for | +|---------------|----------|--------|---------------| +| **Gemini 2.5 Pro**
`gemini-2.5-pro` | Audio, images, videos, text, and PDF | Text | Enhanced thinking and reasoning, multimodal understanding, advanced coding, and more | +| **Gemini 2.5 Flash**
`gemini-2.5-flash` | Audio, images, videos, and text | Text | Adaptive thinking, cost efficiency | +| **Gemini 2.5 Flash-Lite Preview**
`gemini-2.5-flash-lite-preview-06-17` | Text, image, video, audio | Text | Most cost-efficient model supporting high throughput | +| **Gemini 2.5 Flash Native Audio**
`gemini-2.5-flash-preview-native-audio-dialog`
`gemini-2.5-flash-exp-native-audio-thinking-dialog` | Audio, videos, and text | Text and audio, interleaved | High quality, natural conversational audio outputs, with or without thinking | +| **Gemini 2.5 Flash Preview TTS**
`gemini-2.5-flash-preview-tts` | Text | Audio | Low latency, controllable, single- and multi-speaker text-to-speech audio generation | +| **Gemini 2.5 Pro Preview TTS**
`gemini-2.5-pro-preview-tts` | Text | Audio | Low latency, controllable, single- and multi-speaker text-to-speech audio generation | +| **Gemini 2.0 Flash**
`gemini-2.0-flash` | Audio, images, videos, and text | Text | Next generation features, speed, and realtime streaming | +| **Gemini 2.0 Flash Preview Image Generation**
`gemini-2.0-flash-preview-image-generation` | Audio, images, videos, and text | Text, images | Conversational image generation and editing | +| **Gemini 2.0 Flash-Lite**
`gemini-2.0-flash-lite` | Audio, images, videos, and text | Text | Cost efficiency and low latency | +| **Gemini 1.5 Flash** *(Deprecated)*
`gemini-1.5-flash` | Audio, images, videos, and text | Text | Fast and versatile performance across a diverse variety of tasks | +| **Gemini 1.5 Flash-8B** *(Deprecated)*
`gemini-1.5-flash-8b` | Audio, images, videos, and text | Text | High volume and lower intelligence tasks | +| **Gemini 1.5 Pro** *(Deprecated)*
`gemini-1.5-pro` | Audio, images, videos, and text | Text | Complex reasoning tasks requiring more intelligence | +| **Gemini Embedding**
`gemini-embedding-001` | Text | Text embeddings | Measuring the relatedness of text strings | +| **Imagen 4**
`imagen-4.0-generate-preview-06-06`
`imagen-4.0-ultra-generate-preview-06-06` | Text | Images | Our most up-to-date image generation model | +| **Imagen 3**
`imagen-3.0-generate-002` | Text | Images | High quality image generation model | +| **Veo 2**
`veo-2.0-generate-001` | Text, images | Video | High quality video generation | + ## Gemini Files API You can use files uploaded to the Gemini Files API with Genkit: