Skip to content
Draft
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
23 changes: 23 additions & 0 deletions src/content/docs/docs/plugins/google-genai.md
Original file line number Diff line number Diff line change
Expand Up @@ -79,6 +79,29 @@ const embeddings = await ai.embed({
});
```

## Available Models

The Google Generative AI plugin provides access to a comprehensive range of models through the Gemini API. Here's a complete reference of all available models:

| Model variant | Input(s) | Output | Optimized for |
|---------------|----------|--------|---------------|
| **Gemini 2.5 Pro**<br/>`gemini-2.5-pro` | Audio, images, videos, text, and PDF | Text | Enhanced thinking and reasoning, multimodal understanding, advanced coding, and more |
| **Gemini 2.5 Flash**<br/>`gemini-2.5-flash` | Audio, images, videos, and text | Text | Adaptive thinking, cost efficiency |
| **Gemini 2.5 Flash-Lite Preview**<br/>`gemini-2.5-flash-lite-preview-06-17` | Text, image, video, audio | Text | Most cost-efficient model supporting high throughput |
| **Gemini 2.5 Flash Native Audio**<br/>`gemini-2.5-flash-preview-native-audio-dialog`<br/>`gemini-2.5-flash-exp-native-audio-thinking-dialog` | Audio, videos, and text | Text and audio, interleaved | High quality, natural conversational audio outputs, with or without thinking |
| **Gemini 2.5 Flash Preview TTS**<br/>`gemini-2.5-flash-preview-tts` | Text | Audio | Low latency, controllable, single- and multi-speaker text-to-speech audio generation |
| **Gemini 2.5 Pro Preview TTS**<br/>`gemini-2.5-pro-preview-tts` | Text | Audio | Low latency, controllable, single- and multi-speaker text-to-speech audio generation |
| **Gemini 2.0 Flash**<br/>`gemini-2.0-flash` | Audio, images, videos, and text | Text | Next generation features, speed, and realtime streaming |
| **Gemini 2.0 Flash Preview Image Generation**<br/>`gemini-2.0-flash-preview-image-generation` | Audio, images, videos, and text | Text, images | Conversational image generation and editing |
| **Gemini 2.0 Flash-Lite**<br/>`gemini-2.0-flash-lite` | Audio, images, videos, and text | Text | Cost efficiency and low latency |
| **Gemini 1.5 Flash** *(Deprecated)*<br/>`gemini-1.5-flash` | Audio, images, videos, and text | Text | Fast and versatile performance across a diverse variety of tasks |
| **Gemini 1.5 Flash-8B** *(Deprecated)*<br/>`gemini-1.5-flash-8b` | Audio, images, videos, and text | Text | High volume and lower intelligence tasks |
| **Gemini 1.5 Pro** *(Deprecated)*<br/>`gemini-1.5-pro` | Audio, images, videos, and text | Text | Complex reasoning tasks requiring more intelligence |
| **Gemini Embedding**<br/>`gemini-embedding-001` | Text | Text embeddings | Measuring the relatedness of text strings |
| **Imagen 4**<br/>`imagen-4.0-generate-preview-06-06`<br/>`imagen-4.0-ultra-generate-preview-06-06` | Text | Images | Our most up-to-date image generation model |
| **Imagen 3**<br/>`imagen-3.0-generate-002` | Text | Images | High quality image generation model |
| **Veo 2**<br/>`veo-2.0-generate-001` | Text, images | Video | High quality video generation |

## Gemini Files API

You can use files uploaded to the Gemini Files API with Genkit:
Expand Down