Add more models to default pack [Ongoing] #29

scosman · 2024-11-12T14:10:06Z

We like to have a well-tested default pack of models (each model-provider pair is tested, including testing that structured output works reliably).

What models should we add next? 👍 a model comment for support, and I'll prioritize by this list.

See the separate issue tracking end-user option to add models.

scosman · 2024-11-12T14:10:19Z

✅ Request: Qwen 2.5

scosman · 2024-12-10T18:49:38Z

✅ Llama 3.3 70b

scosman · 2024-12-30T01:31:53Z

✅ Deepseek V3

leonardmq · 2024-12-31T18:16:22Z

Adding Google as a provider would be a nice one as Gemini 2.0 is coming out to prod in the next few weeks

scosman · 2025-01-06T16:10:26Z

@leonardmq I can add 2.0 when it's out. Is there a reason to want direct Google/Vertex access instead of Gemini via OpenRouter?

leonardmq · 2025-01-06T17:06:28Z

@leonardmq I can add 2.0 when it's out. Is there a reason to want direct Google/Vertex access instead of Gemini via OpenRouter?

My assumption was that having Google/Vertex as a provider would enable adding support for fine-tuning for models from Google/Vertex, similar to how Kiln currently supports fine-tuning for OpenAI models. From what I have seen, it seems OpenRouter does not support submitting fine-tuning jobs, though I may have missed some functionality there.

My primary use case right now involves creating synthetic datasets and fine-tuning models for specific tasks to then identify the best-performing and most cost-effective one - and Gemini 2.0 Flash is likely to be quite relevant there so would be great if we could fine-tune as easily as we can OpenAI models

scosman · 2025-01-06T17:46:53Z

Ah, fine tuning support should be its own issue (feel free to create it). Fine tuning is a lot more work than just a model addition which is a few lines of config. You're correct OpenRouter is only for inference. I need to look at the Vertex docs; I know they have some API driven fine-tuning, but need to look at their serverless hosting options. In the meantime, just export a JSONL and use their UI?

scosman · 2025-01-21T12:42:23Z

Deepseek R1 - I’ll add this when I’m back

scosman added the enhancement New feature or request label Nov 12, 2024

scosman mentioned this issue Dec 3, 2024

Ollama UX Needs Improvement #54

Closed

scosman changed the title ~~Add more models to default pack~~ Add more models to default pack [Ongoing] Dec 10, 2024

scosman mentioned this issue Jan 13, 2025

Gemini Fine Tuning #104

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more models to default pack [Ongoing] #29

Add more models to default pack [Ongoing] #29

scosman commented Nov 12, 2024

scosman commented Nov 12, 2024 •

edited

Loading

scosman commented Dec 10, 2024

scosman commented Dec 30, 2024

leonardmq commented Dec 31, 2024

scosman commented Jan 6, 2025

leonardmq commented Jan 6, 2025

scosman commented Jan 6, 2025

scosman commented Jan 21, 2025

Add more models to default pack [Ongoing] #29

Add more models to default pack [Ongoing] #29

Comments

scosman commented Nov 12, 2024

scosman commented Nov 12, 2024 • edited Loading

scosman commented Dec 10, 2024

scosman commented Dec 30, 2024

leonardmq commented Dec 31, 2024

scosman commented Jan 6, 2025

leonardmq commented Jan 6, 2025

scosman commented Jan 6, 2025

scosman commented Jan 21, 2025

scosman commented Nov 12, 2024 •

edited

Loading