LLM Router

An intelligent model selection system that automatically chooses the best LLM for a given task based on performance history, cost, and latency requirements.

Features

Automatic model selection based on task requirements
Support for multiple LLM providers (OpenAI, Anthropic, Mistral)
Performance tracking and evaluation
Cost and latency optimization
SQLite database for storing task history
Extensible architecture for adding new models and evaluation metrics

Project Structure

llm_router/
├── config/
│   └── model_registry.yaml    # Model configurations and pricing
├── data/
│   └── task_history.db        # SQLite database for task history
├── src/
│   ├── models.py             # Data models and types
│   ├── model_runner.py       # LLM API integration
│   ├── evaluator.py          # Output quality evaluation
│   ├── router_agent.py       # Model selection logic
│   ├── task_runner.py        # Pipeline orchestration
│   └── example.py            # Usage example
└── requirements.txt          # Project dependencies

Installation

Clone the repository:

git clone https://github.com/yourusername/llm_router.git
cd llm_router

Create a virtual environment and install dependencies:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

Set up environment variables:

export OPENAI_API_KEY="your-openai-key"
export ANTHROPIC_API_KEY="your-anthropic-key"
export MISTRAL_API_KEY="your-mistral-key"

Usage

from src.models import TaskSpec, TaskType
from src.task_runner import TaskRunner

# Initialize task runner
runner = TaskRunner()

# Create a task
task = TaskSpec(
    task_id="unique-id",
    prompt="Your task prompt here",
    task_type=TaskType.CODE_GENERATION,
    importance=0.8,
    latency_budget_ms=5000
)

# Run the task
result = runner.run_task(task)

# Access results
print(f"Selected model: {result.selected_model}")
print(f"Output: {result.output.output_text}")
print(f"Quality score: {result.evaluation_score}")

Model Selection

The router uses a weighted scoring system to select the best model:

score = α * quality - β * cost - γ * latency

Where:

quality: Historical performance and task-specific capabilities
cost: Token pricing and usage
latency: Response time and budget compliance

Extending the Project

Adding New Models

Add model configuration to config/model_registry.yaml
Implement API integration in model_runner.py

Custom Evaluation

Extend evaluator.py with new evaluation metrics
Implement custom scoring logic

Dashboard Integration

The project is designed to be easily integrated with:

Streamlit for visualization
LangSmith for experiment tracking
Custom monitoring systems

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
config		config
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Router

Features

Project Structure

Installation

Usage

Model Selection

Extending the Project

Adding New Models

Custom Evaluation

Dashboard Integration

Contributing

License

About

Uh oh!

Releases

Packages

Languages

License

mihirmath/llm-router

Folders and files

Latest commit

History

Repository files navigation

LLM Router

Features

Project Structure

Installation

Usage

Model Selection

Extending the Project

Adding New Models

Custom Evaluation

Dashboard Integration

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages