You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| 5 |[vLLM‑SR](https://vllm-semantic-router.com/) [[Code]](https://github.com/vllm-project/semantic-router) [[HF]](https://huggingface.co/llm-semantic-router)| 🎓 vLLM SR Team | 64.32 |67.28| $1.67|4.79|12.54|79.33|0.19|35.00|
37
+
| 🥇 |[vLLM‑SR](https://vllm-semantic-router.com/) [[Code]](https://github.com/vllm-project/semantic-router) [[HF]](https://huggingface.co/llm-semantic-router)| 🎓 vLLM SR Team | 67.23| 66.53| $0.06|94.10|90.12|100.00|—|90.95|
@@ -102,6 +102,7 @@ Create a config file in `./router_inference/config/<router_name>.json`. An examp
102
102
{
103
103
"pipeline_params": {
104
104
"router_name": "your-router",
105
+
"router_cls_name": "your_router_class_name",
105
106
"models": [
106
107
"gpt-4o-mini",
107
108
"claude-3-haiku-20240307",
@@ -111,7 +112,7 @@ Create a config file in `./router_inference/config/<router_name>.json`. An examp
111
112
}
112
113
```
113
114
114
-
For each model in your config, add an entry with the pricing per million tokens in this format at [`model_cost/cost.json`](./model_cost/cost.json):
115
+
For each model in your config, add an entry with the pricing per million tokens in this format at [`model_cost/model_cost.json`](./model_cost/model_cost.json):
115
116
116
117
```json
117
118
{
@@ -129,12 +130,13 @@ For each model in your config, add an entry with the pricing per million tokens
129
130
130
131
Create your own router class by inheriting from `BaseRouter` and implementing the `_get_prediction()` method. See [`router_inference/router/example_router.py`](./router_inference/router/example_router.py) for a complete example.
131
132
132
-
Then, modify [`router_inference/generate_prediction_file.py`](./router_inference/generate_prediction_file.py#L150) to use your router class:
133
+
Then, modify [`router_inference/router/__init__.py`](./router_inference/router/__init__.py) to include your router class:
133
134
134
135
```python
135
-
#Replace ExampleRouter with your router class
136
+
#Import your router class
136
137
from router_inference.router.my_router import MyRouter
0 commit comments