Skip to content

Commit bc14e9b

Browse files
committed
weave-router: cheap5 (5-model cost-aware, oracle 83.7 offline)
1 parent ec266ba commit bc14e9b

3 files changed

Lines changed: 718 additions & 719 deletions

File tree

router_inference/config/weave-router.json

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -7,11 +7,10 @@
77
"deepseek/deepseek-v4-pro",
88
"gemini-3.1-flash-lite-preview",
99
"gemini-3.1-pro-preview",
10-
"qwen/qwen3-235b-a22b-2507",
11-
"claude-sonnet-4-5"
10+
"qwen/qwen3-235b-a22b-2507"
1211
],
13-
"description": "Weave Router (oracle 6-model trimmed pool, margin=0.05): drop low-coverage models",
14-
"router_version": "oracle-6m",
12+
"description": "Weave Router (cheap5): 5-model cost-aware oracle, margin=0.05",
13+
"router_version": "cheap5",
1514
"router_homepage": "https://workweave.ai"
1615
}
1716
}

router_inference/predictions/weave-router-robustness.json

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -968,7 +968,7 @@
968968
{
969969
"global index": "MMLUPro_economics_6114",
970970
"prompt": "Kindly examine the subsequent multiple-choice inquiries and select the option that most plausibly represents the correct choice based on the given alternatives.\n\nContext: None\n\nQuestion: Analyze the following equation to identify the type of model it most accurately characterizes:\n\n\\[ y_{{it}} = \\alpha + \\beta_{{it}} + \\mu_i + \\nu_{{it}} \\]\n\nChoices: \nA. A strictly temporal series model\nB. A model of difference-in-differences\nC. A model with random effects\nD. A model using pooled ordinary least squares\nE. A model involving mixed effects\nF. A model with spatial autocorrelation\nG. A model of quantile regression\nH. A model of panel data\nI. A model with entity fixed effects\nJ. A model with time fixed effects\n\nIndicate the correct option by using \\(\\boxed{X}\\), where \\(X\\) corresponds to the accurate choice. Construct the explanation or feedback in no more than three sentences.",
971-
"prediction": "claude-sonnet-4-5",
971+
"prediction": "gemini-3.1-flash-lite-preview",
972972
"cost": 0.0,
973973
"for_optimality": false
974974
},
@@ -1640,7 +1640,7 @@
16401640
{
16411641
"global index": "MathQA_202",
16421642
"prompt": "Carefully examine the subsequent multiple-choice queries and determine the option that most plausibly represents the correct response from those provided.\n\nContext: None\n\nQuestion: Is the mean value between two collections of numbers nearer to the one containing a greater quantity of numbers?\n\nOptions:\nA. 70\nB. 80\nC. 85\nD. 90\nE. 95\n\nPresent the correct letter choice in \\boxed{X}, where X signifies the accurate option. Limit the explanation or feedback to no more than three sentences.",
1643-
"prediction": "claude-sonnet-4-5",
1643+
"prediction": "gemini-3.1-flash-lite-preview",
16441644
"cost": 0.0,
16451645
"for_optimality": false
16461646
},
@@ -2809,7 +2809,7 @@
28092809
{
28102810
"global index": "WMT19-de-en_46",
28112811
"prompt": "Translt the folowing scentnce from Englsih to Grman.\n\nTe trvel waring is also a responce to a nw Misouri lw that woud mak it mor difficult to sue a buisness for housign or employmnt discrimnation.\n\nProide yor final anser in \\boxed{} format.",
2812-
"prediction": "qwen/qwen3-235b-a22b-2507",
2812+
"prediction": "deepseek/deepseek-v4-flash",
28132813
"cost": 0.0,
28142814
"for_optimality": false
28152815
},
@@ -2823,7 +2823,7 @@
28232823
{
28242824
"global index": "WMT19-de-en_883",
28252825
"prompt": "Pladis, the company that owns McVitie's, announced it has poured over £5 million into the location during recent years.",
2826-
"prediction": "qwen/qwen3-235b-a22b-2507",
2826+
"prediction": "gemini-3.1-flash-lite-preview",
28272827
"cost": 0.0,
28282828
"for_optimality": false
28292829
},
@@ -2837,7 +2837,7 @@
28372837
{
28382838
"global index": "WMT19-fi-en_610",
28392839
"prompt": "Trnslate teh folowing sentnce from Englesh to Finnish.\n\nOn Satrday, rains r expecetd in mnay prts of souther and cetnral arreas with a chnce of thundrstorms.\n\nProvdie your fnal answre in \\boxed{{}} frmat.",
2840-
"prediction": "claude-sonnet-4-5",
2840+
"prediction": "qwen/qwen3-235b-a22b-2507",
28412841
"cost": 0.0,
28422842
"for_optimality": false
28432843
},

0 commit comments

Comments
 (0)