From efbcde289713b83c75bf1c8ae633fea018dab261 Mon Sep 17 00:00:00 2001 From: Ryan Zhang Date: Tue, 24 Feb 2026 15:27:34 -0500 Subject: [PATCH] Add MESS+ Added a new research paper on AI model routing. --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index a27d7bc..0bb4e00 100644 --- a/README.md +++ b/README.md @@ -26,6 +26,7 @@ A curated list of awesome solutions and research in AI model routing. Other awes *Research papers on AI model routing (ordered by recency)* +- [MESS+: Dynamically Learned Inference-Time LLM Routing in Model Zoos with Service Level Guarantees, 2025-05-26](https://arxiv.org/abs/2505.19947): A stochastic optimization algorithm for cost-optimal LLM request routing while providing rigorous SLA compliance guarantees. - [EmbedLLM: Learning Compact Representations of Large Language Models, 2024-10-03](https://arxiv.org/abs/2410.02223): A matrix-factorization based LLM router which extracts "model embeddings" that captures LLM characteristics. - [RouteLLM: Learning to Route LLMs with Preference Data, 2024-07-21](https://arxiv.org/abs/2406.18665): A training framework for strong vs weak LLM routers leveraging human preference data and data augmentation. - [MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs, 2024-07-15](https://arxiv.org/abs/2407.10834): Dynamically route each query to the optimal LLM for classification tasks.