Pinned Loading
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
-
xorbitsai/inference
xorbitsai/inference PublicReplace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
-
Vahe1994/AQLM
Vahe1994/AQLM PublicOfficial Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…
-
QwenLM/Qwen3
QwenLM/Qwen3 PublicQwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
-
QwenLM/Qwen2.5-VL
QwenLM/Qwen2.5-VL PublicQwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.