Jun-Howie

Jun-Howie

Achievements

vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 50.4k 8.2k
xorbitsai/inference xorbitsai/inference Public

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 8.1k 695
Vahe1994/AQLM Vahe1994/AQLM Public

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…

Python 1.3k 184
QwenLM/Qwen3 QwenLM/Qwen3 Public

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22.1k 1.5k
QwenLM/Qwen2.5-VL QwenLM/Qwen2.5-VL Public

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11.1k 800
LLMxMapReduce LLMxMapReduce Public

Forked from thunlp/LLMxMapReduce

Python 3