Skip to content
View Jun-Howie's full-sized avatar

Block or report Jun-Howie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 50.4k 8.2k

  2. xorbitsai/inference xorbitsai/inference Public

    Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

    Python 8.1k 695

  3. Vahe1994/AQLM Vahe1994/AQLM Public

    Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…

    Python 1.3k 184

  4. QwenLM/Qwen3 QwenLM/Qwen3 Public

    Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

    Shell 22.1k 1.5k

  5. QwenLM/Qwen2.5-VL QwenLM/Qwen2.5-VL Public

    Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

    Jupyter Notebook 11.1k 800

  6. LLMxMapReduce LLMxMapReduce Public

    Forked from thunlp/LLMxMapReduce

    Python 3