feat: add vLLM backend integration by teamauresta · Pull Request #201 · HKUDS/RAG-Anything

teamauresta · 2026-02-17T20:20:30Z

Add examples/vllm_integration_example.py with full working example
Add docs/vllm_integration.md with setup guide and performance tips
Update env.example with vLLM configuration section

vLLM provides an OpenAI-compatible API with continuous batching, PagedAttention, and tensor parallelism for production RAG workloads.

Description

[Briefly describe the changes made in this pull request.]

Related Issues

[Reference any related issues or tasks addressed by this pull request.]

Changes Made

[List the specific changes made in this pull request.]

Checklist

Changes tested locally
Code reviewed
Documentation updated (if necessary)
Unit tests added (if applicable)

Additional Notes

[Add any additional notes or context for the reviewer(s).]

- Add examples/vllm_integration_example.py with full working example - Add docs/vllm_integration.md with setup guide and performance tips - Update env.example with vLLM configuration section vLLM provides an OpenAI-compatible API with continuous batching, PagedAttention, and tensor parallelism for production RAG workloads.

LarFii · 2026-02-18T16:30:15Z

@cursor review

LarFii · 2026-02-18T16:30:33Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 47ce1635db

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-02-18T16:35:22Z

examples/vllm_integration_example.py

+    def embedding_func_factory(self):
+        """Create a completely serializable embedding function."""
+        return EmbeddingFunc(
+            embedding_dim=1024,  # bge-m3 default dimension


Make embedding dimension configurable for chosen model

The embedding wrapper hardcodes embedding_dim=1024 even though the model is read from environment variables, so switching to any non-1024 embedding model (including the guide's own OpenAI embedding option) will initialize LightRAG with the wrong vector size and can fail during insert/query with dimension mismatch errors. Read the dimension from EMBEDDING_DIM (or model metadata) instead of fixing it to 1024.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-02-18T16:35:22Z

docs/vllm_integration.md

+## Prerequisites
+
+1. **NVIDIA GPU(s)** with CUDA support (compute capability ≥ 7.0)
+2. **Python 3.9+**


Correct Python prerequisite to 3.10+

This guide says Python 3.9+ is supported, but the package metadata requires Python >=3.10 (pyproject.toml), so users on 3.9 following this setup will fail at installation before they can run the integration. Updating this prerequisite avoids a broken onboarding path.

Useful? React with 👍 / 👎.

LarFii · 2026-02-20T16:49:19Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f4384b9b38

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-02-20T16:52:46Z

docs/vllm_integration.md

+## Prerequisites
+
+1. **NVIDIA GPU(s)** with CUDA support (compute capability ≥ 7.0)
+2. **Python 3.9+**


Raise documented Python minimum to 3.10

The new guide states Python 3.9+, but this repository declares requires-python = ">=3.10" in pyproject.toml, so users following this doc on Python 3.9 will fail during installation before they can run the vLLM example. Please align the prerequisite here with the actual package requirement to avoid a broken setup path.

Useful? React with 👍 / 👎.

chatgpt-codex-connector bot reviewed Feb 18, 2026

View reviewed changes

chore: apply ruff formatting

f4384b9

chatgpt-codex-connector bot reviewed Feb 20, 2026

View reviewed changes

LarFii merged commit 20164f6 into HKUDS:main Feb 20, 2026
1 check passed

teamauresta deleted the feature/vllm-integration branch February 23, 2026 03:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: add vLLM backend integration#201

feat: add vLLM backend integration#201
LarFii merged 2 commits intoHKUDS:mainfrom
sotastack:feature/vllm-integration

teamauresta commented Feb 17, 2026

Uh oh!

LarFii commented Feb 18, 2026

Uh oh!

LarFii commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Feb 18, 2026

Uh oh!

chatgpt-codex-connector bot Feb 18, 2026

Uh oh!

LarFii commented Feb 20, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Feb 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

teamauresta commented Feb 17, 2026

Description

Related Issues

Changes Made

Checklist

Additional Notes

Uh oh!

LarFii commented Feb 18, 2026

Uh oh!

LarFii commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

LarFii commented Feb 20, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants