Bump: ChatQnA Example with OpenAI-Compatible Endpoint #2091

edlee123 · 2025-06-24T23:34:02Z

Description

Allows ChatQnA to be used with hundreds of OpenAI-like endpoints e.g. OpenRouter.ai, Hugging Face, Denvr, and improve the developer experience to use OPEA quickly even on low resource environments.

Key Changes Made:

Created ChatQnA/docker_compose/intel/cpu/xeon/README_endpoint_openai.md: instructions to spin up example.
Created ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml: replaces vLLM with an opeai-like endpoint

Also:

Fixed align_generator function to properly detect and skip chunks where content is null in open-ai like endpoints. Previously it'd show the null json in the UI.
Added better error handling and debug logging for easier troubleshooting of endpoint issues.

Issues

N/A

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

N/A

Tests

OpenRouter.ai: anthropic/claude-3.7-sonnet
Denvr: meta-llama/Llama-3.1-70B-Instruct
Hugging Face Inference Endpoint: microsoft/phi-4

Signed-off-by: Ed Lee <[email protected]>

…w null json. Also improved exception handling and logging Signed-off-by: Ed Lee <[email protected]>

…yaml Co-authored-by: Copilot <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Integrate MultimodalQnA set_env to ut scripts. Add README.md for UT scripts. Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: chensuyue <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: chensuyue <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

…nt (opea-project#1996) Signed-off-by: Mustafa <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Yi Yao <[email protected]> Co-authored-by: Copilot <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

…2030) Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

…d HybridRAG (opea-project#2037) Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

…archQnA and Translation (opea-project#2038) update secrets token name for ProductivitySuite, RerankFinetuning, SearchQnA and Translation Fix shellcheck issue Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

…rkflowExecAgent (opea-project#2039) update secrets token name for InstructionTuning, MultimodalQnA and WorkflowExecAgent Fix shellcheck issue Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Yongbozzz <[email protected]> Signed-off-by: Ed Lee <[email protected]>

…pea-project#1981) Signed-off-by: Ed Lee <[email protected]>

github-actions · 2025-06-24T23:34:15Z

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

None

Copilot

Pull Request Overview

This pull request introduces an OpenAI-compatible endpoint for ChatQnA, updates the deployment documentation, and includes improvements in error handling and logging.

Added new Docker Compose file (compose_endpoint_openai.yaml) to support OpenAI-like endpoints.
Updated README files for clearer deployment instructions and configuration details.
Fixed the align_generator function in chatqna.py to better handle and filter null content chunks.

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated no comments.

File	Description
CodeGen/docker_compose/intel/cpu/xeon/README.md	Updated docker compose command and environment variable documentation; note a markdown table formatting issue.
ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml	Added new compose file for OpenAI-compatible endpoint integration.
ChatQnA/docker_compose/intel/cpu/xeon/README_endpoint_openai.md	New documentation with detailed instructions for deploying ChatQnA using the new endpoint.
ChatQnA/chatqna.py	Improved logging and error handling in input/output alignment and generator functions.

Comments suppressed due to low confidence (1)

CodeGen/docker_compose/intel/cpu/xeon/README.md:111

The table row for LLM_ENDPOINT appears to be broken into two columns due to an unintended pipe character. Please merge the content into a single cell to ensure the URL displays correctly.

| `LLM_ENDPOINT`                          | Internal URL for the LLM serving endpoint (used by `codegen-llm-server`). Configured in `compose.yaml`.             | `http://codegen-vllm                           | tgi-server:9000/v1/chat/completions` |

Signed-off-by: Ed Lee <[email protected]>

ChatQnA/chatqna.py

Signed-off-by: Ed Lee <[email protected]>

…es into chatqna_w_endpoints

for more information, see https://pre-commit.ci

edlee123 · 2025-07-02T05:18:35Z

Hi @yao531441 @letonghan if either of you can, I'm looking for one more reviewer please :)

…erence:cpu-1.7 from 1.5 Signed-off-by: Ed Lee <[email protected]>

…es into chatqna_w_endpoints

edlee123 and others added 30 commits June 24, 2025 18:08

Compose file for ChatQnA example with openai-like endpoint

985cddd

Signed-off-by: Ed Lee <[email protected]>

Adding README.md for ChatQnA + endpoint

eec42f2

Signed-off-by: Ed Lee <[email protected]>

In chatqna.py handle null openai api response since UI would show sho…

39962f6

…w null json. Also improved exception handling and logging Signed-off-by: Ed Lee <[email protected]>

Update ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.…

d227878

…yaml Co-authored-by: Copilot <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Add tests for different input formats (opea-project#2006)

139972c

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fix security issues in workflows (opea-project#1977)

9fc1235

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Integrate MultimodalQnA set_env to ut scripts. (opea-project#1965)

70db6c5

Integrate MultimodalQnA set_env to ut scripts. Add README.md for UT scripts. Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Optimize benchmark scripts (opea-project#1949)

7ada28f

Signed-off-by: chensuyue <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fix permissions error. (opea-project#2008)

8c80b08

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Build comps-base:ci for AgentQnA test (opea-project#2010)

a943345

Signed-off-by: chensuyue <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Stop CI test on rocm due to lack of test machine (opea-project#2017)

b63bdb3

Signed-off-by: chensuyue <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fix workflow permission issues. (opea-project#2018)

2a8f3fb

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Refine the README, folder/file hierarchy and test file for FinanceAge…

34d40dc

…nt (opea-project#1996) Signed-off-by: Mustafa <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Ed Lee <[email protected]>

Add code owners. (opea-project#2022)

a0f7ea0

Signed-off-by: Yi Yao <[email protected]> Co-authored-by: Copilot <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fix MultimodalQnA UT issues (opea-project#2011)

9776593

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for AgentQnA. (opea-project#2023)

31cd99f

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for AudioQnA. (opea-project#2024)

9b089dd

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for AvatarChatbot and DBQnA. (opea-project#…

313f671

…2030) Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for ChatQnA. (opea-project#2029)

39b53a2

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for CodeGen and CodeTrans (opea-project#2031)

d18bc9b

Signed-off-by: ZePan110 <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Ed Lee <[email protected]>

[DocSum] Aligned the output format (opea-project#1948)

0798004

Signed-off-by: Ed Lee <[email protected]>

update secrets token name for DocIndexRetriever. (opea-project#2035)

229f2b1

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for EdgeCraftRag, FinanceAgent, GraphRAG an…

4d0b5c4

…d HybridRAG (opea-project#2037) Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for DocSum. (opea-project#2036)

a797945

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for VideoQnA and VisualQnA (opea-project#2040)

81a8841

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fix shellcheck issues and update secrets TOKEN name (opea-project#2043)

d91de60

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

add new feature for EC-RAG (opea-project#2013)

1e20459

Signed-off-by: Yongbozzz <[email protected]> Signed-off-by: Ed Lee <[email protected]>

[CodeGen] Aligned the output format and fixed acc benchmark issues. (o…

09ceb36

…pea-project#1981) Signed-off-by: Ed Lee <[email protected]>

Copilot AI review requested due to automatic review settings June 24, 2025 23:34

edlee123 requested review from lvliang-intel, yao531441 and letonghan as code owners June 24, 2025 23:34

Copilot AI reviewed Jun 24, 2025

View reviewed changes

edlee123 added 3 commits June 24, 2025 18:38

Reverting accidently modified CodeGen in merge

532cb66

Signed-off-by: Ed Lee <[email protected]>

Reverting accidently modified CodeGen in merge

c8b14e3

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'main' into chatqna_w_endpoints

2a0d757

edlee123 mentioned this pull request Jun 27, 2025

[Feature] Enable remote inference endpoints for examples #1973

Closed

27 tasks

Merge branch 'main' into chatqna_w_endpoints

ad679a6

letonghan reviewed Jun 30, 2025

View reviewed changes

ChatQnA/chatqna.py Outdated Show resolved Hide resolved

edlee123 and others added 4 commits July 2, 2025 00:07

Use OPEA CustomLogger style

c097939

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'chatqna_w_endpoints' of github.com:edlee123/GenAIExampl…

0bc4fd4

…es into chatqna_w_endpoints

[pre-commit.ci] auto fixes from pre-commit.com hooks

9227eae

for more information, see https://pre-commit.ci

Merge branch 'main' into chatqna_w_endpoints

16641a2

edlee123 requested a review from letonghan July 2, 2025 05:09

edlee123 added 5 commits July 3, 2025 10:10

Merge branch 'main' into chatqna_w_endpoints

7c85779

Merge branch 'main' into chatqna_w_endpoints

81a2e02

Merge branch 'main' into chatqna_w_endpoints

204702e

Updated compose_endpoint_openai.yaml to use newer text-embeddings-inf…

0184157

…erence:cpu-1.7 from 1.5 Signed-off-by: Ed Lee <[email protected]>

Merge branch 'chatqna_w_endpoints' of github.com:edlee123/GenAIExampl…

f000373

…es into chatqna_w_endpoints

joshuayao added this to the v1.4 milestone Aug 14, 2025

joshuayao added this to OPEA Aug 14, 2025

joshuayao moved this to In progress in OPEA Aug 14, 2025

Merge branch 'main' into chatqna_w_endpoints

855409f

joshuayao requested a review from XinyuYe-Intel August 20, 2025 03:44

Merge branch 'main' into chatqna_w_endpoints

9a059d2

edlee123 changed the title ~~ChatQnA Example with OpenAI-Compatible Endpoint~~ Bump: ChatQnA Example with OpenAI-Compatible Endpoint Aug 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bump: ChatQnA Example with OpenAI-Compatible Endpoint #2091

Bump: ChatQnA Example with OpenAI-Compatible Endpoint #2091

Uh oh!

edlee123 commented Jun 24, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 24, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

edlee123 commented Jul 2, 2025

Uh oh!

Uh oh!

Bump: ChatQnA Example with OpenAI-Compatible Endpoint #2091

Are you sure you want to change the base?

Bump: ChatQnA Example with OpenAI-Compatible Endpoint #2091

Uh oh!

Conversation

edlee123 commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issues

Type of change

Dependencies

Tests

Uh oh!

github-actions bot commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Dependency Review

Scanned Files

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

edlee123 commented Jul 2, 2025

Uh oh!

Uh oh!

edlee123 commented Jun 24, 2025 •

edited

Loading

github-actions bot commented Jun 24, 2025 •

edited

Loading