test: switch LLM API tests to qwen3.7-max by cmgzn · Pull Request #991 · datajuicer/data-juicer

cmgzn · 2026-06-10T05:59:03Z

Updates LLM API-based tests to use qwen3.7-max instead of qwen2.5-72b-instruct.

This avoids failures caused by restricted or unavailable access to the previous model in CI.

Validation:

Ran lightweight Python compile check for affected test directories
Verified no lint errors
Did not run full test suite locally; full validation should run in official CI

gemini-code-assist

Code Review

This pull request updates the API model and tokenizer name from 'qwen2.5-72b-instruct' to 'qwen3.6-plus' across multiple test files. However, 'qwen3.6-plus' appears to be an invalid or non-existent model name, which will cause API calls and tokenizer loading to fail during test execution. It is recommended to correct this to a valid model name, such as 'qwen-plus'.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

fengrui-z

Code uses qwen3.7-max everywhere, but PR title says qwen3.6-plus. Fix the title.

sampling_params is misaligned in several dialog_* test files — not matching the other keyword args.

Unrelated changes:

uv.lock swaps bs4 → beautifulsoup4 — not related to model migration.
test_llm_analysis_filter.py rewrites RFT test data and adds min_score=0.7 — behavioral change beyond a model swap.

PR description says tests weren't run locally. qwen3.7-max may produce different output formats; recommend at least running the affected tests before merge.

cmgzn · 2026-06-16T02:21:03Z

Thanks for the thorough review! Here are my responses:

PR title says qwen3.6-plus but code uses qwen3.7-max: The original qwen3.6-plus had 2–3 tests that repeatedly failed, so I switched to qwen3.7-max. I'll update the PR title and description once all tests pass.
uv.lock swaps bs4 → beautifulsoup4: This is a leftover from PR Replace bs4 stub with beautifulsoup4 in dependencies #977 which swapped "bs4" for "beautifulsoup4" in pyproject.toml dependencies but didn't sync uv.lock. Opening a separate PR for this seems not worth it, so I've included the fix here.
test_llm_analysis_filter.py RFT test data rewrite & min_score=0.7: After the model swap, the RFT tests became flaky — LLM scoring tests are inherently unstable. I adjusted the test data to widen the quality gap between samples and raised min_score to improve reliability.
sampling_params misalignment in dialog_ test files*: Fixed! All sampling_params keyword args are now properly aligned with other keyword arguments across test_dialog_topic_detection_mapper.py, test_dialog_sentiment_detection_mapper.py, test_dialog_intent_detection_mapper.py, and test_dialog_sentiment_intensity_mapper.py.

fengrui-z

LGTM

test: switch LLM API tests to qwen3.6-plus

08fdb2e

cmgzn requested review from HYLcool, cyruszhang, fengrui-z and yxdyc June 10, 2026 05:59

cmgzn temporarily deployed to Testing June 10, 2026 05:59 — with GitHub Actions Inactive

cmgzn had a problem deploying to Testing June 10, 2026 05:59 — with GitHub Actions Failure

cmgzn marked this pull request as ready for review June 10, 2026 05:59

gemini-code-assist Bot reviewed Jun 10, 2026

View reviewed changes

Comment thread tests/ops/aggregator/test_entity_attribute_aggregator.py Outdated

Comment thread tests/ops/mapper/test_text_chunk_mapper.py Outdated

test: update test cases for LLM analysis filter

3cbd360

cmgzn had a problem deploying to Testing June 12, 2026 02:52 — with GitHub Actions Failure

cmgzn temporarily deployed to Testing June 12, 2026 02:52 — with GitHub Actions Inactive

test: update api model and sampling params in tests

5551496

cmgzn temporarily deployed to Testing June 12, 2026 05:41 — with GitHub Actions Inactive

cmgzn had a problem deploying to Testing June 12, 2026 05:41 — with GitHub Actions Failure

test: update api model version in tests

6439804

cmgzn temporarily deployed to Testing June 12, 2026 06:14 — with GitHub Actions Inactive

cmgzn had a problem deploying to Testing June 12, 2026 06:14 — with GitHub Actions Failure

cmgzn had a problem deploying to Testing June 12, 2026 07:52 — with GitHub Actions Failure

cmgzn had a problem deploying to Testing June 12, 2026 08:33 — with GitHub Actions Failure

test: update rft data analysis and answers for clarity

3a8fec6

cmgzn had a problem deploying to Testing June 15, 2026 02:40 — with GitHub Actions Error

cmgzn temporarily deployed to Testing June 15, 2026 08:58 — with GitHub Actions Inactive

cmgzn had a problem deploying to Testing June 15, 2026 08:58 — with GitHub Actions Failure

fengrui-z reviewed Jun 15, 2026

View reviewed changes

cmgzn temporarily deployed to Testing June 16, 2026 01:58 — with GitHub Actions Inactive

cmgzn changed the title ~~test: switch LLM API tests to qwen3.6-plus~~ test: switch LLM API tests to qwen3.7-max Jun 16, 2026

test: refactor test cases for dialog mappers

c7eed4f

cmgzn temporarily deployed to Testing June 16, 2026 02:16 — with GitHub Actions Inactive

cmgzn had a problem deploying to Testing June 16, 2026 02:16 — with GitHub Actions Failure

cmgzn deployed to Testing June 16, 2026 03:20 — with GitHub Actions Active

fengrui-z approved these changes Jun 16, 2026

View reviewed changes

cmgzn merged commit e622254 into main Jun 17, 2026
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: switch LLM API tests to qwen3.7-max#991

test: switch LLM API tests to qwen3.7-max#991
cmgzn merged 6 commits into
mainfrom
chore/update-llm-test-model-qwen36-plus

cmgzn commented Jun 10, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

fengrui-z left a comment

Uh oh!

cmgzn commented Jun 16, 2026

Uh oh!

fengrui-z left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cmgzn commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

fengrui-z left a comment

Choose a reason for hiding this comment

Uh oh!

cmgzn commented Jun 16, 2026

Uh oh!

fengrui-z left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cmgzn commented Jun 10, 2026 •

edited

Loading