[CI] Fix ngram & suffix test oom #4755

fluctlux · 2025-12-05T14:17:09Z

What this PR does / why we need it?

Avoid oom during CI by using with VllmRunner instead of LLM(), and enable test_ngram_correctness

How was this patch tested?

Before:

After:

CI passed.

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

gemini-code-assist

Code Review

This pull request aims to fix an Out-Of-Memory (OOM) error in CI for ngram and suffix tests. The changes involve replacing direct LLM instantiation with the VllmRunner context manager, which ensures proper resource cleanup. Additionally, the multiprocessing start method is set to 'spawn' to avoid issues with NPU context inheritance. The test test_ngram_correctness is also re-enabled.

The changes are logical and address the OOM issue effectively. My main feedback is regarding how the environment variable is set. Using os.environ at the module level can introduce side effects to other tests. I've suggested a more robust approach using pytest's monkeypatch fixture to scope the change correctly.

gemini-code-assist · 2025-12-05T14:18:25Z

tests/e2e/singlecard/spec_decode_v1/test_v1_spec_decode.py


 from tests.e2e.conftest import VllmRunner

+os.environ["VLLM_WORKER_MULTIPROC_METHOD"] = "spawn"


Modifying os.environ directly at the module level can lead to side effects in other tests, as it's a global state change that persists throughout the pytest session. This can make tests flaky and hard to debug.

A safer and more idiomatic pytest approach is to use the monkeypatch fixture to manage environment variables. This ensures that the change is properly scoped and cleaned up after the tests in this module are done.

You could define a module-scoped autouse fixture like this, which would replace the direct modification of os.environ:

import pytest @pytest.fixture(scope="module", autouse=True) def set_spawn_method(monkeypatch): monkeypatch.setenv("VLLM_WORKER_MULTIPROC_METHOD", "spawn")

This would make the test suite more robust against side-effects.

github-actions · 2025-12-05T14:50:45Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

wangxiyuan

Hope the test can works now

Signed-off-by: fluctlux <[email protected]>

wangxiyuan · 2025-12-08T01:26:46Z

really nice change. Thanks very much！！

### What this PR does / why we need it? Avoid oom during CI by using `with VllmRunner` instead of `LLM()`, and enable `test_ngram_correctness` ### How was this patch tested? CI passed. - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e --------- Signed-off-by: fluctlux <[email protected]> Co-authored-by: wangxiyuan <[email protected]> Signed-off-by: yuxingcyx <[email protected]>

### What this PR does / why we need it? Avoid oom during CI by using `with VllmRunner` instead of `LLM()`, and enable `test_ngram_correctness` ### How was this patch tested? CI passed. - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e --------- Signed-off-by: fluctlux <[email protected]> Co-authored-by: wangxiyuan <[email protected]> Signed-off-by: tanqingshan (A) <[email protected]>

### What this PR does / why we need it? Avoid oom during CI by using `with VllmRunner` instead of `LLM()`, and enable `test_ngram_correctness` ### How was this patch tested? CI passed. - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e --------- Signed-off-by: fluctlux <[email protected]> Co-authored-by: wangxiyuan <[email protected]>

fluctlux marked this pull request as ready for review December 5, 2025 14:17

gemini-code-assist bot reviewed Dec 5, 2025

View reviewed changes

fluctlux changed the title ~~[CI] Fix ngram & suffix test OOM~~ [CI] Fix ngram & suffix test oom Dec 5, 2025

github-actions bot added the module:tests label Dec 5, 2025

fluctlux force-pushed the fix-ngram-ci branch from 6e64010 to 44dd0b0 Compare December 5, 2025 15:05

wangxiyuan added ready read for review ready-for-test start test by label for PR labels Dec 6, 2025

wangxiyuan approved these changes Dec 6, 2025

View reviewed changes

fluctlux added 2 commits December 6, 2025 17:23

fix ngram & suffix ci oom issue

943c0b9

Signed-off-by: fluctlux <[email protected]>

enable e2e/test_v1_spec_decode

ad3444f

Signed-off-by: fluctlux <[email protected]>

fluctlux force-pushed the fix-ngram-ci branch from 44ec0db to ad3444f Compare December 6, 2025 09:25

Merge branch 'main' into fix-ngram-ci

1c3debb

wangxiyuan merged commit 9fbcfa3 into vllm-project:main Dec 8, 2025
13 of 15 checks passed

fluctlux deleted the fix-ngram-ci branch December 8, 2025 01:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI] Fix ngram & suffix test oom #4755

[CI] Fix ngram & suffix test oom #4755

Uh oh!

fluctlux commented Dec 5, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 5, 2025

Uh oh!

github-actions bot commented Dec 5, 2025

Uh oh!

wangxiyuan left a comment

Uh oh!

Uh oh!

wangxiyuan commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		from tests.e2e.conftest import VllmRunner

		os.environ["VLLM_WORKER_MULTIPROC_METHOD"] = "spawn"

[CI] Fix ngram & suffix test oom #4755

[CI] Fix ngram & suffix test oom #4755

Uh oh!

Conversation

fluctlux commented Dec 5, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 5, 2025

Uh oh!

wangxiyuan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wangxiyuan commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fluctlux commented Dec 5, 2025 •

edited by github-actions bot

Loading