tests: port upstream kernel tests and adapt test infrastructure for MACA compat by Dayuxiaoshui · Pull Request #255 · MetaX-MACA/vLLM-metax

Dayuxiaoshui · 2026-05-05T11:27:24Z

Port missing upstream vLLM kernel tests to tests/kernels/core/:
- test_apply_rotary_emb.py
- test_cpu_activation.py
- test_fused_qk_norm_rope.py
- test_fused_rms_norm_gated.py
- test_fused_silu_mul_block_quant.py
- test_fused_q_kv_rmsnorm.py
- test_minimax_reduce_rms.py
- test_rotary_embedding_mla_cache_fused.py
- test_vit_bilinear_pos_embed.py
- test_vit_fp8_attn.py
- test_vit_fp8_quant.py
- test_vit_fp8_scaling.py
Adapt test infrastructure for upstream API changes:
- tests/conftest.py: add default_vllm_config fixture; compat imports for ModelDType, RunnerOption, ConvertOption, Logprob, InputContext, etc.; inject infer_schema monkey-patch for PyTorch 2.6+ list[int] compat
- tests/utils.py: compat imports for FlexibleArgumentParser, GB_bytes, cuda_device_count_stateless, get_open_port; add ensure_current_vllm_config
- tests/kernels/utils.py: compat imports for AttentionBackend, AttentionMetadata, AttentionType; fallback constants for _Backend, STR_BACKEND_ENV_VAR, etc.
- tests/kernels/quant_utils.py: compat import for round_up
- tests/models/utils.py: compat imports for InputContext, Logprob
- tests/models/registry.py: compat imports for ModelDType, TokenizerMode

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

…ACA compat 1. Port missing upstream vLLM kernel tests to tests/kernels/core/: - test_apply_rotary_emb.py - test_cpu_activation.py - test_fused_qk_norm_rope.py - test_fused_rms_norm_gated.py - test_fused_silu_mul_block_quant.py - test_fused_q_kv_rmsnorm.py - test_minimax_reduce_rms.py - test_rotary_embedding_mla_cache_fused.py - test_vit_bilinear_pos_embed.py - test_vit_fp8_attn.py - test_vit_fp8_quant.py - test_vit_fp8_scaling.py 2. Adapt test infrastructure for upstream API changes: - tests/conftest.py: add default_vllm_config fixture; compat imports for ModelDType, RunnerOption, ConvertOption, Logprob, InputContext, etc.; inject infer_schema monkey-patch for PyTorch 2.6+ list[int] compat - tests/utils.py: compat imports for FlexibleArgumentParser, GB_bytes, cuda_device_count_stateless, get_open_port; add ensure_current_vllm_config - tests/kernels/utils.py: compat imports for AttentionBackend, AttentionMetadata, AttentionType; fallback constants for _Backend, STR_BACKEND_ENV_VAR, etc. - tests/kernels/quant_utils.py: compat import for round_up - tests/models/utils.py: compat imports for InputContext, Logprob - tests/models/registry.py: compat imports for ModelDType, TokenizerMode

Dayuxiaoshui · 2026-05-05T11:27:47Z

cc @ILikeIneine

gemini-code-assist

Code Review

This pull request introduces a comprehensive set of new kernel tests covering CPU activations, rotary embeddings, fused RMS norms, block quantization, and Vision Transformer (ViT) specific kernels like bilinear position embedding and FP8 attention. It also updates test configurations and utilities with compatibility shims and robust import handling to ensure the test suite works across different vLLM versions. A critical security vulnerability was noted in tests/conftest.py due to the inclusion of /tmp in the system path, which could lead to arbitrary code execution.

gemini-code-assist · 2026-05-05T11:29:51Z

+import sys
+sys.path.insert(0, '/tmp')
+import patch_infer_schema


Modifying sys.path to include a world-writable directory like /tmp is a critical security vulnerability. This allows for arbitrary code execution if a malicious patch_infer_schema.py file is placed in /tmp. The test suite would then import and execute this malicious code, potentially compromising the execution environment.

Instead of modifying sys.path with a hardcoded, insecure path, consider placing the patch_infer_schema.py file within the test directory structure and importing it using a relative path. For example, if it's a test utility, it could live in a tests/utils directory.

ILikeIneine · 2026-05-06T02:27:19Z

@Dayuxiaoshui Have you ever run pytest on metax backend devices?

Dayuxiaoshui · 2026-05-06T08:20:39Z

@ILikeIneine We have run pytest on the MetaX C500 backend. The test environment was MACA SDK 3.3.0.2 with PyTorch 2.6.0+metax3.3.0.2, using upstream vLLM source via PYTHONPATH. Among the 12 ported kernel test files, 8 execute normally (pass/skip). Two files have substantive failures: test_fused_silu_mul_block_quant.py fails 178 cases due to numerical mismatch in the silu_and_mul_per_block_quant kernel output against the CPU reference, and test_rotary_embedding_mla_cache_fused.py fails 322 cases because the concat_and_cache_mla_rope_fused op is not registered in the current MACA environment. The remaining 4 files are skipped entirely as they depend on upstream modules (e.g., deepseek_v4_ops, qwen3_vl, triton fp8) not available in this environment.

Dayuxiaoshui · 2026-05-07T01:08:23Z

cc @ILikeIneine

ILikeIneine · 2026-05-07T03:25:31Z

What your vllm-metax version? We'd like to re-run it with the latest vllm-metax release v0.18.0 in this docker image. Or following the build guild with the latest v0.20.0-dev.

Dayuxiaoshui · 2026-05-07T03:28:17Z

What your vllm-metax version? We'd like to re-run it with the latest vllm-metax release v0.18.0 in this docker image. Or following the build guild with the latest v0.20.0-dev.

no problem,I will do this work

gemini-code-assist Bot reviewed May 5, 2026

View reviewed changes

ILikeIneine force-pushed the master branch from 3e2816c to 3029d1e Compare June 8, 2026 08:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests: port upstream kernel tests and adapt test infrastructure for MACA compat#255

tests: port upstream kernel tests and adapt test infrastructure for MACA compat#255
Dayuxiaoshui wants to merge 1 commit into
MetaX-MACA:masterfrom
Dayuxiaoshui:master

Dayuxiaoshui commented May 5, 2026

Uh oh!

Dayuxiaoshui commented May 5, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 5, 2026

Uh oh!

ILikeIneine commented May 6, 2026 •

edited

Loading

Uh oh!

Dayuxiaoshui commented May 6, 2026

Uh oh!

Dayuxiaoshui commented May 7, 2026

Uh oh!

ILikeIneine commented May 7, 2026

Uh oh!

Dayuxiaoshui commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Dayuxiaoshui commented May 5, 2026

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

Dayuxiaoshui commented May 5, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 5, 2026

Choose a reason for hiding this comment

Uh oh!

ILikeIneine commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Dayuxiaoshui commented May 6, 2026

Uh oh!

Dayuxiaoshui commented May 7, 2026

Uh oh!

ILikeIneine commented May 7, 2026

Uh oh!

Dayuxiaoshui commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ILikeIneine commented May 6, 2026 •

edited

Loading