Fix segmentation fault and calculation error in AveragePool2dKernel #2091

yucai-intel · 2025-09-22T05:47:14Z

Fixed the following issues found by test/test_nn.py::TestNNDeviceTypeXPU::test_avg_pool_large_tensor2_xpu

A segmentation fault caused by a data type conversion error that invalidated the memory address.
A calculation error caused by data overflow.

yucai-intel · 2025-09-22T05:48:33Z

Copilot

Pull Request Overview

This PR fixes critical issues in the AveragePool2dKernel implementation for XPU devices, specifically addressing a segmentation fault and calculation error that were causing test failures.

Replaced direct index access with XPU_KERNEL_LOOP macros for safer kernel iteration
Changed data types from index_t to int64_t for better overflow handling
Added group_size limit to prevent exceeding hardware capabilities

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-09-23T06:33:16Z

src/ATen/native/xpu/sycl/AveragePool2dKernels.cpp

+      const int64_t height,
+      const int64_t width,
+      const int64_t pooled_height,
+      const int pooled_width,


Inconsistent data type usage: pooled_width uses int while other dimension parameters use int64_t. This inconsistency could lead to overflow issues or unexpected behavior when dealing with large tensors. Consider changing to const int64_t pooled_width for consistency.

Copilot · 2025-09-23T06:33:17Z

src/ATen/native/xpu/sycl/AveragePool2dKernels.cpp

+  const int64_t height_;
+  const int64_t width_;
+  const int64_t pooled_height_;
+  const int pooled_width_;


Inconsistent data type in member variable: pooled_width_ uses int while other dimension member variables use int64_t. This should be const int64_t pooled_width_ to match the constructor parameter and prevent potential overflow issues.

CuiYifeng

Please check if test_avg_pool_large_tensor2_xpu is activated in CI.
The rest of this PR looks good to me.

CuiYifeng · 2025-09-23T13:30:58Z

src/ATen/native/xpu/sycl/AveragePool2dKernels.cpp

+  const uint32_t group_size =
+      std::min(static_cast<int>(syclMaxWorkItemsPerSubSlice()), 1024);


This hard-code may have a negative impact on some platforms, especially future platforms.

May I know why here need 1024?

guangyey

One nit, otherwise, LGTM.

yucai-intel added 2 commits September 22, 2025 13:39

Update AveragePool2dKernels.cpp

31d5ae8

Merge branch 'main' into yucai/averagepool2d/fix

4932ebb

yucai-intel mentioned this pull request Sep 22, 2025

reduction got accuracy issue on large tensor cases #2008

Open

Update AveragePool2dKernels.cpp

6b6d11e

CuiYifeng requested review from chunhuanMeng and Copilot September 23, 2025 06:32

Copilot AI reviewed Sep 23, 2025

View reviewed changes

chunhuanMeng approved these changes Sep 23, 2025

View reviewed changes

CuiYifeng requested changes Sep 23, 2025

View reviewed changes

CuiYifeng requested a review from guangyey September 24, 2025 08:56

guangyey approved these changes Sep 24, 2025

View reviewed changes

yucai-intel added 2 commits September 26, 2025 09:06

rm 1024

85792d6

Merge branch 'main' into yucai/averagepool2d/fix

e28da29

yucai-intel requested a review from CuiYifeng September 26, 2025 05:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix segmentation fault and calculation error in AveragePool2dKernel #2091

Fix segmentation fault and calculation error in AveragePool2dKernel #2091

yucai-intel commented Sep 22, 2025

Uh oh!

yucai-intel commented Sep 22, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Sep 23, 2025

Uh oh!

Copilot AI Sep 23, 2025

Uh oh!

CuiYifeng left a comment

Uh oh!

CuiYifeng Sep 23, 2025 •

edited

Loading

Uh oh!

guangyey Sep 24, 2025

Uh oh!

yucai-intel Sep 26, 2025

Uh oh!

guangyey left a comment

Uh oh!

Uh oh!

		const uint32_t group_size =
		std::min(static_cast<int>(syclMaxWorkItemsPerSubSlice()), 1024);

Fix segmentation fault and calculation error in AveragePool2dKernel #2091

Are you sure you want to change the base?

Fix segmentation fault and calculation error in AveragePool2dKernel #2091

Conversation

yucai-intel commented Sep 22, 2025

Uh oh!

yucai-intel commented Sep 22, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

CuiYifeng left a comment

Choose a reason for hiding this comment

Uh oh!

CuiYifeng Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

guangyey Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

yucai-intel Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

guangyey left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

CuiYifeng Sep 23, 2025 •

edited

Loading