Add aten_convolution_backward function #1707

xiaowuhu · 2024-06-25T09:08:02Z

Roadmap:

We should use ConvGrad function which existing in onnxruntime-training library. but its domain is com.microsoft, so we cannot get the schema, then cannot run the operator in onnxscript.
We should use col2im and im2col to finish this job, but onnx only provide col2im, NOT provide im2col.
So I use a magic way:

A. Compute dW

$$ dW = X * dZ $$

But need to transpose X to [1,0,2,3], transpose dZ to [1,0,2,3], then using common op.Conv on them, get dW but also need transpose back to [1,0,2,3].

	N	C	H	W	->	N	C	H	W
X	8	3	7	6	->	3	8	7	6
dZ	8	2	4	2	->	2	8	4	2
dW	2	3	4	5	<-	3	2	4	5

B. Compute dX

It is similar but more complicated:

$$ dX = dZ_0^0 * W^{rot180} $$

	N	C	H	W	->	N	C	H	W
dZ	8	2	4	2	->	8	2	10	10
W	2	3	4	5	->	3	2	4	5
dX	8	3	7	6	<-	8	3	7	6

add padding around dZ
transpose W with [1,0,2,3], then rot W with 180 degree
do common Conv operation to get dX

To Do list:

If the forward conv stride != 1, need to do dilation in dZ before compute dX in B.
Sometimes when do conv we may get the output size as 13.5 then do floor(13.5)=13, but when do backward, we also get the dX with size (29x29) but actually it should be (28x28). This need to Slice(dX, 1, 1, x_height, x_width).
When we found dW is bigger than W, we need to Slice(dW, 0, 0, weight_height, weight_width)

onnxscript/function_libs/torch_lib/ops/core.py

+    # if stride[0] != 1:  # dilation
+    #     dz_height = z_height * stride[0] - stride[0] + 1
+    #     dz_width = z_width * stride[1] - stride[1] + 1
+    #     pos = _help(z_height, dz_width, stride)
+    #     pos = []
+    #     for j in range(z_height):
+    #         for i in range(0, dz_width, stride[1]):
+    #             pos.append(i + j * dz_width * stride[0])
+
+    #     index_tensor = op.Constant(value_ints=pos)
+    #     index_tensor = op.Reshape(index_tensor, z_shape)
+    #     # this should not work because the kernel_shape is attribute
+    #     dz = op.MaxUnpool(grad_output, index_tensor, kernel_shape=[dz_height - z_height + 1, dz_width - z_width + 1])
+
+    # # Computing padding size


codecov · 2024-06-25T09:11:45Z

Codecov Report

Attention: Patch coverage is 86.95652% with 9 lines in your changes missing coverage. Please review.

Project coverage is 75.23%. Comparing base (c57e9e7) to head (1eb33c3).

Files	Patch %	Lines
onnxscript/tools/training_helper.py	79.16%	2 Missing and 3 partials ⚠️
onnxscript/function_libs/torch_lib/ops/core.py	91.11%	3 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1707      +/-   ##
==========================================
- Coverage   75.24%   75.23%   -0.01%     
==========================================
  Files         242      242              
  Lines       25861    25923      +62     
  Branches     4660     4671      +11     
==========================================
+ Hits        19458    19504      +46     
- Misses       5517     5528      +11     
- Partials      886      891       +5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

github-actions · 2024-06-25T09:51:16Z

Test Results

26 files - 1 26 suites - 1 2h 27m 9s ⏱️ - 56m 19s
11 805 tests + 3 446 9 556 ✅ + 2 688 2 247 💤 + 757 1 ❌ +1 1 🔥 ±0
381 311 runs - 142 687 83 485 ✅ - 22 528 297 818 💤 - 120 166 7 ❌ +7 1 🔥 ±0

For more details on these failures and errors, see this check.

Results for commit 1eb33c3. ± Comparison against base commit c57e9e7.

This pull request skips 1 test.

docs.test.test_documentation_examples.TestDocumentationExample ‑ test_documentation_examples

♻️ This comment has been updated with latest results.

xadupre · 2024-06-25T11:31:51Z

Is it possible to add a unit test?

onnxscript/function_libs/torch_lib/backward_test.py

onnxscript/tools/training_helper.py

+def train_loop(
+    model: Any,
+    *args,
+    loss_fn: Any | None = None,
+    optimizer: Any | None = None,
+    dump_onnx_models: bool = False,
+    dump_prefix: str = "dump_train_loop",
+    dump_clean_first: bool = True,
+) -> tuple[Any, tuple[Any, ...]] | tuple[Any, tuple[Any, ...], list[str]]:


onnxscript/function_libs/torch_lib/backward_test.py

…/onnxscript into xiaowu/addConvBackward

xiaowuhu · 2024-07-03T09:14:15Z

Is it possible to add a unit test?

Added.

onnxscript/function_libs/torch_lib/backward_test.py

Depends on #1707, will add unit test after #1707 merged.

onnxscript/function_libs/torch_lib/ops/core.py

justinchuby · 2024-07-05T15:01:49Z

tests/function_libs/torch_lib/backward_test.py

+
+class TestBackward(unittest.TestCase):
+    @unittest.skipIf(sys.platform == "win32", reason="not supported yet on Windows")
+    @unittest.skipIf(not has_transformers(), reason="transformers is missing")


Suggested change

@unittest.skipIf(not has_transformers(), reason="transformers is missing")

justinchuby · 2024-07-05T15:02:33Z

tests/function_libs/torch_lib/backward_test.py

+import onnxscript.tools.transformers_models
+import onnxscript.tools.transformers_models.llama


Suggested change

import onnxscript.tools.transformers_models

import onnxscript.tools.transformers_models.llama

I wonder why ruff doesn't warn the unused imports

Update core.py

b701d9a

github-advanced-security bot found potential problems Jun 25, 2024

View reviewed changes

xiaowuhu added 3 commits June 25, 2024 17:13

Update core.py

2b97b2a

Update core.py

5ad4268

Update core.py

cfb0567

xadupre assigned xiaowuhu Jun 25, 2024

Update core.py

9e11c20

xiaowuhu assigned xadupre and fatcat-z and unassigned xiaowuhu Jun 25, 2024

xiaowuhu added 4 commits July 2, 2024 13:39

Update .gitignore

fc2beec

Merge branch 'main' of https://github.com/microsoft/onnxscript

cec9700

Update .gitignore

a690d73

update

8adc6a2

github-advanced-security bot found potential problems Jul 2, 2024

View reviewed changes

xiaowuhu added 8 commits July 2, 2024 15:45

Update backward_test.py

0969173

Merge branch 'main' into xiaowu/addConvBackward

7c0e7c1

Merge branch 'main' into xiaowu/addConvBackward

0d439e1

Merge branch 'xiaowu/addConvBackward' of https://github.com/microsoft…

be11f9d

…/onnxscript into xiaowu/addConvBackward

Update backward_test.py

55edf8f

Update backward_test.py

3a67a57

Update backward_test.py

8689019

Merge branch 'main' into xiaowu/addConvBackward

b32ada3

xiaowuhu mentioned this pull request Jul 3, 2024

Add aten_hardtanh_backward function #1715

Merged

justinchuby reviewed Jul 3, 2024

View reviewed changes

onnxscript/function_libs/torch_lib/backward_test.py Outdated Show resolved Hide resolved

xiaowuhu added a commit that referenced this pull request Jul 4, 2024

Add aten_hardtanh_backward function (#1715)

0670951

Depends on #1707, will add unit test after #1707 merged.

move to new folder

97cc9d4

xiaowuhu requested a review from justinchuby July 4, 2024 05:21

Merge branch 'main' into xiaowu/addConvBackward

7d75dc4

justinchuby reviewed Jul 4, 2024

View reviewed changes

onnxscript/function_libs/torch_lib/ops/core.py Show resolved Hide resolved

xiaowuhu requested a review from justinchuby July 4, 2024 06:32

Merge branch 'main' into xiaowu/addConvBackward

4455dd3

justinchuby reviewed Jul 5, 2024

View reviewed changes

xiaowuhu added 2 commits July 8, 2024 18:01

Update core.py

c21f6ac

Merge branch 'main' into xiaowu/addConvBackward

1eb33c3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add aten_convolution_backward function #1707

Add aten_convolution_backward function #1707

xiaowuhu commented Jun 25, 2024 •

edited

Loading

codecov bot commented Jun 25, 2024 •

edited

Loading

github-actions bot commented Jun 25, 2024 •

edited

Loading

xadupre commented Jun 25, 2024

xiaowuhu commented Jul 3, 2024

justinchuby Jul 5, 2024

justinchuby Jul 5, 2024

		import onnxscript.tools.transformers_models
		import onnxscript.tools.transformers_models.llama

Add aten_convolution_backward function #1707

Are you sure you want to change the base?

Add aten_convolution_backward function #1707

Conversation

xiaowuhu commented Jun 25, 2024 • edited Loading

codecov bot commented Jun 25, 2024 • edited Loading

Codecov Report

github-actions bot commented Jun 25, 2024 • edited Loading

Test Results

xadupre commented Jun 25, 2024

xiaowuhu commented Jul 3, 2024

justinchuby Jul 5, 2024

Choose a reason for hiding this comment

justinchuby Jul 5, 2024

Choose a reason for hiding this comment

xiaowuhu commented Jun 25, 2024 •

edited

Loading

codecov bot commented Jun 25, 2024 •

edited

Loading

github-actions bot commented Jun 25, 2024 •

edited

Loading