[AscendNPU-IR][A5] A5 support for floorOp and floorDivOp by Ismayil06 · Pull Request #950 · tile-ai/tilelang-ascend

Ismayil06 · 2026-04-30T01:31:17Z

Created new npuir_floor and npuir_floordiv operation. Added new VfloordivCodegen and adding npuir_floor into UnaryVecOpCodegen.

github-actions · 2026-04-30T01:31:27Z

👋 Hi! Thank you for contributing to the TileLang project.

Please remember to run bash format.sh in the root directory of the project to ensure your changes are properly linted and formatted. This will help ensure your contribution passes the format check.

We appreciate you taking this step! Our team will review your contribution, and we look forward to your awesome work!

🚀

gemini-code-assist

Code Review

This pull request introduces support for floor and floordiv operations in the NPU IR backend, encompassing operator definitions, codegen implementations, and corresponding unit tests. The review identified several critical issues: the vector codegen for floor division lacks necessary broadcasting and sliced insertion logic, and the expression visitation for floating-point floor division incorrectly updates the expression map with intermediate results. Additionally, the integer floor division path still fails to match Python semantics for negative numbers, a test fixture name mismatch will cause execution failures, and docstrings in the Python wrapper need to be moved inside the function bodies.

gemini-code-assist · 2026-04-30T01:33:54Z

+void CodeGenTileLangNPUIRDEV::VfloordivCodegen(const CallNode *op) {
+  tvm::tl::NpuirFloorDiv npuirop(op->args, this->vmap);
+  auto loc = builder.getUnknownLoc();
+
+  Value src0 = GenExtractSliceFromRegion(npuirop.src0, npuirop.src0_range);
+  Value src1 = GenExtractSliceFromRegion(npuirop.src1, npuirop.src1_range);
+
+  auto srcType = getElementTypeOrSelf(src0.getType());
+  Value result;
+
+  if (srcType.isa<FloatType>()) {
+    auto divResult = builder.create<mlir::arith::DivFOp>(loc, src0, src1);
+    result = builder.create<mlir::math::FloorOp>(loc, divResult);
+  } else {
+    result = builder.create<mlir::arith::DivSIOp>(loc, src0, src1);
+  }
+
+  SetVarValue(npuirop.dst, result);
+}


The implementation of VfloordivCodegen is missing critical logic present in other vector operations:

Broadcasting: MLIR arithmetic operations require operands to have matching shapes. If src0 and src1 have different shapes (e.g., one is a broadcasted dimension), this will fail. You should use broadcastOrTranspose to align operand shapes.

Sliced Insertion: SetVarValue(npuirop.dst, result) overwrites the entire buffer variable mapping. If the operation is performed on a slice (as indicated by dst_range), it must use ReshapeCastAndInsertSlice to update only the relevant portion of the tensor and maintain SSA consistency.

gemini-code-assist · 2026-04-30T01:33:54Z

+    auto divResult = BinaryOpCodegen<mlir::arith::DivFOp, std::nullptr_t>(op, nullptr,
                                                                   lhs, rhs);
+    mlirVal = builder.create<mlir::math::FloorOp>(builder.getUnknownLoc(), divResult);


There is a correctness issue with how prim_expr_map is updated here. BinaryOpCodegen internally calls UpdatePrimExprMap(op, mlirVal), associating the FloorDivNode (op) with the intermediate arith::DivFOp result. Subsequent calls to MakeValue for the same expression will return the un-floored division result from the map. You should manually check the map, compute the result, and then update the map with the final mlirVal.

} else if (op->dtype.is_float()) { auto result = CheckPrimExprMap(op); if (result.first) return result.second; auto divResult = builder.create<mlir::arith::DivFOp>(builder.getUnknownLoc(), lhs, rhs); mlirVal = builder.create<mlir::math::FloorOp>(builder.getUnknownLoc(), divResult); UpdatePrimExprMap(op, mlirVal); }

gemini-code-assist · 2026-04-30T01:33:54Z

+        ((1024, 16384), "float32"),
+    ]
+)
+def floordiv_case(request):


The fixture name is incorrectly named floordiv_case in this file, but the test function test_floor_dev on line 59 expects a fixture named floor_case. This will cause the test to fail with a Fixture 'floor_case' not found error.

Suggested change

def floordiv_case(request):

def floor_case(request):

gemini-code-assist · 2026-04-30T01:33:54Z

  auto rhs = MakeValue(op->b);
-  // FIXME: The floor div in python is not the same as arith.divsi in negative
-  // scenarios.
  mlir::Value mlirVal;


The removal of this FIXME is premature. mlir::arith::DivSIOp performs truncation towards zero, which does not match Python's floor division semantics for negative integers (e.g., -5 // 2 should be -3, but arith.divsi results in -2). While the float path now correctly uses math::FloorOp, the integer path remains incorrect for negative scenarios.

gemini-code-assist · 2026-04-30T01:33:54Z

+"""npuir floor at tile-level."""
+def npuir_floor(A, B):
+    return AscendUnaryOp("floor", A, B).buildTirCall()
+"""npuir floordiv at tile-level."""
+def npuir_floordiv(A, B, C):
+    return AscendBinaryOp("floordiv", A, B, C).buildTirCall()


The docstrings for npuir_floor and npuir_floordiv are placed outside the function definitions as standalone string literals. They should be moved inside the functions to be correctly recognized as docstrings.

Suggested change

"""npuir floor at tile-level."""

def npuir_floor(A, B):

return AscendUnaryOp("floor", A, B).buildTirCall()

"""npuir floordiv at tile-level."""

def npuir_floordiv(A, B, C):

return AscendBinaryOp("floordiv", A, B, C).buildTirCall()

def npuir_floor(A, B):

"""npuir floor at tile-level."""

return AscendUnaryOp("floor", A, B).buildTirCall()

def npuir_floordiv(A, B, C):

"""npuir floordiv at tile-level."""

return AscendBinaryOp("floordiv", A, B, C).buildTirCall()

gemini-code-assist Bot reviewed Apr 30, 2026

View reviewed changes

Ismayil06 force-pushed the a5-floordiv branch from b0b5e52 to 6ceb7ca Compare May 4, 2026 02:12

[AscendNPU-IR][A5] A5 support for floorOp and floorDivOp

6ceb7ca

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AscendNPU-IR][A5] A5 support for floorOp and floorDivOp#950

[AscendNPU-IR][A5] A5 support for floorOp and floorDivOp#950
Ismayil06 wants to merge 1 commit intotile-ai:npuir-devfrom
Ismayil06:a5-floordiv

Ismayil06 commented Apr 30, 2026

Uh oh!

github-actions Bot commented Apr 30, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 30, 2026

Uh oh!

gemini-code-assist Bot Apr 30, 2026

Uh oh!

gemini-code-assist Bot Apr 30, 2026

Uh oh!

gemini-code-assist Bot Apr 30, 2026

Uh oh!

gemini-code-assist Bot Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Ismayil06 commented Apr 30, 2026

Uh oh!

github-actions Bot commented Apr 30, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant