[None][fix] Enhancing code robustness and adding boundary checks for ITensor #8855

Fan-Yunfan · 2025-11-01T07:58:27Z

Problem

cpp/include/tensorrt_llm/runtime/iTensor.h

In the getDimension() method, the static_assert used for compile-time checking utilizes the static constant attribute MAX_DIMS of the runtime object "shape", which may cause confusion for readers: why can "shape", clearly a runtime object, be used at compile time?

    using Shape = nvinfer1::Dims;
    ......
    [[nodiscard]] virtual Shape const& getShape() const = 0;
    ......
    template <SizeType32 n>
    [[nodiscard]] DimType64 getDimension() const
    {
        auto const shape = getShape();
        static_assert(n < shape.MAX_DIMS && n >= -shape.MAX_DIMS,
            "Trying to access the dimension of a tensor, when its maximal shape cannot have that dimension.");
        if constexpr (n < 0)
        {
            return shape.d[shape.nbDims + n];
        }
        else
        {
            return shape.d[n];
        }
    }

TensorRT/include/NvInferRuntimeBase.h

//!
//! Alias for Dims64.
//!
using Dims = Dims64;
class Dims64
{
public:
    //! The maximum rank (number of dimensions) supported for a tensor.
    static constexpr int32_t MAX_DIMS{8};

    //! The rank (number of dimensions).
    int32_t nbDims;

    //! The extent of each dimension.
    int64_t d[MAX_DIMS];
};

In the volumeNonNegative() method, the data type of "vol" is std::int64_t, but it is converted to the std::size_t type when returned. On 32-bit platforms, converting std::int64_t to std::size_t may pose an overflow risk.

static std::int64_t volume(Shape const& dims)
{
    {
        return dims.nbDims < 0 ? -1
            : dims.nbDims == 0
            ? 0
            : std::accumulate(dims.d, dims.d + dims.nbDims, std::int64_t{1}, std::multiplies<>{});
    }
}
......
static std::size_t volumeNonNegative(Shape const& shape)
{
    auto const vol = volume(shape);
    TLLM_CHECK_WITH_INFO(0 <= vol, "Invalid tensor shape");
    return static_cast<std::size_t>(vol);
}

Solution

Replace static member access through object instances (which appears to be runtime but is optimized to compile-time by the compiler) with class type scope access (compile-time)

template <SizeType32 n>
[[nodiscard]] DimType64 getDimension() const
{
    auto const shape = getShape();
    static_assert(n < Shape::MAX_DIMS && n >= -Shape::MAX_DIMS,
        "Trying to access the dimension of a tensor, when its maximal shape cannot have that dimension.");
    ......
}

Add boundary checks for 32-bit platforms

if constexpr (sizeof(std::size_t) == 4)
{
    TLLM_CHECK_WITH_INFO(vol <= static_cast<std::int64_t>(std::numeric_limits<std::size_t>::max(),
                             "Tensor volume exceeds 32-bit size_t maximum capacity."));
}

Summary by CodeRabbit

Bug Fixes
- Improved handling of edge cases in tensor dimension calculations to prevent overflow on 32-bit platforms.
- Enhanced error detection for out-of-range values to ensure more robust operation.

…rsion Signed-off-by: fanyunfan <[email protected]>

coderabbitai · 2025-11-01T08:02:05Z

📝 Walkthrough

Walkthrough

Updates ITensor interface in a single header file: fixes static assertion reference to Shape::MAX_DIMS and adds 32-bit size_t overflow validation in the volumeNonNegative method.

Changes

Cohort / File(s)	Summary
ITensor interface updates `cpp/include/tensorrt_llm/runtime/iTensor.h`	Fixed static_assert reference in getDimension method from `shape.MAX_DIMS` to `Shape::MAX_DIMS`; added 32-bit size_t overflow check in volumeNonNegative method to validate volume fits within std::size_t.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~8 minutes

Verify static_assert reference correction is syntactically valid and semantically correct
Confirm overflow check logic is appropriate for 32-bit platform compatibility

Pre-merge checks and finishing touches

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Description Check	⚠️ Warning	The pull request description provides detailed problem and solution information, but it significantly deviates from the required template structure. The author provided "Problem" and "Solution" sections with technical details, but failed to include the required "Description" section, completely omitted the "Test Coverage" section (which should explain how the changes are tested), and did not include the PR Checklist confirmation. While the technical content is substantive and on-topic, the missing critical template sections—particularly test coverage information and the required checklist confirmation—represent a major gap in following the repository's contribution guidelines.	The author should restructure the description to follow the template: add a clear "Description" section summarizing the issue and solution, add a "Test Coverage" section documenting what tests safeguard these changes (especially tests for the 32-bit overflow check and the static_assert change), and finally review and confirm the PR Checklist items by checking the final confirmation box. This will ensure the PR meets the repository's documentation standards and provides reviewers with complete information about test coverage.
Docstring Coverage	⚠️ Warning	Docstring coverage is 33.33% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Title Check	✅ Passed	The pull request title "[None][fix] Enhancing code robustness and adding boundary checks for ITensor" follows the required template format with the ticket identifier [None] and type [fix], and it clearly summarizes both main changes in the pull request: the static_assert update for clarity and the addition of 32-bit overflow boundary checks. The title is concise, specific, and avoids vague phrasing, accurately reflecting the changes to the ITensor header file.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

cpp/include/tensorrt_llm/runtime/iTensor.h (1)
2-2: Update copyright year to include 2025.

The file is being modified in 2025, so the copyright year range should be updated.

Apply this diff:
- * Copyright (c) 2022-2024, NVIDIA CORPORATION.  All rights reserved.
+ * Copyright (c) 2022-2025, NVIDIA CORPORATION.  All rights reserved.
As per coding guidelines.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d798d66 and fbc88cc.

📒 Files selected for processing (1)

cpp/include/tensorrt_llm/runtime/iTensor.h (2 hunks)

🧰 Additional context used

📓 Path-based instructions (7)

**/*.{h,hpp,hh,hxx,cpp,cxx,cc,cu,cuh}