Avoid certain panics in decoding. #83

partim · 2025-04-11T12:20:24Z

This PR changes most of the decoding code to be mostly panic free.

Specifically, it removes the use of slice indexing as well as unwrap and expect on options and results.

It does not do this for encoding – which requires some redesign in the traits – and for the string types – which will be done in separate PRs to make reviewing of this PR easier.

The PR introduces a breaking change in the decode::Source trait by changing how invalid indexes are treated. The bytes method now returns an optional Bytes, returning None if the indexes are invalid. If the advance method is called with an invalid length, the source is expected to go into some error state. Finally, the error type of take_opt_u8 was changed to allow an explicit error.

This PR raises the minimum supported Rust version to 1.74.

bal-e

Looks good, @partim, but I'd suggest taking a slightly different approach to failing silently when unexpected conditions occur. I'd at least suggest sprinkling debug_assert!() all over the place -- it's the same tool one would use when writing unsafe code, which can fail worse than silently.

bal-e · 2025-04-14T09:13:19Z

src/decode/content.rs

        if self.source.request(remaining)? < remaining {
-            Err(self.source.content_err("unexpected end of data"))
-        }
-        else {
-            Ok(&self.source.slice()[..remaining])
+            return Err(self.source.content_err("unexpected end of data"))
        }
+        self.source.slice().get(..remaining).ok_or_else(|| {
+            self.source.content_err("unexpected end of data")
+        })


At a cursory glance, the checked get makes the initial check redundant, right? You should be able to simply omit it -- if not enough data is retrieved, the get will fail.

bal-e · 2025-04-14T09:14:47Z

src/decode/content.rs

-        self.source.limit().unwrap()
+        // The source is guaranteed to have a limit by the code creating
+        // the primitive, so we can just return 0 here if it isn’t. This
+        // isn’t ideal and we would rater guarantee things through types,


typo: "rater" -> "rather"

bal-e · 2025-04-14T09:16:56Z

src/decode/source.rs

    pos: usize,

    /// The offset for the reported position.
    ///
    /// This is the value reported by `Source::pos` when `self.pos` is zero.
+    /// This is 0, unles the value was created with `Self::with_offset`.


typo: "unles" -> "unless"

bal-e · 2025-04-14T12:36:11Z

src/tag.rs

+                data[i] = source.slice().get(i).copied().ok_or_else(|| {
+                    source.content_err("source returned short slice")
+                })?;


Given how much you need to check that the Source is working correctly, perhaps it would be worthwhile to change its API to be less fallible.

I have a plan to rewrite both Tag and Length in a follow-up PR to be more robust. Should have mentioned that in the PR comment but forgot.

bal-e · 2025-04-14T12:40:19Z

src/string/octet.rs

-        // Unwrapping here is okay. The only error that can happen is that
-        // the tag is longer that we support. However, we already checked that
-        // there’s only OctetString or End of Value tags which we _do_
-        // support.
+        // Taking the tag and length shouldn’t fail since we checked already,
+        // so just returning `None` should be fine.


Does this mean that a bug here would lead to a silent/hidden failure rather than a panic? Maybe it would be good to leave a debug_assert!() or similar here, so that you have some avenue of detecting these bugs. That might apply to the whole PR.

I am quite unhappy with all the string type handling, so I’ll probably do a rewrite here as well.

bal-e · 2025-04-14T12:41:43Z

src/int.rs

+        assert_eq!(make([0]).as_slice(), b"\0");
+        assert_eq!(make([0, 0]).as_slice(), b"\0");
+
+        assert_eq!(make([0x10]).as_slice(), b"\x10");
+        assert_eq!(make([0, 0x10]).as_slice(), b"\x10");
+        assert_eq!(make([0, 0, 0x10]).as_slice(), b"\x10");
+
+        assert_eq!(make([0x10, 0xF0]).as_slice(), b"\x10\xF0");
+        assert_eq!(make([0, 0x10, 0xF0]).as_slice(), b"\x10\xF0");
+        assert_eq!(make([0, 0, 0x10, 0xF0]).as_slice(), b"\x10\xF0");
+
+        assert_eq!(make([0xF0]).as_slice(), b"\0\xF0");
+        assert_eq!(make([0, 0xF0]).as_slice(), b"\0\xF0");
+        assert_eq!(make([0, 0, 0xF0]).as_slice(), b"\0\xF0");
+
+        assert_eq!(make([0xF0, 0xF0]).as_slice(), b"\0\xF0\xF0");
+        assert_eq!(make([0, 0xF0, 0xF0]).as_slice(), b"\0\xF0\xF0");
+        assert_eq!(make([0, 0, 0xF0, 0xF0]).as_slice(), b"\0\xF0\xF0");


It's probably a lot of work to support, but proptest would be a great way to fuzz the whole library.

There are a bunch of fuzzing tests already, currently implemented manually with cargo fuzz. I’ll have a look at proptest and see if that makes it easier to be more complete.

partim · 2025-04-14T13:01:50Z

I am now considering changing the whole thing more drastically. I want to get rid of the built-in support for Bytes, so I think we can merge Source::reserve and Source::slice into one method that gives you exactly as many octets as you ask for or errors out. That would leave Source::advance which we could get rid of by having this new method not return a blank slice but a newtype that you then have to either consume or return an error. We’d have to try the latter to see if it actually works in practice.

But it would avoid the need for debug assertions.

bal-e · 2025-04-14T13:10:02Z

The change to Source sounds good. I'm curious to see how the newtype approach works out -- it can be quite tricky.

partim added 4 commits April 11, 2025 14:12

Avoid panics in decoding.

2b610ca

Fix documentation indentation.

e6e96f4

Use new io::Error::other.

f9fee2b

Only run Clippy on stable.

7a9d50d

bal-e approved these changes Apr 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid certain panics in decoding. #83

Avoid certain panics in decoding. #83

partim commented Apr 11, 2025 •

edited

Loading

bal-e left a comment

bal-e Apr 14, 2025

bal-e Apr 14, 2025

bal-e Apr 14, 2025

bal-e Apr 14, 2025

partim Apr 14, 2025

bal-e Apr 14, 2025

partim Apr 14, 2025

bal-e Apr 14, 2025

partim Apr 14, 2025

partim commented Apr 14, 2025

bal-e commented Apr 14, 2025

Avoid certain panics in decoding. #83

Are you sure you want to change the base?

Avoid certain panics in decoding. #83

Conversation

partim commented Apr 11, 2025 • edited Loading

bal-e left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

partim commented Apr 14, 2025

bal-e commented Apr 14, 2025

partim commented Apr 11, 2025 •

edited

Loading