return error result only from Iterator.next() #14

mykmelez · 2018-11-19T22:29:17Z

This is an alternative to #13 that minimizes the number of methods that return an error result down to just the Iterator.next() implementation in Iter by using std::iter::once() to create an iterator that returns an error result when a failure occurs in the Cursor.iter*() methods.

A potential downside of this approach is that it returns an Iterator trait object from the Cursor.iter*() methods, which could have runtime performance implications in theory (although I'm unsure that it does in practice).

(I've previously requested integration of these changes upstream in danburkert#45 and would like to move forward with these changes downstream while awaiting that integration.)

… too

mykmelez · 2018-11-19T22:32:55Z

@ncalexan I've submitted this separately from #13 and requested review from you for both branches because I'd like your input on the tradeoff between the ergonomic benefits of minimizing the number of fallible methods vs. the potential performance (or other) implications of using a trait object in this branch.

(Also happy for any insight you have into another way we could make the Cursor.iter*() methods infallible without using a trait object.)

ncalexan

I didn't review this thoroughly, 'cuz I want to talk through options. It's weird to me that any of these iterators can produce errors. iter_dup_of with an unknown key isn't an error; it should always yield an empty iterator. iter_from shouldn't error if the value isn't found, it should produce an empty iterator. And if these things for reasons I don't understand do error, it should be on the use of the iterator (.next()), not at creation time. (Almost certainly.)

Let's talk about this more soon.

ncalexan · 2018-11-20T20:50:05Z

src/cursor.rs

@@ -57,12 +57,12 @@ pub trait Cursor<'txn> {
    /// For databases with duplicate data items (`DatabaseFlags::DUP_SORT`), the
    /// duplicate data items of each key will be returned before moving on to
    /// the next key.
-    fn iter_from<K>(&mut self, key: K) -> Result<Iter<'txn>> where K: AsRef<[u8]> {
+    fn iter_from<K>(&mut self, key: K) -> Box<Iterator<Item=Result<(&'txn [u8], &'txn [u8])>>> where K: AsRef<[u8]> {


The way to avoid the Box is easy enough -- push the branch down into the underlying Iter struct. That is, make it:

/// An iterator over the values in an LMDB database. pub enum Iter<'txn> { Err(Result::E), Ok(...), }

where the bits that are currently in Iter are in the Ok branch as well.

The way to avoid the Box is easy enough -- push the branch down into the underlying Iter struct.

Ok, I've done this in a558fab.

ncalexan · 2018-11-20T20:52:40Z

src/cursor.rs

        match self.get(Some(key.as_ref()), None, ffi::MDB_SET_RANGE) {
            Ok(_) | Err(Error::NotFound) => (),
-            Err(error) => return Err(error),
+            Err(error) => return Box::new(iter::once(Err(error))),


Here, I see how you will only get an error once (and then the iterator ends). With the other approach (Item = Result<...>), how do you ensure you don't get Err forever? This feels similar to fuse in spirit.

Here, I see how you will only get an error once (and then the iterator ends). With the other approach (Item = Result<...>), how do you ensure you don't get Err forever? This feels similar to fuse in spirit.

I don't currently ensure that; rather, I leave it to the caller to decide what to do on error. I'm not entirely sure if it's possible for LMDB to return a value after returning an error. If so, then consumers should be able to continue iterating in that case.

Otherwise, consumers that call Iterator.next() directly can easily stop iterating at the first error, since they need to match on the return value anyway to distinguish between the Some(Ok), Some(Err) and None types; and collections can easily automatically stop collecting at the first error; so it seems like consumers wouldn't gain much by the Iter returning None after that.

Instead of boxing Iter/IterDup to generalize across both successful and failed attempts to get an iterator, we make Iter and IterDup be enums with Ok and Err variants, where the Ok variant behaves like the current implementations, and the Err variant always returns an error.

mykmelez · 2018-12-01T01:02:31Z

I didn't review this thoroughly, 'cuz I want to talk through options. … Let's talk about this more soon.

Does this implementation match your expectations based on the conversation we had?

mykmelez · 2018-12-01T01:07:06Z

Travis failed only on Rust 1.20.0 with errors like:

error[E0308]: mismatched types
   --> src/cursor.rs:265:13
    |
265 |             Iter::Ok { cursor, ref mut op, next_op, _marker } => {
    |             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ expected mutable reference, found enum `cursor::Iter`
    |
    = note: expected type `&mut cursor::Iter<'txn>`
               found type `cursor::Iter<'_>`

I'm unsure what to do about that. Perhaps requiring newer Rust is the simplest solution.

mykmelez · 2018-12-01T01:13:25Z

I'm unsure what to do about that. Perhaps requiring newer Rust is the simplest solution.

af923dc fixes Rust 1.20.0+ by explicitly identifying references in the match patterns.

mykmelez · 2018-12-12T00:09:25Z

@ncalexan Does this look good with the latest set of changes in response to your review comments?

ncalexan

OK, this makes much more sense to me now! Sorry for delayed review.

(I reviewed the squashed diff 'cuz there was a lot of inter-commit churn; I expect that's okay in this situation.)

update minor version for breaking change in #14

mykmelez added 10 commits October 22, 2018 14:34

make Cursor::iter_*() methods return Result instead of panicking

3fcf930

demonstrate various uses of API

c3cb55c

fix test failure; clarify iterator collection type

99f24f7

return Iter that produces error result from Cursor.iter*

58d46ec

alias Box<Iter> type to BoxedIter; make iter_dup_from return iterator…

6414b83

… too

empty commit to force CI rebuild

7f43959

empty commit to force CI rebuild

04c7de8

remove test that is no longer relevant

6f06c1c

remove unnecessary commented-out code

09bd1de

Merge branch 'return-error-result' into return-error-result-from-iter

a32a673

mykmelez requested a review from ncalexan November 19, 2018 22:29

ncalexan suggested changes Nov 20, 2018

View reviewed changes

mykmelez added 3 commits November 30, 2018 15:50

Merge branch 'master' into return-error-result-from-iter

ea1386f

Merge branch 'master' into return-error-result-from-iter

09d7656

explicitly identify reference in match patterns

af923dc

mykmelez mentioned this pull request Dec 3, 2018

return error result from fallible iteration methods #13

Closed

ncalexan approved these changes Dec 19, 2018

View reviewed changes

mykmelez merged commit cfaf37d into mozilla:master Dec 19, 2018

mykmelez added a commit to mykmelez/lmdb-rs that referenced this pull request Dec 19, 2018

update minor version for breaking change in mozilla#14

67d627c

mykmelez mentioned this pull request Dec 20, 2018

fix build bustage on obsolete and nightly Rust #22

Merged

mykmelez added a commit that referenced this pull request Dec 20, 2018

Merge pull request #21 from mykmelez/publish-0.11.0

6ab0155

update minor version for breaking change in #14

mykmelez mentioned this pull request Jan 12, 2019

update lmdb-rkv to latest version 0.11 mozilla/rkv#105

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

return error result only from Iterator.next() #14

return error result only from Iterator.next() #14

Uh oh!

mykmelez commented Nov 19, 2018

Uh oh!

mykmelez commented Nov 19, 2018

Uh oh!

ncalexan left a comment

Uh oh!

ncalexan Nov 20, 2018

Uh oh!

mykmelez Dec 1, 2018

Uh oh!

ncalexan Nov 20, 2018

Uh oh!

mykmelez Dec 1, 2018

Uh oh!

mykmelez commented Dec 1, 2018

Uh oh!

mykmelez commented Dec 1, 2018

Uh oh!

mykmelez commented Dec 1, 2018

Uh oh!

mykmelez commented Dec 12, 2018

Uh oh!

ncalexan left a comment

Uh oh!

Uh oh!

return error result only from Iterator.next() #14

return error result only from Iterator.next() #14

Uh oh!

Conversation

mykmelez commented Nov 19, 2018

Uh oh!

mykmelez commented Nov 19, 2018

Uh oh!

ncalexan left a comment

Choose a reason for hiding this comment

Uh oh!

ncalexan Nov 20, 2018

Choose a reason for hiding this comment

Uh oh!

mykmelez Dec 1, 2018

Choose a reason for hiding this comment

Uh oh!

ncalexan Nov 20, 2018

Choose a reason for hiding this comment

Uh oh!

mykmelez Dec 1, 2018

Choose a reason for hiding this comment

Uh oh!

mykmelez commented Dec 1, 2018

Uh oh!

mykmelez commented Dec 1, 2018

Uh oh!

mykmelez commented Dec 1, 2018

Uh oh!

mykmelez commented Dec 12, 2018

Uh oh!

ncalexan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!