Skip to content

Commit

Permalink
[ADT] Add a fallible_iterator wrapper.
Browse files Browse the repository at this point in the history
A fallible iterator is one whose increment or decrement operations may fail.
This would usually be supported by replacing the ++ and -- operators with
methods that return error:

    class MyFallibleIterator {
    public:
      // ...
      Error inc();
      Errro dec();
      // ...
    };

The downside of this style is that it no longer conforms to the C++ iterator
concept, and can not make use of standard algorithms and features such as
range-based for loops.

The fallible_iterator wrapper takes an iterator written in the style above
and adapts it to (mostly) conform with the C++ iterator concept. It does this
by providing standard ++ and -- operator implementations, returning any errors
generated via a side channel (an Error reference passed into the wrapper at
construction time), and immediately jumping the iterator to a known 'end'
value upon error. It also marks the Error as checked any time an iterator is
compared with a known end value and found to be inequal, allowing early exit
from loops without redundant error checking*.

Usage looks like:

    MyFallibleIterator I = ..., E = ...;

    Error Err = Error::success();
    for (auto &Elem : make_fallible_range(I, E, Err)) {
      // Loop body is only entered when safe.

      // Early exits from loop body permitted without checking Err.
      if (SomeCondition)
        return;

    }
    if (Err)
      // Handle error.

* Since failure causes a fallible iterator to jump to end, testing that a
  fallible iterator is not an end value implicitly verifies that the error is a
  success value, and so is equivalent to an error check.

Reviewers: dblaikie, rupprecht

Subscribers: mgorny, dexonsmith, kristina, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D57618

llvm-svn: 353237
  • Loading branch information
lhames committed Feb 5, 2019
1 parent 7b7a4ef commit 3e040e0
Show file tree
Hide file tree
Showing 7 changed files with 624 additions and 38 deletions.
80 changes: 69 additions & 11 deletions llvm/docs/ProgrammersManual.rst
Expand Up @@ -935,28 +935,86 @@ Building fallible iterators and iterator ranges

The archive walking examples above retrieve archive members by index, however
this requires considerable boiler-plate for iteration and error checking. We can
clean this up by using ``Error`` with the "fallible iterator" pattern. The usual
C++ iterator patterns do not allow for failure on increment, but we can
incorporate support for it by having iterators hold an Error reference through
which they can report failure. In this pattern, if an increment operation fails
the failure is recorded via the Error reference and the iterator value is set to
the end of the range in order to terminate the loop. This ensures that the
dereference operation is safe anywhere that an ordinary iterator dereference
would be safe (i.e. when the iterator is not equal to end). Where this pattern
is followed (as in the ``llvm::object::Archive`` class) the result is much
cleaner iteration idiom:
clean this up by using the "fallible iterator" pattern, which supports the
following natural iteration idiom for fallible containers like Archive:

.. code-block:: c++

Error Err;
for (auto &Child : Ar->children(Err)) {
// Use Child - we only enter the loop when it's valid
// Use Child - only enter the loop when it's valid

// Allow early exit from the loop body, since we know that Err is success
// when we're inside the loop.
if (BailOutOn(Child))
return;

...
}
// Check Err after the loop to ensure it didn't break due to an error.
if (Err)
return Err;

To enable this idiom, iterators over fallible containers are written in a
natural style, with their ``++`` and ``--`` operators replaced with fallible
``Error inc()`` and ``Error dec()`` functions. E.g.:

.. code-block:: c++

class FallibleChildIterator {
public:
FallibleChildIterator(Archive &A, unsigned ChildIdx);
Archive::Child &operator*();
friend bool operator==(const ArchiveIterator &LHS,
const ArchiveIterator &RHS);

// operator++/operator-- replaced with fallible increment / decrement:
Error inc() {
if (!A.childValid(ChildIdx + 1))
return make_error<BadArchiveMember>(...);
++ChildIdx;
return Error::success();
}

Error dec() { ... }
};

Instances of this kind of fallible iterator interface are then wrapped with the
fallible_iterator utility which provides ``operator++`` and ``operator--``,
returning any errors via a reference passed in to the wrapper at construction
time. The fallible_iterator wrapper takes care of (a) jumping to the end of the
range on error, and (b) marking the error as checked whenever an iterator is
compared to ``end`` and found to be inequal (in particular: this marks the
error as checked throughout the body of a range-based for loop), enabling early
exit from the loop without redundant error checking.

Instances of the fallible iterator interface (e.g. FallibleChildIterator above)
are wrapped using the ``make_fallible_itr`` and ``make_fallible_end``
functions. E.g.:

.. code-block:: c++

class Archive {
public:
using child_iterator = fallible_iterator<FallibleChildIterator>;

child_iterator child_begin(Error &Err) {
return make_fallible_itr(FallibleChildIterator(*this, 0), Err);
}
child_iterator child_end() {
return make_fallible_end(FallibleChildIterator(*this, size()));
}
iterator_range<child_iterator> children(Error &Err) {
return make_range(child_begin(Err), child_end());
}
};

Using the fallible_iterator utility allows for both natural construction of
fallible iterators (using failing ``inc`` and ``dec`` operations) and
relatively natural use of c++ iterator/loop idioms.

.. _function_apis:

More information on Error and its related utilities can be found in the
Expand Down
243 changes: 243 additions & 0 deletions llvm/include/llvm/ADT/fallible_iterator.h
@@ -0,0 +1,243 @@
//===--- fallible_iterator.h - Wrapper for fallible iterators ---*- C++ -*-===//
//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//
//===----------------------------------------------------------------------===//

#ifndef LLVM_ADT_FALLIBLE_ITERATOR_H
#define LLVM_ADT_FALLIBLE_ITERATOR_H

#include "llvm/ADT/PointerIntPair.h"
#include "llvm/ADT/iterator_range.h"
#include "llvm/Support/Error.h"

#include <type_traits>

namespace llvm {

/// A wrapper class for fallible iterators.
///
/// The fallible_iterator template wraps an underlying iterator-like class
/// whose increment and decrement operations are replaced with fallible versions
/// like:
///
/// @code{.cpp}
/// Error inc();
/// Error dec();
/// @endcode
///
/// It produces an interface that is (mostly) compatible with a traditional
/// c++ iterator, including ++ and -- operators that do not fail.
///
/// Instances of the wrapper are constructed with an instance of the
/// underlying iterator and (for non-end iterators) a reference to an Error
/// instance. If the underlying increment/decrement operations fail, the Error
/// is returned via this reference, and the resulting iterator value set to an
/// end-of-range sentinel value. This enables the following loop idiom:
///
/// @code{.cpp}
/// class Archive { // E.g. Potentially malformed on-disk archive
/// public:
/// fallible_iterator<ArchiveChildItr> children_begin(Error &Err);
/// fallible_iterator<ArchiveChildItr> children_end();
/// iterator_range<fallible_iterator<ArchiveChildItr>>
/// children(Error &Err) {
/// return make_range(children_begin(Err), children_end());
/// //...
/// };
///
/// void walk(Archive &A) {
/// Error Err = Error::success();
/// for (auto &C : A.children(Err)) {
/// // Loop body only entered when increment succeeds.
/// }
/// if (Err) {
/// // handle error.
/// }
/// }
/// @endcode
///
/// The wrapper marks the referenced Error as unchecked after each increment
/// and/or decrement operation, and clears the unchecked flag when a non-end
/// value is compared against end (since, by the increment invariant, not being
/// an end value proves that there was no error, and is equivalent to checking
/// that the Error is success). This allows early exits from the loop body
/// without requiring redundant error checks.
template <typename Underlying> class fallible_iterator {
private:
template <typename T>
using enable_if_struct_deref_supported = std::enable_if<
!std::is_void<decltype(std::declval<T>().operator->())>::value,
decltype(std::declval<T>().operator->())>;

public:
/// Construct a fallible iterator that *cannot* be used as an end-of-range
/// value.
///
/// A value created by this method can be dereferenced, incremented,
/// decremented and compared, providing the underlying type supports it.
///
/// The error that is passed in will be initially marked as checked, so if the
/// iterator is not used at all the Error need not be checked.
static fallible_iterator itr(Underlying I, Error &Err) {
(void)!!Err;
return fallible_iterator(std::move(I), &Err);
}

/// Construct a fallible iteratro that can be used as an end-of-range value.
///
/// A value created by this method can be dereferenced (if the underlying
/// value points at a valid value) and compared, but not incremented or
/// decremented.
static fallible_iterator end(Underlying I) {
return fallible_iterator(std::move(I), nullptr);
}

/// Forward dereference to the underlying iterator.
auto operator*() -> decltype(*std::declval<Underlying>()) { return *I; }

/// Forward const dereference to the underlying iterator.
auto operator*() const -> decltype(*std::declval<const Underlying>()) {
return *I;
}

/// Forward structure dereference to the underlying iterator (if the
/// underlying iterator supports it).
template <typename T = Underlying>
typename enable_if_struct_deref_supported<T>::type operator->() {
return I.operator->();
}

/// Forward const structure dereference to the underlying iterator (if the
/// underlying iterator supports it).
template <typename T = Underlying>
typename enable_if_struct_deref_supported<const T>::type operator->() const {
return I.operator->();
}

/// Increment the fallible iterator.
///
/// If the underlying 'inc' operation fails, this will set the Error value
/// and update this iterator value to point to end-of-range.
///
/// The Error value is marked as needing checking, regardless of whether the
/// 'inc' operation succeeds or fails.
fallible_iterator &operator++() {
assert(getErrPtr() && "Cannot increment end iterator");
if (auto Err = I.inc())
handleError(std::move(Err));
else
resetCheckedFlag();
return *this;
}

/// Decrement the fallible iterator.
///
/// If the underlying 'dec' operation fails, this will set the Error value
/// and update this iterator value to point to end-of-range.
///
/// The Error value is marked as needing checking, regardless of whether the
/// 'dec' operation succeeds or fails.
fallible_iterator &operator--() {
assert(getErrPtr() && "Cannot decrement end iterator");
if (auto Err = I.dec())
handleError(std::move(Err));
else
resetCheckedFlag();
return *this;
}

/// Compare fallible iterators for equality.
///
/// Returns true if both LHS and RHS are end-of-range values, or if both are
/// non-end-of-range values whose underlying iterator values compare equal.
///
/// If this is a comparison between an end-of-range iterator and a
/// non-end-of-range iterator, then the Error (referenced by the
/// non-end-of-range value) is marked as checked: Since all
/// increment/decrement operations result in an end-of-range value, comparing
/// false against end-of-range is equivalent to checking that the Error value
/// is success. This flag management enables early returns from loop bodies
/// without redundant Error checks.
friend bool operator==(const fallible_iterator &LHS,
const fallible_iterator &RHS) {
// If both iterators are in the end state they compare
// equal, regardless of whether either is valid.
if (LHS.isEnd() && RHS.isEnd())
return true;

assert(LHS.isValid() && RHS.isValid() &&
"Invalid iterators can only be compared against end");

bool Equal = LHS.I == RHS.I;

// If the iterators differ and this is a comparison against end then mark
// the Error as checked.
if (!Equal) {
if (LHS.isEnd())
(void)!!*RHS.getErrPtr();
else
(void)!!*LHS.getErrPtr();
}

return Equal;
}

/// Compare fallible iterators for inequality.
///
/// See notes for operator==.
friend bool operator!=(const fallible_iterator &LHS,
const fallible_iterator &RHS) {
return !(LHS == RHS);
}

private:
fallible_iterator(Underlying I, Error *Err)
: I(std::move(I)), ErrState(Err, false) {}

Error *getErrPtr() const { return ErrState.getPointer(); }

bool isEnd() const { return getErrPtr() == nullptr; }

bool isValid() const { return !ErrState.getInt(); }

void handleError(Error Err) {
*getErrPtr() = std::move(Err);
ErrState.setPointer(nullptr);
ErrState.setInt(true);
}

void resetCheckedFlag() {
*getErrPtr() = Error::success();
}

Underlying I;
mutable PointerIntPair<Error *, 1> ErrState;
};

/// Convenience wrapper to make a fallible_iterator value from an instance
/// of an underlying iterator and an Error reference.
template <typename Underlying>
fallible_iterator<Underlying> make_fallible_itr(Underlying I, Error &Err) {
return fallible_iterator<Underlying>::itr(std::move(I), Err);
}

/// Convenience wrapper to make a fallible_iterator end value from an instance
/// of an underlying iterator.
template <typename Underlying>
fallible_iterator<Underlying> make_fallible_end(Underlying E) {
return fallible_iterator<Underlying>::end(std::move(E));
}

template <typename Underlying>
iterator_range<fallible_iterator<Underlying>>
make_fallible_range(Underlying I, Underlying E, Error &Err) {
return make_range(make_fallible_itr(std::move(I), Err),
make_fallible_end(std::move(E)));
}

} // end namespace llvm

#endif // LLVM_ADT_FALLIBLE_ITERATOR_H

0 comments on commit 3e040e0

Please sign in to comment.