Amend 1192 (RangeInclusive) to use an enum. #1320

Stebalien · 2015-10-13T19:27:57Z

This PR proposes that RangeInclusive be an enum with Empty/NonEmpty variants instead of a struct with a finished field:

pub enum RangeInclusive<T> {
    Empty {
        at: T,
    },
    NonEmpty {
        start: T,
        end: T,
    }
}

Rational:

finished is very iterator specific. Regardless of what happens, I think this field should be called empty.
start/end don't make sense if the range is empty. Using an enum prevents users from using the start/end of spent ranges. Basically, this makes it impossible for users to do something like foo(my_range.take(10)); bar(my_range) and forget to check finished in bar.
If we ever get more space optimizations (specifically, utf8 code point ones) 'a'...'z' should be the same size as 'a'..'z'.
Don't have to allocate the next start when the end of the range is reached (slight constant factor gain..., maybe).

Rational: 1. The word "finished" is very iterator specific. Really, this field is trying to indicate that the range is actually empty. 2. `start`/`end` don't make sense if the range is empty. Using an enum prevents coders from using the `start`/`end` of spent ranges. Basically, this makes it impossible for the coder to do something like `foo(my_range.take(10)); bar(my_range)` and forget to check `finished` in `bar`. 3. If we ever get better enum optimizations (specifically, utf8 code point ones) `'a'...'z'` should be the same size as `'a'..'z'`; the Empty variant can be encoded as an invalid code point.

huonw · 2015-10-28T23:02:02Z

Hm, I wonder if there's any use for storing the end point, i.e. Empty { end: T } (or even Empty { start: T, end :T }). This at least respects ownership better, in that you don't lose the Ts.

huonw · 2015-10-29T05:32:37Z

To expand a little/respond to your rational:

empty is definitely nicer/less opinionated than finished
people are still prevented from dismissing the empty case even if it has fields, and I think it doens't not make no sense, e.g. 0...1 represents decreasing segments as you walk through it:
```
[ 0 1 ] 2 3 ..
0 [ 1 ] 2 3 ...
0 1 [ ] 2 3 ...
```
Once at that point, one can expand it in either direction, e.g. 0 1 [ 2 3 ] ....
the space optimisations don't seem imperative for this type: I imagine it will generally either be transient, or, if it is being stored, the Empty case is important (e.g. being stored and used as an iterator). (And, it's easy to store (T, T) instead, if the space is important.)

Stebalien · 2015-10-29T14:19:47Z

I agree that keeping the end is useful but I don't really get the ownership argument for keeping both.

bluss · 2015-11-03T13:24:15Z

I think we should back out of RangeInclusive. We need fully general ranges instead which are open/closed/unbounded of both ends.

huonw · 2015-11-03T13:32:09Z

@Stebalien it can be expensive to create/copy/destroy values in Rust (this is in contrast to managed languages, where copying is often just copying a single pointer, or at most manipulating some reference counts), so minimising how often this happens implicitly/without programmer control is good. E.g. if you have a RangeInclusive<BigInt>, it is unfortunate to destroy one end-point when you get to Empty and then have to copy the other if the range is extended.

@bluss I agree that having fully general ranges would be nice, but there has been a lot of discussion/thinking about it and AFAIK there has been no nice (and backwards compatible) syntax raised.

bluss · 2015-11-03T14:08:07Z

Syntax is a second-order issue for me, which types we want to introduce into the core of rust is much more important.

Stebalien · 2015-11-03T14:13:37Z

@huonw std::iter::Step already allocates new objects so you'd have to change that interface to make storing both endpoints useful.

We need fully general ranges instead which are open/closed/unbounded of both ends.

I (obviously) agree. However, it may be a bit late for that given that we're stuck with Range as-is but I'd love to hear proposals.

huonw · 2015-11-03T14:18:23Z

I think std::iter::Step essentially creates as few extra objects as it can, i.e. it only duplicates things when it absolutely has to, in order satisfy the type signatures of the traits it is used in (i.e. Iterator::next cannot return references pointing into the Range itself), whereas dropping/reduplicating an object in this case isn't necessary.

Stebalien · 2015-11-03T15:11:51Z

Scratch that, I thought Step was used for iterating in general. Not storing start actually saves an allocation because you don't have to allocate a new one when transitioning from NonEmpty to Empty.

bluss · 2015-11-03T18:54:38Z

I (obviously) agree. However, it may be a bit late for that given that we're stuck with Range as-is but I'd love to hear proposals.

Range<T> is just a struct of two values, what the endpoints mean can be changed by convention (without breaking existing practice). Among the possibilities is to let a...b or another inclusive range syntax produce Range { start: Bound::Include(a), end: Bound::Exclude(b) }.

Range<usize> has a pretty fixed interpretation as it is now, but Range<Bound<usize>> need not have. These are just rather loose ideas.

Stebalien · 2015-11-13T22:12:12Z

@huonw, I've updated the proposal to avoid throwing away the endpoint. I do only need to keep one because, on the last iteration, I can just return the start as-is without advancing it leaving me with the end only.

huonw · 2015-11-18T23:49:18Z

I can just return the start as-is without advancing it leaving me with the end only.

Oh, that's a good point.

nikomatsakis · 2016-01-15T09:46:46Z

Hear ye, hear ye! This RFC is now entering final comment period.

nikomatsakis · 2016-01-15T09:50:25Z

I am 👍 on this RFC but I would request that the author add a "# History" or "# Amendments" section and simply note the change that is occurring. It's nice when reading the text of an RFC if you don't have to consult the git history to know when it was updated. For example:

# Amendments

- In rust-lang/rfcs#1320, this RFC was amended to change the `RangeInclusive` type from a struct with a `finished` field to an enum.

bluss · 2016-01-15T11:35:42Z

Where did the discussion of more complete range syntax go? Mentioned in #1254

huonw · 2016-01-15T11:38:49Z

That topic seems more relevant for stabilisation of the actual ... sugar, orthogonal to this particular RFC which is mostly an implementation detail (i.e. this RFC is an improvement for the ... sugar, even if we don't end up going with it in the end).

(Maybe you're thinking of https://internals.rust-lang.org/t/vs-for-inclusive-ranges/1539 ?)

bluss · 2016-01-15T12:03:32Z

I'm not thinking of that, that's an old discussion. #1254 was much more recent, indicating aturon and lang team wanted to look at this.

@nikomatsakis

At @nikomatsakis request.

aturon · 2016-01-19T23:22:02Z

@bluss Yes, I am on the same page: I want to back out ... an instead settle on a general syntax that can cover inclusive/exclusive on both sides, just like mathematical range notation. I just haven't had time to write an RFC for it. (I'd be happy to collaborate on an RFC if you're up for that...)

However, as @huonw said, this RFC is just an amendment to the previous one, so it's somewhat orthogonal to the larger question.

aturon · 2016-01-20T23:08:36Z

Libs team consensus: this is a clear improvement over the original RFC.

liigo · 2016-01-21T17:50:28Z

text/1192-inclusive-ranges.md

@@ -37,12 +41,11 @@ pub struct RangeToInclusive<T> {
 }
 ```

-Writing `a...b` in an expression desugars to `std::ops::RangeInclusive
-{ start: a, end: b, finished: false }`. Writing `...b` in an
+Writing `a...b` in an expression desugars to `std::ops::RangeInclusive::NonEmpty { start: a, end: b }`. Writing `...b` in an
 expression desugars to `std::ops::RangeToInclusive { end: b }`.


oops. soorry!

nikomatsakis · 2016-01-22T21:23:52Z

Huzzah! The lang team has decided to accept this RFC.

nikomatsakis · 2016-01-22T21:26:23Z

I kept the same tracking issue, but added a "to do" item to implement the changes suggested here.

durka · 2016-01-26T17:15:50Z

Implemented in (in progress) rust-lang/rust#30884.

eddyb · 2016-04-05T04:07:21Z

I wonder whether, from a performance perspective, how does the enum approach compare to using x+1...x as the empty state and having a MAX...MAX-1 special case when x == MAX?

I can't tell if this was suggested at some point, but even if it wasn't, I doubt the implementation will change.

nrc added the T-libs-api Relevant to the library API team, which will review and decide on the RFC. label Oct 15, 2015

alexcrichton assigned huonw Oct 28, 2015

Don't throw away endpoint after exhausting range

6641758

Stebalien mentioned this pull request Jan 13, 2016

implement RFC 1192 inclusive ranges rust-lang/rust#30884

Merged

huonw added T-lang Relevant to the language team, which will review and decide on the RFC. I-nominated labels Jan 14, 2016

nikomatsakis added final-comment-period Will be merged/postponed/closed in ~10 calendar days unless new substational objections are raised. and removed I-nominated labels Jan 15, 2016

Add amendments section to note change.

2025389

At @nikomatsakis request.

liigo reviewed Jan 21, 2016
View reviewed changes

nikomatsakis mentioned this pull request Jan 22, 2016

Tracking issue for ..= inclusive ranges (RFC #1192) -- originally ... rust-lang/rust#28237

Closed

8 tasks

nikomatsakis merged commit 2025389 into rust-lang:master Jan 22, 2016

scottmcm mentioned this pull request Apr 25, 2017

Make RangeInclusive just a two-field struct (amend 1192) #1980

Merged

Centril added the A-ranges Proposals relating to ranges. label Nov 23, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Amend 1192 (RangeInclusive) to use an enum. #1320

Amend 1192 (RangeInclusive) to use an enum. #1320

Stebalien commented Oct 13, 2015

huonw commented Oct 28, 2015

huonw commented Oct 29, 2015

Stebalien commented Oct 29, 2015

bluss commented Nov 3, 2015

huonw commented Nov 3, 2015

bluss commented Nov 3, 2015

Stebalien commented Nov 3, 2015

huonw commented Nov 3, 2015

Stebalien commented Nov 3, 2015

bluss commented Nov 3, 2015

Stebalien commented Nov 13, 2015

huonw commented Nov 18, 2015

nikomatsakis commented Jan 15, 2016

nikomatsakis commented Jan 15, 2016

bluss commented Jan 15, 2016

huonw commented Jan 15, 2016

bluss commented Jan 15, 2016

aturon commented Jan 19, 2016

aturon commented Jan 20, 2016

liigo Jan 21, 2016

liigo Jan 21, 2016

nikomatsakis commented Jan 22, 2016

nikomatsakis commented Jan 22, 2016

durka commented Jan 26, 2016

eddyb commented Apr 5, 2016

Amend 1192 (RangeInclusive) to use an enum. #1320

Amend 1192 (RangeInclusive) to use an enum. #1320

Conversation

Stebalien commented Oct 13, 2015

huonw commented Oct 28, 2015

huonw commented Oct 29, 2015

Stebalien commented Oct 29, 2015

bluss commented Nov 3, 2015

huonw commented Nov 3, 2015

bluss commented Nov 3, 2015

Stebalien commented Nov 3, 2015

huonw commented Nov 3, 2015

Stebalien commented Nov 3, 2015

bluss commented Nov 3, 2015

Stebalien commented Nov 13, 2015

huonw commented Nov 18, 2015

nikomatsakis commented Jan 15, 2016

nikomatsakis commented Jan 15, 2016

bluss commented Jan 15, 2016

huonw commented Jan 15, 2016

bluss commented Jan 15, 2016

aturon commented Jan 19, 2016

aturon commented Jan 20, 2016

liigo Jan 21, 2016

Choose a reason for hiding this comment

liigo Jan 21, 2016

Choose a reason for hiding this comment

nikomatsakis commented Jan 22, 2016

nikomatsakis commented Jan 22, 2016

durka commented Jan 26, 2016

eddyb commented Apr 5, 2016