Tracking Issue for `Iterator::collect_into` #94780

frengor · 2022-03-09T20:08:42Z

Feature gate: #![feature(iter_collect_into)]

This is a tracking issue for adding the collect_into method to the Iterator trait.
Iterator::collect_into lets an iterator to be collected into a collection which implements the Extend trait, consuming the iterator and adding every of its item to the collection.
Adding this method has also the benefit of making the Extend trait more discoverable.

Public API

trait Iterator {
    type Item;

    fn collect_into<E: Extend<Self::Item>>(self, collection: &mut E) -> &mut E
    where
        Self: Sized;
}

Steps / History

Implementation: Add Iterator::collect_into #93057
Final comment period (FCP)
Stabilization PR

Unresolved Questions

Is it worth it to have this API? The Iterator interface is already pretty large, and use cases can easily be written differently without this API.
- See Add Iterator::collect_into #93057 (review)
- And Add Iterator::collect_into #93057 (comment)

The text was updated successfully, but these errors were encountered:

Corfucinas · 2022-05-31T05:01:45Z

This would be a useful method when using MPSC and appending to different vectors depending on the object.

mqudsi · 2022-11-08T18:22:14Z

Not sure if this is where any discussion should go but I would like it to formally request from now that consideration should be given to the ability to (try to?) collect into a fixed-size buffer well, primarily for purposes of avoiding heap allocation. Presumably this would just be a separate feature (try_collect_into?) but if the design of that feature would in any way conflict with the current collect_into feature, then I would like us to discuss these issues here and now before collect_into stabilizes any further.

collect_into is nice because it lets you reduce allocations by directly extending a previous buffer/collection rather than allocating into a wholly separate one and then forcing you to merge the two, which (boilerplate aside) might be less efficient. On the other hand, collecting into a fixed-size buffer could let you use iterators to directly collect into a stack-allocated array/slice, which is pretty much guaranteed to be a huge performance win. I am happy to open a separate issue to specifically track such a feature, but again, I just wanted to mention it here so that if there's something the collect_into feature should (or shouldn't) have in order to facilitate such a try_collect_into in the future, we can hopefully fix it.

The idea is that sometimes you want to use the iterator façade to simplify code transforming an existing collection and your input set is either of a known size or a capped size. Being able to collect into a stack-allocated array or mutable slice thereof would allow collecting the results of an enumeration cleanly. The semantics can (but don't necessarily have to) differ from those of collect_into, as certain things (like tracking the number of elements collected) become harder and there's the question of what to do when the provided fixed-length destination doesn't fit everything (return false and just leave the remainder unread/uncollected in the iterator? panic?).

You can currently mock this yourself by implementing Extend<A> (though the Extend api assumes infallibility) and then using .inspect() before .collect_into() to track the number of elements actually written to the collection.

EDIT:

It may very well just make more sense to loop over the items and add them to the array rather than building a huge façade around that. (Technically the same goes for collect_into as well...)

feature-engineer · 2023-01-18T15:07:18Z

Regarding the last point:

Is it worth it to have this API? The Iterator interface is already pretty large, and use cases can easily be written differently without this API.

I have an example use case, and would like to know how it should be written differently without this API:

I'm implementing a limited size priority queue (i.e. I want to keep only the top N elements of the queue at any given time).
For this I create a struct which has a collection and a size as its members.
I create this struct with a given size field - which is why it has to be created before it is being collected to.
Using this queue after implementing its insert fn, and extend fn is trivial when this API exists:

q = SizedQueue::new(5);
<some iter over large data>.collect_into(q);

How would it look with an alternative API?

frengor · 2023-01-18T16:47:48Z

q = SizedQueue::new(5);
<some iter over large data>.collect_into(q);
How would it look with an alternative API?

q = SizedQueue::new(5);
q.extend(<some iter over large data>);

The two codes are equivalent (in fact, collect_into is implemented calling Extend::extend).

frengor · 2023-01-18T18:54:27Z

Presumably this would just be a separate feature (try_collect_into?)

Yeah, I think it should be. However, seeing that try_collect doesn't handle fallible allocations, the name of either try_collect_into or try_collect should be changed to maintain coherence (also see the unresolved questions of try_collect).

but if the design of that feature would in any way conflict with the current collect_into feature, then I would like us to discuss these issues here and now before collect_into stabilizes any further.

I don't think it will really impact collect_into, I suppose it would mostly require adding something like a TryExtend trait (it would be good to add a TryFromIterator trait, too), but maybe there is a better way to handle fallible allocations (?).

It may very well just make more sense to loop over the items and add them to the array rather than building a huge façade around that. (Technically the same goes for collect_into as well...)

That would probably just be the implementation of TryExtend for slices I think, and having it implemented once in the stdlib doesn't require users to duplicate the same code over and over. collect_into uses Extend for this exact reason.

frengor added C-tracking-issue Category: A tracking issue for an RFC or an unstable feature. T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. labels Mar 9, 2022

nazar-pc mentioned this issue Mar 30, 2023

Consider adding collect_into rayon-rs/rayon#1039

Open

cuviper mentioned this issue Mar 30, 2023

Add Iterator::collect_into<E: Extend>(e: E)? #45840

Closed

egkoppel mentioned this issue Mar 10, 2024

List of nightly features required popcorn-2/popcorn-2#74

Open

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking Issue for `Iterator::collect_into` #94780

Tracking Issue for `Iterator::collect_into` #94780

frengor commented Mar 9, 2022 •

edited by m-ou-se

Corfucinas commented May 31, 2022

mqudsi commented Nov 8, 2022 •

edited

feature-engineer commented Jan 18, 2023 •

edited

frengor commented Jan 18, 2023

frengor commented Jan 18, 2023

Tracking Issue for Iterator::collect_into #94780

Tracking Issue for Iterator::collect_into #94780

Comments

frengor commented Mar 9, 2022 • edited by m-ou-se

Public API

Steps / History

Unresolved Questions

Corfucinas commented May 31, 2022

mqudsi commented Nov 8, 2022 • edited

feature-engineer commented Jan 18, 2023 • edited

frengor commented Jan 18, 2023

frengor commented Jan 18, 2023

Tracking Issue for `Iterator::collect_into` #94780

Tracking Issue for `Iterator::collect_into` #94780

frengor commented Mar 9, 2022 •

edited by m-ou-se

mqudsi commented Nov 8, 2022 •

edited

feature-engineer commented Jan 18, 2023 •

edited