Skip to content

Latest commit

 

History

History
35 lines (24 loc) · 1.31 KB

chapter_2_4.md

File metadata and controls

35 lines (24 loc) · 1.31 KB

The Consolidate Operator

The consolidate operator takes an input collection, and does nothing other than possibly changing its physical representation. It leaves the same sets of elements at the same times with the same logical counts.

What consolidate does do is ensure that each element at each time has at most one physical tuple. Generally, we might have multiple updates to the same element at the same time, expressed as independent updates. The consolidate operator adds all of these updates together before moving the update along.

As an example, if we were to inspect

    manages
        .map(|(m2, m1)| (m1, m2))
        .concat(&manages)
        .inspect(|x| println!("{:?}", x));

we might see two copies of the same element:

    ((0, 0), 0, 1)
    ((0, 0), 0, 1)

However, by introducing consolidate

    manages
        .map(|(m2, m1)| (m1, m2))
        .concat(&manages)
        .consolidate()
        .inspect(|x| println!("{:?}", x));

we are guaranteed to see at most one (0,0) update at each time:

    ((0, 0), 0, 2)

The consolidate operator is mostly useful before inspecting data, but it can also be important for efficiency; knowing when to spend the additional computation to consolidate the representation of your data is an advanced topic!