Replace `MultiIterable` usages with `Sequence` #2504

FloEdelmann · 2021-01-15T17:01:32Z

It seems to work fine this way, but I'm not sure if I decreased the performance by unnecessary iterating/copying. If I understand Sequences in Kotlin correctly though, it should behave as wanted.

app/src/main/java/de/westnordost/streetcomplete/data/osm/osmquest/OsmFilterQuestType.kt

app/src/main/java/de/westnordost/streetcomplete/util/LatLonRaster.kt

app/src/main/java/de/westnordost/streetcomplete/data/osm/osmquest/OsmFilterQuestType.kt

smichel17

I think this is a little more kotlin-y

smichel17 · 2021-01-17T02:35:28Z

app/src/main/java/de/westnordost/streetcomplete/data/osm/osmquest/OsmFilterQuestType.kt

+        var elements = sequenceOf<Element>()
+        if (filter.includesElementType(Element.Type.NODE)) elements += mapData.nodes
+        if (filter.includesElementType(Element.Type.WAY)) elements += mapData.ways
+        if (filter.includesElementType(Element.Type.RELATION)) elements += mapData.relations
+        return elements.filter { element -> filter.matches(element) }.asIterable()


I am personally allergic to if statements without braces (except when replacing the ternary operator), and using them on one liners like this makes them very busy. Here's a rewrite to avoid that, and use a more functional style:

val elements = mapOf( Element.Type.NODE to mapData.nodes, Element.Type.WAY to mapData.ways, Element.Type.RELATION to mapData.relations ).filterKeys(filter::includesElementType).values.flatten() return elements.asSequence().filter(filter::matches).asIterable()

If you don't like using the function references (understandable — they work by reflection and smell faintly of magic):

).filterKeys { filter.includesElementType(it) }.values.flatten() return elements.asSequence().filter{ filter.matches(it) }.asIterable()

Alternative version using sequence() (maybe better performance?)

return sequence { mapOf( Element.Type.NODE to mapData.nodes, Element.Type.WAY to mapData.ways, Element.Type.RELATION to mapData.relations ).filterKeys(filter::includesElementType).forEach { yieldAll(it.value) } }.filter(filter::matches).asIterable()

Whoa, really? Readability aside, you:

mapOf: create a hashmap and add three values

filterKeys: you create yet another new hashmap with only the keys added that return true on filter::includesElementType

.values: you access the value set of the hashmap (Not sure if this requires extra processing. It would if the value set is created lazily or if it is not a view onto the hashmap but a copy of all values in it)

flatten: you create a new list of these (all the values in the map) by iterating through them all and adding them all to an array list

since in step 4, you already copied the complete lists, the last line doesn't really make sense

That's a lot of transforming and copying of data just to avoid few ifs.

Your last suggestion would at least work as far as I understand because you don't flatten everything into a list.

On the sequence { } syntax, I think it would look like this then. Since anyone who reads this also needs to understand sequences, this would be fine too. Not sure if flatten keeps the sequence a sequence, it should otherwise the below code doesn't make sense.

return sequence { if (filter.includesElementType(Element.Type.NODE)) yield(mapData.nodes) if (filter.includesElementType(Element.Type.WAY)) yield(mapData.ways) if (filter.includesElementType(Element.Type.RELATION)) yield(mapData.relations) }.flatten().filter(filter::matches).asIterable()

Not sure if flatten keeps the sequence a sequence

It does: https://github.com/JetBrains/kotlin/blob/8fa848bed384568a8ceb9731ec9d45663aa64521/libraries/stdlib/src/kotlin/collections/Sequences.kt#L94-L99

What about yieldAll though?

return sequence { if (filter.includesElementType(Element.Type.NODE)) yieldAll(mapData.nodes) if (filter.includesElementType(Element.Type.WAY)) yieldAll(mapData.ways) if (filter.includesElementType(Element.Type.RELATION)) yieldAll(mapData.relations) }.filter { filter.matches(it) }.asIterable()

Right, that should work too. (Disclaimer: I am not an expert on Kotlin sequences)

Well, I did phrase it as a question (maybe better performance? ). Disclaimer: I am also not an expert on Kotlin sequences, although I re-read most of the documentation before writing my version, so some details are fresh in my mind.

4) This is mostly what I was talking about — that my second version was more efficient than the first version, because it avoids flatten() on a list, which will evaluate eagerly.

5) Objects are copied by reference in java, so while there is a performance impact, it may be smaller than it seems. Thus, a lazy filter may still provide a meaningful performance impact. It depends on the cost of running filter.matches.

1-3) Because of the small number of elements (3 max), this is essentially a fixed cost. I don't know exactly how big that cost (2x hash map creation, up to 6 hash map insertions, up to 3 hash map accesses) is. I did not consider it in my performance analysis, but it may have a meaningful impact.

However, the main reason I say "maybe" is that I don't know the implementation of elements += mapData.nodes. In particular, **I do not know if it is eagerly copying the nodes into an array inside the elements sequence (which would then be evaluated lazily), or just holding a reference to mapData.nodes. I have the same question about yieldAll(), although I would guess that it is more likely to be optimized in this way. This is the key answer needed to analyze the efficiency.

The block that you pass to sequence() is a suspend function (docs here). Each time you ask for values in the sequence, the block runs only up until you yield enough values, then suspends. There is some cost to suspending (should be low, since coroutines are lightweight, but I don't know how lightweight), and a memory cost to capture the closure.

So this is why I am uncertain. There are many different unknown variables.

I suspect that the most efficient approach is either the latest @FloEdelmann wrote, with ifs and yieldAll(), or

return sequenceOf( Pair(Element.Type.NODE, mapData.nodes), Pair(Element.Type.WAY, mapData.ways), Pair(Element.Type.RELATION, mapData.relations) ) .filter { (type, _) -> filter.includesElementType(type) } .flatMap { (_, elements) -> elements } .filter { filter.matches(it) } .asIterable()

I considered writing it this way initially, but I thought the map version was more readable than a list of pairs.

app/src/main/java/de/westnordost/streetcomplete/util/LatLonRaster.kt

FloEdelmann · 2021-01-17T11:44:41Z

@westnordost Your opinion on @smichel17's suggestions?

westnordost · 2021-01-17T13:25:24Z

I think they make it harder to read. To know about what all those kotlin flow functions (or how'd you call them) do requires additional expertise.
Chaining such functions makes it harder to understand for each transformation of the data added.

To know that an if statement does not need braces if there is just one statement that follows, not - to boot, this (if statements don't always need braces) is how the code in all of the app is written.

Co-Authored-By: smichel17 <github@smichel.me>

smichel17 · 2021-01-17T17:16:50Z

to boot, this (if statements don't always need braces) is how the code in all of the app is written.

Yes, I know… not being happy just sticking to an existing style for consistency's sake is a weakness of mine as a programmer.

To know about what all those kotlin flow functions (or how'd you call them) do requires additional expertise.

In my opinion, if statements without braces are easier to miss or misinterpret when scanning to understand the control flow. Thus, it is easier to accidentally change them to something unintended. Also, if you want to add a second statement after the if, you need to change the entire line, making the git commit history messier. Yes, it does require additional expertise. But so do static types, and I think you would agree they are useful. So it is just a difference of opinion about whether the trade-off is worth it.

I think we've had this conversation before, so I will not say anything more on the topic. (You are still welcome to respond — I don't want to insist on myself having the final word.)

smichel17

Oop, forgot to comment as a "review"

FloEdelmann added 2 commits January 15, 2021 17:37

Replace MultiIterable usages with Sequence

ebe948b

Drop unused FlattenIterable and MultiIterable

cd4190f

westnordost reviewed Jan 15, 2021

View reviewed changes

app/src/main/java/de/westnordost/streetcomplete/data/osm/osmquest/OsmFilterQuestType.kt Outdated Show resolved Hide resolved

westnordost reviewed Jan 15, 2021

View reviewed changes

app/src/main/java/de/westnordost/streetcomplete/util/LatLonRaster.kt Outdated Show resolved Hide resolved

Avoid expensive toList call

066b159

westnordost reviewed Jan 15, 2021

View reviewed changes

app/src/main/java/de/westnordost/streetcomplete/data/osm/osmquest/OsmFilterQuestType.kt Outdated Show resolved Hide resolved

Shorten conversion with asIterable()

f0c867b

westnordost approved these changes Jan 16, 2021

View reviewed changes

smichel17 suggested changes Jan 17, 2021

View reviewed changes

Make sequence creation more concise

c63f323

Co-Authored-By: smichel17 <github@smichel.me>

FloEdelmann requested review from smichel17 and westnordost January 17, 2021 15:16

westnordost approved these changes Jan 17, 2021

View reviewed changes

smichel17 approved these changes Jan 17, 2021

View reviewed changes

westnordost merged commit b509b39 into streetcomplete:master Jan 17, 2021

FloEdelmann deleted the iterable branch January 17, 2021 19:21

FloEdelmann mentioned this pull request Jun 21, 2022

Upgrade ktlint to v0.46.0 #4140

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace `MultiIterable` usages with `Sequence` #2504

Replace `MultiIterable` usages with `Sequence` #2504

FloEdelmann commented Jan 15, 2021

smichel17 left a comment

smichel17 Jan 17, 2021

smichel17 Jan 17, 2021 •

edited

Loading

smichel17 Jan 17, 2021

westnordost Jan 17, 2021 •

edited

Loading

westnordost Jan 17, 2021

FloEdelmann Jan 17, 2021

westnordost Jan 17, 2021

smichel17 Jan 17, 2021 •

edited

Loading

FloEdelmann commented Jan 17, 2021

westnordost commented Jan 17, 2021

smichel17 commented Jan 17, 2021

smichel17 left a comment

Replace MultiIterable usages with Sequence #2504

Replace MultiIterable usages with Sequence #2504

Conversation

FloEdelmann commented Jan 15, 2021

smichel17 left a comment

Choose a reason for hiding this comment

smichel17 Jan 17, 2021

Choose a reason for hiding this comment

smichel17 Jan 17, 2021 • edited Loading

Choose a reason for hiding this comment

smichel17 Jan 17, 2021

Choose a reason for hiding this comment

westnordost Jan 17, 2021 • edited Loading

Choose a reason for hiding this comment

westnordost Jan 17, 2021

Choose a reason for hiding this comment

FloEdelmann Jan 17, 2021

Choose a reason for hiding this comment

westnordost Jan 17, 2021

Choose a reason for hiding this comment

smichel17 Jan 17, 2021 • edited Loading

Choose a reason for hiding this comment

FloEdelmann commented Jan 17, 2021

westnordost commented Jan 17, 2021

smichel17 commented Jan 17, 2021

smichel17 left a comment

Choose a reason for hiding this comment

Replace `MultiIterable` usages with `Sequence` #2504

Replace `MultiIterable` usages with `Sequence` #2504

smichel17 Jan 17, 2021 •

edited

Loading

westnordost Jan 17, 2021 •

edited

Loading

smichel17 Jan 17, 2021 •

edited

Loading