RPC calls for visits within a time interval #1

elliottwilliams · 2017-01-15T13:41:34Z

As Proper starts to consume Timetable data, I'm realizing that a more useful way to query arrival data is by asking for all the arrivals within a time interval. Proper's station views (by default) show all arrivals for the next hour; this RPC would suit that use case nicely. Here's how I envision the RPC looking:

timetable.visits_between(station, route, start_time, end_time, n) -> [(ETA, ETD)]

start_time and end_time are standard (%Y%m%d %H:%M:%S) timestamps. n allows a count limit to be specified, but it can be omitted.

The text was updated successfully, but these errors were encountered:

faultyserver · 2017-01-15T16:23:03Z

This is definitely possible (and fairly easy) with the current system. In fact I've already implemented it.

Right now, there exists a map with a complex key of (Route, Trip, Station, StopTime). The keys are ordered lexicographically by each of the key parts successively. That is, all Stops on a Route are contiguous in memory, etc. Asking for all StopTimes in any timespan is just making two searches into the tree and iterating between them. In fact, visits_after and visits_before are just special cases of visits_between where one of the bounding points is the first/last StopTime at a Station.

The question is: does this hierarchy make sense? Or should it be rearranged in a way that's more efficient for the queries that will be made? For example, having the Station be the top-level key-part and the Route be the last-level key-part would make queries for all arrivals at a Station (regardless of Route) much more efficient.

elliottwilliams · 2017-01-15T18:34:12Z

The two views in 1.0 that feed off of Timetable data will be station views, so inverting your map to have the top-level key be Station would make sense to me.

Perhaps "the right thing" to do here is to have a map keyed by route-trip-station, and another by station-route-trip, and pick the more efficient for a given RPC call, but station-route-trip is the most useful to Proper for now.

elliottwilliams · 2017-01-15T18:34:52Z

And if you're cool with the visits_between call I specified, that's perfect for me. I'll add it to the wiki and start using it in Proper.

elliottwilliams · 2017-01-15T18:42:35Z

I made two minor changes to the call I added to the wiki:

Clarified that visits returned must have a departure time t such that begin_time <= t < end_time. This is so that the client can get the next interval of visits by searching starting with end_time.
Removed the limit n parameter. This makes doing interval-based "pagination" the way I specified above safer by guaranteeing that visits won't be missed.

faultyserver · 2017-01-16T17:16:23Z

All that makes sense to me.

Having two maps wouldn't be too bad, since each one can just store pointers (or rather, references). The only changes will be adding a new key class for the different order. Will try this out tonight.

faultyserver · 2017-01-29T06:11:10Z

Update on implementation: there's still only one map at this point, but it's key is (Station, StopTime, Route, Trip). This means that the default order of results for a query will be time-ordered, then route-ordered, and finally trip-ordered (though trip ordering is mostly negligible).

Update on performance: I'm currently working with Citybus' most recent archive, which has more than 250,000 StopTimes shared across 800 Stops and just under 9,000 Trips. The total memory usage with this single map is 213MB. My best guess at the implications of adding a second map will increase that memory usage to ~300MB. Processing time (parsing, interpolating stop times, and creating the map) before the system is available is ~4 seconds.

Overall, it's actually a lot better than I was anticipating based on the dataset, but could probably be improved farther with better usage of references and move-constructors, but that's a future concern.

…on, getting ready for filter_iterator. CSV Parsing can now be done all at once (as before, but now called `all`) or as a stream (via `initialize` and `next`). Also, `visit_list_key` and its usages have also been re-arranged to be `(Station, StopTime, Route, Trip)` to better accomodate its primary use case (See #1). For uses better managed by a different arrangement, another key/map will likely be made. In general, Timetable is now ready to have a `filter_iterator` that takes a custom predicate and only returns results that pass the predicate condition. The primary use of this will be to filter out visits whose service is not being offered in a given timeframe (a la `Timetable::Calendar`), or to return only visits from a given set of routes.

faultyserver · 2017-03-17T01:47:45Z

This has been referenced in a few other issues as well (#3 in particular), but this RPC has both been added to the wiki and implemented in Timetable, so I'm going to close this issue as completed.

faultyserver added the enhancement label Jan 16, 2017

faultyserver mentioned this issue Jan 21, 2017

Create visits RPCs for all routes at a station #3

Closed

faultyserver closed this as completed Mar 17, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RPC calls for visits within a time interval #1

RPC calls for visits within a time interval #1

elliottwilliams commented Jan 15, 2017 •

edited

Loading

faultyserver commented Jan 15, 2017

elliottwilliams commented Jan 15, 2017

elliottwilliams commented Jan 15, 2017

elliottwilliams commented Jan 15, 2017

faultyserver commented Jan 16, 2017

faultyserver commented Jan 29, 2017 •

edited

Loading

faultyserver commented Mar 17, 2017

RPC calls for visits within a time interval #1

RPC calls for visits within a time interval #1

Comments

elliottwilliams commented Jan 15, 2017 • edited Loading

faultyserver commented Jan 15, 2017

elliottwilliams commented Jan 15, 2017

elliottwilliams commented Jan 15, 2017

elliottwilliams commented Jan 15, 2017

faultyserver commented Jan 16, 2017

faultyserver commented Jan 29, 2017 • edited Loading

faultyserver commented Mar 17, 2017

elliottwilliams commented Jan 15, 2017 •

edited

Loading

faultyserver commented Jan 29, 2017 •

edited

Loading