Core: Use Neo4j Spark connector for importing data #48

mattigrthr · 2021-07-19T10:08:04Z

Instead of sending the insertion queries through the Neo4j Python driver, we should use the Spark Neo4j connector to read the preprocessed population and OSM Parquet files and make the inserts. The Spark driver batches the queries automatically, which should speed up the entire population process.

It depends on #47

mattigrthr added enhancement New feature or request core Issues related to the core labels Jul 19, 2021

mattigrthr added this to the Speed-up OSM and Population Processing milestone Jul 19, 2021

mattigrthr added this to To do in Kuwala via automation Jul 19, 2021

mattigrthr self-assigned this Aug 11, 2021

mattigrthr moved this from To do to In progress in Kuwala Aug 11, 2021

mattigrthr mentioned this issue Aug 11, 2021

Speed up OSM processing and switch to Python #52

Merged

mattigrthr closed this as completed in #52 Aug 21, 2021

Kuwala automation moved this from In progress to Done Aug 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Core: Use Neo4j Spark connector for importing data #48

Core: Use Neo4j Spark connector for importing data #48

mattigrthr commented Jul 19, 2021

Core: Use Neo4j Spark connector for importing data #48

Core: Use Neo4j Spark connector for importing data #48

Comments

mattigrthr commented Jul 19, 2021