Apache Sedona (incubating) is a cluster computing system for processing large-scale spatial data. Sedona extends Apache Spark / SparkSQL with a set of out-of-the-box Spatial Resilient Distributed Datasets (SRDDs)/ SpatialSQL that efficiently load, process, and analyze large-scale spatial data across machines.
Sedona contains several modules:
| Name | API | Spark compatibility | Introduction |
|---|---|---|---|
| Core | RDD | Spark 2.X/1.X | SpatialRDDs and Query Operators. |
| SQL | SQL/DataFrame | SparkSQL 2.1+ | SQL interfaces for Sedona core. |
| Viz | RDD, SQL/DataFrame | RDD - Spark 2.X/1.X, SQL - Spark 2.1+ | Visualization for Spatial RDD and DataFrame. |
| Zeppelin | Apache Zeppelin | Spark 2.1+, Zeppelin 0.8.1+ | Plugin for Apache Zeppelin |
Sedona supports several programming languages: Scala, Java, SQL, Python and R.
Contact
Twitter: Sedona@Twitter
Sedona JIRA: Bugs, Pull Requests, and other similar issues
- issues@sedona.apache.org: general questions or tutorials
- dev@sedona.apache.org: project development

