Explore, transform, and analyze FHIR data with Apache Spark
Clone or download
kthakore and rbrush Doc patch 1 (#36)
* Putting the right path for the example code

bunsen.valuesets doesn't actually exists, after reading the code I found it to be in the stu3 package. Just updating this for other folks :)

* Updated docs for how to create Ontologies

Without a DB existing on Spark context it isn't possible to write_to_database. Additionally making it clear what to call to write the ontologies
Latest commit 7d6c29e Sep 14, 2018



Bunsen lets users load, transform, and analyze FHIR data with Apache Spark. It offers Java and Python APIs to convert FHIR resources into Spark Datasets, which then can be explored with the full power of that platform, including with Spark SQL. For details see the Bunsen documentation.


Bunsen is built and tested with Apache Maven, with the standard Maven lifecycle to build, install, and deploy it.

User documentation is built with Sphinx. PySpark should be installed in the environment to generate the Python documentation. With that in place, the user can simply run make html in the docs directory to build the documentation, and make deploy in that directory to publish it to the GitHub pages site.


Bunsen is hosted in the Maven Central repository.


Bunsen's Java code should follow the Google Java Style Guide.


Please use GitHub issues to record any requests or issues for this project.




Copyright 2017 Cerner Innovation, Inc.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at


Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.