Illustrates how to 1) transfer IBM Informix data to Apache Spark using JDBC and 2) continue doing analytics with Spark and Java.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
data
src
.gitignore
LICENSE
README.md
pom.xml

README.md

IBM Informix 2 Apache Spark

Illustrates how to transfer IBM Informix data to Apache Spark using JDBC, and more.

This code is support to a series of articles on IBM's developerWorks:

Part 1: Collecting the data

Scope: The first article covers data ingestion.

Link: https://www.ibm.com/developerworks/opensource/library/ba-offloading-informix-data-spark/index.html

Code: All code is in the net.jgp.labs.informix2spark.l0xx and net.jgp.labs.informix2spark.l1xx packages.

Part 2: Basic analysis of your data

Scope: Basic analytics and gaining basic insight from the data.

Link: https://www.ibm.com/developerworks/opensource/library/ba-offloading-informix-data-spark-2/index.html

Code: All code is in the net.jgp.labs.informix2spark.l2xx packages.

Part 3: More Complex Analysis

Scope: Going deeper in the dataframe API to start doing joins and more complex analytics.

Link: https://www.ibm.com/developerworks/opensource/library/ba-offloading-informix-data-spark-3/index.html

Code: All code is in the net.jgp.labs.informix2spark.l3xx packages.

Part 4: Leverage data against other data sources

Scope: It's time to discover some of the power of Spark by adding two other (external) data sources.

Link: https://www.ibm.com/developerworks/opensource/library/ba-offloading-informix-data-spark-4/index.html

Code: All code is in the net.jgp.labs.informix2spark.l4xx packages.

Part 5: Machine learning to the rescue

Scope: Machine Learning will help you extrapolate future orders.

Link: https://www.ibm.com/developerworks/opensource/library/ba-offloading-informix-data-spark-5/index.html

Code: All code is in the net.jgp.labs.informix2spark.l5xx packages.