This is a desktop application that simulates working of a data warehouse process for a telecommunication company, at a very basic level.
The basic steps of any datawarehouse are-
- Extraction- Extracting from various data stores like text, SQL, Access, CSV etc.
- Transformation- Transforms data into certain format(uniform)
- Loading- Load the data into a database.
The application also has a couple of mining algorithms embedded in it. currently it has Bayes classification, KNN and K means clustering. Various other algorithms like Apriori, Heirarchical clustering etc can also be included.
To run:
-Download an extract Zip
-Copy it into your netbeans projects directory
-Build, Compile and Run.
Or
Refer to the Dropbox link. It has screenshots and built JAR file.
Or
Clone the JAR in dist/ETL.jar
Also, to use SQL and JDBC data stores, necessary things may have to be installed like the drivers for JDBC and the MySQL.
Would love pull requests and contributions! :)