Skip to content

sunnypatel165/ETL-Tool

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ETL-Tool

This is a desktop application that simulates working of a data warehouse process for a telecommunication company, at a very basic level.

The basic steps of any datawarehouse are-

  • Extraction- Extracting from various data stores like text, SQL, Access, CSV etc.
  • Transformation- Transforms data into certain format(uniform)
  • Loading- Load the data into a database.

The application also has a couple of mining algorithms embedded in it. currently it has Bayes classification, KNN and K means clustering. Various other algorithms like Apriori, Heirarchical clustering etc can also be included.

To run:

-Download an extract Zip
-Copy it into your netbeans projects directory
-Build, Compile and Run.

Or
Refer to the Dropbox link. It has screenshots and built JAR file.

Or
Clone the JAR in dist/ETL.jar

Note for current version:

Paths used in the code are not relative/real, they are absolute paths refering to my PC, hence one has to change a few paths before getting it to work.
Also, to use SQL and JDBC data stores, necessary things may have to be installed like the drivers for JDBC and the MySQL.
Would love pull requests and contributions! :)

About

ETL tool for a telecomm company's data warehouse

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages