data-science-challenge

Get NASA's solar flare data here:

Data columns explained (ignore the ones that are not listed):

The challenge:

Cleanse and extract the data in a useable format for machine learning. Ideally into a database. Please use Python
Perform analysis on the data to find "interestingness" or correlations among the columns (if any).
Perform anomaly detection on the Dur (duration in seconds) column. It's up to you how you approach this. The timestamps for the data are also in random intervals, it's your choice how you approach this.

Caveat: Do not use Numenta's Nupic algorithms for anomaly detection (its what we use). Anything else is fine, Tensorflow, from scratch, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md

Provide feedback