machine-learning-challenge

For this assignment, two different models were fit and tested via the data provided on exoplanets.

Model 1:

Before tuning, a logistic model provided the following correlation coefficients:

Training Data Score: 0.8046919702460423 Testing Data Score: 0.8175057208237986

After tuning, the logistic model provided the following correlation coefficients:

Training Data Score: 0.7959183673469388 Testing Data Score: 0.8054919908466819

Model 2:

A neural network model provided the following data:

Loss: 0.4614897072315216, Accuracy: 0.8592677116394043

Based on the comparison of the two chosen models, the neural network model provided the greatest degree of accuracy (85.9%)

Machine Learning Homework - Exoplanet Exploration

Before You Begin

Create a new repository for this project called machine-learning-challenge. Do not add this homework to an existing repository.
Clone the new repository to your computer.
Give each model you choose their own Jupyter notebook, do not use more than one model per notebook.
Save your best model to a file. This will be the model used to test your accuracy and used for grading.
Commit your Jupyter notebooks and model file and push them to GitHub.

Note

Keep in mind that this homework is optional! However, you will gain a much greater understanding of testing and tuning different Classification models if you do complete it.

Background

Over a period of nine years in deep space, the NASA Kepler space telescope has been out on a planet-hunting mission to discover hidden planets outside of our solar system.

To help process this data, you will create machine learning models capable of classifying candidate exoplanets from the raw dataset.

In this homework assignment, you will need to:

Preprocess the raw data
Tune the models
Compare two or more models

Instructions

Preprocess the Data

Preprocess the dataset prior to fitting the model.
Perform feature selection and remove unnecessary features.
Use MinMaxScaler to scale the numerical data.
Separate the data into training and testing data.

Tune Model Parameters

Use GridSearch to tune model parameters.
Tune and compare at least two different classifiers.

Reporting

Create a README that reports a comparison of each model's performance as well as a summary about your findings and any assumptions you can make based on your model (is your model good enough to predict new exoplanets? Why or why not? What would make your model be better at predicting new exoplanets?).

Resources

Hints and Considerations

Start by cleaning the data, removing unnecessary columns, and scaling the data.
Not all variables are significant be sure to remove any insignificant variables.
Make sure your sklearn package is up to date.
Try a simple model first, and then tune the model using GridSearch.

Submission

Create a Jupyter Notebook for each model and host the notebooks on GitHub.
Create a file for your best model and push to GitHub
Include a README.md file that summarizes your assumptions and findings.
Submit the link to your GitHub project to Bootcamp Spot.
Ensure your repository has regular commits (i.e. 20+ commits) and a thorough README.md file

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
README.md		README.md
ShellyNixon.sav		ShellyNixon.sav
exoplanet_data.csv		exoplanet_data.csv
model2-trained.h5		model2-trained.h5
model_1.ipynb		model_1.ipynb
model_2.ipynb		model_2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

machine-learning-challenge

Machine Learning Homework - Exoplanet Exploration

Before You Begin

Note

Background

Instructions

Preprocess the Data

Tune Model Parameters

Reporting

Resources

Hints and Considerations

Submission

© 2019 Trilogy Education Services, a 2U, Inc. brand. All Rights Reserved.

About

Releases

Packages

Languages

shellymnixon/machine-learning-challenge

Folders and files

Latest commit

History

Repository files navigation

machine-learning-challenge

Machine Learning Homework - Exoplanet Exploration

Before You Begin

Note

Background

Instructions

Preprocess the Data

Tune Model Parameters

Reporting

Resources

Hints and Considerations

Submission

© 2019 Trilogy Education Services, a 2U, Inc. brand. All Rights Reserved.

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages