Skip to content

This project, part of my Master's in Business Analytics at The State University of New York at Buffalo, utilizes Python, SQL, and machine learning algorithms to analyze the RMS Titanic disaster, revealing insights into survival rates by gender, age groups, and socio-economic class.

Notifications You must be signed in to change notification settings

nrhartnett/TitanicMachineLearning

Repository files navigation

Titanic-Machine-Learning

This comprehensive group-project, undertaken as part of the EAS-503: Python for Data Scientists course during my pursuit of a Master of Science degree in Business Analytics at The State University of New York at Buffalo, leverages a robust integration of Python programming, SQL, SQLite3, databases, normalization, machine learning, data parsing, and algorithms to analyze the RMS Titanic disaster. The study involves parsing and normalizing raw data from CSV files using Python, followed by loading the processed data into a normalized SQLite3 database table. Utilizing Pandas for data exploration and visualization, the analysis reveals survival rates by gender, age groups, and socio-economic class. In the independent project segment, machine learning models are developed and trained on features like Ticket class, number of parents/children aboard, number of siblings/spouses aboard, port of Embarkation, Sex, and Age. The inclusion of diverse algorithms, such as Logistic Regression, SVM, Decision Trees, KNN Classifier, and Random Forest, contributes to a nuanced understanding of the factors influencing passenger survival during this historical event within the broader context of business analytics.

The full project may be viewed within the Jupyter Notebook file: "Project Notebook.ipynb".

The data utilized in this project may be found at: https://www.kaggle.com/competitions/titanic.

About

This project, part of my Master's in Business Analytics at The State University of New York at Buffalo, utilizes Python, SQL, and machine learning algorithms to analyze the RMS Titanic disaster, revealing insights into survival rates by gender, age groups, and socio-economic class.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published