MDST-FARS-share

This repo is a start where Tiapei Xie, Sheng Yang and Sean Ma are collaborating together for the goal of becoming the top notch Data Scientiest. We started with the the MDST-FARS competition.

Mission:

Helping each other to become the top Data Scientist in our field.

Goals:

We use competitions in Kaggle to sharpen our skill sets. Our main goal is to get placed in the public leader board. Pratical goals for now are:

Further test FARS data set with ensemble technologies (Tianpei & Sean).
Practice deep learning on FARS data set (Tianpei lead).
Continue to learn from 2 public Kaggle competitions (Home Depot; Santander) (Sheng lead).
Learn cloud technologies such as AWS (Sean lead).
Compete in a public Kaggle competition (maybe in May or June).

Models and AUC scores:

Model code	Submission CSV	Public Score	Private Score	Note
Model_xgboost-production_weighted.py	fars_submit_xgb004-production_weighted_missing.csv	0.87135	0.86657	No change using missing option
Model_xgboost-production_weighted.py	fars_submit_xgb004-production_weighted.csv	0.87135	0.86657
Model_xgboost-production_weighted_Tian_acc_per.py	fars_submit_xgb005-production_weighted_Tian_acc_per.csv	0.85757	0.85050	Seems like 2 tables with dummies are ok

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
Tianpei		Tianpei
images		images
.gitignore		.gitignore
Model_accident_person.ipynb		Model_accident_person.ipynb
Model_accident_person.py		Model_accident_person.py
Model_accident_person_001.py		Model_accident_person_001.py
Model_xgboost-production-Copy1.ipynb		Model_xgboost-production-Copy1.ipynb
Model_xgboost-production.ipynb		Model_xgboost-production.ipynb
Model_xgboost-production.py		Model_xgboost-production.py
Model_xgboost-production_weighted.pbs		Model_xgboost-production_weighted.pbs
Model_xgboost-production_weighted.pbs.o18707803		Model_xgboost-production_weighted.pbs.o18707803
Model_xgboost-production_weighted.pbs.o18707844		Model_xgboost-production_weighted.pbs.o18707844
Model_xgboost-production_weighted.pbs.o18708206		Model_xgboost-production_weighted.pbs.o18708206
Model_xgboost-production_weighted.py		Model_xgboost-production_weighted.py
Model_xgboost-production_weighted_Tian_acc_per.pbs		Model_xgboost-production_weighted_Tian_acc_per.pbs
Model_xgboost-production_weighted_Tian_acc_per.pbs.o18709071		Model_xgboost-production_weighted_Tian_acc_per.pbs.o18709071
Model_xgboost-production_weighted_Tian_acc_per.pbs.o18825666		Model_xgboost-production_weighted_Tian_acc_per.pbs.o18825666
Model_xgboost-production_weighted_Tian_acc_per.pbs.o18825690		Model_xgboost-production_weighted_Tian_acc_per.pbs.o18825690
Model_xgboost-production_weighted_Tian_acc_per.py		Model_xgboost-production_weighted_Tian_acc_per.py
Model_xgboost-production_weighted_Tian_per_veh.pbs		Model_xgboost-production_weighted_Tian_per_veh.pbs
Model_xgboost-production_weighted_Tian_per_veh.pbs.o18709074		Model_xgboost-production_weighted_Tian_per_veh.pbs.o18709074
Model_xgboost-production_weighted_Tian_per_veh.pbs.o18825691		Model_xgboost-production_weighted_Tian_per_veh.pbs.o18825691
Model_xgboost-production_weighted_Tian_per_veh.pbs.o18826952		Model_xgboost-production_weighted_Tian_per_veh.pbs.o18826952
Model_xgboost-production_weighted_Tian_per_veh.py		Model_xgboost-production_weighted_Tian_per_veh.py
Model_xgboost.ipynb		Model_xgboost.ipynb
Model_xgboost_acc_only.py		Model_xgboost_acc_only.py
README.md		README.md
R_cleandata.ipynb		R_cleandata.ipynb
T_j_acc_per_001.pbs		T_j_acc_per_001.pbs
T_j_acc_per_001.pbs.o18706919		T_j_acc_per_001.pbs.o18706919
fars_submit.csv		fars_submit.csv
fars_submit_j_acc_per_ext_1.csv		fars_submit_j_acc_per_ext_1.csv
fars_submit_xgb001.csv		fars_submit_xgb001.csv
fars_submit_xgb002.csv		fars_submit_xgb002.csv
fars_submit_xgb003.csv		fars_submit_xgb003.csv
fars_submit_xgb003_production.csv		fars_submit_xgb003_production.csv
fars_submit_xgb004_production_weighted.csv		fars_submit_xgb004_production_weighted.csv
fars_submit_xgb004_production_weighted_missing.csv		fars_submit_xgb004_production_weighted_missing.csv
fars_submit_xgb005_production_weighted_Tian_acc_per.csv		fars_submit_xgb005_production_weighted_Tian_acc_per.csv
weight_train.csv		weight_train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MDST-FARS-share

Mission:

Helping each other to become the top Data Scientist in our field.

Goals:

Models and AUC scores:

About

Releases

Packages

Contributors 2

Languages

seantma/MDST-FARS-share

Folders and files

Latest commit

History

Repository files navigation

MDST-FARS-share

Mission:

Helping each other to become the top Data Scientist in our field.

Goals:

Models and AUC scores:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages