Data science project in week 6 of the data science bootcamp @neuefische

This is a collaborative project, authored together with @froukje

Summary:

Evaluating the risk whether a person tends to consume drugs is not only important for each individual person, but for our entiry society as medical treatment and resozialization of an adicted person is costly. That is each individual of our society has a responsibility to prevent him-/herself to get in touch with drugs. To facilitate that an app was developed to help to self-assess a tendency to become a drug consumer. To do this we used data from a survey, which asked people questions about their demographic background and personality. The dataset is described here.

Structure of the repository:

In this repository you find the whole process of our approach within our notebook.

In addition we created a business presentation for our fictitious recommendation app. Don't take this presentation too seriously 😉.

Business Case:

We analysed the given dataset with a fictitious business case in mind. We want to develop an app that by testing the users personality makes recommendations whether the user might tend to do drugs in the future. The goal is to prevent drug use and sensitize the user. The result of the app is shown in an encouraging way, it might look like to following:

Key Takeaways:

Logistic Regression gives the best results for our business case
All classifiers deliver very similar values for the precision
Using all feature not always gives the best model. (That is for our business case there is no need to survey caffeine, chocolate, alcohol)

Future Work:

Definition of targets:
- Other target groups or more might be considered, maybe there is one specific drug which is highly interested
Definition of metric:
- As discussed in the last section recall is also a possible metric for this business case. An optimization on recall might change the results.
- Unsettle (recall) vs. don't unsettle (precision) people
Our survey data is from people who are already taking drugs. For pur prediction we use personality characteristics, which might change when taking drugs. We however assume that this is not the case. Otherwise we could not use this data to make predictions on potential drug users. This might be clarified talking to an expert.
In our business case a possibility to overcome this and also to test the predictions might be an optional check whether a tested person turned into a drug user or not after a certain time.
Our model is biased for ethnicity “white”. If possible observations fromdifferent ethnicities should be included.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.ipynb_checkpoints		.ipynb_checkpoints
DS-Project-Personality-vs-Risk-of-Drug-Use.ipynb		DS-Project-Personality-vs-Risk-of-Drug-Use.ipynb
DS-Project-Presentation-Personality-vs-Drug-Use.pdf		DS-Project-Presentation-Personality-vs-Drug-Use.pdf
README.md		README.md
app_template.png		app_template.png
drug_consumption.xls		drug_consumption.xls
drugs4.png		drugs4.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data science project in week 6 of the data science bootcamp @neuefische

Summary:

Structure of the repository:

Business Case:

Key Takeaways:

Future Work:

About

Releases

Packages

Languages

HssDix/DS-Project2-Personality-vs-Risk-of-Drug-Use

Folders and files

Latest commit

History

Repository files navigation

Data science project in week 6 of the data science bootcamp @neuefische

Summary:

Structure of the repository:

Business Case:

Key Takeaways:

Future Work:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages