Click_prediction_using_SVM-Doc2Vec

This REPO contains a Data Science Challenge which consists of predicting whether an AD is going to be clicked or not depending on some given features. After some preprocessing and data analysis we have to choose which features are the most important as well as to determine the embeddings of the text contained in the Topic Line of the Advertisement.

We use some text embedding technics like Doc2Vec and then we apply a classification SVM model to predict the class (0,1) of each feature vector.

In the end we used Hold-Out Validation to assess some trained SVM models and got the following results :

where Model 1 :

Input features :
- Day of the Month
- Hour of the Day
- Daily Time Spent on Site
- Age
- Area Income
- Daily Internet Usage
- Gender
- Embeddings of Ad Topic Line (using Doc2Vec)

and Model 2 :

Input features :
- Day of the Month
- Hour of the Day
- Daily Time Spent on Site
- Age
- Area Income
- Daily Internet Usage
- Embeddings of Ad Topic Line (using Doc2Vec)

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.gitignore		.gitignore
README.md		README.md
challenge.ipynb		challenge.ipynb
submissions.csv		submissions.csv
submissions_v2.csv		submissions_v2.csv
test.csv		test.csv
train.csv		train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

challenge.ipynb

challenge.ipynb

submissions.csv

submissions.csv

submissions_v2.csv

submissions_v2.csv

test.csv

test.csv

train.csv

train.csv

Repository files navigation

Click_prediction_using_SVM-Doc2Vec

About

Releases

Packages

Languages

MohamedAmineDHIAB/AD_click_prediction_using_SVM-Doc2Vec

Folders and files

Latest commit

History

Repository files navigation

Click_prediction_using_SVM-Doc2Vec

About

Resources

Stars

Watchers

Forks

Languages