Fraud-Detection

Detecting anomaly patterns in surveys using autoencoders

This was an internship assessment project that I was required to do, I have developed my skill using unsupervised learning with this project and had to expand my knowledge on autoencoders and how they work.

A breif report of the assessment is found in the file Report.pdf

Below you can see all the entries that were completed, the green dashed line is the threshold line and any point above that is colored red and is labeled as Fraud

Out of the 155,000 data entries, we have gathered 12,341 data points that were labeled as "Complete" to where the user opened and completed the survey this rewarding the user with a revenue per completion.

We then split the completed datapoints into a 80/20 for training and testing with the testing consisting of 2,468 datapoints.

Out of the 2,468 datapoints we have detected 27 fraudulent activities and the user_id and the survey_id of these fraudulent activities can be found in the table below.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
__pycache__		__pycache__
Anomaly_data.csv		Anomaly_data.csv
BitBurst_autoencoder_na4.h5		BitBurst_autoencoder_na4.h5
Duration-csv.py		Duration-csv.py
README.md		README.md
Report.pdf		Report.pdf
Test.py		Test.py
Train.py		Train.py
events.csv		events.csv
events_duration.csv		events_duration.csv
events_duration_na.csv		events_duration_na.csv
events_test.csv		events_test.csv
json-to-csv.py		json-to-csv.py
users.csv		users.csv
users.json		users.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fraud-Detection

About

Uh oh!

Releases

Packages

Languages

Regularized-ML/Fraud-Detection

Folders and files

Latest commit

History

Repository files navigation

Fraud-Detection

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages