Fraud Case Study

Objective:

Our team of three people was given 2 days to create a model and user interface to detect possible fraud. The system pulls json files from a API and classifies them storing the result in a mongoDB. A web user interface is then accessible for client review of activity.

Files:

build_model.py : This is used to build a pickle the tested model for use. It also contains the class FraudModel which incapsulates the functionality for the API's and Data Cleaning / preprocessing.
DataCleaner.py : A class that executes the required proposing for the data before being modeled. Also cleans any new data before the model predicts the probability of fraud.

model.pkl : The stored model used in analysis of new data.

save_to_db.py: a helper file used to connect to the db and send/receive data

live.py: this launches the web server (app.py) then it runs a function that hits the endpoint every second continuously updating the database

Model:

We used a RandomForest model on 12 fields. The classifier has three possible outcomes. The Model was trained with the idea that the 'acct_tyep' field will translate to the below.
0 : Not Fraud - 'premium' 1 : Maybe Fraud - 'spammer_warn', 'spammer_limited', 'spammer_noinvite', 'locked', 'tos_lock', 'tos_warn', 'fraudster_att', 'spammer_web', 'spammer' 2 : Fraud - 'fraudster_event', 'fraudster'

Results:

The current model has a cross validation score of 92%.

Conclusion:

The following steps should still be taken: 1) Modifications to the Model. a) More analysis on the fields to find importance. b) Adding a TF-IDF classification on the Description field.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.ipynb_checkpoints		.ipynb_checkpoints
templates		templates
DataCleaner.py		DataCleaner.py
README.md		README.md
app.py		app.py
build_model.py		build_model.py
init_db.py		init_db.py
live.py		live.py
model.pkl		model.pkl
save_to_db.py		save_to_db.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fraud Case Study

Objective:

Our team of three people was given 2 days to create a model and user interface to detect possible fraud. The system pulls json files from a API and classifies them storing the result in a mongoDB. A web user interface is then accessible for client review of activity.

Files:

Model:

Results:

The current model has a cross validation score of 92%.

Conclusion:

The following steps should still be taken: 1) Modifications to the Model. a) More analysis on the fields to find importance. b) Adding a TF-IDF classification on the Description field.

About

Releases

Packages

Languages

ruppdj/Fraud-Case-Study

Folders and files

Latest commit

History

Repository files navigation

Fraud Case Study

Objective:

Our team of three people was given 2 days to create a model and user interface to detect possible fraud. The system pulls json files from a API and classifies them storing the result in a mongoDB. A web user interface is then accessible for client review of activity.

Files:

Model:

Results:

The current model has a cross validation score of 92%.

Conclusion:

The following steps should still be taken: 1) Modifications to the Model. a) More analysis on the fields to find importance. b) Adding a TF-IDF classification on the Description field.

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages