SCARFF (SCAlable Real-time Fraud Finder) is a framework which enables credit card fraud detection.
SCAlable Real-time Fraud Finder (SCARFF) is an open source platform which processes and analyses credit card streaming data in order to return reliable alerts in a nearly real-time setting. This original framework for near real-time Streaming Fraud Detection integrates Big Data tools (Kafka, Spark and Cassandra) with a machine learning approach which deals with data imbalance, non-stationarity and feedback latency.
At the core of SCARFF there is a Spark application and here we present its implementation.
A Docker image containing the tools needed to run a streaming fraud detection demo can be found in: https://hub.docker.com/r/fabriziocarcillo/scarff/
You can follow the commands of the docker hub link or the video-tutorial:
The images contains:
- Kafka, Spark and Cassandra;
- a compiled version of SCARFF;
- an artificial dataset and the program to stream it.