Skip to content

sabbaghm/HackNEU2017

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HackNEU 2017

Outlier Northeastern University 2017 Hackathon (HackNEU) repository includes source files and documents for developing the Outlier project.

Overview

We have developed an anomaly detection system using machine learning and adaptive filter methods, to detect abnormal patterns in power traces of computers, servers, and household electric consumption. The motivation behind this system was to detect anomalous power dynamics by a device which is integrated with Google Firebase notification platform to notify users on their mobile devices using push notifications, twitter and/or slack channels, of the detected event. The project was initially developed to notify system admins of organizations of anomalous power traces usage which can be used by hackers to determine the general behavior and possibly learn the secret encryption keys of the machine. Hence as a test case, we used a power trace from an unmasked version of Advanced Encryption Standard (AES) encryption, also we used a household power consumption dataset from the University of California, Irvine to evaluate our platform (more details about the datasets can be found here). The power traces with regular trends were categorized as normal by our machine learning model. Whereas the anomalous trace/plot clearly indicated outliers in the trace, which immediately alerts the user/admin.

Approach

In our first method we used a multivariate Gaussian model based on this work by Aaqib Saeed to learn the typical power trends and detect the outliers. Moreover, various data pre-processing and post-processing challenges including restructuring our dataset for our algorithm and gathering our output data to an analyzable form have been addressed. The following figures shows the output of our tool operated on an unmasked AES power traces (red point shows outliers which alert the user/admin about and anomalous event):

Multivariate Gaussian model detected anomalous points in the unmasked AES power traces

Multivariate Gaussian model detected anomalous points in the unmasked AES power traces

In our second method, we used a Kalman filter to track the trend of power signals and detect the abnormalities in the signal amplitude. We based our implementation for this method on this code. In this method, we defined outliers as signal points whose amplitude are higher than the Kalman filter prediction for. The first version of the Kalman filter could not track the signal properly. After changing the process variance Q in the kalman filter code from 1e-5 to 0.5 the filter started to track the signal closely.

Here are the results of our tool categorizing time points from different signal traces of our household power consumption dataset into normal (red) and abnormal (blue) trends: Global Active Power vs Time Points

Global Active Power vs Time Points

Global Reactive Power vs Time Points

Global Reactive Power vs Time Points

Voltage Deviation vs Time Points

Voltage Deviation vs Time Points

Global Intensity vs Time Points

Global Intensity vs Time Points

Sub Meter 1 vs Time Points

Sub Meter 1 vs Time Points

Sub Meter 2 vs Time Points

Sub Meter 2 vs Time Points

Sub Meter 3 vs Time Points

Sub Meter 3 vs Time Points

For creating a remote alert system, we developed a custom Android app using Android Studio which accepts notifications from Google Firebase platform. In the other end, the Firebase platform receives an alert from our anomaly detection system in case an anomalous behavior detected in the system. Here is a screenshot of a push notification received on a Samsung Android phone in our live demo:

Notification

Push notification received from our anomaly detection system in a live demo

Furthermore, We integrated Slack and Twitter APIs to post alerts on those social media sites as well.

The API integration code for the notifications can be found here. Note that all of the detection and notification codes except the Android app which is written in Java, are developed using python.

Challenges Faced

  • Handling large datasets (10000x3125)
  • Preprocessing the datasets to remove NULL or empty fields
  • Converting the input datasets into appropriate formats for applying the machine learning model
  • Integrating Google Firebase notification platform to send push notification to users through a custom-designed Android app
  • Integrating Slack and Twitter posting APIs and authentication procedures to our system

License

This project is licensed under the MIT License - see the LICENSE file for details

Project members

Name Email Contribution
Majid Sabbagh sabbagh.m@husky.neu.edu Problem Modeling - Finding Datasets - Android App Design - Remote Notification Integration
Samkeet Shah shah.sam@husky.neu.edu Problem Modeling - Preprocessing Datasets - Multivariate Gaussian Implementation - Plotting
Shantanu Kawlekar kawlekar.s@husky.neu.edu Problem Modeling - Preprocessing Datasets - Kalman Filter Implementation - Plotting
Yixing Zhang zhang.yixin@husky.neu.edu Problem Modeling - Finding Datasets - Report Preparation

Acknowledgments

We would like to thank all the mentors especially, Rohan Jahagirdar and Shreyas Mahimkar for their great help during the Hackathon. We also thank all the hackathon officials for creating an awesome environment for the participants to develope their ideas.

About

Northeastern University Hackathon (HackNEU) repository includes source files and documents for developing the hackathon projects.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •