Merge pull request #60 from BA-HanseML/pre_master

Pre master
BA-HanseML · May 12, 2020 · 99bff37 · 99bff37
2 parents 231cc79 + fbf3b05
commit 99bff37
Show file tree

Hide file tree

Showing 171 changed files with 236,630 additions and 2,348 deletions.
diff --git a/.gitattributes b/.gitattributes
@@ -0,0 +1 @@
+text=auto eol=lf
diff --git a/.gitignore b/.gitignore
@@ -1,7 +1,13 @@
 # Project based
-data/*
+
+# --- dataset ---
 dataset/*
+!dataset/*/*.ipynb
 !dataset/dataset_struct.md
+*.pkl
+*.npy
+
+# --- reference --- 
 ref/*.pdf
 !ref/paper_list.md
 !ref/web_list.md

diff --git a/MIMII_main.ipynb b/MIMII_main.ipynb
diff --git a/README.md b/README.md
@@ -1,9 +1,132 @@
-# NF_Prj_MIMII_Dataset
-
-### based on the data set from zenodo
-https://zenodo.org/record/3384388#.XpNAUpnRYuV
-
-### based on the base line project on gitHub
-https://github.com/MIMII-hitachi/mimii_baseline
-
-###
+# Can a Machine Hear If a Machine Is Broken? 
+**Unsupervised Anomaly Detection by Airborne Sound of Industrial Machinery**
+
+A machine learning approach to machine anomaly detection on the MIMII dataset.
+
+![story](doc/media_main/story.png)
+
+## Discription of the Study - in this Repo
+
+This repo is about a concept study on the MIMII dataset to detect anomalies of machines or machine parts like fans, slider, pump and valves by means of classic machine learning and deep learning methods. 
+
+In condition monitoring of machinery, it is common to use structure-borne sound and order tracking (RPM, etc.) to detect malfunctions. For various reasons like ease of retrofitting or the need for a mobile solution and size of the machine part or operational needs like zero downtime, airborne sound anomaly detection could be preferable. The proof of concept study conducted here, shows that by applying machine learning anomaly detection on acoustic sensing, a ML/AI sensor can be constructed that has good performance so that it can compete with a trained technician when detecting anomalies by listening to the machine, with the potential benefit of staying 24/7 at the machine part of interest. The development can be seen as groundwork for an embedded solution of a smart sensor as part of an IoT plant supervision system (like SCADA).
+
+Anomaly detection with machine learning means mostly unsupervised learning as the base assumption is that abnormal operation is unknown. Abnormal operation could be potentially very diverse of nature, so even if the recordings of abnormal operation would exist training on them would lead to overfitting. Furthermore, the application of smart sensor would be less useful. 
+
+This means a smart sensor microphone system needs to be trained by being placed for a reasonable training time in front of a healthy machine part, to become an armed detector that learned what is normal under representative background noise. Herein lies also the limitations of the study as in a real-world scenario more machine parts are connected and the dataset specifically focuses on a single part. But a general abnormality detection also for an ensemble of machine parts could be applicable with the same technique.
+
+To reach an optimal architecture various machine learning techniques are explored and eventually a divers ensemble connected. The following list summarizes the techniques explored:
+
+*	Stochastic model (a multi-dimensional normal distribution is found and outlier defined by significance)
+*	Random Isolation trees (a decision tree is taken assuming outlier need only a few decisions to be found as spatially separated in one or more features)
+*	Autoencoder (an underrepresented auto encoder reconstruction error is taken)
+*	Pseudo supervision (where normal observation is augmented/ distorted to train a binary classifier)
+
+In the picture below an exemplary classification ensemble is sketched, this is an example to show the main parts that are:
+
+![dievens](doc/media_main/DiverseEnsamble_general_examble.png)
+
+*	Feature extraction pre-filter like BSS blind source separation or denoising filter
+*	Feature extraction like welch spectra (PSDs) or MEL spectra  
+*	Classifier like RFC random forest classifier
+*	Stochastic models for outlier detection like GMM gaussian mixture model, etc.
+*	Unsupervised outlier detection like neural network autoencoder 
+*	Or outlier classification like IFC isolation forest classifier
+*	The time frame-based ensemble collects different classification over the time processed as some classifiers work on longer buffer parts than others. This may help to regulate the training of the algorithms and can improve training speeds.
+
+In order to make any machine learning algorithm able to work with audio it is necessary to use various signal processing steps (feature extraction pre filter) that may be of classic nature or also take use of machine learning methods like clustering for preprocessing on the time buffer like activation detection. 
+In application the reaction time of such an anomaly detection is around 10 sec at the current construction and training chain buildup. But some indication could be found to reduce this eventually.
+
+## About the Dataset MIMII
+
+The data have a general discretion through the ZENODO page where it is open for [download](https://zenodo.org/record/3384388#.XpNAUpnRYuV)
+
+The related [paper](https://arxiv.org/pdf/1909.09347.pdf) by the MIMII dataset creators  - Harsh Purohit, Ryo Tanabe, Kenji Ichige, Takashi Endo,Yuki Nikaido, Kaori Suefusa, and Yohei Kawaguchi can be found here: https://arxiv.org/pdf/1909.09347.pdf
+
+At this point we like to sincerely thank you for sharing the dataset, it serves a great challenge!
+
+Some further interpretation from our end can be found in the sub chapter: [about the dataset](doc/about_the_dataset.md)
+
+Furthermore the MIMII creators also provided an autoencoder based baseline model on GitHub:
+https://github.com/MIMII-hitachi/mimii_baseline
+This repo was of great help to get into the topic and inspired us.
+
+### Dataset audition show room
+
+Maybe you like to tune in, listen and have a look to some spectra of a small selection from the MIMII dataset. Then visit the showroom:<br/>
+![click_show_room](doc/media_main/show_room_click.png) <br/><br/>
+[link to the showroom](https://ba-hanseml.github.io/MIMII_show_room/showroom.html)
+
+## Structure of Study = How to read the Repo
+
+The repo has some folders that correspond to either setup or the main chapters of the study. Here you can find some overview information and links to the in depth chapters.
+
+### Study parts and Algorithm part
+
+#### Folder/part: feature_extraction_diagram
+The feature extraction diagram is a chain of filters and feature extractions like the MEL spectrum as an output. Thereby the folder hosts the tuning and test of the components, finally used ones and explored ones. As well as the batch creation scripts for pre-processing the dataset. More in the sub chapter [feature extraction diagrams](doc/feature_extraction.md)
+
+#### Folder/part: Modeling
+The modeling folder hosts all the explored machine learning variants sorted in sub folders like each with general dicription: 
+* [unsupervised](modeling/unsupervised/unsupervised_modeling.md)
+* [pseudo_supervised](modeling/pseudo_supervised/pseudo_supervised.md)
+
+### Setup
+
+#### Folder: dataset
+Hosts the unpacked dataset from the zenodo site. The structure expected is discribed in the subchapter: [dataset folder structure](dataset/dataset_struct.md).
+Additionaly it has folders for the extracted features after using the diagrams (see above).
+
+#### Folder: Utiliy
+Utility-function and classes stored in py-files rather than jupyter notebooks. In the folder utility/Workshop some application and basic tests of the utilities can be found.
+
+#### Folder: doc
+All subchapter and media material for documention.
+
+#### Folder: ref
+References like papers, etc. see below.
+
+#### Folder: env
+Information about the Conda environment, Jupyter settings and GPU elements in tensorflow. Notice this work was done on windows 10 with anaconda and jupyter.
+
+#### Folder: misc
+miscellaneous material and interesting site experiments - partially unsorted.
+
+# Future Work
+
+## Not Explored Options And Missing Ends
+A list of general potential for improvement and not fully or not at all explored techniques mostly due to time restrictions. Sub Chapter [improvements](doc/improvements.md)
+
+## Application Notes
+Notes for the application as a smart sensor. A small discussion on implication and follow-up-work for deployment. Sub chapter: [application notes] (doc/feature_extraction. MD) 
+
+## Workflow Improvements
+Some reflection on workflow chosen and setup. Sub chapter [workflow improvements](doc/workflow_improvment.md) 
+
+
+# Credits and Refrences
+
+## THANKS to...
+
+To the creators of the MIMII dataset
+
+To the creators of the gigantic audio processing library [librosa](https://librosa.github.io/)
+
+To the creators of the library [pyrommacustics](https://pyroomacoustics.readthedocs.io/en/pypi-release/)
+
+To all the developers of python, scipy, numpy, scikit learn and tensorflow ... and all the great python stuff that we can build on.
+
+To Mike X Cohen for his great Python DSP [Udemy courses](https://www.udemy.com/user/mike-x-cohen/) 
+
+To all the founders and minds of machine learning you created an awesome universe to explore.
+
+To the team at [neuefische GmbH](https://www.neuefische.de), that made this project possible by training us in the data science bootcamp.
+
+To so many more form the web like towardsdatascience.com and stackoverflow ...
+
+## References 
+Find all references: papers, source code and other web sources in the following sub chapters: 
+
+* [Papers and Books](ref/paper_list.md)   
+* [Sorce code / GitHubs](ref/github_list.md)
+* [Websites / Videos](ref/web_list.md)
diff --git a/dataset/dataset_struct.md b/dataset/dataset_struct.md
@@ -58,4 +58,8 @@ md 6dB
 7z.exe -o0dB -y x .\0_dB_pump.zip
 7z.exe -o0dB -y x .\0_dB_valve.zip
 7z.exe -o0dB -y x .\0_dB_slider.zip
-```
+```
+
+# Addtional Folders from feature extractrion
+Feature extraction will create folder named after the extraction diagram like:
+exdia_v1 - for feature extraction diagram version 1 find more about feature extraction diagrams in [feature extraction](../doc/feature_extraction.md)
diff --git a/doc/application_notes.md b/doc/application_notes.md
@@ -0,0 +1,18 @@
+## machine activation
+- if a machine is not permanent active this needs to be known to the algorithem... else false abnormal detection in inactivty might be detected
+
+## calibration phase
+- TODO explaint the possibel ways to make the machine learn normal and total time needed etc.
+
+
+## what is abnormal in a spectrum or in time
+- after detecting a abnormal what feature is made the algorthem thing it is abnormal is vital for debugging and post missclassifaction training
+
+## if multilabel use
+- if more cluster are found by unsupervised methodes the match to the cluster spectrum can be recorded and used to record how often a similer event is occuring this can be insight for machine monitoring and defining what modes are correlatble with other machine modes or what is worth investigating is not expalinabel etc.
+
+## setting recall or precssion modes
+- it might be possible by simple threshold tuning to increase the FN or FP missed abnormaltys over false alarms - with the idea in mind that hard to hear anomalys maybe not to bad failures
+and many false alarms would lead to disabling of the smart sensor in practise, as the mic. can only be extra sensor not safty and not process critcal based on its design.
+
+
diff --git a/doc/feature_extraction.md b/doc/feature_extraction.md
@@ -0,0 +1,27 @@
+# Feature Extraction 
+
+## Introduction 
+
+Audio feature extraction is the diverse set of filters to get features extracted from the raw recording ready for a machine learning algorthem to process it. While cleaning operation like de-noising or de-reverbing are done. Further, it can be attempted to use multi channel recordings that are done with spatial separated microphone arrays, like this dataset is recorded with 8 microphones in a circle.  The feature could be formulated in the time domain but usually are left in frequency domain so that the output is a form of the spectrum like Welch in 1D or a STFT or alike in 2D. Other outputs are possible like envelopes BSS (blind source separation) de-mixing estimations or DOA (direction of arrival) as well as basic statistics like SNR (sound to noise ratio) estimation or standard deviation etc.
+
+In this study, we settled with:
+* MEL spectrum
+* Welch spectrum ( called PSD here, even then, even if not it is not the density, but the spectral power V^2 instead of V^2/f)
+* FastICA mixing matrix estimation
+
+We also explored other forms find below remarks to that exploration.
+
+For pre-filtering we settled:
+* nn-filter denoising
+* FastICA
+* K means clustering on STFT to find activation
+
+Further audio augmentation for pseudo supervision is a topic in feature extraction.
+
+In the documentation many links lead to Jupyter notebooks in the the folder feature_extraction_diagrams.
+
+
+## Diagrams
+
+
+## Pre Pre Processing and Time Sliceing
diff --git a/doc/imporvments.md b/doc/imporvments.md
@@ -0,0 +1,11 @@
+
+
+# pywavelet
+
+# flaoting SNR+
+
+# Mic CorossCorrelation
+
+#MFCC
+
+# pseudo supervised CNN and CNN U-net
diff --git a/doc/media_feature_extraction/DiagramV0.png b/doc/media_feature_extraction/DiagramV0.png
diff --git a/doc/media_feature_extraction/exdia_v1_essential.png b/doc/media_feature_extraction/exdia_v1_essential.png
diff --git a/doc/media_feature_extraction/html5_play_audio.html b/doc/media_feature_extraction/html5_play_audio.html
@@ -0,0 +1,4 @@
+<html lang="en">
+<script src="http://api.html5media.info/1.1.8/html5media.min.js"></script>
+<audio src="00000018.mp3" controls preload></audio>
+</html>