Embedded sound classifier

Overview

This application uses the on-board MEMS microphone to collect audio samples, analyze them using a pre-trained neural network and send their classification on the serial port. A desktop python script takes care of reading the results.

Requirements

STM32F4 Discovery board (project has been developed using STM32F407VGT6)
For sound clasification (normal usage):
- Python 3. You can check your version with python --V
- pyserial: pip install pyserial
- A RS232-USB cable
For neural network training:
- Python 3, pyserial and RS232-USB cable as in previous case
- Keras 2.2.4: pip install keras==2.2.4
- Tensorflow 2.0.0-alpha0: pip install tensorflow==2.0.0-alpha0
- GCC
For pre-trained Keras model to C conversion:
- STMCubeMX
- X-CUBE-AI: can be installed from within STMCubeMX
For embedded software compilation:
- Miosix toolchain
- GNU ARM embedded toolchain (just for the linker)

How to use

For sound classification (normal usage):
1. Connect the serial cable pins to PA2 (board TX) and PA3 (board RX)
2. Connect the board through USB cable
3. Launch the client with python client.py serial_port_name, replacing serial_port_name with the name of the serial port (i.e /dev/tty, COM1)
4. Press the board user button, do the desired sounds and press again the button to stop recording
For neural network training:
1. Compile the FFT extraction program with gcc FFT_extract.c -o FFT_extract
2. Connect the cables as in previous case
3. Launch the FFT reciever with python FFT_receive.py serial_port_name
4. Press the user button, do the desired sounds and press again the button to stop recording
5. The results will be in the file fft.csv. They must be manually classified according to what they are: one last column has to be added and it must contain value 0 for silence, 1 for whistle or 2 for clap
6. Go into the neural-network folder, place the new data in training_data.csv and run python trainer.py. The pre-trained model will output to file model.h5
For pre-trained Keras model to C library conversion: everything is explained in the docs/x-cube-ai.pdf file, provided by ST.
For embedded software compilation: use command make in the miosix-kernel folder or compile using your preferred CMake compatible IDE

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs

docs

miosix-kernel

miosix-kernel

neural-network

neural-network

samples/fft_1024

samples/fft_1024

.gitignore

.gitignore

FFT_extract.c

FFT_extract.c

FFT_receive.py

FFT_receive.py

README.md

README.md

client.py

client.py

Repository files navigation

Embedded sound classifier

Overview

Requirements

How to use

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
docs		docs
miosix-kernel		miosix-kernel
neural-network		neural-network
samples/fft_1024		samples/fft_1024
.gitignore		.gitignore
FFT_extract.c		FFT_extract.c
FFT_receive.py		FFT_receive.py
README.md		README.md
client.py		client.py

mscuttari/embedded-sound-classifier

Folders and files

Latest commit

History

Repository files navigation

Embedded sound classifier

Overview

Requirements

How to use

About

Topics

Resources

Stars

Watchers

Forks

Languages