Introduction to AI Machine Learning with Sound Course Material

This repo contains all the necessary code and applications to be able to complete the Introduction to AI Machine Learning with Sound course. The instructions are provided on the online course.

audio/birdsong

Contains the wav files used as test and training samples.

birdsongs

Contains the CSVs for the different types of birds used in this course.

images

Contains the images used for the Visual Recognition service. This is used in lab 5.

labs

Contains extra code to be used in the labs.

noderedflows

You can use this sample Prediction Flow to craft a node-red application that runs a prediction using input from either the microphone or a file inject

output

Contains the sound CSVs.

src

Contains the OSP Service and OSP Converter applications. Details are below.

OSP Converter

Performs Signal Processing against a directory of audio files.

This application will work against the audio files in https://www.kaggle.com/mmoreaux/audio-cats-and-dogs csv files have also been generated for the audio files in https://www.kaggle.com/rtatman/british-birdsong-dataset

Download the audio zip from kaggle and unzip to the audio directory. The application is in the src directory To install the prerequisites run

pip install -r requirements.txt

Run the application

python ospconverter.py

The resulting csv file will be in the output folder. The Application produces a simplified table representing an audio spectogram that can be used to create machine learning models to recognize audio sounds.

An enhancement would be to take a digitial signature, eg the top 10 and bottom 10 amplitudes and the frequencies they occur at, producing a table with 41 columns per row. The first row being the class identifier. We leave that as an exercise for our consumers, though we might consider adding it ourselves in the near future.

OSP Service

Provides a web based API to performs Signal Processing against a single audio file. This application provides a web page on \/audio which can be used to test the OSP processing.

The application also provides an endpoint \/audio\/nodered suitable to be used in conjunction with a node-red file upload or microphone node.

Run the application

python ospservice.py

License

Apache License available in license

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
audio/birdsong		audio/birdsong
birdsongs		birdsongs
images		images
labs/lab_5		labs/lab_5
noderedflows		noderedflows
output		output
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

audio/birdsong

audio/birdsong

birdsongs

birdsongs

images

images

labs/lab_5

labs/lab_5

noderedflows

noderedflows

output

output

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Introduction to AI Machine Learning with Sound Course Material

audio/birdsong

birdsongs

images

labs

noderedflows

output

src

OSP Converter

OSP Service

License

About

Releases

Packages

Languages

License

hieuqtran/animal-sounds

Folders and files

Latest commit

History

Repository files navigation

Introduction to AI Machine Learning with Sound Course Material

audio/birdsong

birdsongs

images

labs

noderedflows

output

src

OSP Converter

OSP Service

License

About

Resources

License

Stars

Watchers

Forks

Languages