audio-segment

MFCC based audio grain sorting

How this script works:

Global parameters:

number of MFCCs used,
size of the FFT used in calculating MFCCs,
length of the frame,
0th MFCC toggle,
onset detection toggle,
windowing toggle,
smoothing toggle,
smoothing window width,
sigma parameter of the Gauss filter,
segmentation hop length.

Loading a file using GUI. [ipyfilechooser] / [google.colab]
Segmentation:

into frames of set width, [librosa]
on onset times. [librosa]

Windowing of signal in each frame using Hamming Function. [numpy]
Calculation of MFCCs for each frame without centering [librosa]
Example frame sorting using MFCC values via as folllows,

using a k-d tree structure: [scipy.spatial]
- by querying for all the neighbours of frame[0], sorted by distance in ascending order,
- by finding the shortest path that traverses the entire tree starting from frame[0],
using correlation clustering represented as hierarchy dendrogram, sorted by distance in ascending order. [scipy.cluster]

Output construction via concatenation of frames in arrangements calculated in step 6., optionally smoothing out signal discontinuities between them using a 1-Dimensional Gaussian filter. [scipy.ndimage]

All results displayed are plotted using matplotlib.

This project references online resources which elaborate further on the concepts of audio feature extraction and math-based workflows in Python, such as musicinformationretrieval.com as well as documentation of matplotlib, numpy, librosa and scipy libraries.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.idea		.idea
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
GrainModelling.ipynb		GrainModelling.ipynb
LICENSE		LICENSE
README.md		README.md
audioSlicer.ipynb		audioSlicer.ipynb
interpolate.ipynb		interpolate.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

.DS_Store

.DS_Store

.gitattributes

.gitattributes

.gitignore

.gitignore

GrainModelling.ipynb

GrainModelling.ipynb

LICENSE

LICENSE

README.md

README.md

audioSlicer.ipynb

audioSlicer.ipynb

interpolate.ipynb

interpolate.ipynb

requirements.txt

requirements.txt

Repository files navigation

audio-segment

About

Releases

Packages

Languages

License

wwerkk/audio-segment

Folders and files

Latest commit

History

Repository files navigation

audio-segment

About

Resources

License

Stars

Watchers

Forks

Languages