MelBank

This project objective is to separate the sound of multiple speakers on a single channel.

It enables not only noise-speech separation, but also speech-speech separation.

Demo

Cannot play demo audio in GitHub. If you want to listen to demo audio, look this.

Requirement

Python ~> 3.8
TensorFlow

Installation

$ git clone <this repo>
$ cd <this repo>

$ pipenv install

You also need to install portaudio.

macOS - brew install portaudio
Ubuntu - sudo apt-get install portaudio19-dev

Usage

1. Create teacher data

$ pipenv run record # Recording each sound source to be separated
$ pipenv run build  # Build teacher data

2. Training

$ pipenv run train

3. Start demo!

$ pipenv run demo

If you want to know the details of how to use this, run the following command.

$ pipenv run help

Name		Name	Last commit message	Last commit date
Latest commit History 138 Commits
ckpt		ckpt
config		config
core		core
data		data
demo		demo
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ckpt

ckpt

config

config

core

core

data

data

demo

demo

.flake8

.flake8

.gitignore

.gitignore

LICENSE

LICENSE

Pipfile

Pipfile

README.md

README.md

main.py

main.py

Repository files navigation

MelBank

Demo

Requirement

Installation

Usage

1. Create teacher data

2. Training

3. Start demo!

About

Releases

Packages

Languages

License

averak/MelBank

Folders and files

Latest commit

History

Repository files navigation

MelBank

Demo

Requirement

Installation

Usage

1. Create teacher data

2. Training

3. Start demo!

About

Resources

License

Stars

Watchers

Forks

Languages