TSE-with-ref-selection

target speaker extraction with speaker reference selection

Dependencies

pip install -r requirements.txt

git clone https://github.com/JorisCos/LibriMix.git

Method	ACC(ov=30%)
Speaker Diarization and use longest segments	86.0
Split into 5s chunks and clustering	79.5
Overlap detection and clustering with 5s sliding window	99.5
Split into regions, overlap detection and clustering with regions	89.2

refer to SPEX+

# train
./train.sh

# evaluation
python cse_test.py

Method	SI-SDR(model=x8515, ov=30%)	SI-SDR（model=xmax, ov=30%,use ground truth segmentation)
Speaker Diarization and use longest segments	8.77	13.62
Split into 5s chunks and clustering	8.75	14.00
Overlap detection and clustering with 5s sliding window	8.89	14.32
Split into regions, overlap detection and clustering with regions	9.11	14.44

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
chkpt		chkpt
data		data
libs		libs
nnet		nnet
res		res
utils		utils
README.md		README.md
cse_dataset.py		cse_dataset.py
cse_test.py		cse_test.py
requirements.txt		requirements.txt
train.py		train.py
train.sh		train.sh