Dataset #21

DenisSouth · 2019-01-01T06:53:02Z

Which dataset should I use for training network?

wq2012 · 2019-01-01T15:29:55Z

It depends on what you want to work on.

You can use any dataset that satisfies the definition of supervised clustering, meaning you can extract sequences of features, and associate those features with ground truth labels. Features can be speaker embeddings, face embeddings, etc.

Example datasets include NIST SRE 2000 CALLHOME for speaker diarization. But for any dataset, you need to process them yourself to extract features and align the features with labels. This library only provides the API for the clustering part.

More details are in the README.md file and the paper on arXiv.

wq2012 self-assigned this Jan 1, 2019

wq2012 added the question Further information is requested label Jan 1, 2019

wq2012 closed this as completed Jan 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataset #21

Dataset #21

DenisSouth commented Jan 1, 2019

wq2012 commented Jan 1, 2019

Dataset #21

Dataset #21

Comments

DenisSouth commented Jan 1, 2019

wq2012 commented Jan 1, 2019