Pitch detection #313

vincentqb · 2019-10-24T21:29:07Z

Since the pitch is important in translating languages such as mandarin, we want a pitch detection algorithm. See also notebook.

We implement this using normalized cross-correlation function (NCCF) and median smoothing, mentioned in RAPT. Kaldi also uses NCCF, but uses an algorithm based on viterbi instead of median smoothing for smoothing, see here.

Both pure tones from here are detected successfully.
Move out of example

Closes #257

examples/pitch_dectection/pitch.py

cpuhrsch · 2019-10-29T18:56:41Z

Would these transforms already work with @torch.jit.script ? :)

vincentqb · 2019-10-29T22:30:15Z

Would these transforms already work with @torch.jit.script ? :)

Yup :)

PetrochukM · 2020-08-21T03:29:26Z

What's the difference between this and something like CREPE, pYIN and SPICE? Which is best to use for a typical voice-over?

vincentqb · 2020-10-19T19:46:09Z

What's the difference between this and something like CREPE, pYIN and SPICE? Which is best to use for a typical voice-over?

The algorithm included here is a non-neural-network algorithm (based on RAPT, and related to Kaldi's) to identify pitch that can be used as a baseline. The algorithms you mention look promising, but I'd simply experiment and find what's best for a particular case :) Is that what you meant?

vincentqb added 14 commits October 24, 2019 17:20

pitch detection validation.

53787ac

cleaning code a little.

009f272

explicit sentence.

dcbfa43

NCCF.

cffda86

typo.

8850da1

readme.

8264475

no need to return frame_size.

a8c368c

adding docstring.

4edd828

improved explanation.

0e9eb87

message when fails too.

a33b310

works with channels.

4e9fde7

fix repeat.

066c03f

comment.

6650b2c

remove todo note.

8a08c44

vincentqb requested review from cpuhrsch and zhangguanheng66 October 25, 2019 17:15

vincentqb commented Oct 25, 2019

View reviewed changes

examples/pitch_dectection/pitch.py Outdated Show resolved Hide resolved

vincentqb commented Oct 25, 2019

View reviewed changes

examples/pitch_dectection/pitch.py Outdated Show resolved Hide resolved

vincentqb added 5 commits October 25, 2019 13:45

reformatting.

67653c9

flake8.

4022f7c

minor formatting.

bd6364b

turning into a functional.

651730a

moving out internal function.

b950bd3

make torchscriptable.

1be1769

cpuhrsch approved these changes Oct 30, 2019

View reviewed changes

vincentqb merged commit 26237c8 into pytorch:master Oct 30, 2019

vincentqb deleted the pitch branch October 30, 2019 13:56

vincentqb mentioned this pull request Oct 30, 2019

assets for testing pitch #322

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pitch detection #313

Pitch detection #313

vincentqb commented Oct 24, 2019 •

edited

cpuhrsch commented Oct 29, 2019

vincentqb commented Oct 29, 2019

PetrochukM commented Aug 21, 2020

vincentqb commented Oct 19, 2020 •

edited

Pitch detection #313

Pitch detection #313

Conversation

vincentqb commented Oct 24, 2019 • edited

cpuhrsch commented Oct 29, 2019

vincentqb commented Oct 29, 2019

PetrochukM commented Aug 21, 2020

vincentqb commented Oct 19, 2020 • edited

vincentqb commented Oct 24, 2019 •

edited

vincentqb commented Oct 19, 2020 •

edited