Further CQT enhancements #279

bmcfee · 2015-11-09T18:53:43Z

The current cqt implementation returns only magnitude. If we're going to support inverse CQT #165 , phase will be critical for good reconstruction.

This PR is the first step in this direction.

It's currently implemented as API-compatible with the existing CQT; complex-output is enabled by setting real=False. In 0.5, we should remove this parameter and make it always return complex-valued transforms.

As a side issue, this PR fixes util.sync to retain the dtype of the input data.

bmcfee · 2015-11-09T18:57:45Z

Paging @ebattenberg for feedback. Not urgent, but I reordered some of the abs operations in a not-entirely-equivalent way. This may effect the accuracy of pseudo and hybrid cqt, though whether it's improved or not, I can't say just yet.

UPDATE:

hybrid cqt tests still pass with the new order of operations (abs(filters.dot(D))), and some informal sanity checks show a slight reduction in error compared to the old way (abs(filters).dot(abs(D))).

ebattenberg · 2015-11-12T22:53:45Z

Changing the Pseudo CQT to take phase into account changes it into not a pseudo CQT but an undersampled (in time) CQT.

bmcfee · 2015-11-13T13:11:57Z

Fair enough. Is this really a meaningful distinction, or is there a reference implementation/description of pcqt that we adhere to?

dpwe · 2015-11-13T16:51:50Z

It's a big deal. Imagine the "impulse response" corresponding to a
high-frequency, wide-bandwidth CQT bin. It's a little windowed (complex)
sinusoid whose window might be 1ms or shorter. Now tell me the phase of
this kernel over a 10ms window - there could be several entire instances of
the kernel within the window, each with unrelated phases. So a single phase
value doesn't really mean anything.

If you do a single calculation against the complex Fourier coefficients,
you're most likely taking the inner product against one particular
positioning of the IR, which will invite severe time-aliasing (under
sampling).

Taking the average magnitude, the total energy coming through the filters
defined by the impulse response over any length of window, does make sense.

DAn.

On Friday, November 13, 2015, Brian McFee notifications@github.com wrote:

Fair enough. Is this really a meaningful distinction, or is there a
reference implementation/description of pcqt that we adhere to?

—
Reply to this email directly or view it on GitHub
https://github.com/bmcfee/librosa/pull/279#issuecomment-156427937.

bmcfee · 2015-11-13T17:30:16Z

If you do a single calculation against the complex Fourier coefficients,
you're most likely taking the inner product against one particular
positioning of the IR, which will invite severe time-aliasing (under
sampling).

Ok, this makes sense. Thanks.

Backing up a bit: maybe it's just silly to offer complex mode for pseudo-cqt in the first place? I suppose that anyone that cares enough about phase (eg for later cqt inversion or what-have-you) would just use the full cqt anyway. Does that seem reasonable to everyone else?

ebattenberg · 2015-11-14T05:34:27Z

Backing up a bit: maybe it's just silly to offer complex mode for pseudo-cqt in the first place? I suppose that anyone that cares enough about phase (eg for later cqt inversion or what-have-you) would just use the full cqt anyway. Does that seem reasonable to everyone else?

Yeah. The Pseudo CQT is an alternative to averaging and downsampling the CQT for higher frequency bands. It should be more efficient but is not exactly equivalent. Though I don't know how well inverting the CQT will go if you average and downsample in time to get down to the desired hop size.

ebattenberg · 2015-11-14T05:38:06Z

Is this really a meaningful distinction, or is there a reference implementation/description of pcqt that we adhere to?

I just googled "pseudo CQT" to get the exact definition, but just got links to librosa. ;)

bmcfee · 2015-11-14T14:20:48Z

I just googled "pseudo CQT" to get the exact definition, but just got links to librosa

I guess the spirit of my question was more: is "pseudo CQT" a precise term, or do we have some wiggle room in its definition? At any rate, DAn's comment makes sense, and I'm happy to keep pcqt as magnitude-only.

bmcfee · 2016-02-01T15:20:56Z

I don't think there's anything left to do on this one. I have a working (but terribly slow) inverse-cqt implemented in a notebook, but I don't think perfecting inverse-cqt should hold up complex-cqt.

Does anyone object to merging?

bmcfee · 2016-02-02T21:23:06Z

In working out #302 , it occurred to me that we should probably rename the resolution parameter for cqt functions, because: A) it's not all that descriptive, and B) it conflicts with resolution in tuning, which has a much more precise interpretation. Since this PR has other cqt changes, we may as well consolidate.

Do folks on this thread have any thoughts on a better name for resolution? I'd hesitate to use something like q_factor since the resolution parameter is just one contributing factor to Q here, and I don't want to be misleading. Eric H suggested something like width. Maybe filter_scale?

(And don't worry, we have a warning shim that will provide backward compatibility now, so the old argument name will continue to work.)

bmcfee · 2016-02-04T03:50:08Z

I don't think there's anything left to do on this one. I have a working (but terribly slow) inverse-cqt implemented in a notebook, but I don't think perfecting inverse-cqt should hold up complex-cqt.

Fast inverse CQT implementation is here ; gist won't store audio buffers, so you'll have to download it and run it on your own favorite example. I think this basically demonstrates that the complex cqt is doing the right thing.

(Caveats abound: icqt sounds lousy for resolution > 1, and frequency over-sampling actually degrades quality due to using filter^H instead of a proper basis inversion. SNR is also rather low, mostly i suspect due to LPF inherent in this cqt implementation, but to my ear it sounds pretty good.)

bmcfee · 2016-02-04T17:25:02Z

Final to-dos:

decide on new name for resolution
implement deprecation warning for real=True

Further CQT enhancements

bmcfee added enhancement Does this improve existing functionality? functionality Does this add new functionality? labels Nov 9, 2015

bmcfee self-assigned this Nov 9, 2015

bmcfee added this to the 0.4.2 milestone Nov 9, 2015

bmcfee force-pushed the complex-cqt branch 2 times, most recently from 5a96590 to bcbd151 Compare December 8, 2015 14:42

bmcfee force-pushed the complex-cqt branch 2 times, most recently from 9345878 to 894e002 Compare January 7, 2016 16:15

bmcfee force-pushed the complex-cqt branch from 894e002 to a683f58 Compare January 14, 2016 04:04

bmcfee force-pushed the complex-cqt branch from a683f58 to 9b5d482 Compare February 1, 2016 16:02

bmcfee added 4 commits February 2, 2016 16:18

modifications necessary to retain phase information in CQT

4e71a68

added dtype condition to utils.sync test

12d7bcf

reverted pseudo_cqt to force real-value

af3435e

updated cqt docstring

fb9527f

bmcfee force-pushed the complex-cqt branch from 9b5d482 to fb9527f Compare February 2, 2016 21:18

bmcfee mentioned this pull request Feb 4, 2016

inverse CQT #165

Closed

bmcfee added 2 commits February 4, 2016 17:24

renamed cqt resolution to filter_scale

aa4c67a

added repr string to deprecator object

2cd2fad

added a warning for cqt(real=True)

0ebb9f2

bmcfee force-pushed the complex-cqt branch from 7e46a83 to 0ebb9f2 Compare February 5, 2016 00:40

bmcfee added 2 commits February 4, 2016 20:02

hybrid_cqt now explicitly discards phase in cqt call

6f8da02

suppressing warnings in test suite

9770d06

bmcfee added a commit that referenced this pull request Feb 5, 2016

Merge pull request #279 from bmcfee/complex-cqt

ce00692

Further CQT enhancements

bmcfee merged commit ce00692 into master Feb 5, 2016

bmcfee deleted the complex-cqt branch February 5, 2016 01:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Further CQT enhancements #279

Further CQT enhancements #279

bmcfee commented Nov 9, 2015

bmcfee commented Nov 9, 2015

ebattenberg commented Nov 12, 2015

bmcfee commented Nov 13, 2015

dpwe commented Nov 13, 2015

bmcfee commented Nov 13, 2015

ebattenberg commented Nov 14, 2015

ebattenberg commented Nov 14, 2015

bmcfee commented Nov 14, 2015

bmcfee commented Feb 1, 2016

bmcfee commented Feb 2, 2016

bmcfee commented Feb 4, 2016

bmcfee commented Feb 4, 2016

Further CQT enhancements #279

Further CQT enhancements #279

Conversation

bmcfee commented Nov 9, 2015

bmcfee commented Nov 9, 2015

ebattenberg commented Nov 12, 2015

bmcfee commented Nov 13, 2015

dpwe commented Nov 13, 2015

bmcfee commented Nov 13, 2015

ebattenberg commented Nov 14, 2015

ebattenberg commented Nov 14, 2015

bmcfee commented Nov 14, 2015

bmcfee commented Feb 1, 2016

bmcfee commented Feb 2, 2016

bmcfee commented Feb 4, 2016

bmcfee commented Feb 4, 2016