Interchangeability of cauchy kernel methods #81

tommybotch · 2022-12-24T02:19:44Z

Hi Albert,

I've been training an image generation model using the S4 module and the cauchy extension (compiled code on local machine). Within a model trained with the cauchy extension, would you expect performance differences if the naive implementation (slow kernel) was used for evaluation? Or is the naive implementation less robust than the extension?

My goal is to visually inspect a few things, but am experiencing problems using the extension in a jupyter notebook (even when placing all tensors/models on a GPU).

Thanks again for your time,
Tommy

albertfgu · 2023-01-04T17:09:48Z

Sorry for the late response. I have never tried this; I think it might depend on the problem characteristics. It's likely that there are small numerical differences between the implementations, and they might compound depend on how you're using them (e.g. if it's for autoregressive generation which involves repeated computations, the differences might compound).

I've never tried using the extension in a notebook. I think sometimes it can be tricky getting the notebook to use the same pip environment which has the extension, can that be related to the issue? Is it unable to find the extension at all, or is it trying to use the extension and giving errors?

tommybotch · 2023-01-04T17:23:29Z

Thank you for your reply! Yeah, this was also my guess for what is happening (and seems like a trade-off between training and inference speed). I managed to solve the problem in my notebook by pushing the model onto a GPU since the extension seems to require CUDA.

Small note - lines 78 and 102 of cauchy.py have a small syntax error where:

if not v.is_cuda and z.is_cuda and w.is_cuda: raise NotImplementedError(f'Only support CUDA tensors')

doesn't end up throwing an error unless it is changed to the following:

if not (v.is_cuda and z.is_cuda and w.is_cuda): raise NotImplementedError(f'Only support CUDA tensors')

I can open a pull request if that's helpful. Thanks again for the help!

albertfgu · 2023-01-05T19:57:18Z

Thanks for the catch! I've made the fix and it will be incorporated in the next release.

tommybotch changed the title ~~Interchangeability of cauchy kernels~~ Interchangeability of cauchy kernel methods Dec 24, 2022

tommybotch closed this as completed Jan 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interchangeability of cauchy kernel methods #81

Interchangeability of cauchy kernel methods #81

tommybotch commented Dec 24, 2022

albertfgu commented Jan 4, 2023

tommybotch commented Jan 4, 2023 •

edited

albertfgu commented Jan 5, 2023

Interchangeability of cauchy kernel methods #81

Interchangeability of cauchy kernel methods #81

Comments

tommybotch commented Dec 24, 2022

albertfgu commented Jan 4, 2023

tommybotch commented Jan 4, 2023 • edited

albertfgu commented Jan 5, 2023

tommybotch commented Jan 4, 2023 •

edited