v0.9.0 #702

sammlapp · 2023-04-26T05:09:10Z

This PR includes major new features and some breaking changes to OpenSoundscape, including

fully featured localization module for localizing sounds from a spatial array of synchronized recorders
class activation mapping for visualizing sample activation in deep learning models
refactoring of ml (formerly torch) modules including new sample module

merge hotfix from master to develop

578: tutorial download links. Resolves #578

datasets now return Sample class dataloaders should use opso.sample.collate_samples as collate_fn argument in order to properly collate data and labels for training/prediction AudioSample object is created by AudioFileDataset (or AudioClipDataset) and passed to the preprocessor. Each Action now recieves and returns the Sample object, which eliminatees the ugly _extra_args implementation. The Action() class retains a simple user-friendly idea of being able to accept a function that acts on data (not a sample) by changing the Action.go() method to run the action_fn on sample.data and update sample.data (then return the sample). Also, train and predict now expect batches from dataloaders that have dictionary keys "samples" and "labels" (instead of "X" and "y"). tutorials and tests will be broken, I havent modified them

next: consider removing dependency of external package and implementing the cam class in opso instead (it has cv2 dependency and manipulates the model in ways we might want to avoid/control ourselves)

added to docstring

I refactored the use of DataLoader by changing collate function to simply return the list of AudioSamples (rather than dictionary of batched tensors for 'samples' and 'labels'). This allows us to retain information about the AudioSamples (especially important if they are modified during preprocessing). The collate_samples function is now used after interating the dataloader (iterating the dataloader creates a list of AudioSamples). salieny_map now returns a list of AudioSamples as well. The returned samples have an attribute .activation_map (type ActivationMap) which can be plotted etc.

note that Actions now modify a sample in-place (updating it's .data and maybe other attributes) - this is now reflected in the tests

Spectrogram's setattr raising AttributeError means that copy.deepcopy() will fail. As a workaround, if AttributeError is raised when trying to copy sample.data in to saple.trace[], it assigns the original object instead of copying (if immutable this isnt an issue). However, this isn't a good solution, and change immutable class implementation when #671 is addressed it should be changed

watch() will log histograms of parameter and gradient values every n epochs for each module in the torch model

this works now, and avoid error on Save by removing all forward/backward hooks from saved model (unless user specifies save_hooks=True in CNN.save()).

Torch modules should simply be "called" ie Module(input) rather than Module.forward(input). The forward() function will bypass forward hooks.

Implement wandb.watch in CNN.train()

cam module now has CAM class which stores and plots base image, activation maps per class, and guided back propagation per class cnn now has method called generate_cams, which returns AudioSample objects with .cam as an instance of CAM next steps: add examples to tutorials, add tests

Added from_url method to load audio from downloaded url data (following SoundFile documentation) Also added methods to display interactive audio widget. The audio automatically displays as a widget now in Jupyter noteboks, using IPython.disply.Audio. The user can generate the widget (ie in a loop) by calling Audio.show_widget()

one test failing `test_generate_cams_num_workers` gives error about pickling when num_workers is 2, need to investigate

also test gillette for each receiver as reference and remove warning for centering in soundfinder

also asserts dims in (2,3) for soundfinder

Adds full localization module for localizing sounds from time-synchronized recording arrays Could use more test coverage and needs a demo notebook

these arguments got un-exposed during a refactor, but invert in particular is needed when loading models from old opso versions

also add sample module to __init__.py

for some reason, ReadTheDocs was failing (ModuleNotFoundError) when running the ribbit tutorial notebook. I just copied all the cells to a new notebook then renamed it to the same name as the previous one, and it builds fine locally for me now.

update docs to reflect supported python versions also update sentry-sdk to address #680 ran `poetry lock --no-update` because `poetry lock` hangs

docs/installation/mac_and_linux.md

adds tests for sample module

this function was totally broken, but didn't realize it because it was catching errors. I updated it to use AudioSample so that it is compatible with the current codebase. I added the flag raise_errors and added tests with raise_errors=True. I also changed the name of the `wandb` module to `logging` to avoid name conflicts with the wandb package.

syunkova · 2023-04-28T05:11:07Z

README.md

typo: don't need a comma after 3.8 in line 20... doesn't matter much

sammlapp and others added 30 commits January 12, 2023 15:46

not working, just ideas

2d4bf1d

Merge pull request #651 from kitzeslab/master

bcb4f45

merge hotfix from master to develop

Changed link in annotaqtion tutorial

361956b

Moved download links in tutorials box --> onedrive

aba8d40

moved download links to kitzeslab google drive

5db9689

Merge pull request #658 from kitzeslab/578_links

157ace6

578: tutorial download links. Resolves #578

move logic to create dataloder to intern. function

e6989c0

working prototype

c505bcd

next: consider removing dependency of external package and implementing the cam class in opso instead (it has cv2 dependency and manipulates the model in ways we might want to avoid/control ourselves)

add helpful repr method for AudioSample

bea5e4d

clarify purpose of AudioSplittingDataset

e02cc9c

added to docstring

update tests

a288a64

note that Actions now modify a sample in-place (updating it's .data and maybe other attributes) - this is now reflected in the tests

update tests and debug (2 tests still failing)

ff18ddf

update preprocessor comments/docstrings

7157e6b

fix behavior for predict() w zero samples

e32b9e6

add wandb.watch in CNN.train()

a276522

watch() will log histograms of parameter and gradient values every n epochs for each module in the torch model

change implementation of wandb.watch

35b27c3

this works now, and avoid error on Save by removing all forward/backward hooks from saved model (unless user specifies save_hooks=True in CNN.save()).

replace .forward() with __call__()

902d1b8

Torch modules should simply be "called" ie Module(input) rather than Module.forward(input). The forward() function will bypass forward hooks.

correct comment steps-> batches

f4eee16

Merge pull request #675 from kitzeslab/feat_wandb_watch

afa4fd7

Implement wandb.watch in CNN.train()

Merge branch 'develop' into feat_gradcam_sl

1af4dfc

working cam + gbp for multiple classes

0f02502

change bypass_augmentations default to True

c353968

add tests and default target layers

f5a5556

one test failing `test_generate_cams_num_workers` gives error about pickling when num_workers is 2, need to investigate

add identity helper function

f69b5e5

add pytorch_grad_cam dependency

73d06ce

sammlapp added 13 commits April 25, 2023 10:50

correct min-> min(abs())

6836dfb

also test gillette for each receiver as reference and remove warning for centering in soundfinder

full refactor of localization working

14050b7

update audio files for correct delays

0c79a8d

Merge branch 'feat_loca_events' into feat_refact_localizer

40f9c3a

add missing 'order' arg to bandpass

070e724

documentation of localization algorithms

b7b0975

also asserts dims in (2,3) for soundfinder

Merge pull request #697 from kitzeslab/feat_refact_localizer

5453a57

Adds full localization module for localizing sounds from time-synchronized recording arrays Could use more test coverage and needs a demo notebook

docs/comments updates for localization module

6bf9359

expose invert and colormap in SpectrogramToTensor

7cb4dcd

these arguments got un-exposed during a refactor, but invert in particular is needed when loading models from old opso versions

add pytorch_grad_cam to autodoc_mock_imports

876e87b

update tutorials for 0.9.0

10cec99

update version number

e164811

also add sample module to __init__.py

re-save notebooks

324c0ef

sammlapp added this to the 0.9.0 milestone Apr 26, 2023

sammlapp added 6 commits April 26, 2023 01:24

black formatting

a82706a

update ribbit notebook

8fae612

for some reason, ReadTheDocs was failing (ModuleNotFoundError) when running the ribbit tutorial notebook. I just copied all the cells to a new notebook then renamed it to the same name as the previous one, and it builds fine locally for me now.

update dependencies: no longer support python 3.7

cd170d2

update docs to reflect supported python versions also update sentry-sdk to address #680 ran `poetry lock --no-update` because `poetry lock` hangs

no longer support python 3.7

7ddb890

update version in suggested citation

7a0b09e

correct path to sample module in modules.rst

90a674c

louisfh reviewed Apr 27, 2023

View reviewed changes

docs/installation/mac_and_linux.md Outdated Show resolved Hide resolved

sammlapp added 5 commits April 27, 2023 13:29

add categorical_labels to sample, and add tests

6329cad

adds tests for sample module

python version support correction in docs

a5ac95e

add docs/tutorials/wandb to gitignore

f73fd7b

rerun cnn notebook

016d187

syunkova self-requested a review April 28, 2023 04:42

syunkova reviewed Apr 28, 2023

View reviewed changes

README.md

Copy link

Collaborator

syunkova Apr 28, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

typo: don't need a comma after 3.8 in line 20... doesn't matter much

syunkova approved these changes Apr 28, 2023

View reviewed changes

sammlapp merged commit 03e8661 into master Apr 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.9.0 #702

v0.9.0 #702

sammlapp commented Apr 26, 2023

syunkova Apr 28, 2023

v0.9.0 #702

v0.9.0 #702

Conversation

sammlapp commented Apr 26, 2023

syunkova Apr 28, 2023

Choose a reason for hiding this comment