Major updates! #44

mese79 · 2025-06-15T18:48:23Z

This is a major update over several parts (sorry for many changes)!
I needed to add an option for extracting image embeddings without patching, so I decided while adding this feature, make a major refactoring as well.

Added a checkbox for no_patching option (only works for images with height == width)
- With this option checked, the whole image (slice) will be treated as a single patch, and this speeds up the feature extraction and prediction (beneficial for images equal or smaller than 512x512).
Made a dataset class: FFImageDataset
- A torch IterableDataset that yields images / patches in batch.
- The input image can be a numpy array, or a (large) stack file, or a directory of images.
- Images will be lazy-loaded using pims (except for numpy array image, which already loaded).
- This help unifying gui and pipeline script using the same set of functions.
~~Now zarr is being used as the feature storage file (old storages are not compatible anymore, sorry).~~
- ~~Mainly because of appending features of zarr array, and more control over compression.~~
run_pipeline.py can be used for large stack prediction on hpc without having a temporary feature store (faster). Also, it has an option for only extracting features into a zarr storage now.
Default overlap is now patch_size // 4 as opposed to patch_size // 2 before. This will speed up extraction and prediction (I need to check the result stats with this change).
Added more testing and typing.

codecov-commenter · 2025-06-15T18:52:29Z

Codecov Report

Attention: Patch coverage is 63.68564% with 134 lines in your changes missing coverage. Please review.

Project coverage is 28.18%. Comparing base (5183db5) to head (d26ae91).

Files with missing lines	Patch %	Lines
src/featureforest/_segmentation_widget.py	36.02%	87 Missing ⚠️
src/featureforest/_feature_extractor_widget.py	17.39%	19 Missing ⚠️
src/featureforest/models/SAM/adapter.py	12.50%	7 Missing ⚠️
src/featureforest/models/MobileSAM/model.py	40.00%	3 Missing ⚠️
src/featureforest/models/SAM2/model.py	57.14%	3 Missing ⚠️
src/featureforest/utils/data.py	83.33%	3 Missing ⚠️
src/featureforest/utils/extract.py	93.18%	3 Missing ⚠️
src/featureforest/utils/pipeline_prediction.py	88.00%	3 Missing ⚠️
src/featureforest/utils/dataset.py	96.42%	2 Missing ⚠️
src/featureforest/models/Cellpose/adapter.py	50.00%	1 Missing ⚠️
... and 3 more

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #44      +/-   ##
==========================================
+ Coverage   24.85%   28.18%   +3.33%     
==========================================
  Files          40       40              
  Lines        2394     2469      +75     
==========================================
+ Hits          595      696     +101     
+ Misses       1799     1773      -26

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

mese79 · 2025-06-15T19:00:56Z

I need to update documentation as well.

mese79 · 2025-06-18T12:44:36Z

Using zarr data storage seems to make RF training slow. :(
I would probably revert it back to HDF5.

mese79 added 30 commits June 11, 2025 16:40

added No Patching checkbox to extractor widget

2891520

updated models adapters to support no_patching

ca67604

updated segmentation widget

cb3d544

removed unused imports

411bd9e

fixed some typings

b9ce247

added the dataset class

2050125

init using the new dataset for feature extraction

928cdb0

init using zarr storage

1ce1ea0

moved up no patching checkbox

123bfd9

updated extractor widget and no_patching option

2d8d91e

updated segmentation widget using zarr storage

b5e6edf

clean up & formatting

3b68f8c

fixed extraction progress info

2999421

updated dataset to handle images type inside

75f80e4

updated prediction pipeline over a large stack

7f5b78c

fixed run pipeline; file dialogs open in parent dialog

5bfe8c9

updated run_pipeline script

6393b2a

removed unused import in run_pipeline

e959bf1

fixed models params: image height & width as int

b580968

ignored some typing

b7261df

updated requirements

bc624a2

fixed image height & width as int

b3d6720

fixed dataset & get_model_ready_image image dimention problem

32f4a26

added dataset test

f2b04d6

fixed adapter: concat double output into one tensor

0104b10

updated mobilesam test

8b9cc93

updated dino adapter test

f4c6239

added sam2 adapter test

227a8d4

added pipeline_prediction test

b3bd256

bumped version

d3fce05

run_pipeline script can be used to only extract features

5eb9ed0

mese79 requested a review from jdeschamps June 15, 2025 18:48

mese79 added 2 commits June 18, 2025 11:19

updated dependencies in pyproject.toml

0cc7b37

fixed bug for calculating padding with no_patching

1118177

mese79 added 2 commits June 20, 2025 22:13

revert back storage to hdf5; improved get_train_data

10acde5

fixed embedding_extraction test

d26ae91

mese79 merged commit e1ed0dc into main Jun 22, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Major updates! #44

Major updates! #44

Uh oh!

mese79 commented Jun 15, 2025 •

edited

Loading

Uh oh!

codecov-commenter commented Jun 15, 2025 •

edited

Loading

Uh oh!

mese79 commented Jun 15, 2025

Uh oh!

mese79 commented Jun 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Major updates! #44

Major updates! #44

Uh oh!

Conversation

mese79 commented Jun 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Jun 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mese79 commented Jun 15, 2025

Uh oh!

mese79 commented Jun 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mese79 commented Jun 15, 2025 •

edited

Loading

codecov-commenter commented Jun 15, 2025 •

edited

Loading