Always use IDs in plots and brain results #2614

brimoor · 2023-02-05T22:12:10Z

Updates methods like scatterplot() and the in-App embeddings backend to always rely on sample/label IDs when pulling data for plots.

Previously, these implementations assumed that the view on which embeddings/etc were computed has not changed in any way. Now, these methods will gracefully handle added/deleted data.

New tests are added to tests/intensive/plot_tests.py to verify that the plotting methods function as expected.

codecov · 2023-02-05T22:20:38Z

Codecov Report

Base: 62.53% // Head: 62.46% // Decreases project coverage by -0.08% ⚠️

Coverage data is based on head (a6a0eeb) compared to base (ff3d48a).
Patch coverage: 100.00% of modified lines in pull request are covered.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #2614      +/-   ##
===========================================
- Coverage    62.53%   62.46%   -0.08%     
===========================================
  Files          249      249              
  Lines        42168    42262      +94     
  Branches       347      347              
===========================================
+ Hits         26371    26399      +28     
- Misses       15797    15863      +66

Flag	Coverage Δ
app	`50.03% <ø> (-0.10%)`	⬇️
python	`99.39% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
tests/unittests/import_export_tests.py	`99.83% <100.00%> (+<0.01%)`	⬆️
app/packages/state/src/recoil/skeletonFilter.ts	`11.85% <0.00%> (-7.87%)`	⬇️
app/packages/state/src/recoil/pathFilters/index.ts	`34.55% <0.00%> (-0.78%)`	⬇️
app/packages/looker/src/overlays/keypoint.ts	`18.77% <0.00%> (+1.84%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

brimoor · 2023-02-06T16:24:02Z

fiftyone/server/routes/embeddings.py

                    patches_field, leaf
                )

-            labels = curr_view.values(label_field, unwind=True)
-            field = curr_view.get_field(label_field)
+            labels = view._get_values_by_id(


Updates embeddings backend to always pull color-by data using IDs

brimoor · 2023-02-06T16:27:53Z

fiftyone/core/plots/utils.py

+            )
+
+        if ids is not None and not is_frames:
+            values = samples._get_values_by_id(


All of the refactoring in the plotting methods was to achieve this line: when the user has provided IDs and is asking to pull values from a dataset via path/expression, use _get_values_by_id() to ensure that the correct values are pulled, even if samples doesn't correspond 1-1 with other data that may have been provided.

brimoor · 2023-02-06T16:29:59Z

tests/intensive/plot_tests.py

+    plot = fo.scatterplot(
+        points=points,
+        samples=dataset,
+        ids=ids,


This is an example of something that wouldn't previously work. User provided ids + points corresponding to a subset of the samples=dataset argument that they provided, but they provided paths/expressions for labels and sizes arguments.

Previously this would fail because dataset.values() would naively be used (which would result in too many labels/sizes). Now the ids argument is used to lookup the correct labels/sizes in the correct order corresponding to points/ids.

ritch

Embeddings.py changes LGTM - Other plot changes look fine but I'm less familiar with those.

brimoor added 3 commits February 5, 2023 16:14

adding _get_values_by_id() util

3bb151a

adding robust ID support to plotting utils

cf1164d

using _get_values_by_id()

0692153

brimoor added the enhancement Code enhancement label Feb 5, 2023

brimoor requested a review from a team February 5, 2023 22:12

brimoor self-assigned this Feb 5, 2023

lint

57cbb83

brimoor added 5 commits February 6, 2023 00:45

bug fixes

0f1d2ae

bug fix

1cba484

bug fixes

35bfe13

tweaks

2c0ed22

adding plot tests

2153393

brimoor commented Feb 6, 2023

View reviewed changes

dont use cache if not requested

a6a0eeb

ritch approved these changes Feb 6, 2023

View reviewed changes

brimoor merged commit 85d2f59 into develop Feb 6, 2023

brimoor deleted the save-ids branch February 6, 2023 21:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Always use IDs in plots and brain results #2614

Always use IDs in plots and brain results #2614

brimoor commented Feb 5, 2023 •

edited

codecov bot commented Feb 5, 2023 •

edited

brimoor Feb 6, 2023

brimoor Feb 6, 2023

brimoor Feb 6, 2023

ritch left a comment

Always use IDs in plots and brain results #2614

Always use IDs in plots and brain results #2614

Conversation

brimoor commented Feb 5, 2023 • edited

codecov bot commented Feb 5, 2023 • edited

Codecov Report

brimoor Feb 6, 2023

Choose a reason for hiding this comment

brimoor Feb 6, 2023

Choose a reason for hiding this comment

brimoor Feb 6, 2023

Choose a reason for hiding this comment

ritch left a comment

Choose a reason for hiding this comment

brimoor commented Feb 5, 2023 •

edited

codecov bot commented Feb 5, 2023 •

edited