v10 #110

bamdadsabbagh · 2023-05-31T10:40:01Z

id: "20230609-SSE-meeting-v10"
aliases:

".notes/20230609: SSE Meeting v10"
tags:
"sse"
"dev"
"pro"
"sound scape explorer"
date: 20230609

.notes/20230609: SSE Meeting v10

[toc]

PR v10

#110

Major enhancements

Note

Timeline strategy

Different audio file lengths

Delivery

ETA: mid/end June
Still working on it by mid July
- Merging business requirements to avoid undoing v10 new features
- Implementing new business requirements
- Improve codebase

TODOs left over for v11

Processing: Investigate migrating to python 3.10
- We observe 6x better performance on Nicolas' M1 MacBook with 3.10 compared
  to 3.8
Processing/Config: Path importer for configuration file
- Use Lana's sse-config-importer
Processing/Config: Investigate the use of a default range
Front/Histogram: Combine meta properties
Front/FIX: Unable to open h5 file created with docker with chromium
- Reproduce with the help of Rémi
- Could not reproduce.
Front/FIX: Reproduce the firefox error when loading h5wasm in web worker
- Import at top module error
- Could not reproduce.
Front/FIX: Console shows render and filter in an infinite loop
- Could not reproduce.
Front/Queries: Migrate to plotly
- Are queries still used? No users seem to use it. I guess we can delay it
  for later on.
Front/Queries: Adapt to dynamic filters and color scales
Front/Histogram: Add new traces to current plot
- Need combination
Front/Metas: Combine multiple meta properties
- Needs all derived (volumes, matrices, pairings) being done in the Front

Notes from v9 PR #97

Questions about timeline strategy

How would you prefer specifying the starting date for integration?

Selected range
New specific setting

I vote for 2.

`Timeline` Strategy recap

Instead of referring to files, we now refer to a timeframe
If an integrated portion of time is empty, we drop it
If an integrated portion of time is partially filled with audio, we keep it

TODOs timeline

Processing/Groups: Add
Processing/Indicators: Adapt to timeline
Processing/Volumes: Adapt to timeline
Processing/Matrices: Adapt to timeline
Processing/Pairings: Adapt to timeline
Front: Show file site as reference instead of file name or file index
- A Group (integrated interval / aggregation) can now reference multiple files
Front/Export: Add indicators
Front/Export: Add volumes
Front/Histogram: Show indicators (for single points) over x (time, ...)

TODOs meeting

Read upcoming documentation

Traceback (most recent call last):
  File "/home/sf39231h/git/sound-scape-explorer/processing/processing/actions/_all.py", line 15, in <module>
    run_files(env)
  File "/home/sf39231h/git/sound-scape-explorer/processing/processing/actions/run_files.py", line 30, in run_files
    extractor.yield_and_store_features(
  File "/home/sf39231h/git/sound-scape-explorer/processing/processing/extractors/ConfigFilesExtractor.py", line 119, in yield_and_store_features
    storage.write_features(
  File "/home/sf39231h/git/sound-scape-explorer/processing/processing/storage/Storage.py", line 630, in write_features
    flat_features.append(features[f][s])
IndexError: list index out of range

Notes Rémi

Notes business

que 30 points sur 1h avec 60s integration ? à vérifier de mon coté...
timeline bizarre 3600 (qui miss 1j sur 2) vs 7200,
probablement dû à la résolution des blocks de la barre?

TODOs from JR implementation

TODOs during implementation

Project delays

List of reasons for the late deliveries:

Project specifications modification from existing codebase
- More refactoring was needed
- More time spent analysing all available options
Optimistic estimations
- With new grouping and site handling

Commits containing BREAKING CHANGEs

From most recent to oldest.

bamdadsabbagh · 2023-07-11T20:35:22Z

id: "20230711-SSE-meeting-implementing-new-JR-method"
aliases:

"SSE Meeting: Implémentation nouvelle méthode JR"
tags:
"sse"
"dev"
"pro"
"sound scape explorer"
date: 20230711

SSE Meeting: Implémentation nouvelle méthode JR

[toc]

Liens utiles

Google Colab
Utils script
Google Drive Folder
Forked Google Colab
- 20230711 RESULTATS LANA 2.ipynb
- Colab link
Forked utils script
- utilsENESSubV3.py
Link to this document as GitHub commment

Présentation Jeremy

Avec la nouvelle implémentation, on change la méthode de calcul pour les :

Silhouettes
Autocluster
Trajectoires (nouveau)

En effet, dans un souci de validation de la méthode et d'augmentation des
performances de calculs, ces changements sont la bienvenue.

Méthode

En partant des données générées par VGGish, on génère n UMAPs avec une seed random.

Note

Pour le papier, nous prendrons un nombre n d'UMAPs à générer de `100.

Mais en pratique, 50 itérations suffisent pour avoir un taux de différence < 1%.

En effet, la convergence des UMAPs est assez rapide (les itérations permettant
de réduire la randomness de cet algorithme).

Le nombre de dimensions demandées ne doit être, ni trop élevé (supérieur à ~10),
ni trop faible (inférieur à ~3).

Pour la campagne de Lana, un sweet spot de 5 dimensions a été déterminé.

Note

En dessous de 5 dimensions, on estime que trop de données sont perdues.

A partir de ces n UMAPs 5D, on fait :

La génération de 1 trajectoire
- En moyennant les coordonnées de chacun des UMAPs puis traçant la trajectoire.
- De cette trajectoire, on en tire les indicateurs propres aux trajectoires
  qui permettront de les comparer entre elles (la moyenne des distances
  et les quartiles à 95%).
- On peut ajouter les percentiles à 5% et afficher en tracé type candlestick.
La génération de n matrices de distances donnant naissance à 1 seule et unique
matrice de distances moyennées.
- Cette matrice de distances sera alors utilisée pour la génération des indices
  Silhouette et Autocluster.

Data flow

Files (VGGish)
Groups (Integration)
Robust scale (intégré dans le calcul des UMAPs)
n UMAP 5D
- random_seed: None
- min_dist: 0
- distance_metric: manhattan
Matrice distances
- metric: euclidean

Note

Pour rappel, les UMAPs utilisés pour l'affichage (2D, 3D) utilisent
les paramètres suivants :

random_seed: 42000

min_dist: 0.1

distance_metric: manhattan

New configuration settings

UMAP dimensions: 5
UMAP iterations: 50
hdbscan
- min_cluster_size: 15
- min_samples: None (min_cluster_size)
- alpha: 1
- epsilon: 0.1
- algo: eom | leaf

See TODOs for new Autocluster panel

Publishing setings

UMAP dimensions: 5
UMAP iterations: 100
hdbscan
- min_cluster_size: 50 | 100
  - To be determined by Jeremy Rouch & Lana Minier
- min_samples: None
- alpha: 1
- epsilon: 0
- algo: eom & leaf

TODOs

See [[20230609-SSE-meeting-v10]]

…e + Store relative timestamps

…ration

…ructs to numbers and not strings

sonarcloud · 2023-08-31T20:10:29Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
11 Code Smells

No Coverage information
0.0% Duplication

bamdadsabbagh self-assigned this May 31, 2023

bamdadsabbagh added the enhancement New feature or request label May 31, 2023

bamdadsabbagh added this to the v10 milestone May 31, 2023

bamdadsabbagh force-pushed the next/v10 branch from cf50dde to 257bf4c Compare June 12, 2023 14:06

bamdadsabbagh force-pushed the next/v10 branch 3 times, most recently from 56f8b0e to 7c76c45 Compare July 8, 2023 14:23

bamdadsabbagh force-pushed the next/v10 branch 13 times, most recently from 1a55af9 to 6968128 Compare July 17, 2023 20:56

bamdadsabbagh force-pushed the next/v10 branch 9 times, most recently from f848b0e to 028caa5 Compare July 21, 2023 15:01

bamdadsabbagh added 27 commits August 31, 2023 02:42

feat(Front/Loading): Add reading details

77a2367

feat(Processing/Trajectories): Trace for specific label property/valu…

1985653

…e + Store relative timestamps

perf(Front/Storage): Simplify storage load component

8945979

fix(Front/Selection): Make screenshotting available anytime

e48d93e

perf(Front/Import): Sort imports

e9d3e7a

feat(Front/Trajectories): Improve coloring and behaviour

6175d52

feat(Front/Trajectories): Improve coloring of fused trace

85d5dc8

feat(Processing/Trajectories): Allow monthly traces

a8fde81

fix(Processing/Actions): Remove site printing when refreshing configu…

8c6258c

…ration

fix(Processing/Trajectories): Adapt configuration and storage reconst…

fd371d9

…ructs to numbers and not strings

feat(Processing): Allow user to pass any yaml file

55f4132

feat(Processing): Add template sse.yaml file

79c23f3

fix(Front/Details): Display indicators

0b85ba0

perf(Front): Remove three from vite bundle splitting

57b99f5

perf(Front/Time): Remove old TODO

43dd666

feat(Front/Export): Add site

984aa9f

feat(Processing/Export): Add site

019931a

fix(Processing/Export): Better column naming

3a9a7d7

fix(Processing/Actions): Improve console outputs

b84f6fb

fix(Processing/Menu): Improve wording for digests

f45a52d

docs: Remove very old TODO file

5f5dfc1

docs: Update documentation

6ab3fd4

feat(Examples): Update template configuration file

04f6c8b

feat(Examples): Add coral reef campaign example

afac5aa

docs: Update README with coral reef light example

6ebf96a

Merge branch 'main' into next/v10

aa5e54e

perf(Processing): Remove merge artefacts

d2ff9c5

bamdadsabbagh merged commit a269d98 into main Aug 31, 2023
8 checks passed

bamdadsabbagh deleted the next/v10 branch August 31, 2023 20:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v10 #110

v10 #110

bamdadsabbagh commented May 31, 2023 •

edited

bamdadsabbagh commented Jul 11, 2023 •

edited

sonarcloud bot commented Aug 31, 2023

v10 #110

v10 #110

Conversation

bamdadsabbagh commented May 31, 2023 • edited

.notes/20230609: SSE Meeting v10

PR v10

Major enhancements

Delivery

TODOs left over for v11

Notes from v9 PR #97

Questions about timeline strategy

How would you prefer specifying the starting date for integration?

Timeline Strategy recap

TODOs timeline

TODOs meeting

Notes Rémi

Notes business

TODOs from JR implementation

TODOs during implementation

Project delays

Commits containing BREAKING CHANGEs

bamdadsabbagh commented Jul 11, 2023 • edited

SSE Meeting: Implémentation nouvelle méthode JR

Liens utiles

Présentation Jeremy

Méthode

Data flow

New configuration settings

Publishing setings

TODOs

sonarcloud bot commented Aug 31, 2023

bamdadsabbagh commented May 31, 2023 •

edited

`Timeline` Strategy recap

bamdadsabbagh commented Jul 11, 2023 •

edited