Skip to content

v2.4.0 — identify fan-out + colleague example notebooks

Latest

Choose a tag to compare

@borgi-s borgi-s released this 09 Jun 08:58
· 50 commits to main since this release

Install

pip install dfxm-geo==2.4.0

(conda-forge follows once the autotick-bot feedstock PR merges.)

What's new

Identification fan-out

  • dfxm-identify --seed INT — reproducible / shardable identification runs.
  • scripts/fanout.py --mode {forward,identify} — in-node launcher now drives
    identification as well as forward.
  • Fixed lsf/identify_array.bsub: it claimed a per-task seed offset but passed
    none, so all array tasks drew identical samples. Now seeds per LSB_JOBINDEX.
  • __version__ now derives from installed package metadata (was a stale
    hardcoded string).

Colleague-ready examples (examples/)

  • identification_ml_tutorial/ — self-contained tutorial: build kernel →
    dfxm-identify → HDF5 layout → per-image labels → images → scaling to 100k+.
    Regenerates its own inputs; runs end to end from a fresh clone.
  • cluster_showcase/ — visualizes a real cluster identify sweep.
  • examples/README.md cross-links both and documents run requirements.

No dependency / entry-point / requires-python changes from 2.3.x.