Skip to content

Started to work on extracting label multisets from BigCAT #1

@hanslovsky

Description

@hanslovsky

Hi @shrucis1,
we finally started to work on extracting the label multisets from BigCAT (long overdue). This issue is just a FYI for you so you know what we are doing.

These are the relevant repositories:

I also forked your

I modified your code to use the new dependencies (label-multisets, n5-label-multisets) and to adapt to changes in other dependencies, e.g. n5. I also replaced jcommander by picocli, a more modern command line argument parser for Java (so far I am really convinced by it) and changed the high level structure: Now, bdv.bigcat.spark.SparkDownsampler creates a whole scale pyramid (s0 ... sN) at once (as defined by parameters). In n5, this will look like this:

+- /group/dataset/
|  s0/
|  s1/
...
|  sN/

We do not have an input dataset anymore but s0 is required to be present (can be a symlink which is also a parameter now). s0 is LabelMutlisetType already (and not any kind of Integertype), which can be created through your hdf-n5-converter (I modified that as well, see link above). I will sparkify the hdf-n5-converter as well, so it can be used for larger datasets.

Eventually, we will rename this repository, because it is not specific to/does not depend on BigCAT (anymore). Something like label-multisets-downsampler or label-multisets-downsampler-spark would probably be a good choice.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions