spool with zero patches for binary data #381

ahmadtourei · 2024-05-29T19:53:06Z

Description

Although indexing with sp = dc.spool(data_path).update() continues to 100% and creates the index file in the data directory (verified by index_path_1==index_path_2==data_path=="/mnt/DAS/data_1" as below), when printing the spool (sp), it surprisingly indexes it again, and results in a zero size spool:

DASCore DirectorySpool 🧵 (0 Patches) Path: /mnt/DAS/data_1

index_path_1 = dc.spool(data_path, index_path=data_path).indexer.index_path
index_path_2 = dc.spool(data_path).indexer.index_path

Data is in binary ".raw" format. Each data directory has an XML file with metadata. I see the BinaryReader class in dascore.io but I'm unsure if we currently support this raw format with XML metadata.

Example

Expected behavior

Versions

OS [e.g. Ubuntu 20.04]: Ubuntu 22.04.3 2023.10.17 LTS
DasCore Version [e.g. 0.0.5]: 0.1.1
Python Version [e.g. 3.10]: 3.12.3

The text was updated successfully, but these errors were encountered:

d-chambers · 2024-05-29T20:10:54Z

Interesting.

Can you check if the index actually exists? spool.index.index_path.exists().

I think what is happening is DASCore can't read any of the files in the directory, so it goes through all of them (hence the progress bar) but ends up not creating an index because it doesn't have any recognizable contents. Then when you print the spool it sees there is no index file and tries to index again.

We don't currently support this file format, but perhaps we could. We don't yet support reading formats which have multiple files but Madagascar does something similar so it may be worth looking into. Is there a spec/example file you can share?

The BinaryReader isn't for a specific format, its just a way that FiberIO subclasses tell DASCore they need to read the file in binary mode (e.g., open(data_path, 'rb')) as oppose to using pytables or h5py.

ahmadtourei · 2024-05-29T21:53:24Z

Can you check if the index actually exists?

Yes, it returns True. However, I cannot see it in the directory using "ls -a" in the terminal.

I'll update you about sharing an example file tomorrow. Thanks!

d-chambers · 2024-06-07T18:34:16Z

closed by #384

ahmadtourei added the bug Something isn't working label May 29, 2024

ahmadtourei mentioned this issue Jun 4, 2024

FiberIO Directory Support and XML Binary support #384

Merged

8 tasks

d-chambers closed this as completed Jun 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spool with zero patches for binary data #381

spool with zero patches for binary data #381

ahmadtourei commented May 29, 2024 •

edited

Loading

d-chambers commented May 29, 2024

ahmadtourei commented May 29, 2024

d-chambers commented Jun 7, 2024

spool with zero patches for binary data #381

spool with zero patches for binary data #381

Comments

ahmadtourei commented May 29, 2024 • edited Loading

Description

Example

Expected behavior

Versions

d-chambers commented May 29, 2024

ahmadtourei commented May 29, 2024

d-chambers commented Jun 7, 2024

ahmadtourei commented May 29, 2024 •

edited

Loading