Adding reader for TVIPS datastream files #2780

din14970 · 2021-06-25T19:37:42Z

This issue serves mainly to notify about something I'm currently working on, and to have something to reference. TVIPS is a very small German company that sells CMOS cameras that are fast and have good dynamic range. They seem to be decently popular in Germany. We have these cameras installed our JEOL instruments and use them for in-situ studies and various 4D-STEM experiments. This kind of data is collected in a quite inconvenient stream form, starting with a main header and each image frame preceded by a small frame header. In addition, files are capped at a certain size, so datasets continue by creating and saving to additional files (file_000.tvips, file_001.tvips, ...). There is very limited metadata included in the files, so users must typically reconstruct the shape of the data hypercube themselves. To deal with this data, I've previously written a GUI based converter program to convert the data to blo, hspy or tiff. However, it is annoying to have to duplicate very large datasets before one can work with them, and besides the additional wasted disk space it can take 10-30 min for a conversion. I think at this point I have enough knowledge to implement a file reader directly in Hyperspy. I envision adding the following arguments; most of them are only relevant for 4D-STEM datasets:

scan_shape: in case the dataset is a 4D STEM dataset and the original shape can not be automatically detected
first_frame: index of the first frame to include in the dataset in case it can not be automatically detected
last_frame: index of the last frame in case it can not be automatically detected
winding_scan: boolean, whether the scan unit operated under flyback mode or "snake-scan" mode
hysteresis: scan point offset of even scan rows to correct for miss-aligned "snake-scan" mode

By default, the data would just be loaded as an image stack unless scan_shape is defined.
The original implementation in my GUI converter relies on a loop over the files. I hope I can do something a bit smarter with np.memmap, even though the array data is non contiguous and possibly split over multiple files.

For original_metadata I was planning to only include the main header. However there is the possibility to record additional information like temperature in the frame metadata, so it might be handy to be able to also optionally load and return this information.

The text was updated successfully, but these errors were encountered:

magnunor · 2021-06-28T15:43:46Z

More file formats is always nice!

With regards to lazy loading + scan_shape combined with np.memmap, there is probably some clever things you can do with regards to the chunking to get optimal performance. For example making sure the chunks doesn't extend over several files, to avoid read amplification.

ericpre · 2022-03-26T13:41:44Z

Done in #2781.

din14970 mentioned this issue Jun 26, 2021

Add .tvips file reader plug-in #2781

Merged

9 tasks

ericpre closed this as completed Mar 26, 2022

ericpre added this to the v1.7 milestone Mar 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding reader for TVIPS datastream files #2780

Adding reader for TVIPS datastream files #2780

din14970 commented Jun 25, 2021

magnunor commented Jun 28, 2021

ericpre commented Mar 26, 2022

Adding reader for TVIPS datastream files #2780

Adding reader for TVIPS datastream files #2780

Comments

din14970 commented Jun 25, 2021

magnunor commented Jun 28, 2021

ericpre commented Mar 26, 2022