Change main delay calculation from simple for-loop to parallel for-loop #32

jlmaurer · 2020-04-27T03:44:30Z

Code is tools/RAiDER/delayFcns.py:

Line 123 in 8b397b1

with h5py.File(pnts_file, 'r') as f:

The code is chunked but it processes each chunk serially. A potentially major speed-up would be to add a parallel implementation here. Only reads are required; no data is written, but I'm not sure what happens if you try to open an HDF5 file in read-only mode (see perhaps here for a start). My hope is that we can just pass the file name to different processes (and if the file is chunked properly so we never try to read the same chunk) we can just use multiprocessing and do it that way. I.e. something like:

zipped_args = zip([filename]*len(other_args), other_arg_1, other_arg_2,...)
for arg_set in zipped_args:
    open the file in read-only mode and read the appropriate chunk
    do the delay calculation

But I don't know if HDF5 will actually allow that to work.

The text was updated successfully, but these errors were encountered:

jlmaurer · 2020-06-17T00:57:33Z

PR #45 addresses this issue.

jlmaurer mentioned this issue Jun 17, 2020

hdf5 parallel along with a few clean-ups #45

Merged

dbekaert closed this as completed Jun 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change main delay calculation from simple for-loop to parallel for-loop #32

Change main delay calculation from simple for-loop to parallel for-loop #32

jlmaurer commented Apr 27, 2020 •

edited

Loading

jlmaurer commented Jun 17, 2020

Change main delay calculation from simple for-loop to parallel for-loop #32

Change main delay calculation from simple for-loop to parallel for-loop #32

Comments

jlmaurer commented Apr 27, 2020 • edited Loading

jlmaurer commented Jun 17, 2020

jlmaurer commented Apr 27, 2020 •

edited

Loading