Parsing local and cloud SEG-Y files with new I/O library #381

tasansal · 2024-04-04T15:18:24Z

No description provided.

Removed the list type check from the text_header setter in accessor.py. The application now expects a string input instead of a list, simplifying the validation process.

Simplify `header_scan_worker` and `trace_worker` in SEG-Y module by removing unused imports and streamlining parameter list. Update functions to work directly with `SegyFile` instances and clean up data handling logic for efficiency.

Refactor the parsing functions in `src/mdio/segy/parsers.py` to simplify the codebase and improve maintainability. Redundant functions such as `parse_binary_header`, `parse_text_header`, and `get_trace_count` have been removed, while imports have been condensed to only essential modules. The `NUM_CORES` logic is updated to count logical cores instead of just physical ones.

Removed unused imports and functions in the SEG-Y converter module to enhance code maintainability. Simplified the arguments for the `segy_to_mdio` function to increase ease of use and readability. Reduced complexity by utilizing `SegyFile` class for SEG-Y file operations.

The get_grid_plan function in utilities.py has been refactored to accept a SegyFile instance instead of individual parameters for the file path. Unused imports were eliminated, and type checking imports are now conditional, improving readability and modularity.

The changes involve major refactoring of the code base to use the 'segy' library instead of 'segyio'. Most notably, this included updating the handling of SEG-Y dtypes, byte order, and trace headers. Unused imports have been removed to clean up the code. A new multiprocessing chunk size has been introduced and set attributes to SegyFile instance instead of passing them as function arguments.

…y to make it work with 'segy'.

The segy package version has been updated from 0.0.13 to 0.0.14 in the pyproject.toml file. This upgrade was performed to update software dependencies and to integrate the latest bug fixes and features delivered with the new version.

The mdio_to_segy function in the MDIO converter has been updated with 'endianness' and 'sample_format' parameters. These parameters provide control over the endianness and sample format of the output SEG-Y file. Defaults for the parameters are set as Endianness.BIG and ScalarType.IBM32 respectively.

A new helper function, 'make_segy_factory', has been created to handle the generation of SegyFactory. This function accepts more parameters to provide better control over the creation of the SEG-Y based on the MDIO metadata. Changes also include updates in import declarations and reorganization of some code blocks in the 'mdio_spec_to_segy' function.

In the SegyFactory initialization within creation.py, the sample_interval parameter has been modified to be multiplied by 1000. This change ensures that the value is correctly represented in microseconds, aligning with the expected data format.

tasansal added 16 commits March 21, 2024 18:01

Refactor text_header setter type check

ba3747d

Removed the list type check from the text_header setter in accessor.py. The application now expects a string input instead of a list, simplifying the validation process.

Refactor text_header setter type check

15012af

Removed the list type check from the text_header setter in accessor.py. The application now expects a string input instead of a list, simplifying the validation process.

Refactor text_header setter type check

6cb88c1

Removed the list type check from the text_header setter in accessor.py. The application now expects a string input instead of a list, simplifying the validation process.

Refactor workers for SEG-Y parsing

e54e2a3

Simplify `header_scan_worker` and `trace_worker` in SEG-Y module by removing unused imports and streamlining parameter list. Update functions to work directly with `SegyFile` instances and clean up data handling logic for efficiency.

use NDArray typing since we now return struct

7ec54f6

refactor override tests to use ndarray headers instead of a dictionar…

2bff61e

…y to make it work with 'segy'.

Remove unit tests for IBM/IEEE conversions and text headers

238dcb5

Refactor and simplify 6D tests related to SEG-Y

51d241d

Refactor and simplify 6D tests related to SEG-Y

e488200

Upgrade segy package version

ab22810

The segy package version has been updated from 0.0.13 to 0.0.14 in the pyproject.toml file. This upgrade was performed to update software dependencies and to integrate the latest bug fixes and features delivered with the new version.

tasansal self-assigned this Apr 4, 2024

tasansal added breaking Breaking Changes enhancement New feature or request labels Apr 4, 2024

tasansal added 2 commits April 4, 2024 20:11

Update segy library version

409d2d9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parsing local and cloud SEG-Y files with new I/O library #381

Parsing local and cloud SEG-Y files with new I/O library #381

tasansal commented Apr 4, 2024

Parsing local and cloud SEG-Y files with new I/O library #381

Are you sure you want to change the base?

Parsing local and cloud SEG-Y files with new I/O library #381

Conversation

tasansal commented Apr 4, 2024