MSEED: Segfault reading truncated file #1728

krischer · 2017-03-23T20:00:45Z

While trying to work around a problem when reasding truncated files (in SDS client while reading files that are currently being appended to by a different program), I came across a segfault when reading truncated MiniSEED files:

import copy
from io import BytesIO
from obspy import read
from obspy.core.util import get_example_file

file_ = get_example_file('BW.BGLD.__.EHE.D.2008.001.first_10_records')

with open(file_, 'rb') as fh: 
    data = fh.read()

# for i in range(1, 1000):
for i in [257]:
    print(i)
    bio = BytesIO(copy.deepcopy(data[:-i]))
    read(bio, format='MSEED')

$ python read_mseed_truncated.py 
257
Segmentation fault

QuLogic · 2017-03-23T04:10:13Z

Seems to be crashing in our code, not libmseed:

0x00007fffdda816aa in readMSEEDBuffer (mseed=0x18ffc10 "763445D BGLD   EHEBW", <incomplete sequence \330>, buflen=4863, selections=0x0, 
    unpack_data=1 '\001', reclen=-1, verbose=0 '\000', details=0 '\000', header_byteorder=-1, allocData=0x7ffff7fae048, diag_print=0x7ffff7fae080, 
    log_print=0x7ffff7fae0b8) at obspy/io/mseed/src/obspy-readbuffer.c:472
472	        if ((unpack_data != 0) && (msr->fsdh->data_offset >= 48) &&

megies · 2017-03-23T10:22:00Z

Maybe @krischer can have a look when he's got some time, no hurry though..

krischer · 2017-03-23T20:05:41Z

This branch contains a fix: https://github.com/obspy/obspy/tree/mseed-fix-segfault-truncated-file

Not sure why I cannot convert this issue to a PR right now but I'll try again later tonight or tomorrow. Or maybe somebody else can try?

Some other types of record corruption where already caught by libmseed and correctly bubble up to the Python warnings. I'm not entirely sure why this one does not but maybe its just because its truncated fairly late in the file?

In any case: now works as expected and it raises a nice warning (but still reads all previous records).

krischer · 2017-03-23T21:59:42Z

Hmm...looks like one of my tries did convert it to a PR in the end? Or did someone else do it?

Anyways - IMHO good to go. Feel free to review and merge :)

megies · 2017-03-25T11:37:23Z

Thanks for the fix @krischer, checking again, there's still some truncation scenarios that end in segfaults though..

Can you maybe have a look at these two byte offset:

256
5066

These seem to be different issues.. the latter one I've seen in real live reading mseed files that currently also get appended to in other threads (checking data latency).

import copy
from io import BytesIO
from obspy import read
from obspy.core.util import get_example_file

file_ = get_example_file('BW.BGLD.__.EHE.D.2008.001.first_10_records')

with open(file_, 'rb') as fh: 
    data = fh.read()

for i in range(1, 10000):
    # this seems to be a different issue than the already covered one:
    if i == 256:
        continue
    # these seem to be the same issue as with 256, as there just offset by 512
    # bytes..
    if i % 512 == 256:
        continue
    # this is finally the issue I was looking after: :-)
    if i == 5066:
        continue
    print(i)
    bio = BytesIO(copy.deepcopy(data[:-i]))
    read(bio, format='MSEED')

We already caught a couple of other variants of this but not this particular one. Now works correctly and raises a proper warning.

… MiniSEED reading function.

krischer · 2017-03-27T15:34:05Z

All fixed, rebased and force pushed.

The 256 + 512 bytes offsets were just because I forgot the <= case. The larger truncation because you passed a file with less than 128 bytes - this now raises a much better error message.

megies · 2017-03-27T17:35:11Z

Thanks for the fix(es)! 🎉

krischer · 2017-03-27T20:11:26Z

IMHO ready to be merged.

megies

Works like a charm, thanks!
(somehow I can't 'approve' this PR, seems like there's a problem with the review button..)

too small file size also see #639 and #1728

megies added the .io.mseed label Mar 22, 2017

megies mentioned this pull request Mar 22, 2017

Segfault after obspy.read #1658

Merged

megies added the bug confirmed bug label Mar 23, 2017

megies added this to the 1.1.0 milestone Mar 23, 2017

megies assigned krischer Mar 23, 2017

krischer added 5 commits March 27, 2017 17:31

Adding segfaulting truncated mseed reading test.

841aca1

Fixing segfault when parsing truncated mini-SEED file.

031d3a1

We already caught a couple of other variants of this but not this particular one. Now works correctly and raises a proper warning.

Also handle the case...

de97634

Correctly raise an exception if less than 128 bytes are passed to the…

8e73c3b

… MiniSEED reading function.

changelog

52d109d

krischer force-pushed the mseed-fix-segfault-truncated-file branch from 5475060 to 52d109d Compare March 27, 2017 15:33

Cleaning up after rebase

83c06c1

megies reviewed Mar 28, 2017

View reviewed changes

megies merged commit 062b241 into master Mar 28, 2017

megies deleted the mseed-fix-segfault-truncated-file branch March 28, 2017 08:29

megies added a commit that referenced this pull request Mar 28, 2017

introduce proper canonical exception class when reading MSEED file with

a3c522e

too small file size also see #639 and #1728

megies added a commit that referenced this pull request Mar 28, 2017

introduce proper canonical exception class when reading MSEED file with

d457845

too small file size also see #639 and #1728

flixha mentioned this pull request Jul 28, 2022

Occasional Segmentation fault on threaded MSEED-reading from SDS-client #3114

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MSEED: Segfault reading truncated file #1728

MSEED: Segfault reading truncated file #1728

krischer commented Mar 23, 2017

QuLogic commented Mar 23, 2017 •

edited

Loading

megies commented Mar 23, 2017

krischer commented Mar 23, 2017

krischer commented Mar 23, 2017

megies commented Mar 25, 2017

krischer commented Mar 27, 2017

megies commented Mar 27, 2017

krischer commented Mar 27, 2017

megies left a comment •

edited

Loading

MSEED: Segfault reading truncated file #1728

MSEED: Segfault reading truncated file #1728

Conversation

krischer commented Mar 23, 2017

QuLogic commented Mar 23, 2017 • edited Loading

megies commented Mar 23, 2017

krischer commented Mar 23, 2017

krischer commented Mar 23, 2017

megies commented Mar 25, 2017

krischer commented Mar 27, 2017

megies commented Mar 27, 2017

krischer commented Mar 27, 2017

megies left a comment • edited Loading

Choose a reason for hiding this comment

QuLogic commented Mar 23, 2017 •

edited

Loading

megies left a comment •

edited

Loading