LUCENE-9409: Truncation can also cause IndexOutOfBoundsException. #1593

jpountz · 2020-06-18T07:29:57Z

Expect IndexOutOfBoundsException when opening indices with truncated files.

This changes terms and points to check the length of the index/data files before creating slices in these files. A side-effect of this is that we can no longer verify checksums of the meta file before checking the length of other files, but this shouldn't be a problem. On the other hand it helps make sure that we would return a clear exception in case of truncation instead of a confusing OutOfBoundsException that isn't clear whether it's due to index corruption or a bug in Lucene.

mikemccand · 2020-06-18T18:29:03Z

lucene/core/src/java/org/apache/lucene/codecs/lucene86/Lucene86PointsReader.java

        } catch (Throwable t) {
          priorE = t;
        } finally {
          CodecUtil.checkFooter(metaIn, priorE);
        }
      }
-      // At this point, checksums of the meta file have been validated so we


Hmm are we losing this safety?

Oh, actually, maybe not, because in the finally clause above, where we check meta's footer, if the checksum is bad we will throw an exception, adding it as suppressed exception if the indexLength or dataLength was wrong. So I think we do not lose any safety with this change.

we don't lose safety, but in case of a corrupt meta file, it might be slightly more confusing in the sense that the suppressed exception will complain about a truncated index/data file

mikemccand · 2020-06-18T20:06:50Z

lucene/core/src/java/org/apache/lucene/codecs/lucene86/Lucene86PointsWriter.java

-          Lucene86PointsFormat.META_EXTENSION);
-      metaOut = writeState.directory.createOutput(metaFileName, writeState.context);
-      CodecUtil.writeIndexHeader(metaOut,
+      tempMetaOut = writeState.directory.createTempOutput(


Why are we switching to a temp file and copying to the real file after closing? Maybe add a comment explaining?

This is because we need to write file lengths of the index/data files before any offsets/lengths of slices into these files. But since these index/data files have not been written yet, we don't know the length yet. So I wrote into a temp file, and only then write the final metadata file that includes first the lengths of the index/data files and then metadata about the KD trees that includes offsets into these index/data files. I'll add a comment.

As an alternative, I could buffer the metadata in memory like we do for terms. It will require changing some APIs to replace IndexOutput with DataOutputs but other than that it shouldn't be too hard.

OK thanks for the explanation @jpountz. I am OK with using temp files for this ...

rmuir · 2020-06-21T10:39:04Z

Sorry, I'm against this change. The test is broken. It looks like we are willing to make bad tradeoffs in order to deliver CorruptIndexException and only CorruptIndexException if anything goes wrong. Fix the test instead!

A side-effect of this is that we can no longer verify checksums of the meta file before checking the length of other files

This is seriously the wrong tradeoff: let's fix the test instead. If we unexpectedly hit EOF, EOFException is the correct exception. If an index is out of bounds, IndexOutOfBoundsException is the correct exception.

jpountz · 2020-06-22T13:14:34Z

I like the CorruptIndexException because it tells me that the problem is that the file got altered after being written, while I would otherwise wonder if there is a bug in Lucene. As an alternative, would it work better for you if we called retrieveChecksum(IndexInput) before the try block, and then again with the length (retrieveChecksum(IndexInput, long)) after the try block once the checksum of the meta file has been validated?

jpountz · 2020-08-11T17:03:41Z

I repurposed this PR to instead make the test expect out-of-bounds exceptions. Does it look better to you @rmuir @uschindler ?

uschindler · 2020-08-11T17:14:03Z

I am fine to fix the test. Sure you have to first figure out why the index is out of bounds, and the exact exception may be misleading, but that's actually what's happening here. If you want other exceptions, another fix would be to enforce the IO layer to have a meaningful exception and implement it for all directory implementations.

jpountz added 2 commits June 18, 2020 09:28

iter

18b90ae

mikemccand reviewed Jun 18, 2020

View reviewed changes

jpountz added 3 commits June 18, 2020 22:19

comment

3c39fc1

Merge branch 'master' into lucene9489

5d8205c

Reenable test.

a0d6a74

jpountz added 2 commits August 11, 2020 18:59

Merge branch 'master' into lucene9489

1044df4

Fix test instead.

de54aa3

Merge branch 'master' into lucene9489

978703f

jpountz changed the title ~~LUCENE-9409: Check file lengths before creating slices.~~ LUCENE-9409: Truncation can also cause IndexOutOfBoundsException. Nov 16, 2020

asfimport mentioned this pull request Jul 29, 2022

TestAllFilesDetectTruncation failures [LUCENE-9409] apache/lucene#10449

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LUCENE-9409: Truncation can also cause IndexOutOfBoundsException. #1593

LUCENE-9409: Truncation can also cause IndexOutOfBoundsException. #1593

jpountz commented Jun 18, 2020 •

edited

mikemccand Jun 18, 2020

jpountz Jun 18, 2020

mikemccand Jun 18, 2020

jpountz Jun 18, 2020

jpountz Jun 18, 2020

mikemccand Jun 23, 2020

rmuir commented Jun 21, 2020

jpountz commented Jun 22, 2020

jpountz commented Aug 11, 2020

uschindler commented Aug 11, 2020

LUCENE-9409: Truncation can also cause IndexOutOfBoundsException. #1593

Are you sure you want to change the base?

LUCENE-9409: Truncation can also cause IndexOutOfBoundsException. #1593

Conversation

jpountz commented Jun 18, 2020 • edited

mikemccand Jun 18, 2020

Choose a reason for hiding this comment

jpountz Jun 18, 2020

Choose a reason for hiding this comment

mikemccand Jun 18, 2020

Choose a reason for hiding this comment

jpountz Jun 18, 2020

Choose a reason for hiding this comment

jpountz Jun 18, 2020

Choose a reason for hiding this comment

mikemccand Jun 23, 2020

Choose a reason for hiding this comment

rmuir commented Jun 21, 2020

jpountz commented Jun 22, 2020

jpountz commented Aug 11, 2020

uschindler commented Aug 11, 2020

jpountz commented Jun 18, 2020 •

edited