Fix MANIFEST name assignment #6426

riversand963 · 2020-02-18T17:39:41Z

Summary:
Currently, a new MANIFEST file is assigned a new file number when 1) no
MANIFEST is open, or 2) current MANIFEST file size exceeds a threshold. This is
not sufficient. There are cases when the caller explicitly specifies that a new
MANIFEST be created. For example, if user sets options.write_dbid_to_manifest = true,
and there are WAL files, then RocksDB will run into an issue during recovery.
DBImpl::Recover() will call LogAndApply() to write dbid. At this point, the db being
recovered creates a new MANIFEST, say, MANIFEST-000003. Since there are WALs,
DBImpl::RecoverLogFiles will be called. Towards the end of this function, we call
LogAndApply(new_descriptor_log=true), which explicitly creates a new MANIFEST.
However, the manifest_file_number is wrong before this fix. Consequently, RocksDB
opens an existing, non-empty file for append, effectively truncating the file to zero.
If a crash occurs, then there will be data loss.

Test Plan (devserver):
make check

facebook-github-bot

@riversand963 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@riversand963 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-02-18T20:39:11Z

@riversand963 has updated the pull request. Re-import the pull request

facebook-github-bot

@riversand963 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

siying

The change itself is OK. Can you explain more about whether it is a user facing bug or not? If it is a user facing bug, what will be the impact?

Is it a regression bug related to the out-of-space recovery fix? If that is the case, do we need to backport to 6.7 branch?

siying · 2020-02-20T00:17:21Z

HISTORY.md

@@ -12,6 +12,7 @@
 * BlobDB now ignores trivially moved files when updating the mapping between blob files and SSTs. This should mitigate issue #6338 where out of order flush/compaction notifications could trigger an assertion with the earlier code.
 * Batched MultiGet() ignores IO errors while reading data blocks, causing it to potentially continue looking for a key and returning stale results.
 * `WriteBatchWithIndex::DeleteRange` returns `Status::NotSupported`. Previously it returned success even though reads on the batch did not account for range tombstones. The corresponding language bindings now cannot be used. In C, that includes `rocksdb_writebatch_wi_delete_range`, `rocksdb_writebatch_wi_delete_range_cf`, `rocksdb_writebatch_wi_delete_rangev`, and `rocksdb_writebatch_wi_delete_rangev_cf`. In Java, that includes `WriteBatchWithIndex::deleteRange`.
+* Assign new MANIFEST file number when caller tries to create a new MANIFEST by calling LogAndApply(..., new_descriptor_log=true).


I don't think LogAndApply() is a public API. Can you explain from RocksDB users' point of the view, how the bug can be triggered?

siying · 2020-02-20T00:18:15Z

CC @anand1976

anand1976 · 2020-02-20T02:47:06Z

The change looks fine to me. I agree with @siying that the description needs to be updated to say when this can happen from the user perspective. I don't think its related to the out of space recovery though, as background ops call LogAndApply() with new_descriptor_log = false.

riversand963 · 2020-02-20T17:53:22Z

Thanks @siying and @anand1976 for the review. I have updated the PR description and HISTORY.md.

riversand963 · 2020-02-20T17:54:02Z

A backport may be necessary.

facebook-github-bot

@riversand963 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: Currently, a new MANIFEST file is assigned a new file number when 1) no MANIFEST is open, or 2) current MANIFEST file size exceeds a threshold. This is not sufficient. There are cases when the caller explicitly specifies that a new MANIFEST be created. Test Plan: make check

facebook-github-bot · 2020-02-20T18:55:28Z

@riversand963 has updated the pull request. Re-import the pull request

facebook-github-bot

@riversand963 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@riversand963 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

riversand963 · 2020-02-20T20:55:57Z

The appveyor failure is irrelevant.

facebook-github-bot · 2020-02-21T02:41:42Z

@riversand963 merged this pull request in 362b8d4.

facebook-github-bot added the CLA Signed label Feb 18, 2020

facebook-github-bot reviewed Feb 18, 2020

View reviewed changes

riversand963 requested a review from siying February 18, 2020 18:37

riversand963 force-pushed the fix-new-manifest-name branch from c14ee98 to 586d87a Compare February 18, 2020 20:39

facebook-github-bot reviewed Feb 18, 2020

View reviewed changes

siying approved these changes Feb 20, 2020

View reviewed changes

riversand963 force-pushed the fix-new-manifest-name branch from 586d87a to 3cd6eb0 Compare February 20, 2020 17:52

facebook-github-bot reviewed Feb 20, 2020

View reviewed changes

riversand963 added 2 commits February 20, 2020 10:54

Update HISTORY

e600fba

riversand963 force-pushed the fix-new-manifest-name branch from 3cd6eb0 to e600fba Compare February 20, 2020 18:55

facebook-github-bot reviewed Feb 20, 2020

View reviewed changes

facebook-github-bot closed this in 362b8d4 Feb 20, 2020

riversand963 deleted the fix-new-manifest-name branch February 20, 2020 22:36

facebook-github-bot added the Merged label Feb 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix MANIFEST name assignment #6426

Fix MANIFEST name assignment #6426

riversand963 commented Feb 18, 2020 •

edited

Loading

facebook-github-bot left a comment

facebook-github-bot left a comment

facebook-github-bot commented Feb 18, 2020

facebook-github-bot left a comment

siying left a comment

siying Feb 20, 2020

siying commented Feb 20, 2020

anand1976 commented Feb 20, 2020

riversand963 commented Feb 20, 2020

riversand963 commented Feb 20, 2020

facebook-github-bot left a comment

facebook-github-bot commented Feb 20, 2020

facebook-github-bot left a comment

facebook-github-bot left a comment

riversand963 commented Feb 20, 2020

facebook-github-bot commented Feb 21, 2020

Fix MANIFEST name assignment #6426

Fix MANIFEST name assignment #6426

Conversation

riversand963 commented Feb 18, 2020 • edited Loading

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Feb 18, 2020

facebook-github-bot left a comment

Choose a reason for hiding this comment

siying left a comment

Choose a reason for hiding this comment

siying Feb 20, 2020

Choose a reason for hiding this comment

siying commented Feb 20, 2020

anand1976 commented Feb 20, 2020

riversand963 commented Feb 20, 2020

riversand963 commented Feb 20, 2020

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Feb 20, 2020

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

riversand963 commented Feb 20, 2020

facebook-github-bot commented Feb 21, 2020

riversand963 commented Feb 18, 2020 •

edited

Loading