-
Notifications
You must be signed in to change notification settings - Fork 155
read-cache: make the index write buffer size 128K #877
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
read-cache: make the index write buffer size 128K #877
Conversation
Welcome to GitGitGadgetHi @neerajsi-msft, and welcome to GitGitGadget, the GitHub App to send patch series to the Git mailing list from GitHub Pull Requests. Please make sure that your Pull Request has a good description, as it will be used as cover letter. Also, it is a good idea to review the commit messages one last time, as the Git project expects them in a quite specific form:
It is in general a good idea to await the automated test ("Checks") in this Pull Request before contributing the patches, e.g. to avoid trivial issues such as unportable code. Contributing the patchesBefore you can contribute the patches, your GitHub username needs to be added to the list of permitted users. Any already-permitted user can do that, by adding a comment to your PR of the form Both the person who commented An alternative is the channel
Once on the list of permitted usernames, you can contribute the patches to the Git mailing list by adding a PR comment If you want to see what email(s) would be sent for a After you submit, GitGitGadget will respond with another comment that contains the link to the cover letter mail in the Git mailing list archive. Please make sure to monitor the discussion in that thread and to address comments and suggestions (while the comments and suggestions will be mirrored into the PR by GitGitGadget, you will still want to reply via mail). If you do not want to subscribe to the Git mailing list just to be able to respond to a mail, you can download the mbox from the Git mailing list archive (click the curl -g --user "<EMailAddress>:<Password>" \
--url "imaps://imap.gmail.com/INBOX" -T /path/to/raw.txt To iterate on your change, i.e. send a revised patch or patch series, you will first want to (force-)push to the same branch. You probably also want to modify your Pull Request description (or title). It is a good idea to summarize the revision by adding something like this to the cover letter (read: by editing the first comment on the PR, i.e. the PR description):
To send a new iteration, just add another PR comment with the contents: Need help?New contributors who want advice are encouraged to join git-mentoring@googlegroups.com, where volunteers who regularly contribute to Git are willing to answer newbie questions, give advice, or otherwise provide mentoring to interested contributors. You must join in order to post or view messages, but anyone can join. You may also be able to find help in real time in the developer IRC channel, |
/allow |
User neerajsi-msft is now allowed to use GitGitGadget. WARNING: neerajsi-msft has no public email address set on GitHub |
/preview |
Error: Could not determine public email of neerajsi-msft |
This means that the GitHub profile does not show your email address publicly. GitGitGadget needs this, though (at least for the moment) to be able to Cc: the cover letter to you. |
/preview |
Preview email sent as pull.877.git.1613613918861.gitgitgadget@gmail.com |
@dscho Thanks for helping me out! I'm going to submit. |
/submit |
Submitted as pull.877.git.1613616506949.gitgitgadget@gmail.com To fetch this version into
To fetch this version to local tag
|
Thanks, @dscho, for |
On the Git mailing list, Jeff Hostetler wrote (reply to this):
|
User |
4e20fdc
to
80055ec
Compare
On the Git mailing list, Junio C Hamano wrote (reply to this):
|
Writing an index 8K at a time invokes the OS filesystem and caching code very frequently, introducing noticeable overhead while writing large indexes. When experimenting with different write buffer sizes on Windows writing the Windows OS repo index (260MB), most of the benefit came by bumping the index write buffer size to 64K. I picked 128K to ensure that we're past the knee of the curve. With this change, the time under do_write_index for an index with 3M files goes from ~1.02s to ~0.72s. Signed-off-by: Neeraj Singh <neerajsi@microsoft.com> Signed-off-by: Jeff Hostetler <jeffhost@microsoft.com>
80055ec
to
b35eab9
Compare
On the Git mailing list, Neeraj Singh wrote (reply to this):
|
User |
On the Git mailing list, Junio C Hamano wrote (reply to this):
|
On the Git mailing list, Neeraj Singh wrote (reply to this):
|
This branch is now known as |
This patch series was integrated into seen via git@e8f7cbe. |
On the Git mailing list, Junio C Hamano wrote (reply to this):
|
On the Git mailing list, Chris Torek wrote (reply to this):
|
User |
On the Git mailing list, Junio C Hamano wrote (reply to this):
|
On the Git mailing list, Neeraj Singh wrote (reply to this):
|
On the Git mailing list, Chris Torek wrote (reply to this):
|
This patch series was integrated into seen via git@f577814. |
This patch series was integrated into seen via git@b55ea45. |
This patch series was integrated into next via git@8f43f67. |
This patch series was integrated into seen via git@07c0a2e. |
This patch series was integrated into seen via git@ada7c5f. |
This patch series was integrated into next via git@ada7c5f. |
This patch series was integrated into master via git@ada7c5f. |
Closed via ada7c5f. |
Writing an index 8K at a time invokes the OS filesystem and caching code
very frequently, introducing noticeable overhead while writing large
indexes. When experimenting with different write buffer sizes on Windows
writing the Windows OS repo index (260MB), most of the benefit came by
bumping the index write buffer size to 64K. I picked 128K to ensure that
we're past the knee of the curve.
With this change, the time under do_write_index for an index with 3M
files goes from ~1.02s to ~0.72s.
Signed-off-by: Neeraj Singh neerajsi@ntdev.microsoft.com
Note: This was previously discussed on the mailing list in 2016 at:
https://lore.kernel.org/git/1458350341-12276-1-git-send-email-dturner@twopensource.com/.
Since then, I believe we have a couple changes:
cc: Jeff Hostetler git@jeffhostetler.com
cc: Neeraj Singh nksingh85@gmail.com
cc: Chris Torek chris.torek@gmail.com