Fix revwalk limiting regression #4809

carlosmn · 2018-09-17T12:55:21Z

When porting, we overlooked that the difference between git's and our's time
representation and copied their way of getting the max value.

Unfortunately git was using unsigned integers, so ~0ll does correspond to
their max value, whereas for us it corresponds to -1. This means that we
always consider the last date to be smaller than the current commit's and always
think commits are interesting.

Change the initial value to the macro that gives us the maximum value on each
platform so we can accurately consider commits interesting or not.

The second commit is mostly just to reduce the drift from git, the actual perf difference should be negligible.

This fixes #4740 and we should backport it as it fixes a regression.

When porting, we overlooked that the difference between git's and our's time representation and copied their way of getting the max value. Unfortunately git was using unsigned integers, so `~0ll` does correspond to their max value, whereas for us it corresponds to `-1`. This means that we always consider the last date to be smaller than the current commit's and always think commits are interesting. Change the initial value to the macro that gives us the maximum value on each platform so we can accurately consider commits interesting or not.

…tamp This is not a big deal, but it does make us match git more closely by checking only the first. The lists are sorted already, so there should be no functional difference other than removing a possible check from every iteration in the loop.

neithernut · 2018-09-17T18:57:30Z

I've just ran and timed the reproducers from #4428 and a trimmed version of the reproducer in #4740 (single iteration) on the linux kernel repo.

The times reported for libgit2 built from master (bc34cb6):

#4428:

real	0m18.426s
user	0m18.187s
sys	0m0.227s

#4740:

real	0m36.638s
user	0m36.372s
sys	0m0.236s

For the current tip (12a1790)

#4428:

real	0m18.413s
user	0m18.156s
sys	0m0.246s

#4740:

real	0m0.007s
user	0m0.005s
sys	0m0.002s

Honestly, I thought I did something wrong when I performed the second measurement for the #4428 reproducer, but apparently this particular regression is not resolved. However, the performance is much better for #4740.

Edit: As expected, the performance of the #4428 reproducer also greatly increases if the commits are not sorted by time:

real	0m0.006s
user	0m0.003s
sys	0m0.003s

carlosmn · 2018-09-17T19:03:12Z

#4428 doesn't actually seem like a regression, but a fix and a misunderstanding of what different flags ought to do.

ethomson · 2018-09-18T01:58:03Z

Ouch! Thanks for the fix, @carlosmn.

pks-t · 2018-09-21T09:28:35Z

src/revwalk.c

@@ -405,7 +405,7 @@ static int still_interesting(git_commit_list *list, int64_t time, int slop)
 static int limit_list(git_commit_list **out, git_revwalk *walk, git_commit_list *commits)
 {
 	int error, slop = SLOP;
-	int64_t time = ~0ll;
+	int64_t time = INT64_MAX;


I'm surprised none of our tests catched this :(

It's just a performance regression, as all this does is make us think that we never went back far enough. If we had perf tests, maybe we would have caugh tit.

pks-t · 2018-09-24T06:09:14Z

On Sat, Sep 22, 2018 at 12:32:52PM -0700, Carlos Martín Nieto wrote: carlosmn commented on this pull request. > @@ -405,7 +405,7 @@ static int still_interesting(git_commit_list *list, int64_t time, int slop) static int limit_list(git_commit_list **out, git_revwalk *walk, git_commit_list *commits) { int error, slop = SLOP; - int64_t time = ~0ll; + int64_t time = INT64_MAX; It's just a performance regression, as all this does is make us think that we never went back far enough. If we had perf tests, maybe we would have caugh tit.

Yeah, I realized that at a later point, too. But thanks for clarifying!

carlosmn added 2 commits September 17, 2018 14:39

This was referenced Sep 17, 2018

Performance regression in revwalk API #4740

Closed

git_remote_fetch is slow #4736

Closed

ethomson merged commit e181a64 into master Sep 18, 2018

pks-t reviewed Sep 21, 2018

View reviewed changes

carlosmn deleted the cmn/revwalk-sign-regression branch September 22, 2018 19:32

pks-t added the backport-0.27.6 label Oct 11, 2018

battlmonstr mentioned this pull request Feb 16, 2019

git log -15 is slow libgit2/libgit2sharp#1558

Closed

This was referenced Mar 18, 2019

Flaky build: Azure DevOps macOS build sometimes failing at language-yaml tests atom/atom#18990

Closed

Try upgrading libgit2 to resolve renderer crashes atom/atom#19007

Closed

rafeca mentioned this pull request Mar 20, 2019

Upgrade libgit2 version atom/git-utils#91

Merged

snyk-bot mentioned this pull request Feb 23, 2020

[Snyk] Upgrade nodegit from 0.4.1 to 0.26.4 saurabharch/Breezeblocks#1

Open

snyk-bot mentioned this pull request Apr 22, 2020

[Snyk] Upgrade nodegit from 0.24.3 to 0.26.5 aminatakonate000/Graviton-App#4

Open

snyk-bot mentioned this pull request May 5, 2020

[Snyk] Upgrade nodegit from 0.24.3 to 0.26.5 Barnstorm-Online/ngp-openapi-generator#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix revwalk limiting regression #4809

Fix revwalk limiting regression #4809

carlosmn commented Sep 17, 2018

neithernut commented Sep 17, 2018 •

edited

Loading

carlosmn commented Sep 17, 2018

ethomson commented Sep 18, 2018

pks-t Sep 21, 2018

carlosmn Sep 22, 2018

pks-t commented Sep 24, 2018 via email

Fix revwalk limiting regression #4809

Fix revwalk limiting regression #4809

Conversation

carlosmn commented Sep 17, 2018

neithernut commented Sep 17, 2018 • edited Loading

carlosmn commented Sep 17, 2018

ethomson commented Sep 18, 2018

pks-t Sep 21, 2018

Choose a reason for hiding this comment

carlosmn Sep 22, 2018

Choose a reason for hiding this comment

pks-t commented Sep 24, 2018 via email

neithernut commented Sep 17, 2018 •

edited

Loading