-
Notifications
You must be signed in to change notification settings - Fork 10.6k
Fix SR-4590 compare_perf_tests.py fails when new benchmarks are added #8974
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@gottesmm Please review |
|
@palimondo Can you put in a warning for the new tests that were added? I think you would just do a set subtraction and do a simple print. |
|
Also, why is the digit test not necessary any more? |
|
Another small concern of mine is about the row column numbers. Are you sure you got those correct? @atrick is more familiar with this script, so he may be the right person to just quickly confirm that. |
|
I glanced at the diff but I don't understand how removing all that logic for finding the min/max scores pertains to this bug. Incidentally, I once had an earlier version of the script that handled added and removed tests. That shouldn't be hard. |
|
New tests that were added are coming in the fix for SR-4601 - separate PR, after we land the refactoring. That whole code around row[MIN].isdigit() must have been some legacy from who-knows-when. It looks like there was a time when Benchamark_Driver wasn't reporting aggregate stats, but repeated tests were printed out multiple time? Given the current output from Benchmark_Driver, this works just fine. I wasn't changing row column numbers… if you mean going from |
What?! Where, how? I don't understand. When is it invoked like that? |
|
The compare script needs to handle multiple invocations of the driver. That's literally the only way that I use the driver. 1/benchmark_driver > out1 2/benchmark_driver > out2 compare out1 out2 Also, I usually only rerun some subset of tests and concatenate those to the same output. |
|
Oh, my… I can see how's the |
|
Well, compare_perf_test was around long before the driver. I've always used a script similar to that. Mishal made it work with CI. |
Also fix phantom “number” test result parsed from Totals.
|
That was just great. 🙅♂️ |
|
@atrick Please go ahead and do the book-keeping in Jira, too. Thanks! |
|
@palimondo I think you misunderstood. I'm not fixing SR-4590. I'm saying I'm surprised that it's broken because I expected it to be able to handle added/removed benchmarks. The confusing thing is that your bug title and this PR title is unrelated to the functionality that you're removing. |
|
@atrick It wasn't you. This PR is broken out from #8793 by request from @gottesmm to land it per-partes. @moiseev just landed the fix for SR-4590 in #8923. And a logical change in sorting order. 👍 I'm upset as my changes in #8793 do that too, as a part of much bigger refactoring of Your quick fixes make this unnecessarily hard, as I must rebase on top of your changes. The legacy script contains a ton of little quirks I had to replicate, now I need to redo that again. Its quite upsetting. |
|
I thought that filing bugs, that nobody took and I did self assign and opening a PR with fixes is enough to coordinate the work on this part of code to prevent conflict. I was apparently mistaken. |
|
Ok. Thanks for working at this. Rebasing is always frustrating. But what you're experiencing is pretty normal. The reality is that I miss a lot of bug and commit activity, especially when I'm on vacation for a week then working on a deadline. |
Also fix phantom “number” test result parsed from Totals.
Resolves SR-4590.
Broken out of #8793.