LUCENE-9651: Make benchmarks run again, correct javadocs by dweiss · Pull Request #71 · apache/lucene

dweiss · 2021-04-07T19:23:11Z

No description provided.

dweiss · 2021-04-08T12:43:33Z

Thanks Robert. I'll go through these benchmark files and correct them so that they work. It is a bit worrying that nobody noticed they're broken. :) Anybody using these at all?

rmuir · 2021-04-08T13:08:00Z

Thanks Robert. I'll go through these benchmark files and correct them so that they work. It is a bit worrying that nobody noticed they're broken. :) Anybody using these at all?

I've not used this mechanism of the benchmark to do any performance benchmarking: It seems most performance benchmarking from contributors/committers is using https://github.com/mikemccand/luceneutil for this, or writing ad-hoc benchmarks.

Personally, I use this benchmarking package, but via QualityRun's main method, to measure relevance, and I always write my own parser (because every trec-like dataset differs oh-so-slightly and the generic TREC parser we supply never works), and I just hold it in a minimum way (generate submission.txt, then i run trec_eval etc from commandline myself).

The issue why it isn't used might be the dataset, I'm unfamiliar with this reuters dataset and maybe its not big enough for useful benchmarks? I think in general people tend to use these datasets more often for performance benchmarks, often ad-hoc:

wikipedia english
geonames
apache httpd logs
NYC Taxis
OpenStreetMap

Or maybe its just because perf issues are usually complicated? For example to reproduce LUCENE-9827 I downloaded geonames and wrote a simple standalone .java Indexer (attached to issue) that essentially changes IW's config (flush every doc, SerialMergeScheduler, LZ4 and DEFLATE codec compression) to keep it simple measuring using only a single thread. It ran so slow i had to limit the number of docs to the first N as well.

mikemccand

Thank you for fixing this @dweiss! Alas these benchmarks indeed do not get much love/attention.

… updates are distributed (apache#71) Fixes PerReplicaStatesIntegrationTest.testRestart()

Correct micro-standard.alg.

60b169f

rmuir approved these changes Apr 8, 2021

View reviewed changes

mikemccand approved these changes Apr 8, 2021

View reviewed changes

janhoy pushed a commit to cominvent/lucene that referenced this pull request May 12, 2021

SOLR-15288: fix DOWNNODE issue for PRS collections when cluster state…

ddf9dc7

… updates are distributed (apache#71) Fixes PerReplicaStatesIntegrationTest.testRestart()

mikemccand approved these changes May 14, 2021

View reviewed changes

mikemccand merged commit 0d05b21 into apache:main May 14, 2021

asfimport mentioned this pull request Mar 23, 2022

Correct javadoc for benchmarks [LUCENE-9651] #10690

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LUCENE-9651: Make benchmarks run again, correct javadocs#71

LUCENE-9651: Make benchmarks run again, correct javadocs#71
mikemccand merged 1 commit into
apache:mainfrom
dweiss:LUCENE-9651

dweiss commented Apr 7, 2021

Uh oh!

dweiss commented Apr 8, 2021

Uh oh!

rmuir commented Apr 8, 2021

Uh oh!

mikemccand left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dweiss commented Apr 7, 2021

Uh oh!

dweiss commented Apr 8, 2021

Uh oh!

rmuir commented Apr 8, 2021

Uh oh!

mikemccand left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants