Large quantity of facets causing degradation of Meilisearch performance #2349

shivaylamba · 2022-04-25T15:56:36Z

Describe the bug
I am building an e-commerce demo using an Amazon Dataset that contains more than 1 million items. It has 300k brands, 100k tags which are added as part of filterable attributes. I am using Instant Meilisearch and thus these are also part of the Refinement list.
The time required to calculate the facets will increase as the number of facets increases, hence degrading search performance. Since there are 300K brands and 100K tags, it can take a lot of time to load them initially.

To Reproduce
Steps to reproduce the behavior:

Visit https://medusajs-storefront.vercel.app/
Click on Inspect element to open Developer tools
Go to the networks section and click on the search request
Click on the Response tab
You can see the response.

Expected behavior
Performance should be quick.

Screenshots

Meilisearch version: v0.26.0

Additional context
Additional information that may be relevant to the issue.
[e.g. architecture, device, OS, browser]
Browser: Chrome
OS: MacOS
Device: Mac Book Pro

Kerollmops · 2022-04-25T16:23:33Z

Hey @shivaylamba,

I ran the query that was taking a lot of time and it is indeed related to the number of facets returned in the results. More specifically we are returning 172700 facet values associated with their counts, which is a lot.

You can run this jq query on the file I linked, here. Make sure to unzip it first.

output.json.zip

cat output.json | jq '. | .facetsDistribution.brand | length'

I remember a PR that I have done that removed the limit on the number of facets returned by the engine, this should probably be reintroduced.

shivaylamba · 2022-04-25T18:25:57Z

Thanks for your response!

So for now would you recommend reducing the number of facets value @Kerollmops

curquiza · 2022-04-26T09:38:09Z

Discussed with @gmourier and @Kerollmops -> this will be fixed in v0.28.0 by introducing an hard limit (will be customizable in the future, but not for v0.28.0)
I will open an issue regarding this next week

shivaylamba · 2022-04-26T09:40:26Z

Alright thank you @curquiza

535: Reintroduce the max values by facet limit r=ManyTheFish a=Kerollmops This PR reintroduces the max values by facet limit this is related to meilisearch/meilisearch#2349. ~I would like some help in deciding on whether I keep the default 100 max values in milli and set up the `FacetDistribution` settings in Meilisearch to use 1000 as the new value, I expose the `max_values_by_facet` for this purpose.~ I changed the default value to 1000 and the max to 10000, thank you `@ManyTheFish` for the help! Co-authored-by: Kerollmops <clement@meilisearch.com>

curquiza · 2022-06-08T15:11:59Z

Closed by #2468

curquiza added the performance Related to the performance in term of search/indexation speed or RAM/CPU/Disk consumption label Apr 25, 2022

curquiza added this to the v0.28.0 milestone Apr 26, 2022

curquiza mentioned this issue May 4, 2022

Add limit of facet value and a setting to let the users customize it #2368

Closed

3 tasks

curquiza added the bug Something isn't working as expected label May 17, 2022

Kerollmops changed the title ~~Large quantity of facets causing degradation of Meilisearch peformance.~~ Large quantity of facets causing degradation of Meilisearch performance May 18, 2022

Kerollmops mentioned this issue May 18, 2022

Reintroduce the max values by facet limit meilisearch/milli#535

Merged

Kerollmops added the milli Related to the milli workspace label Jun 2, 2022

curquiza closed this as completed Jun 8, 2022

curquiza added the v0.28.0 PRs/issues solved in v0.28.0 label Aug 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Large quantity of facets causing degradation of Meilisearch performance #2349

Large quantity of facets causing degradation of Meilisearch performance #2349

shivaylamba commented Apr 25, 2022 •

edited

Kerollmops commented Apr 25, 2022 •

edited

shivaylamba commented Apr 25, 2022

curquiza commented Apr 26, 2022

shivaylamba commented Apr 26, 2022

curquiza commented Jun 8, 2022

Large quantity of facets causing degradation of Meilisearch performance #2349

Large quantity of facets causing degradation of Meilisearch performance #2349

Comments

shivaylamba commented Apr 25, 2022 • edited

Kerollmops commented Apr 25, 2022 • edited

shivaylamba commented Apr 25, 2022

curquiza commented Apr 26, 2022

shivaylamba commented Apr 26, 2022

curquiza commented Jun 8, 2022

shivaylamba commented Apr 25, 2022 •

edited

Kerollmops commented Apr 25, 2022 •

edited