Description
Hey, first of all, thanks for starting this project!
Problem
I successfully loaded sample product data into the local qdrant db. In total, there are only 42 products in the json_with_embeddings/*.json
, three of which are <$20.
Then I queried for products below 20 dollars. The top 2 results are >$20 and only the third is <$20.
Also it hallucinated on the description:

while
{
"@context": "https://schema.org",
"@type": "Product",
"name": "Choker with Gold Pendant",
"offers": {
"@type": "Offer",
"priceCurrency": "USD",
"price": "29.99",
"availability": "https://schema.org/InStock"
}
}
Also other queries like "I am looking for jewelry for men" failed to find the one product in 42 that was categorized as male in description and PeopleAudience.suggestedGender
.
I tried all the queries multiple times, the system never got it correct.
Expectation
I would expect the out of the box set up to perform better on these very simple queries and set up.
Using text-embedding-3-small
, gpt-4o-mini
and qdrant_local
. And this is my data set
products_jsonld.json