Skip to content

Basic use case has poor results #198

Open
@johannespn

Description

@johannespn

Hey, first of all, thanks for starting this project!

Problem

I successfully loaded sample product data into the local qdrant db. In total, there are only 42 products in the json_with_embeddings/*.json, three of which are <$20.

Then I queried for products below 20 dollars. The top 2 results are >$20 and only the third is <$20.

Also it hallucinated on the description:

Image

while

{
	"@context": "https://schema.org",
	"@type": "Product",
	"name": "Choker with Gold Pendant",
	"offers": {
		"@type": "Offer",
		"priceCurrency": "USD",
		"price": "29.99",
		"availability": "https://schema.org/InStock"
	}
}

Also other queries like "I am looking for jewelry for men" failed to find the one product in 42 that was categorized as male in description and PeopleAudience.suggestedGender.

I tried all the queries multiple times, the system never got it correct.

Expectation

I would expect the out of the box set up to perform better on these very simple queries and set up.

Using text-embedding-3-small, gpt-4o-mini and qdrant_local. And this is my data set
products_jsonld.json

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions