Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing entries in tabular data #2540

Closed
yagmur-q opened this issue Oct 19, 2022 · 2 comments
Closed

Missing entries in tabular data #2540

yagmur-q opened this issue Oct 19, 2022 · 2 comments

Comments

@yagmur-q
Copy link

Hi,

I have tabular data with >100k data points and >20 attributes. The problem is when new user information enters to the system, faiss model should retrieve the most similar users to the query user. However, there are too many missing values in data. At this point, should I use standard imputation techniques(mean, median etc) to replace the NaN values in data when using faiss?

Thanks

@mdouze
Copy link
Contributor

mdouze commented Oct 30, 2022

This is not really in the scope of Faiss. Please check what is the best way of replacing missing values before using optimized search with faiss.

@asadoughi
Copy link
Contributor

faiss.METRIC_NaNEuclidean was implemented in #3414

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants