Join GitHub today
GitHub is home to over 36 million developers working together to host and review code, manage projects, and build software together.Sign up
Apparent bias toward concepts that start with "a" and "b" in the relationship list API results #200
If you call
That's a lot of a- and b- concepts! If you query the relationship endpoint with
Is this because there is an issue with paging stopping early? Or is there an issue with the data?
Good observation! That looks like an issue that's a lot like paging stopping early -- in particular, the database query stopping early to avoid slowdowns when there are too many results.
Something we're working on that uses machine learning to refine the "weight" value would cause the results to be ranked by something more meaningful than the order they were put in the database, although it would be biased toward certain data sources.
If you need to get all the edges for a common relation, I'd recommend downloading the data and filtering it.
Aha! Thank you, it didn't occur to me that downloading it all was an option!
What I'm trying to accomplish is getting a random edge for a relation. Right now, I think I'll download, convert to ndjson so there's one edge per line, then pick a random line from the file. I don't suppose there's a more direct way to do that?
I see that there's a method for that in the DB wrapper, but it does not appear to be exposed?