New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hot sorting pulling up 2 year old posts with no comments #3428
Comments
This screenshot is through the Jerboa android app, but it also occurs on PC in the browser (both Chrome and Firefox). |
Having this same issue on my instance. Also running 0.18.0. |
That could be the case, but it would mean that the hot rank calculation on lemmy_server launch has failed. Could anybody here who is seeing this issue restart lemmy_server with By the way, I'm not seeing any of these old posts crop up on the front page of lemm.ee at all after #3131, even though we are certainly pulling in old posts all the time. |
Yes, it seems to affect smaller servers more than larger ones. e.g. this looks fine on lemmy.world: https://lemmy.world/c/linuxmemes?dataType=Post&page=1&sort=Hot but on a smaller server, older posts are ranked high: https://lemmy.nz/c/linuxmemes@lemmy.world?dataType=Post&page=1&sort=Hot |
FWIW, I had the same problem on my very small instance, restarted Lemmy twice to take a look at the logs as @sunaurus suggested and now I don't get these old posts when sorting by "Hot" anymore 🥳 Edit: nevermind, they came back after some time :( although not as many as before. And, unlike before, there actually are recent posts below the age-old ones. |
Yes, we've seen something similar on our instance. The log lines for the initial hot rank update does appear. It seems like initially after restart there are no "stale" posts in the hot listing. However, over time posts start to get the hot_rank 1728 including posts older than a day (sometimes months old), and the ranking doesn't decrease. So it seems to me as if #3131 hasn't completely fixed this issue. |
This seems to be an issue with certain communities. For example programmerhumor@lemmy.ml. If I'm subscribed to that instance it overwhelms my Hot feed. Unsubbing makes my Hot feed normal. |
Here's what I believe is happening:
This explains why this problem is more apparent on smaller servers - on large servers, users will "discover" all the old posts, so there are less of them to find since the last restart. But on smaller servers, after a server restart there are still old posts that have yet to be synced, so over time these will start polluting the hot rankings. What I don't know is why some old posts seem to get stuck at 1728 and why some don't. It could be some transitory issue (e.g. deadlock), but then I would have thought a scheduled job would later correct the hot_rank? At any rate, there are enough old posts with a hot_rank of 1728 to effectively make the hot listing unusable on smaller servers about 12 hours after a restart. |
I found out why I wasn't able to reproduce this - I had an additional fix still active on lemm.ee. I should be able to disable that and reproduce it now, so I will try to submit a better fix soon! |
One other thing I'll note, that's an issue with our current sorts: The code order by is Small communities with few posts especially, will have a ton of posts with hot_rank = 0 , so it sorts them randomly. I'll create a PR for this shortly. |
Does #3618 actually fix the underlying issue? It seems like it just adds a published date to the aggregates table, so it will address the issue mentioned by @dessalines. But it won't fix the problem of old posts erroneously getting a hot_rank value of 1728 (unless I'm missing something). Edit: apologies, I did miss something, i.e. the earlier PR that fixes this. Thanks! |
Requirements
Summary
Browsing "All" (from the Vlemmy instance), sorting by "hot", is pulling up 2-year-old posts with no comments and maybe one upvote. Seems like an error in the "hot" sorting algorithm.
Steps to Reproduce
https://lemmy.ml/post/61856
Technical Details
Don't have access to logs. Using Lemmy as a client. However the sorting algorithm and article fetching I assume is on the backend.
Version
0.18.0
Lemmy Instance URL
Vlemmy.net
The text was updated successfully, but these errors were encountered: