Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some sentences don't seem to be getting indexed for searches #1865

Closed
ckjpn opened this Issue Apr 12, 2019 · 6 comments

Comments

Projects
None yet
4 participants
@ckjpn
Copy link

ckjpn commented Apr 12, 2019

Some sentences don't seem to be getting indexed for searches.
Some sentences are missing from searches, but some sentences newer than the missing ones show up.

See the Wall.
https://tatoeba.org/eng/wall/show_message/31658#message_31658

In addition to the note on the Wall, I noticed this one, recently, too.

Search for Amastan's recently-contributed English sentences with "separatist."

https://tatoeba.org/eng/sentences/search?from=eng&to=ber&user=Amastan&sort=created&query=separatist

The top number is: 7836358
Kabyles don't need a fascist racist separatist group to defend them.
I'm sick of the separatist propaganda.
Ferhat Mehenni, the leader of a dangerous separatist group, has no right to speak in the name of all the Kabyles.
Kabylie, a Berber-speaking region in Algeria, doesn't need separatist terrorists.
The separatist flag is unwanted in Algerian marches.
etc.

You can see that there are actually the following more recent ones showing on this page
https://tatoeba.org/eng/Sentences/of_user/Amastan/eng

The bottom number is this: 7841405.
Those separatist shitheads are real sociopaths.
The separatists are sociopaths.
I have received new threatening messages from the separatists.
I have received new threats from the separatists.
I received threats from the separatists.
Ferhat Mehenni, an ex-singer, is now the "president" of a ridiculous but dangerous separatist movement.
Kabyles are traditionally a tight community and they don't like traitors. Therefore, the separatists have no future in our region.
We are not afraid of the separatists.
The separatists have developed an extremely violent political discourse lately.

@soliloquist-tatoeba

This comment has been minimized.

Copy link

soliloquist-tatoeba commented Apr 14, 2019

I want to report a similar issue.

https://tatoeba.org/eng/sentences/show/2332801

There's something strange on this sentence's page. The names of the users on the comment and logs are not shown. I wonder how common this issue is and if it occurred after the code migration.

@ckjpn

This comment has been minimized.

Copy link
Author

ckjpn commented Apr 15, 2019

I think for an old sentences like this, it's not a current bug. In the past, I think when some members were banned from the site, and Sysko (maybe it was) deleted them, this happened.

@trang

This comment has been minimized.

Copy link
Member

trang commented Apr 17, 2019

@jiru jiru self-assigned this Apr 18, 2019

@jiru

This comment has been minimized.

Copy link
Member

jiru commented Apr 18, 2019

I think it’s related.

I found out that the old sphinxsearch package left some files even after removal. There was a cron file /etc/cron.d/sphinxsearch that was starting a full index refresh as root everyday at midnight, effectively messing up with our own index refresh scripts. I had to use apt-get remove --purge sphinxsearch to get rid of these leftovers. I’m rebuilding the main indexes right now to add back sentences that havn’t been indexed. Hopefully this will prevent this problem from happening again.

@jiru

This comment has been minimized.

Copy link
Member

jiru commented Apr 18, 2019

I think I solved the problem.

@soliloquist-tatoeba, the issue you mentioned is different. It happens because that user was deleted. I don’t think it’s a big problem but feel free to open a new issue if it bothers you too much.

@jiru jiru closed this Apr 18, 2019

@soliloquist-tatoeba

This comment has been minimized.

Copy link

soliloquist-tatoeba commented Apr 18, 2019

@jiru, I just wanted to know if it was a general bug. Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.