Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Rich text indexing issues again #2066
There is code in plone.app.contenttypes/indexers.py that is getting changed back and forth. The code gets rich text for the SearchableText index like this:
Let's simplify this to make it easier to talk about:
It is not clear what
Then @petri filed plone/plone.app.contenttypes#357.
Both seem okay.
So he made pull requests choosing the first solution: output and output mimetype. This was included in 1.1.2 and 1.2.18.
Then @agitator noticed problems with this in #1844.
This was released in 1.2.19 (branch 1.2.x and master). It was not yet backported to branch 1.1.x, but it should be: I notice the same problem there.
It's been a while... I don't remember of any particular problems in Plone that I would have encountered.
After re-reading the sources and #1844 discussion, it seems to me that for indexing, we should use the raw RichText value for plaintext conversion, not the output value, because:
Thus it seems to me the above-mentioned possible alternative by @mauritsvanrees is what we should do: