-
Notifications
You must be signed in to change notification settings - Fork 2
Please scrap Mathjax from all posts #61
Comments
Remove completely, or convert it to text? The former only requires pairing How do you tell if a site supports mathjax? Hardcoding it is an option, but On Wed, Jan 28, 2015 at 6:40 PM, Mooseman notifications@github.com wrote:
|
Just some data of the post that prompted this. Rendered output: Log entry:
... which begs the question, do we want to classify these sorts of posts as LQ? If yes, then case closed. Otherwise, just let Pham do his thing and lower that term's weight for mathjax supporting sites (and optionally add another term for posts with a lower char count). Or...? |
If the question or answer doesn't have enough content besides the mathjax, I think it will generally be LQ. |
Seems LQ to me, but I'm not sure it needs our handling. The auto-whitelist On Wed, Jan 28, 2015 at 7:02 PM, Sam notifications@github.com wrote:
|
So... it looks like just a simple matter of adjusting the current terms. Should I continue to add mathjax scrapping then? |
Please do. Not sure if it's strictly necessary, but it should be helpful. On Wed, Jan 28, 2015 at 7:44 PM, Sam notifications@github.com wrote:
|
Sure, ok. Shall I just remove all mathjax or (somehow) convert it to plain text? |
I'd remove it so we don't match phone numbers or other filters. |
If you feel like parsing mathjax... sure, go ahead. Be sure to keep the On Wed, Jan 28, 2015 at 8:05 PM, Sam notifications@github.com wrote:
|
Alright, well I'm sure there's a library for that (I hope). Will do (I'm gonna put this as low priority until Pham's stable after the switch over to a CLI). |
Pham does not contain any filters specific to mathjax blocks. The only issue I can see this causing is false positives for some regexes such as
{0,80}
.The text was updated successfully, but these errors were encountered: