Google search results for mithril.js.org show html vomit (see picture) #2114

kylebakerio · 2018-04-03T01:25:07Z

(Image should be entirely self-explanatory, title then some.)

pygy · 2018-04-03T06:52:05Z

Thanks, it looks like we dont serve valid HTML.

https://validator.w3.org/check?uri=https%3A%2F%2Fmithril.js.org&charset=%28detect+automatically%29&doctype=Inline&group=0

Adding a doctype would be a good start...

Indeed, there are far fewer errors when parsing in HTML5 mode: https://validator.w3.org/nu/?doc=https%3A%2F%2Fmithril.js.org%2F

codeclown · 2018-04-10T22:53:43Z

That's just the code example from the page, Google happened to pick it up as the page description. The way to affect that would be to add a <meta name="description" content="..."> tag on the page.

pygy · 2018-04-10T23:08:46Z

Good catch @codeclown!

kylebakerio · 2018-04-25T22:54:39Z

So, where is the repo for that file so we can add it? This bug is still present, just checked.

pygy · 2018-04-26T06:03:33Z

https://github.com/MithrilJS/mithril.js/blob/next/docs/layout.html

The trouble is that may then get the same description for all pages (Google doesn't always honour meta tags). I'd need to test this more, but IIRC the home page is the only one that shows up in search results so it doesn't matter much, but we'd need some systematic testing to be sure.

orbitbot · 2018-04-26T06:06:40Z

Currently when searching f.e. site:mithril.js.org keys you'll get the start of actual page body content, don't know if the meta description would affect this. If not, then just making it reasonably generic would probably work.

kylebakerio · 2018-04-26T17:04:17Z

https://support.google.com/webmasters/answer/35624?hl=en

Make sure that every page on your site has a meta description.

Differentiate the descriptions for different pages.

Programmatically generate descriptions.

You can, alternatively, prevent snippets from being created and shown for your site in Search results. Use the tag to prevent Google from displaying a snippet for your page in Search results.

^ several good tips from google here. We could programatically generate the meta tags to use the opening content on documentation pages. We could also turn off snippets otherwise as an easier solution for a specific page.

I'm open to doing this if it's welcome.

tivac · 2018-04-27T01:11:51Z

Go for it, @finetype!

kylebakerio · 2018-05-06T01:40:53Z

Finally sat down to work on this today.

It would be "easy" to add snippets programatically, except for one problem: how to grab the right piece of text? The doc files aren't sufficiently standardized, unfortunately. Some ideal descriptions are under the first ---, some are under the first ##, most are under ###, and some are directly underneath the navigation table, above the ---. Any attempt to programatically extract a snippet from these docs will result in something pretty brittle and unwieldy.

I propose one of three solutions:

Add a "meta description" section to the bottom of all the files. (Proposal: I can just copy some meaningful chunk of text by hand into such a section and put it under a #### Meta Description header on each doc).
We could only add a custom snippet (or remove snippets) for that main index.md file that is causing the problem specifically mentioned in this bug report--that'd probably be the 80/20 solution.
Remove snippets from the docs altogether with <meta name="nosnippets">. (This would fix the problem, would be nearly effortless, and would prevent this problem from happening on other docs as well, but then we don't get snippets at all, which are nice when they work.)

I'd prefer to go forward with (1), but would like some feedback before going that route.

leothorp · 2018-05-06T02:33:37Z

#1 sounds good to me (or if that's a bit time consuming, could just start by quickly doing #3 and do #1 at a more relaxed pace.)

…

On Sat, May 5, 2018, 20:41 Kyle Baker ***@***.***> wrote: Finally sat down to work on this today. It would be "easy" to add snippets programatically, except for one problem: how to grab the right piece of text? The doc files aren't sufficiently standardized, unfortunately. Some ideal descriptions are under the first ---, some are under the first ##, most are under ###, and some are directly underneath the navigation table, above the ---. Any attempt to programatically extract a snippet from these docs will result in something pretty brittle and unwieldy. I propose one of three solutions: 1. Add a "meta description" section to the bottom of all the files. (Proposal: I can just copy some meaningful chunk of text by hand into such a section and put it under a #### Meta Description header on each doc.) 2. Remove snippets from the docs altogether with <meta name="nosnippets"> 3. We could only add a custom snippet for that main index.md file that is causing the problem specifically mentioned in this bug report--that'd probably be the 80/20 solution. I'd prefer to go forward with (1), but would like some feedback before going that route. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#2114 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AMVFwRLnyNp0x5g5kPV4fOSLI5MauJneks5tvlSrgaJpZM4TET3e> .

tivac · 2018-05-06T02:45:26Z

I'm for 3, being that it's the most obvious win.

kylebakerio · 2018-05-06T20:27:07Z

Heh. I'll just wait for more responses, I guess? lol. That's for that tiebreaker, @pygy. ;P

kylebakerio · 2018-05-06T20:43:01Z

I re-ran the html validator--a lot of its error messages were because it was evaluating it as if it were HTML 4.1, but if you switch it to the HTML5 mode, we get a much nicer bit of output: https://validator.w3.org/nu/?showsource=yes&doc=https%3A%2F%2Fmithril.js.org%2F

I have added a doctype and a lang="en" to the html tag, as well as alt text to the logo, but the extra "p" closing tags are interesting... My best guess is that those are being inserted by the marked library erroneously, but there is some off custom handling around code blocks that may be causing it. If you look at the source we generate, the incorrect closing </p> tags are all after code blocks on that page.

kylebakerio · 2018-05-06T20:47:31Z

Huh... maybe that's a bug in the validator (which is acknowledged as experimental). I see two

tags, one inside another. While that's odd, it seems correct to have two closing tags...

Looking at it now, I think the problem is in marked, though. Those inner p tags are written directly as p tags within the markdown, e.g. index.md, and the outer layer of p tags are probably added by marked around that.

Doesn't seem to cause any real issues, though, it seems, fwiw.

StephanHoyer · 2022-02-21T13:42:19Z

seems no issue anymore

pygy added Type: Bug For bugs and any other unexpected breakage Area: Documentation For anything dealing mainly with the documentation itself labels Apr 3, 2018

kylebakerio mentioned this issue May 13, 2018

Meta description snippets for documentation pages #2149

Closed

11 tasks

dead-claudia added this to Needs triage in Triage/bugs via automation Oct 28, 2018

dead-claudia moved this from Needs triage to High priority in Triage/bugs Oct 28, 2018

StephanHoyer closed this as completed Feb 21, 2022

Triage/bugs automation moved this from High priority to Closed Feb 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Google search results for mithril.js.org show html vomit (see picture) #2114

Google search results for mithril.js.org show html vomit (see picture) #2114

kylebakerio commented Apr 3, 2018

pygy commented Apr 3, 2018 •

edited

Loading

codeclown commented Apr 10, 2018

pygy commented Apr 10, 2018

kylebakerio commented Apr 25, 2018

pygy commented Apr 26, 2018

orbitbot commented Apr 26, 2018

kylebakerio commented Apr 26, 2018 •

edited

Loading

tivac commented Apr 27, 2018

kylebakerio commented May 6, 2018 •

edited

Loading

leothorp commented May 6, 2018 via email

tivac commented May 6, 2018

kylebakerio commented May 6, 2018

kylebakerio commented May 6, 2018 •

edited

Loading

kylebakerio commented May 6, 2018 •

edited

Loading

StephanHoyer commented Feb 21, 2022

Google search results for mithril.js.org show html vomit (see picture) #2114

Google search results for mithril.js.org show html vomit (see picture) #2114

Comments

kylebakerio commented Apr 3, 2018

pygy commented Apr 3, 2018 • edited Loading

codeclown commented Apr 10, 2018

pygy commented Apr 10, 2018

kylebakerio commented Apr 25, 2018

pygy commented Apr 26, 2018

orbitbot commented Apr 26, 2018

kylebakerio commented Apr 26, 2018 • edited Loading

tivac commented Apr 27, 2018

kylebakerio commented May 6, 2018 • edited Loading

leothorp commented May 6, 2018 via email

tivac commented May 6, 2018

kylebakerio commented May 6, 2018

kylebakerio commented May 6, 2018 • edited Loading

kylebakerio commented May 6, 2018 • edited Loading

StephanHoyer commented Feb 21, 2022

pygy commented Apr 3, 2018 •

edited

Loading

kylebakerio commented Apr 26, 2018 •

edited

Loading

kylebakerio commented May 6, 2018 •

edited

Loading

kylebakerio commented May 6, 2018 •

edited

Loading

kylebakerio commented May 6, 2018 •

edited

Loading