Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finalize assignments: Chapter 10. SEO #12

Closed
3 tasks done
rviscomi opened this issue May 21, 2019 · 17 comments
Closed
3 tasks done

Finalize assignments: Chapter 10. SEO #12

rviscomi opened this issue May 21, 2019 · 17 comments

Comments

@rviscomi
Copy link
Member

rviscomi commented May 21, 2019

Section Chapter Authors Reviewers
II. User Experience 10. SEO @rachellcostello @ymschaap @AVGP @clarkeclark @andylimn @voltek62

Due date: To help us stay on schedule, please complete the action items in this issue by June 3.

To do:

  • Assign subject matter experts (coauthors)
  • Assign peer reviewers
  • Finalize metrics

Current list of metrics:

  • Structured data rich results eligibility (ratings, search, etc,)
  • Lang attribute usage and mistakes (lang='en')
  • <link> rel="amphtml" (AMP)
  • <link> hreflang="en-us" (localisation usage)
  • Breakdown of type of structured data served (ld+json, microformatting, schema.org + what @type)?
  • Indexability - looking at meta tags like <meta> noindex, <link> canonicals.
  • <meta> description + <title> (presence & length)
  • Status codes and whether pages are accessible - 200, 3xx, 4xx, 5xx.
  • Content - looking at word count, thin pages, header usage, alt attributes images
  • Linking - extract <a href> count per page (internal + external)
  • Linking - fragment URLs (together with SPAs to navigate content)
  • robots.txt (It is mentioned in Lighthouse, can we parse the content or only confirm its existence? E.g. check if has a sitemap reference - seems it does list the potential issues)
  • If the desktop site is responsive/mobile-ready, or a specific mobile site (redirect, UA)? (Can we find if these are different sites?)
  • Descriptive link text usage (available in Lighthouse data)
  • speed metrics (FCP, server response time)

👉 AI (coauthors): Finalize which metrics you might like to include in an annual "state of SEO" report powered by HTTP Archive. Community contributors have initially sketched out a few ideas to get the ball rolling, but it's up to you, the subject matter experts, to know exactly which metrics we should be looking at. You can use the brainstorming doc to explore ideas.

The metrics should paint a holistic, data-driven picture of the SEO landscape. The HTTP Archive does have its limitations and blind spots, so if there are metrics out of scope it's still good to identify them now during the brainstorming phase. We can make a note of them in the final report so readers understand why they're not discussed and the HTTP Archive team can make an effort to improve our telemetry for next year's Almanac.

Next steps: Over the next couple of months analysts will write the queries and generate the results, then hand everything off to you to write up your interpretation of the data.

Additional resources:

@rviscomi rviscomi transferred this issue from HTTPArchive/httparchive.org May 21, 2019
@rviscomi rviscomi added this to the Chapter planning complete milestone May 21, 2019
@rviscomi rviscomi added this to TODO in Web Almanac 2019 via automation May 21, 2019
@rviscomi rviscomi changed the title [Web Almanac] Finalize assignments: Chapter 10. SEO Finalize assignments: Chapter 10. SEO May 21, 2019
@rviscomi rviscomi moved this from TODO to In Progress in Web Almanac 2019 May 21, 2019
@clarkeclark
Copy link

Rick, love to help out as a peer reviewer

@rviscomi
Copy link
Member Author

Great, thanks for volunteering! Just added you as a reviewer and invited you to the @HTTPArchive/reviewers team.

@andylimn
Copy link

Hey Rick, I'd be keen to peer review too!

@rviscomi
Copy link
Member Author

@andylimn great, thanks! Just added you.

@ymschaap could you go to https://github.com/HTTPArchive and accept the invitation to the Authors team? That will enable me to assign this issue to you.

@ymschaap
Copy link
Contributor

ymschaap commented May 24, 2019

Re: SPA-only sites % (so which don’t support SSR)
So I tried multiple ways to expose SPA's without SSR but couldn't find a good query without too many false positives. So I would drop that (it would've been interesting).

Looking at the Lighthouse SEO data, there might be some other interesting metrics.

@rviscomi Where do I make changes to the final list of metrics? The Web Almanac Brainstorm Google doc or propose in this thread?

@rviscomi
Copy link
Member Author

This issue will be the canonical source of metrics, so feel free to use the doc to iterate then copy them here when you're ready.

@ymschaap
Copy link
Contributor

ymschaap commented May 25, 2019

Current list of metrics:

  • Structured data rich results eligibility (ratings, search, etc,)
  • Lang attribute usage and mistakes (lang='en')
  • <link> rel="amphtml" (AMP)
  • <link> hreflang="en-us" (localisation usage)
  • Breakdown of type of structured data served (ld+json, microformatting, schema.org + what @type)?
  • Indexability - looking at meta tags like <meta> noindex, <link> canonicals.
  • <meta> description + <title> (presence & length)
  • Status codes and whether pages are accessible - 200, 3xx, 4xx, 5xx.
  • Content - looking at word count, thin pages, header usage, alt attributes images
  • Linking - extract <a href> count per page (internal + external)
  • Linking - fragment URLs (together with SPAs to navigate content)
  • robots.txt (It is mentioned in Lighthouse, can we parse the content or only confirm its existence? E.g. check if has a sitemap reference - seems it does list the potential issues)
  • If the desktop site is responsive/mobile-ready, or a specific mobile site (redirect, UA)? (Can we find if these are different sites?)
  • Descriptive link text usage (available in Lighthouse data)
  • speed metrics (FCP, server response time)

@rachellcostello want to add/change anything missing?

@rachellcostello
Copy link
Contributor

Here's my feedback on the current list of metrics:

In response to your question @ymschaap - pagination won't usually be relevant for homepages, just sections of websites like category pages listing products or blog sections listing articles.

I wouldn't include the meta keywords tag as this isn't used by search engines anymore so it's become kind of obsolete. Page titles and meta descriptions should definitely be included though.

For canonical tags, it would be interesting to see if they are self-referencing, if it's possible to check if the URL of the page and the URL in the canonical tag are an exact match or not?

It might also be useful to add another level of detail to the links information by classifying them by type. E.g. a href links, onclick links, JavaScript links etc. Martin Splitt's slide on problematic links for SEO is a great example of the bad types to watch out for!

Screenshot 2019-05-28 at 18 15 21

I like the idea of having a speed metric. TTI or FCP would be useful from a UX perspective, and something like server response time would be useful from a search engine crawling perspective.

Everything else is looking good!

@ymschaap
Copy link
Contributor

Great, I updated the 'current list of metrics' with your remarks and will move it also in the 'brainstorm' doc.

@AVGP
Copy link

AVGP commented May 29, 2019

If there's still time / space, I'd be happy to review or help in whatever shape or form :)

@AVGP
Copy link

AVGP commented May 29, 2019

also @ymschaap could we flag fragment URLS ("#") that are used to load different content in SPAs as problematic?

@rviscomi
Copy link
Member Author

Yes would be great to have you Martin! Adding you as a coauthor, let me know if you'd prefer to review.

@AVGP
Copy link

AVGP commented May 29, 2019

Awesome, thanks @rviscomi - I'd love to help author it :)

@ymschaap
Copy link
Contributor

@AVGP initially SPAs and their implementation were on the list, but I couldn't figure out a reliant way to flag those. But yes, lets now add fragment URLs in there and maybe we can find a query to get the right data out.

@rviscomi
Copy link
Member Author

rviscomi commented Jun 4, 2019

@rachellcostello @ymschaap @AVGP @clarkeclark @andylimn @voltek62 if everyone's happy with the current list of metrics in #12 (comment) could you tick the final TODO checkbox item and close this issue? Thanks!

Web Almanac 2019 automation moved this from In Progress to Done Jun 4, 2019
@voltek62
Copy link

voltek62 commented Jun 4, 2019

The current list of metrics is perfect.

@rviscomi
Copy link
Member Author

rviscomi commented Jun 4, 2019

Thanks everyone!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Development

No branches or pull requests

7 participants