Query metrics: Chapter 10. SEO #91

rviscomi · 2019-07-23T19:00:51Z

Part	Chapter	Authors	Reviewers	Tracking Issue
II. User Experience	10. SEO	@rachellcostello @ymschaap @AVGP	@clarkeclark @andylimn @voltek62	#12

READ ME!

All of the metrics in the table below have been marked as Able To Query during the metrics triage. The analyst assigned to each metric is expected to write the corresponding query and submit a PR to have it reviewed and added to the repo.

In order to stay on schedule and have the data ready for authors, please have all metrics reviewed and merged by August 5.

Assignments

ID	Metric description	Analyst	Notes
10.01	Structured data rich results eligibility (ratings, search, etc,)	@ymschaap	html -> regex
10.02	Lang attribute usage and mistakes (lang='en')	@ymschaap	lighthouse -> html-lang-valid, resource: See: https://discuss.httparchive.org/t/what-are-the-invalid-uses-of-the-lang-attribute/1022
10.03	`<link>` rel="amphtml" (AMP)	@ymschaap	html -> regex
10.04	`<link>` hreflang="en-us" (localisation usage)	@ymschaap	html -> regex + lighthouse -> hreflang
10.05	Breakdown of type of structured data served (ld+json, microformatting, schema.org + what `@type`)?	@ymschaap	Custom Query Can we have this data: https://search.google.com/structured-data/testing-tool
10.06	Indexability - looking at meta tags like `<meta>` noindex, `<link>` canonicals.	@ymschaap	lighthouse -> is-crawlable, lighthouse -> canonical
10.07	`<meta>` description + `<title>` (presence & length)	@ymschaap	html -> regex
10.08	Status codes and whether pages are accessible - 200, 3xx, 4xx, 5xx.	@ymschaap	request -> response
10.09	Content - looking at word count, thin pages, header usage, alt attributes images	@ymschaap	lighthouse ->image-alt, Custom Query
10.10	Linking - extract `<a href>` count per page (internal + external)	@ymschaap	Custom Query
10.11	Linking - fragment URLs (together with SPAs to navigate content)	@ymschaap	we have react/vue as application type + a href Custom Query
10.12	robots.txt (It is mentioned in Lighthouse, can we parse the content or only confirm its existence? E.g. check if has a sitemap reference - seems it does list the potential issues)	@ymschaap	lighthouse -> robots-txt
10.13	If the desktop site is responsive/mobile-ready, or a specific mobile site (redirect, UA)? (Can we find if these are different sites?)	@ymschaap	compare mobile vs desktop crawl page -> _final_url + lighthouse -> seo-mobile
10.14	Descriptive link text usage (available in Lighthouse data)	@ymschaap	lighthouse -> link-text
10.15	speed metrics (FCP, server response time) would be nice for SEO as well given the recent focus on fast loading sites	@ymschaap	See: https://discuss.httparchive.org/t/measuring-cms-host-ttfb-in-crux/1676/1

Checklist of metrics to be merged

The text was updated successfully, but these errors were encountered:

rviscomi · 2019-08-03T18:32:15Z

10.01 Structured data rich results eligibility

I had assumed GoogleChrome/lighthouse#4359 was already added to Lighthouse, but it doesn't seem like it. This may be tricky to get right using only SQL. Since I was the one to add this metric, I think it's ok to change this to "Not Feasible". @ymschaap WDYT?

ymschaap · 2019-08-03T21:32:45Z

10.01 Structured data rich results eligibility

I had assumed GoogleChrome/lighthouse#4359 was already added to Lighthouse, but it doesn't seem like it. This may be tricky to get right using only SQL. Since I was the one to add this metric, I think it's ok to change this to "Not Feasible". @ymschaap WDYT?

What I did now was use the 10.05 metric (which grabs any json+ld, finds @type and @content) and looks at what @types triggers rich results.

We know the JSON is valid, and we know @context + @type is set, which is pretty close to what GoogleChrome/lighthouse#4359 does.

So imho we could keep it as long as we make clear what the limitations of this metric is in the webalmanac. On the other hand, 10.05 might already touch on this, and 10.01 would could be considered a duplicate.

rviscomi added the analysis Querying the dataset label Jul 23, 2019

rviscomi added this to the Content written milestone Jul 23, 2019

rviscomi assigned ymschaap Jul 23, 2019

rviscomi added this to TODO in Web Almanac 2019 via automation Jul 23, 2019

rviscomi mentioned this issue Jul 23, 2019

Assign analysts to chapters #71

Closed

ymschaap mentioned this issue Jul 25, 2019

Analyst SQL files chapter SEO #103

Merged

rviscomi moved this from TODO to In Progress in Web Almanac 2019 Aug 27, 2019

rviscomi added the ASAP This issue is blocking progress label Sep 4, 2019

rviscomi mentioned this issue Sep 15, 2019

SEO queries revisited #159

Merged

rviscomi closed this as completed in #159 Sep 17, 2019

Web Almanac 2019 automation moved this from In Progress to Done Sep 17, 2019

rviscomi mentioned this issue Sep 17, 2019

Write content: Chapter 10. SEO #163

Closed

3 tasks

rviscomi removed the ASAP This issue is blocking progress label Sep 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query metrics: Chapter 10. SEO #91

Query metrics: Chapter 10. SEO #91

rviscomi commented Jul 23, 2019 •

edited by ymschaap

Loading

rviscomi commented Aug 3, 2019

ymschaap commented Aug 3, 2019 •

edited

Loading

Query metrics: Chapter 10. SEO #91

Query metrics: Chapter 10. SEO #91

Comments

rviscomi commented Jul 23, 2019 • edited by ymschaap Loading

READ ME!

Assignments

Checklist of metrics to be merged

rviscomi commented Aug 3, 2019

ymschaap commented Aug 3, 2019 • edited Loading

rviscomi commented Jul 23, 2019 •

edited by ymschaap

Loading

ymschaap commented Aug 3, 2019 •

edited

Loading