Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Security 2021 #2150

Closed
6 tasks done
rviscomi opened this issue Apr 27, 2021 · 39 comments
Closed
6 tasks done

Security 2021 #2150

rviscomi opened this issue Apr 27, 2021 · 39 comments
Assignees
Labels
2021 chapter Tracking issue for a 2021 chapter
Projects

Comments

@rviscomi
Copy link
Member

rviscomi commented Apr 27, 2021

Part II Chapter 12: Security

Security illustration

If you're interested in contributing to the Security chapter of the 2021 Web Almanac, please reply to this issue and indicate which role or roles best fit your interest and availability: author, reviewer, analyst, and/or editor.

Content team

Lead Authors Reviewers Analysts Editors Coordinator
@SaptakS @SaptakS @tomvangoethem @nrllh @cqueern @edmondwwchan @awareseven @GJFR @tunetheweb @OBTo
Expand for more information about each role
  • The content team lead is the chapter owner and responsible for setting the scope of the chapter and managing contributors' day-to-day progress.
  • Authors are subject matter experts and lead the content direction for each chapter. Chapters typically have one or two authors. Authors are responsible for planning the outline of the chapter, analyzing stats and trends, and writing the annual report.
  • Reviewers are also subject matter experts and assist authors with technical reviews during the planning, analyzing, and writing phases.
  • Analysts are responsible for researching the stats and trends used throughout the Almanac. Analysts work closely with authors and reviewers during the planning phase to give direction on the types of stats that are possible from the dataset, and during the analyzing/writing phases to ensure that the stats are used correctly.
  • Editors are technical writers who have a penchant for both technical and non-technical content correctness. Editors have a mastery of the English language and work closely with authors to help wordsmith content and ensure that everything fits together as a cohesive unit.
  • The section coordinator is the overall owner for all chapters within a section like "User Experience" or "Page Content" and helps to keep each chapter on schedule.

Note: The time commitment for each role varies by the chapter's scope and complexity as well as the number of contributors.

For an overview of how the roles work together at each phase of the project, see the Chapter Lifecycle doc.

Milestone checklist

0. Form the content team

  • May 31: The content team has at least one author, reviewer, and analyst

1. Plan content

  • June 15 The content team has completed the chapter outline in the draft doc

2. Gather data

  • June 30: Analysts have added all necessary custom metrics and drafted a PR (example) to track query progress
  • July 1 - 31: HTTP Archive runs the July crawl

3. Validate results

  • September 30: Analysts have queried all metrics and saved the output to the results sheet

4. Draft content

  • October 31: The content team has written, reviewed, and edited the chapter in the doc

5. Publication

  • November 15: The completed chapter and all required metadata and figures are converted to markdown and submitted to GitHub
  • December 1: Target launch date 🚀

Chapter resources

Refer to these 2021 Security resources throughout the content creation process:

📄 Google Docs for outlining and drafting content
🔍 SQL files for committing the queries used during analysis
📊 Google Sheets for saving the results of queries
📝 Markdown file for publishing content and managing public metadata

@rviscomi rviscomi added 2021 chapter Tracking issue for a 2021 chapter help wanted Extra attention is needed labels Apr 27, 2021
@tomvangoethem
Copy link
Contributor

I'd like to join as an author!

@SaptakS
Copy link
Collaborator

SaptakS commented Apr 27, 2021

I would love to help as a co-author or reviewer

@cqueern
Copy link
Contributor

cqueern commented Apr 27, 2021

I'd like to support as a Peer Reviewer.

@nrllh
Copy link
Collaborator

nrllh commented Apr 27, 2021

I'd like to contribute as an author.

@rviscomi
Copy link
Member Author

Thanks @tomvangoethem @SaptakS @cqueern @nrllh! It's great to see so much interest. I'm going to tentatively put you all down as peer reviewers for now until we're ready to start selecting authors. FYI since @tomvangoethem and @nrllh were coauthors of the 2020 Security chapter, we're going to lean towards selecting new people. More context in #2165.

@edmondwwchan
Copy link

Hi team, I am interested and would like to support this chapter as a peer reviewer.

@rviscomi rviscomi added this to TODO in 2021 via automation Apr 27, 2021
@rviscomi
Copy link
Member Author

Welcome back @edmondwwchan!

@tomvangoethem @SaptakS @cqueern @nrllh @edmondwwchan I think there's critical mass for all of the interested contributors of this chapter to start brainstorming content in the doc. It'd be great to get an early start on sketching the outline and thinking about metrics. Here's the 2020 doc for reference if you wanted to bootstrap this chapter with 2020 topics as a starting point. Add your notes to the 2021 doc if you can think of anything especially interesting that is new or outdated since the last chapter.

@rviscomi
Copy link
Member Author

rviscomi commented May 4, 2021

@SaptakS thanks for your interest in authoring this chapter! As the content team lead, you'll be responsible for the scope and direction of the chapter and keeping it on schedule. We automatically monitor the staffing and progress of each chapter based on the state of the initial comment so please keep that updated as you add new contributors and meet each milestone.

Depending on the length/scope of the chapter, you may want to add additional coauthors to share the load. @tomvangoethem and @nrllh are both interested to coauthor the chapter again this year and would be great additions. It's up to you!

We've created a Google Doc for this chapter, which you're encouraged to use to collaborate with the content team on the initial outline, metrics, and ultimately the final draft.

Next steps for this chapter are:

@OBTo will be the section coordinator for this chapter, so they'll be periodically checking in with you directly to make sure the chapter is staying on schedule. Reach out to them here in this issue if you have any questions about the process.

More information about the content team lead and author roles and responsibilities are available for reference in the wiki if needed.

To anyone else interested in contributing to this chapter, please comment below to join the team!

@rviscomi rviscomi added the help wanted: analysts This chapter is looking for data analysts label May 4, 2021
@rviscomi rviscomi moved this from TODO to In Progress in 2021 May 4, 2021
@SaptakS
Copy link
Collaborator

SaptakS commented May 5, 2021

I would more than love to have @tomvangoethem and @nrllh as co-authors and get their valuable ideas from their experiences. Also, @tomvangoethem, you did analysis as well last year, would you be interested to do so this year as well?

@rviscomi
Copy link
Member Author

rviscomi commented May 5, 2021

📟 paging 2019/2020 contributors: @arturjanc @ScottHelme @paulcalvano @tunetheweb @ghedo @ndrnmnn @dotjs @jrharalson @AAgar

Would any of you be interested to contribute to the 2021 chapter? This chapter could use your help with reviewing and/or analyzing. It'd be great to have your support!

@rviscomi
Copy link
Member Author

rviscomi commented May 5, 2021

@awareseven were you interested in reviewing this chapter?

@ScottHelme
Copy link
Contributor

I'd be happy to review 👍

@SaptakS
Copy link
Collaborator

SaptakS commented May 6, 2021

@tomvangoethem @cqueern @nrllh @edmondwwchan @ScottHelme I have added the outline from last year in the docs, along with 2 more suggestions I think might make sense to add. More ideas are welcome that might make sense!!

@awareseven
Copy link

awareseven commented May 6, 2021

I can review the chapter @rviscomi and I am also happy to draft a few paragraphs as an author if you like

@rviscomi
Copy link
Member Author

rviscomi commented May 6, 2021

@awareseven ok great! I'll defer to the content team lead @SaptakS to loop you in.

@SaptakS
Copy link
Collaborator

SaptakS commented May 6, 2021

@awareseven sure! You can take a look at the 2021 docs. I have taken the basic outline from last year and added few other things that I feel might be interesting to see. We are currently brainstorming ideas for the chapter so suggestions are welcome!

@foxdavidj
Copy link
Contributor

Hey @SaptakS excited to work with you and the rest of the group this year on the Almanac. I'm your go to guy if you've got any questions or need help so don't hesitate to reach out to me on github, the Slack (@OBTo) or email (david@davidjfox.com).

Few first steps:

  1. @awareseven @ScottHelme should add themselves to the Google Doc in the format: Name (email@example.com).
  2. I've added links within the doc to the previous years Google doc in case you'd like to mine it for ideas.
  3. Would love to set up a 30 minute Zoom call in the next couple weeks to kick-start the chapter planning and brainstorming process, and put some faces to the names of the people we'll all be working with this year. I'll reach out again later this week to find a time that works.

Excited to work with you all this year.

@foxdavidj
Copy link
Contributor

foxdavidj commented May 19, 2021

How does Monday (May 24) at 12p ET / 9a PT / 5p BST (timezones here) work for the 30m chat?

@SaptakS @tomvangoethem @nrllh @cqueern @edmondwwchan @awareseven @ScottHelme

@edmondwwchan
Copy link

@OBTo Appreciate for setting up the call. The meeting will be a bit late in my local time (May 25 12am GMT+8). Anyway, I will try my best to attend.

@foxdavidj
Copy link
Contributor

@edmondwwchan So sorry about that. We'll try to find a more reasonable time moving forward.

Just sent the invite. If you didn't get it, please send me an email to david@davidjfox.com and I'll add you.

@SaptakS
Copy link
Collaborator

SaptakS commented May 20, 2021

Got it! Thanks!

@rviscomi rviscomi removed help wanted Extra attention is needed help wanted: analysts This chapter is looking for data analysts labels May 24, 2021
@foxdavidj
Copy link
Contributor

@SaptakS Can you tick the first milestone checkbox above 0. Form the content team? Keeping these milestones up to date helps to give me an overview on how the Almanac as a whole is coming together

@foxdavidj
Copy link
Contributor

@SaptakS @tomvangoethem @nrllh @cqueern @edmondwwchan @awareseven @ScottHelme @GJFR

Hey everyone, wanted to give a quick reminder that we need to have the chapter outline complete by June 15 so we have enough time to update our crawler with any additional metrics you need this year. Seeing some good ideas in the doc so far which is great.

Also the team has a channel on slack (#web-almanac-security), so feel free to join everyone there as well: https://join.slack.com/t/httparchive/shared_invite/zt-45sgwmnb-eDEatOhqssqNAKxxOSLAaA

If you have any other questions don't hesitate to reach out :)

@foxdavidj
Copy link
Contributor

@SaptakS reminder that we've got just over 1 week (June 15th) until the chapter outline is due. Please work together with your analyst (@GJFR) to make sure all metrics are feasible as well.

If there are any new custom metrics you require this year, have them decided by EOD June 23rd so your analyst has time to add them to our crawler.

@tunetheweb
Copy link
Member

FYI Lighthouse v8 has a new Content-Security-Policy audit, which we will have access to. Worth considering for this year: https://github.com/GoogleChrome/lighthouse/releases/tag/v8.0.0

@cqueern
Copy link
Contributor

cqueern commented Jun 7, 2021

FYI Lighthouse v8 has a new Content-Security-Policy audit, which we will have access to. Worth considering for this year: https://github.com/GoogleChrome/lighthouse/releases/tag/v8.0.0

This is great, thanks @tunetheweb .

@ScottHelme may have some tips on sound ways of analyzing the adoption of CSP capabilities at web-scale.

@foxdavidj
Copy link
Contributor

@SaptakS @GJFR

Hey everyone, wanted to give you a heads-up and reminder that the July website crawl has completed and chapters now need to:

  1. Analysts: Please write, test and publish the results of all the queries in the draft PR you should have created last month. We've got 3 chapters (PWA, Mobile Web, Accessibility) filled with every type of query you can imagine that you can refer to if you've ever got a question for how to grab the data you need.

  2. Chapter leads: Take a look at your Analysts draft PR where they have listed all the queries/data they'll be analyzing. You'll want to make sure all the ideas you discussed are listed and that nothing was lost in communication.

  3. Analysts: Once your queries are completed and data has been put into the spreadsheets (along with comments), set up a time to run through the data with the Chapter lead so they know exactly how to interpret the data

That's it! Really looking forward to seeing the chapter start to take form. And if you've ever got any questions just ping me

PWA: Queries, Results (has all their visualizations done as well)
Mobile Web: Queries, Results
A11Y: Queries, Results

@GJFR
Copy link
Member

GJFR commented Aug 30, 2021

Hi @OBTo. There have been some issues regarding the well-known custom metric. I have fixed the issue in this PR, but still, the current crawl data will represent an undercount due to JS errors.

Would it be possible to push the fix for the September crawl such that we could still use this metric for the chapter?

@foxdavidj
Copy link
Contributor

Hi @OBTo. There have been some issues regarding the well-known custom metric. I have fixed the issue in this PR, but still, the current crawl data will represent an undercount due to JS errors.

Would it be possible to push the fix for the September crawl such that we could still use this metric for the chapter?

How major is that metric to the chapter? We'd really prefer having the entire almanac using the July data.

If it's a more minor metric it'd be best to shelve it and save it for next year

@GJFR
Copy link
Member

GJFR commented Sep 1, 2021

Hi @OBTo. There have been some issues regarding the well-known custom metric. I have fixed the issue in this PR, but still, the current crawl data will represent an undercount due to JS errors.
Would it be possible to push the fix for the September crawl such that we could still use this metric for the chapter?

How major is that metric to the chapter? We'd really prefer having the entire almanac using the July data.

If it's a more minor metric it'd be best to shelve it and save it for next year

I'd say it's not that major, but I think @SaptakS has a better idea about that.

Because the presence of a page's /robots.txt endpoint was already being captured by another metric, I could check the amount of positives that were missed in the July crawl due to the errors:

Hosts with /robots.txt: Actual Measured Error
desktop_10k 7244 / 9990 7110 / 9990 134 / 9990 (1.3%)
mobile_10k 7439 / 9994 6916 / 9994 523 / 9994 (5.2%)

It seems the amount of /robots.txt endpoints that were missed in the July crawl is rather limited (at least for the sample data), also considering that not all errors can be prevented. Can we afford this error rate or should we disregard this data?

@SaptakS
Copy link
Collaborator

SaptakS commented Sep 1, 2021

I don't think it's necessarily a major metric. I think we were just trying to get some new metrics this time from security.txt and robots.txt, but not critical for the chapter. @tomvangoethem @nrllh what do you think?

It seems the amount of /robots.txt endpoints that were missed in the July crawl is rather limited (at least for the sample data), also considering that not all errors can be prevented. Can we afford this error rate or should we disregard this data?

I am also curious about this. Do we keep this if the error is not much? @OBTo thoughts?

@tomvangoethem
Copy link
Contributor

I think it's best to keep it for next year; we already have plenty of content & having to explain that there is an (additional) error rate in these measurements might complicate things for the reader.

@foxdavidj
Copy link
Contributor

@SaptakS We'd be ok with you including the metric if your team does the following

  1. Mention the possible error when presenting the data from this metric
  2. Re-run that one metric with the September data to see if there is a large difference in the results
  3. If there is a large difference, we'll have to exclude it from the chapter and wait till next year

How does that sound?

@SaptakS
Copy link
Collaborator

SaptakS commented Sep 12, 2021

I think rerunning on September data to check how large of a difference it is makes sense to me, if it's not too much of an effort @GJFR . I agree it makes sense otherwise to exclude it from the chapter this year.

@GJFR
Copy link
Member

GJFR commented Sep 14, 2021

@SaptakS Sure, no problem!

@rviscomi
Copy link
Member Author

@SaptakS @tomvangoethem @nrllh @cqueern @edmondwwchan @awareseven @GJFR

🎉 This chapter is fully written, reviewed, edited, and ready to be launched on Wednesday! Thank you to all of the contributors who put in the time and effort to make this a great chapter.

When you get 5 minutes, I'd really appreciate if you could fill out our contributor survey to tell us (the project leads) about your experience. It's super helpful to hear what went well or what could be improved for next time. 🙏

Congratulations and thank you all again. I'm excited for this to launch soon!

2021 automation moved this from In Progress to Done Nov 30, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2021 chapter Tracking issue for a 2021 chapter
Projects
2021
Done
Development

No branches or pull requests

10 participants