-
-
Notifications
You must be signed in to change notification settings - Fork 84
Infrastructure and design reboot #81
Comments
@pmeenan WDYT of the feasibility of adding first-class support for Google Storage to WPT, similar to S3? This would simplify the architecture of HA and would allow WPT to write directly to GS without the need for polling/transferring on HA's end. |
It already can but that is for general archiving of tests. The GS writes that happen right now are for artifacts that WPT doesn't normally store or archive on it's own (traces and HARs). To eliminate the polling you probably want to use a few pub/sub queues for pushing things through various stages. This is largely how the queue already works since it is built on top of beanstalk but it doesn't use the support for re-trying failed tasks. We'd still need to plumb the output side to do the post-processing/extraction (or even just the state management if all processing is moved to BQ). Instead of the current "callback" support, the ability to post HARs (or references to HARs) directly from WPT into a pub/sub queue is probably more along the lines of what you'd need. |
I don't think I ever dug deep enough into this part of the HA pipeline.. :) What would be the externally visible net win from the above? HARs would show up earlier in the GS storage bucket? Load decrease on the agents? |
Not much. You may get HARs a few hours earlier (they are already uploaded as the crawls run) and it doesn't help the agents. It may lighten the load on the server and clean up some processing logic but that's about it. |
Minor niggle: is it possible to use less ambiguous notation when referring to the dates of various crawls? "5/1" is 5th January (or even 5 to 1 odds) for most of the world. It's surprisingly confusing when reading some of the issues. |
Thanks @Themanwithoutaplan. Tracking this in #106. |
Been chatting with @igrigorik about rebooting HA's UI and backend. Making this issue to organize thoughts/plans and invite wider discussion.
Incomplete list of high level objectives in no particular order:
Should use sub-issues for tracking each objective.
The text was updated successfully, but these errors were encountered: