A gem for Google's PageSpeed API.
Leverages Google's generated APIs and in doing so can do batch queries of 20 requests at a time.
Can take any number of domains in a list to look up.
Utilizes a block to minimize memory footprint in query().
Uses exponential backoff to handle rate limit errors from Google.
Failed rate limit queries are automatically re-run.
Add to your Gemfile:
gem 'pagespeedhelper', '>=0.4.7'
Minor updates should no longer break functionality. v0.4.7
is the first updated, fully functional release. Any version after will be compatible but offer new features / fix bugs.
Setup:
require 'pagespeedhelper'
ps = PagespeedHelper.new('YOUR_GOOGLE_PAGESPEED_API_KEY')
# With exponential backoff the max time to wait can be set as:
# See more in the Errors Section
ps = PagespeedHelper.new('YOUR_GOOGLE_PAGESPEED_API_KEY', 32)
# Verbose debugging can be set by:
ps = PagespeedHelper.new('YOUR_GOOGLE_PAGESPEED_API_KEY', 32, true)
Query:
The query() method has been rewritten as of v0.5.0
to now accept blocks for immediate handling and parsing of results. This helps to drastically reduce the memory footprint of this gem. This of course is optional as well, if a block is not passed then the data will be returned from the method.
# old examples
data = ps.query('www.example.com')
# OR can take any number of elements in a list
data = ps.query(['www.foo.com', 'www.bar.com'])
# the strategy parameter can be either "mobile" or "desktop" for pagespeed, default is desktop
# the third parameter alerts query() as to whether it should prepend http/https to the url if not present
# default is false which is http
data = ps.query([LIST_OF_URLS], "mobile", true)
# with block
ps.query([LIST_OF_URLS]) do |p|
results = PagespeedHelper.parse(p) # or other processing
end
Parse Results:
Parse can be run even if errors occurred, see the Errors section for more information.
results = PagespeedHelper.parse(data)
Getting Data from Results:
Each of the rule results from Google are parsed and set in the results hash.
This set of rules varies depending on which strategy is used. Mobile will also include "USABILITY" rule results.
Results being parsed now need to be checked to see if they have an error. If so there will be no accompanying data, just the URL and the error.
Result for one site checked:
results[0]["url"] # url checked
results[0]["score"] # site overall pagespeed score
results[0]["results"][ONE_OF_GOOGLES_RULES]["name"] # localized name for printing
results[0]["results"][ONE_OF_GOOGLES_RULES]["impact"] # impact of rule on pagespeed result
results[0]["results"][ONE_OF_GOOGLES_RULES]["summary"] # text explanation of rule result or what could be improved
A note: Make sure to check if the result had an error, this can be seen further in the bulk results example, and the errors section.
To get the Page Stats:
List of Page Stats:
css_response_bytes, html_response_bytes, image_response_bytes, javascript_response_bytes,
number_css_resources, number_hosts, number_js_resources, number_resources,
number_static_resources, other_response_bytes, total_request_bytes
Stats now have additional hashes for localized names and their value
results[0]["stats"][STAT_FROM_ABOVE]["name"]
results[0]["stats"][STAT_FROM_ABOVE]["value"]
Bulk results example:
results.each do |res|
if !res.key?("error")
# do something with valid result
else
# do something with error
end
end
Errors:
As of v0.4.1
this gem utilizes exponential backoff which waits in between sending batch requests if a query returns a rate limit error (rateLimitExceeded
or userRateLimitExceeded
). This starts at one and goes to the values set in the initialization, or 32 which is default. This value is the max amount of time it will wait, after which if another rate error occurs it will be added into the results hash with the rest of the errors / results.
As seen above, errors are now listed in the result hash, a manual check needs to be done to see if the site had an issue with the request. It will also contain the reason why it failed.
Errors will not be changed when PagespeedHelper.parse()
is run. In fact, it is advantageous to parse before checking for errors as then you can simply do the bulk results example above. Otherwise to check the return from ps.query()
requires examining hashes and objects, which is trickier.
The available error information is:
results[0]["error"] # the error that occurred, MainResource, etc.
results[0]["url"] # the url where the error occurred
Tests are under spec/, run with rspec.
Tests no longer require a ENV['PAGESPEED_API_KEY'], rather all the results have been set to use provided VCR cassettes for the cases.
rspec