Page Size Check
Page Size Check is an utility to check the size of pages from a sitemap and its resources parsering the HAR file of the request using Selenium and haralyzer. The execution of this utility produces some files to allow the user to make an analysis of the number of requests and its size. The execution use ThreadPoolExecutor to launch the browsers in parallel.
- Python 3.5 or higher, pip and virtualenvwrapper.
- System requeriments in requirements-sys.txt
Installation with PyPI
pip install page-size-check
- Firefox webdriver for selenium: geckodriver - Browsermob-proxy
page_size_check --sitemap_url="sitemap.url" [--help]
- Summary tables: tables with summary info for each url and the total amount of results - Resume urls file: a resume of the urls with the number of entries, the page size and the page load times - Resources list file: a list of the resources on every page with its mimetype, size and load time - Mimetype resources: a resume of the resources grouped by mimetype in each url of the sitemap
Installation with Docker
If you want to use Docker and avoid to install external dependencies just type:
docker run --rm apsl/page-size-check --sitemap_url=http://www.example.com/sitemap.xml
As was showned before, you can execute the script doing
python page_size_check.py --help to see all the
parameters that you can set:
|Browsermob server path.|
|Browsermob server port.|
|Firefox driver path.|
|Sitemap to get urls.|
|Number of threads.|
|If true displays the results summary to the stdout.|
|--help||Show this message and exit.|
Contributions are very welcome. Please open a pull request or file an issue. Tests will be ready as soon as posible, please ensure the coverage at least stays the same before you submit a pull request.
Distributed under the terms of the MIT license, "page-size-check" is free and open source software
If you encounter any problems, please file an issue along with a detailed description.