An utility to check the size of your pages and resources used
Branch: master
Clone or download
csalom fix readme (#3)
Fixing some readme items
Latest commit 2c240c2 Jan 30, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
dist
page_size_check
tests
.gitignore add tests, tox and travis (#1) Oct 19, 2018
.travis.yml
AUTHORS
CHANGELOG
Dockerfile
LICENSE
MANIFEST.in
README.rst
requirements-sys.txt Refactor (#2) Dec 2, 2018
requirements.txt
setup.cfg
setup.py
tox.ini

README.rst

Page Size Check

Page Size Check is an utility to check the size of pages from a sitemap and its resources parsering the HAR file of the request using Selenium and haralyzer. The execution of this utility produces some files to allow the user to make an analysis of the number of requests and its size. The execution use ThreadPoolExecutor to launch the browsers in parallel.

Dependencies

  • Git.
  • Python 3.5 or higher, pip and virtualenvwrapper.
  • System requeriments in requirements-sys.txt

Installation with PyPI

  1. Installation

    pip install page-size-check
    
  2. Download

    - Firefox webdriver for selenium: geckodriver
    - Browsermob-proxy
    
  3. Execution

    page_size_check --sitemap_url="sitemap.url" [--help]
    
  4. Output

    - Summary tables: tables with summary info for each url and the total amount of results
    - Resume urls file: a resume of the urls with the number of entries, the page size and the page load times
    - Resources list file: a list of the resources on every page with its mimetype, size and load time
    - Mimetype resources: a resume of the resources grouped by mimetype in each url of the sitemap
    

Installation with Docker

If you want to use Docker and avoid to install external dependencies just type:

docker run --rm  apsl/page-size-check --sitemap_url=http://www.example.com/sitemap.xml

Usage

As was showned before, you can execute the script doing python page_size_check.py --help to see all the parameters that you can set:

--browsermob_server_path TEXT
 Browsermob server path.
--browsermob_server_port INTEGER
 Browsermob server port.
--firefox_driver_path TEXT
 Firefox driver path.
--sitemap_url TEXT
 Sitemap to get urls.
--threads INTEGER
 Number of threads.
--display_summary BOOLEAN
 If true displays the results summary to the stdout.
--help Show this message and exit.

Contributing

Contributions are very welcome. Please open a pull request or file an issue. Tests will be ready as soon as posible, please ensure the coverage at least stays the same before you submit a pull request.

License

Distributed under the terms of the MIT license, "page-size-check" is free and open source software

Issues

If you encounter any problems, please file an issue along with a detailed description.