Skip to content
Python library of web-related functions
Branch: master
Clone or download
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
docs
tests
w3lib Bump version: 1.19.0 → 1.20.0 Jan 11, 2019
.bumpversion.cfg Bump version: 1.19.0 → 1.20.0 Jan 11, 2019
.coveragerc
.gitignore
.travis.yml
LICENSE added BSD license file Feb 13, 2012
MANIFEST.in
NEWS
README.rst drop Python 3.3 support, add Python 3.7 support Oct 16, 2018
codecov.yml
conftest.py Run doctests with tox (and py.test) Jul 29, 2016
pytest.ini
setup.cfg build an universal wheel Jul 25, 2014
setup.py
stdeb.cfg
tox.ini

README.rst

w3lib

https://secure.travis-ci.org/scrapy/w3lib.png?branch=master Coverage report

Overview

This is a Python library of web-related functions, such as:

  • remove comments, or tags from HTML snippets
  • extract base url from HTML snippets
  • translate entites on HTML strings
  • convert raw HTTP headers to dicts and vice-versa
  • construct HTTP auth header
  • converting HTML pages to unicode
  • sanitize urls (like browsers do)
  • extract arguments from urls

Requirements

Python 2.7 or Python 3.4+

Install

pip install w3lib

Documentation

See http://w3lib.readthedocs.org/

License

The w3lib library is licensed under the BSD license.

You can’t perform that action at this time.