Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

Extract dates from webpages

branch: master

Fetching latest commit…

Octocat-spinner-32-eaf2f5

Cannot retrieve the latest commit at this time

Octocat-spinner-32 tests
Octocat-spinner-32 .gitignore
Octocat-spinner-32 LICENSE
Octocat-spinner-32 README.rst
Octocat-spinner-32 dateminer.py
Octocat-spinner-32 setup.py
README.rst

dateminer is a Python port of John Muellerleile's dateminer Java library:

https://github.com/jrecursive/date_miner

It gives you a best guess at the creation date of an article (webpage) based on the URL and content of that page.

Usage

>>> from dateminer import guess_date
>>> date = guess_date(url, html_content)
Something went wrong with that request. Please try again.