Skip to content


Subversion checkout URL

You can clone with
Download ZIP
a small library for extracting rich content from urls
Python HTML
Latest commit 764e7c9 @coleifer Merge pull request #57 from OlegGirko/master
Fix tests to run with newer Django.


A small library for extracting rich content from urls. Live demo.

what does it do?

micawber supplies a few methods for retrieving rich metadata about a variety of links, such as links to youtube videos. micawber also provides functions for parsing blocks of text and html and replacing links to videos with rich embedded content.


here is a quick example:

import micawber

# load up rules for some default providers, such as youtube and flickr
providers = micawber.bootstrap_basic()


# returns the following dictionary:
    'author_name': 'pascalbrax',
    'author_url': u''
    'height': 344,
    'html': u'<iframe width="459" height="344" src="" frameborder="0" allowfullscreen></iframe>',
    'provider_name': 'YouTube',
    'provider_url': '',
    'title': 'Future Crew - Second Reality demo - HD',
    'type': u'video',
    'thumbnail_height': 360,
    'thumbnail_url': u'',
    'thumbnail_width': 480,
    'url': '',
    'width': 459,
    'version': '1.0',

micawber.parse_text('this is a test:\n', providers)

# returns the following string:
this is a test:
<iframe width="459" height="344" src="" frameborder="0" allowfullscreen></iframe>

micawber.parse_html('<p></p>', providers)

# returns the following html:
<p><iframe width="459" height="344" src=";feature=oembed" frameborder="0" allowfullscreen="allowfullscreen"></iframe></p>
Something went wrong with that request. Please try again.