Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added the option to include images with relative urls, added/fixed up some tests. #5

Closed
wants to merge 20 commits into from

Commits on Jan 17, 2011

  1. Replaced keywords functionality with the Phrasie gem

    Ashley Williams committed Jan 17, 2011
    Configuration menu
    Copy the full SHA
    d154c00 View commit details
    Browse the repository at this point in the history

Commits on Mar 2, 2011

  1. Configuration menu
    Copy the full SHA
    75dcba5 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    3379ee8 View commit details
    Browse the repository at this point in the history

Commits on Apr 24, 2011

  1. add video embed,object scraping

    bborn committed Apr 24, 2011
    Configuration menu
    Copy the full SHA
    c47d11c View commit details
    Browse the repository at this point in the history

Commits on Apr 28, 2011

  1. Configuration menu
    Copy the full SHA
    38a3a97 View commit details
    Browse the repository at this point in the history
  2. bugfix

    bborn committed Apr 28, 2011
    Configuration menu
    Copy the full SHA
    162881d View commit details
    Browse the repository at this point in the history
  3. another logging bugfix

    bborn committed Apr 28, 2011
    Configuration menu
    Copy the full SHA
    529ed6a View commit details
    Browse the repository at this point in the history
  4. there

    bborn committed Apr 28, 2011
    Configuration menu
    Copy the full SHA
    4dfcb37 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    f415d35 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    250806a View commit details
    Browse the repository at this point in the history

Commits on Feb 29, 2012

  1. Configuration menu
    Copy the full SHA
    10e184a View commit details
    Browse the repository at this point in the history
  2. Merge branch 'ashleyw_phrasie'

    dparis committed Feb 29, 2012
    Configuration menu
    Copy the full SHA
    2f22638 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    1afb407 View commit details
    Browse the repository at this point in the history
  4. Merge branch 'bborn_bugfixes'

    dparis committed Feb 29, 2012
    Configuration menu
    Copy the full SHA
    b8b6f68 View commit details
    Browse the repository at this point in the history
  5. Fixed bug where raw_html content was being processed before the encod…

    …ing was enforced, leading to an invalid UTF-8 encoding exception
    dparis committed Feb 29, 2012
    Configuration menu
    Copy the full SHA
    0bcb1e7 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    64cf7cf View commit details
    Browse the repository at this point in the history

Commits on Mar 17, 2012

  1. Configuration menu
    Copy the full SHA
    e7dd0da View commit details
    Browse the repository at this point in the history

Commits on Apr 7, 2012

  1. * Rewrote image_extractor to use more idiomatic Ruby

    * Rewrote huge parts of internal_document to be more DRY and produce less garbage
    * Integrated the htmlentities gem for generalized HTML entity decoding
    * Fixed HTML entity decoding so that it happens when content is extracted, rather than doing it on the source document, which can break parsing
    * Stubbed out the network calls in the test suite, resulting in dramatically faster tests
    * General garbage, speed, and style tweaks
    * Removed trailing whitespace from many files
    * Make the ImageExtractor logger customizable, or pass false for no logger
    * In the same vein, use default options and pass them along down to the various pieces of the parser
    cheald committed Apr 7, 2012
    Configuration menu
    Copy the full SHA
    bae9a0e View commit details
    Browse the repository at this point in the history

Commits on Oct 12, 2012

  1. add rake to gemspec

    bborn committed Oct 12, 2012
    Configuration menu
    Copy the full SHA
    da2c4c5 View commit details
    Browse the repository at this point in the history
  2. merge peterc and fix tests

    bborn committed Oct 12, 2012
    Configuration menu
    Copy the full SHA
    88527c2 View commit details
    Browse the repository at this point in the history