Permalink
Browse files

Work on why we have an empty <body/> tag

- Seems to come because the sanitizer ends up with two nodes, not one. The
first is an empty body, the second is the article div.
- Fix up the tabs so we can work with the file. Needs lots of pep8 love.
- Implement an initial hack that at least gets it working atm.
- Start to add test cases, sample html files we can test against, etc.
  • Loading branch information...
1 parent ab783b2 commit edccec5d3b4cecee3fdccff7667dd81bb3ed6258 @mitechie committed Apr 16, 2012
Showing with 1,286 additions and 479 deletions.
  1. +485 −479 readability/readability.py
  2. 0 tests/__init__.py
  3. +762 −0 tests/samples/si-game.sample.html
  4. +39 −0 tests/test_article_only.py
Oops, something went wrong.

0 comments on commit edccec5

Please sign in to comment.