Skip to content

Commit

Permalink
fix gemspec
Browse files Browse the repository at this point in the history
  • Loading branch information
Caged committed Aug 25, 2009
1 parent 17f596c commit c4e377a
Show file tree
Hide file tree
Showing 2 changed files with 13 additions and 13 deletions.
20 changes: 10 additions & 10 deletions README.markdown
Expand Up @@ -26,26 +26,26 @@ A basic overview of what this library does:
####Notable Troublesome URLS
[x] Readability doesn't parse correctly : [-] Readability parses correctly

* [-] http://www.politico.com/blogs/glennthrush/0809/Remains_of_the_day_August_24_2009.html
* [-] [http://www.politico.com/blogs/glennthrush/0809/Remains_of_the_day_August_24_2009.html](http://www.politico.com/blogs/glennthrush/0809/Remains_of_the_day_August_24_2009.html)
Short content, invalid lines.
* [-] http://www.tulsaworld.com/news/article.aspx?subjectid=298&articleid=20090824_298_0_TheTul317502&rss_lnk=1
* [-] [http://www.tulsaworld.com/news/article.aspx?subjectid=298&articleid=20090824_298_0_TheTul317502&rss_lnk=1](http://www.tulsaworld.com/news/article.aspx?subjectid=298&articleid=20090824_298_0_TheTul317502&rss_lnk=1)
No content paragraphs, uses spans and double <br /><br /> to signify paragraphs.
* [-] http://article.nationalreview.com/?q=YTMxNGQyNmYyNjljYmE0NDVhZTdlMjlkZTM1Y2NiOTU=
* [-] [http://article.nationalreview.com/?q=YTMxNGQyNmYyNjljYmE0NDVhZTdlMjlkZTM1Y2NiOTU=](http://article.nationalreview.com/?q=YTMxNGQyNmYyNjljYmE0NDVhZTdlMjlkZTM1Y2NiOTU=)
Multiple page article
* [x] http://politicalwire.com/archives/2009/08/24/bonus_quote_of_the_day.html
* [x] [http://politicalwire.com/archives/2009/08/24/bonus_quote_of_the_day.html](http://politicalwire.com/archives/2009/08/24/bonus_quote_of_the_day.html)
Tiny amount of content with no paragraphs and a huge footer with paragraph content.
* [x] http://briefingroom.thehill.com/2009/08/24/doctor-to-be-named-in-jackson-homicide-donated-to-republican-party-in-2004/
* [x] [http://briefingroom.thehill.com/2009/08/24/doctor-to-be-named-in-jackson-homicide-donated-to-republican-party-in-2004/](http://briefingroom.thehill.com/2009/08/24/doctor-to-be-named-in-jackson-homicide-donated-to-republican-party-in-2004/)
Paragraphs embedded in spans
* [-] http://www.nytimes.com/2009/08/25/us/politics/25detain.html?_r=1&partner=rss&emc=rss
* [-] [http://www.nytimes.com/2009/08/25/us/politics/25detain.html?_r=1&partner=rss&emc=rss](http://www.nytimes.com/2009/08/25/us/politics/25detain.html?_r=1&partner=rss&emc=rss)
Not sure what the problem is here. It doesn't look like anything should be wrong here
at first glance.
* [-] http://www.weeklystandard.com/weblogs/TWSFP/2009/08/kristol_gratitude_obamastyle.asp
* [-] [http://www.weeklystandard.com/weblogs/TWSFP/2009/08/kristol_gratitude_obamastyle.asp](http://www.weeklystandard.com/weblogs/TWSFP/2009/08/kristol_gratitude_obamastyle.asp)
Everything looks fine. Not sure why we're getting a blank block of content back
* [x] http://www.cbsnews.com/stories/2009/08/24/entertainment/michaeljackson/main5262822.shtml
* [x] [http://www.cbsnews.com/stories/2009/08/24/entertainment/michaeljackson/main5262822.shtml](http://www.cbsnews.com/stories/2009/08/24/entertainment/michaeljackson/main5262822.shtml)
Comments are included as readable content
* [x] http://www.whitehouse.gov/blog/Diligence-on-H1N1/
* [x] [http://www.whitehouse.gov/blog/Diligence-on-H1N1/](http://www.whitehouse.gov/blog/Diligence-on-H1N1/)
Uses divs for content, but contains paragraphs elsewhere.
* [-] http://www.msnbc.msn.com/id/32518842/ns/meet_the_press/
* [-] [http://www.msnbc.msn.com/id/32518842/ns/meet_the_press/](http://www.msnbc.msn.com/id/32518842/ns/meet_the_press/)
Parsing a bunch of unwanted, non readable content. I assume because it shares the same
parent as the other content.

Expand Down
6 changes: 3 additions & 3 deletions dragnet.gemspec
Expand Up @@ -10,8 +10,8 @@ Gem::Specification.new do |s|
s.required_rubygems_version = Gem::Requirement.new(">= 0") if s.respond_to? :required_rubygems_version=
s.authors = ["Caged"]
s.date = %q{2009-08-23}
s.description = %q{Given a url Dragnet will attempt to analyze and extract the intended readable content and embedded links from a page}
s.email = %q{justin@activereload.net}
s.description = %q{Given a URL Dragnet will attempt to analyze and extract the intended readable content and embedded links from a page}
s.email = %q{justin@labratrevenge.com}
s.extra_rdoc_files = [
"LICENSE",
"README.rdoc"
Expand All @@ -20,7 +20,7 @@ Gem::Specification.new do |s|
".document",
".gitignore",
"LICENSE",
"README.rdoc",
"README.markdown",
"Rakefile",
"VERSION",
"lib/dragnet.rb",
Expand Down

0 comments on commit c4e377a

Please sign in to comment.