Permalink
Browse files

Docsplit 0.5.1

  • Loading branch information...
1 parent 4a5568e commit 6fbb1b3310ff1c6fe6f1163631495c71db051b8e @jashkenas jashkenas committed Apr 26, 2011
Showing with 10 additions and 4 deletions.
  1. +2 −2 docsplit.gemspec
  2. +7 −1 index.html
  3. +1 −1 lib/docsplit.rb
View
@@ -1,7 +1,7 @@
Gem::Specification.new do |s|
s.name = 'docsplit'
- s.version = '0.5.0' # Keep version in sync with docsplit.rb
- s.date = '2010-10-18'
+ s.version = '0.5.1' # Keep version in sync with docsplit.rb
+ s.date = '2010-04-26'
s.homepage = "http://documentcloud.github.com/docsplit/"
s.summary = "Break Apart Documents into Images, Text, Pages and PDFs"
View
@@ -98,7 +98,7 @@
(title, author, number of pages...)
</p>
- <p>Docsplit is currently at <a href="http://rubygems.org/gems/docsplit">version 0.5.0</a>.</p>
+ <p>Docsplit is currently at <a href="http://rubygems.org/gems/docsplit">version 0.5.1</a>.</p>
<p>
<i>Docsplit is an open-source component of <a href="http://documentcloud.org/">DocumentCloud</a>.</i>
@@ -282,6 +282,12 @@ <h2 id="internals">Internals</h2>
<h2 id="changes">Change Log</h2>
<p>
+ <b class="header">0.5.1</b><small> &ndash; April 26, 2011</small><br />
+ Minor tweaks to the <tt>TextCleaner</tt> to be more lenient about acryonms
+ with hyphens, and words with four vowels in a row.
+ </p>
+
+ <p>
<b class="header">0.5.0</b><br />
Added a <tt>Docsplit::TextCleaner</tt> class which is used to post-process
OCR'd text, and remove garbage characters that are created when Tesseract
View
@@ -1,7 +1,7 @@
# The Docsplit module delegates to the Java PDF extractors.
module Docsplit
- VERSION = '0.5.0' # Keep in sync with gemspec.
+ VERSION = '0.5.1' # Keep in sync with gemspec.
ROOT = File.expand_path(File.dirname(__FILE__) + '/..')

0 comments on commit 6fbb1b3

Please sign in to comment.