Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP

Loading…

lzw.rb / decode : can't convert nil into String #90

Open
pinpudding opened this Issue · 1 comment

2 participants

@pinpudding

When trying to extract text of document https://dl.dropbox.com/u/39311630/JFE-June%2B79.pdf I get
vendor/bundle/ruby/1.9.1/bundler/gems/pdf-reader-65b417aea904/lib/pdf/reader/lzw.rb:91:in decode'
vendor/bundle/ruby/1.9.1/bundler/gems/pdf-reader-65b417aea904/lib/pdf/reader/filter/lzw.rb:14:in
filter'
vendor/bundle/ruby/1.9.1/bundler/gems/pdf-reader-65b417aea904/lib/pdf/reader/stream.rb:63:in block in unfiltered_data'
vendor/bundle/ruby/1.9.1/bundler/gems/pdf-reader-65b417aea904/lib/pdf/reader/stream.rb:62:in
each'
vendor/bundle/ruby/1.9.1/bundler/gems/pdf-reader-65b417aea904/lib/pdf/reader/stream.rb:62:in each_with_index'
vendor/bundle/ruby/1.9.1/bundler/gems/pdf-reader-65b417aea904/lib/pdf/reader/stream.rb:62:in
unfiltered_data'
vendor/bundle/ruby/1.9.1/bundler/gems/pdf-reader-65b417aea904/lib/pdf/reader/page.rb:116:in block in raw_content'
vendor/bundle/ruby/1.9.1/bundler/gems/pdf-reader-65b417aea904/lib/pdf/reader/page.rb:113:in
map'
vendor/bundle/ruby/1.9.1/bundler/gems/pdf-reader-65b417aea904/lib/pdf/reader/page.rb:113:in raw_content'
vendor/bundle/ruby/1.9.1/bundler/gems/pdf-reader-65b417aea904/lib/pdf/reader/page.rb:105:in
walk'
vendor/bundle/ruby/1.9.1/bundler/gems/pdf-reader-65b417aea904/lib/pdf/reader/page.rb:75:in `text'

@yob
Owner
yob commented

Thanks for the report, I can reproduce the error on my system.

I'm unfamiliar with the finer details of the LZW compression spec and I'm pressed for time at the moment, so I doubt I can look into this for a while.

Do you have any time to investigate it? We probably need some extra specs for PDF::Reader::LZW

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Something went wrong with that request. Please try again.