slow parsing #36

martianinteractive opened this Issue Nov 11, 2011 · 4 comments


None yet
2 participants

FYI, A 14 page pdf took 27 seconds to extract the text.

[12] pry(main)> Benchmark.realtime do
[12] pry(main)*{ |p| p.text }
[12] pry(main)* end
=> 27.4952540397644

Intel 2.2 GHz i7, 8GB RAM


yob commented Nov 12, 2011

I'd definitely like things to be faster, so far I've mainly focused on features and API improvements over optimisations.

What version are you testing with? I recently merged some optimisation work from another user into master, so I'd be interested to see how it changes parsing of your test file.

version: 1.0.0.beta1
ruby 1.8.7 (2011-06-30 patchlevel 352)

Awesome library by the way.


yob commented Nov 16, 2011

Thanks. The parsing speed is definitely something I'd like to improve, so I'll leave this ticket open until I can find time to look into it better.


yob commented Jan 3, 2017

Thanks to the passage of time and hard work of the ruby core team, pdf-reader is now significantly faster on modern ruby versions.

There'll always be improvements we can make, but I'm going to close this for now. If specific performance issues come up, I'm happy to investigate further.

@yob yob closed this Jan 3, 2017

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment