An extension gem for format_parser that uses pdf-reader to detect page count in PDF files.
In your Gemfile, in addition to the format_parser gem add the format_parser_pdf:
source :rubygems
# ...
gem 'format_parser'
gem 'format_parser_pdf'If you only use Bundler.setup in your code and require gems manually, you need to explicitly require the library so that the built-in
PDF parser in the format_parser gem gets replaced with the extended version. If you use Bundler.require it will happen automatically.
Anywhere in your code use the standard FormatParser.parse(io) calls and related methods, for PDFs the extended version will be used.
After checking out the repo, run bin/setup to install dependencies. Then, run rake spec to run the tests. You can also run bin/console for an interactive prompt that will allow you to experiment.
To install this gem onto your local machine, run bundle exec rake install. To release a new version, update the version number in version.rb, and then run bundle exec rake release, which will create a git tag for the version, push git commits and tags, and push the .gem file to rubygems.org.
Bug reports and pull requests are welcome on GitHub at https://github.com/WeTransfer/format_parser_pdf