Grails plugin: TikaParser
This plugin wraps Apache Tika, a content analysis toolkit which can extract all kinds of content and metadata from files you provide.
Add a line
to the plugins section in BuildConfig.groovy of your application.
You can now use the tikaService to extract a file's content as XML.
What this plugin does
It adds the Tika parsers as a dependency and provides a TikaService for easy content extraction.
This plugin downloads "all the things", meaning it may increase the size of you application's war file by 20-30 MByte. But then, it provides so much functionality like parsing of Excel and MS Word files etc which are hard to come by with other libraries.
The version numbers are derived from the original library's version scheme: this plugin is version 1.12, meaning it is based on Tika 1.12.