Extract the contents of MS Docx to TXT
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
bin
lib
.gitignore
.ruby-version
Gemfile
Gemfile.lock
LICENSE
README.md
docx2txt.gemspec

README.md

Docx2TXT

Extract the simplest TXT (I could imagine) from a MS Docx. It just do a best effort to preserve paragraphs.

How

Instantiate the docx with the location of the file

doc = Docx2TXT::Docx.new file_path

Later just ask for the txt

doc.to_txt

Simple executable

docx2txt <docxfilepath>

Contributing

  1. Fork it
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create new Pull Request