extracts the text from a Microsoft Word docx file
C# Shell
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
lib
settings
src
tools/NUnit
.gitignore
Build.bat
InstallGems.bat
License.txt
Provenance.txt
README.md
RakeFile
Ruby_Not_Installed.txt
go.bat

README.md

DocxToText ReadMe

Description

A simple library for extracting text from Microsoft Word docx files.

Usage:

var text = new Reader().GetTextFromFile(docxFileName);

How To Build:

The build script requires Ruby with rake installed.

  1. Run InstallGems.bat to get the ruby dependencies (only needs to be run once per computer)
  2. open a command prompt to the root folder and type rake to execute rakefile.rb

If you do not have ruby:

  1. You need to create a src\CommonAssemblyInfo.cs file. Go.bat will copy src\CommonAssemblyInfo.cs.default to src\CommonAssemblyInfo.cs
  2. open src\DocxToText.sln with Visual Studio and build the solution

License

MIT License

This project is part of MVBA's Open Source Projects.