This class can convert MS Word Docx files to text. It can extract the files from compressed Microsoft Word file in docx format. The class can parse the document XML file and extract the text that it contains.
DocX is a format used by the latest versions of Microsoft Word to save a document to a file.
Rendering a Microsoft Word file as it is presented in the program is not an easy task.
This class can perform a simpler but useful task of extracting the text used in the document file saved in DocX format.
This class convert doc, docx to txt. It reads the content of doc, docx file and return the content in soimple txt format to search.
Innovation Award: February 2013 Winner