Refactored version for https://github.com/shirdrn/document-processor.git
Process documents to prepare train/test data for 'libsvm' tool. We are using CHI as the default policy to select terms as the feature vector, and then using TF-IDF to compute weight values.
- Website: www.shiyanjun.cn
- Email : shirdrn@gmail.com