This is a ChupaText decomposer plugin for to extract text and meta-data from HTML.
You can use
Install chupa-text-decomposer-html gem:
% gem install chupa-text-decomposer-html
Now, you can extract text and meta-data from HTML:
% chupa-text index.html
- Kouhei Sutou
LGPL 2.1 or later.
(Kouhei Sutou has a right to change the license including contributed patches.)