Skip to content

Commit

Permalink
fix sp
Browse files Browse the repository at this point in the history
  • Loading branch information
charliejuggler@gmail.com committed Feb 11, 2010
1 parent c84140c commit 138f681
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions flax_filters/README
Original file line number Diff line number Diff line change
Expand Up @@ -8,14 +8,14 @@ Charlie Hull/Paul Rudin
Flax Filters can be used for extracting plain-text content and some metadata
from various file formats. This data is returned as field/value pairs.

extract_to_json.py shows how the filters can be used to process all the files
in a directory and create result files in JSON format.
extract_to_json.py shows how the filters can be used to process all the files
in a directory and create result files in JSON format.

extract.py contains the actual extraction code, which runs either some
internal procedures (for the simpler formats) or external procedures to
extract the content and metadata. Of particular interest is the way a
'headless' Open Office instance can be used to extract data. utils.py
contains the framework for spawing Open Office translators in 'killable'
contains the framework for spawning Open Office filters in 'killable'
external processes - note that the Python code doesn't start Open Office
itself. The extraction code can also be run in multiple processes, with
multiple instances of Open Office running on different ports, although
Expand Down

0 comments on commit 138f681

Please sign in to comment.