Index your local files (PDF/Word) to ElasticSearch 2.X
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
web
.gitignore
README.md
checkme.txt
esdocs.py
requirements.txt

README.md

esdocs

Prerequsities

Run

To run please do following

  • Ensure your Elasticsearch listens and plugin works, check the _nodes endpoint
  • Install prerequsities via pip pip install -r requirements.txt
  • Run the scirpt

Script usage

Running with python the main script will fallback to default options

python esdocs.py

It will index files being in folder \files_to_index relative to script. You can use custom parameters as follows:

  • -h <host> point a ES host
  • -t <type> ES type name that document should be stored
  • -i <index_name> ES index name that document should be stored