Skip to content

This project makes use of powerful python libraries to extract text from files of several formats (pdf,doc etc).

Notifications You must be signed in to change notification settings

1nvictus/elasticsearchPython

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

elasticsearchPython

This project makes use of powerful python library of tika to extract text from files of several formats (pdf,doc etc).

Install tika library using command in linux:

pip install tika

It then connects with elasticsearh,creates an index and inserts data into the index.

It uses elasticsearch python library to connect and perform operations on elasticsearch.

Install elasticsearch library using below command in linux:

pip install elasticsearch

About

This project makes use of powerful python libraries to extract text from files of several formats (pdf,doc etc).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages