Elasticsearch File System Crawler (FS Crawler)
Clone or download
Permalink
Failed to load latest commit information.
.mvn Move to .mvn folder all needed settings to build/test FSCrawler Jan 20, 2017
beans prepare for next development iteration Aug 4, 2018
cli prepare for next development iteration Aug 4, 2018
core Add a Noop Parser Sep 24, 2018
crawler prepare for next development iteration Aug 4, 2018
distribution prepare for next development iteration Aug 4, 2018
docs Fix documentation example for REST Sep 24, 2018
elasticsearch-client Update to Elasticsearch 6.4.1 Sep 19, 2018
framework prepare for next development iteration Aug 4, 2018
integration-tests Update to Elasticsearch 6.4.1 Sep 19, 2018
rest Update to Elasticsearch 6.4.1 Sep 19, 2018
settings Make default root dir Windows compatible Sep 19, 2018
src/main/resources/org/apache/maven/plugin/announcement Fix announcement link and documentation Aug 6, 2018
test-documents Support XML reoccurring structures Aug 23, 2018
test-framework prepare for next development iteration Aug 4, 2018
tika Update to Tika 1.19 Sep 19, 2018
.gitignore Split project into modules Oct 5, 2017
.travis.yml Update to Tika 1.19 Sep 19, 2018
CODE_OF_CONDUCT.md Add Code Of Conduct Jul 6, 2017
CONTRIBUTING.md Add tests on OSS image as well Feb 22, 2018
LICENSE Add Apache License. Fix #4 Aug 9, 2012
NOTICE Add Apache License. Fix #4 Aug 9, 2012
README.md Add LGTM.com code quality badges Sep 3, 2018
deploy-settings.xml Automatically deploy SNAPSHOT Aug 10, 2016
pom.xml Update to Tika 1.19 Sep 19, 2018
release.sh We need to generate documentation files Aug 4, 2018
travis.sh Update to Tika 1.19 Sep 19, 2018

README.md

File System Crawler for Elasticsearch

Welcome to the FS Crawler for Elasticsearch

This crawler helps to index binary documents such as PDF, Open Office, MS Office.

Main features:

  • Local file system (or a mounted drive) crawling and index new files, update existing ones and removes old ones.
  • Remote file system over SSH crawling.
  • REST interface to let you "upload" your binary documents to elasticsearch.

You need to install a version matching your Elasticsearch version:

Elasticsearch FS Crawler Released Docs
2.x, 5.x, 6.x 2.6-SNAPSHOT 2.6-SNAPSHOT
2.x, 5.x, 6.x 2.5 2018-08-04 2.5
2.x, 5.x, 6.x 2.4 2017-08-11 2.4
2.x, 5.x, 6.x 2.3 2017-07-10 2.3
1.x, 2.x, 5.x 2.2 2017-02-03 2.2
1.x, 2.x, 5.x 2.1 2016-07-26 2.1
es-2.0 2.0.0 2015-10-30 2.0.0

Build and Quality Status

Maven Central Travis Documentation Status Code Quality: Java Total Alerts

Lines Duplicated Lines Maintainability Technical Debt Reliability

Vulnerabilities Bugs Quality Gate Code Smells Coverage

The guide has been moved to ReadTheDocs.

License

Read more about the License.