Welcome to the FS Crawler for Elasticsearch
This crawler helps to index binary documents such as PDF, Open Office, MS Office.
Main features:
- Local file system (or a mounted drive) crawling and index new files, update existing ones and removes old ones.
- Remote file system over SSH/FTP crawling.
- REST interface to let you "upload" your binary documents to elasticsearch.
Current "most stable" versions are:
Elasticsearch | FS Crawler | Released | Docs |
---|---|---|---|
6.x, 7.x, 8.x | 2.10-SNAPSHOT | 2.10-SNAPSHOT |
The guide has been moved to ReadTheDocs.
Works on my machine - and yours ! Spin up pre-configured, standardized dev environments of this repository, by clicking on the button below.
Read more about the Apache2 License.
Thanks to JetBrains for the IntelliJ IDEA License!
Thanks to SonarCloud for the free analysis!