This project includes tools required to perform metadata analysis. Specifically, it uses Apache Tika to parse metadata from various files and then builds inverted index using solr.
- Get and install docker for your operating system (if not already) https://www.docker.com/products/docker
- Build a docker image. This can be done by
docker build . -f docker/Dockerfile -t imagecat2
- get inside docker container and start services
docker run -it imagecat2 # or unique id for your build
# starts solr
/deploy/solr4/bin/solr start
# invokes parser indexer, more info check out https://github.com/USCDataScience/parser-indexer
java -jar /deploy/parser-indexer/parser-indexer*.jar
# Example
java -jar /deploy/parser-indexer/parser-indexer-1.0-SNAPSHOT.jar postdump -solr http://localhost:8983/solr/imagecatdev -in /etc/hosts
Please create issues at https://github.com/uscdataScience/imagecat2/issues
- Thamme Gowda
- Chris Mattmann