Skip to content

Latest commit

 

History

History
40 lines (26 loc) · 1022 Bytes

README.md

File metadata and controls

40 lines (26 loc) · 1022 Bytes

heritrix-docker

docker container with #webarchiving crawler heritrix-3.2.0

run

  • build

      docker build -t atomotic/heritrix github.com/atomotic/heritrix-docker
    
  • create a persistent job directory on the host

      mkdir jobs
    
  • run the container on host:8443

      docker run -d -p 8443:8443 -v $(pwd)/jobs:/opt/heritrix-3.2.0/jobs atomotic/heritrix heritrix-start.sh
    
  • access the web interface

      open https://localhost:8443
    
    • user heritrix
    • password heritrix

TODO