Skip to content

bufistov/plagiat-detector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Find similar documents using MinHash similarity

RUN

Docker and python3 are required to run this program.

Build Elasticsearch image

docker build -t es-with-minhash ./ 

Start Elasticsearch

./start_es.sh

Install elasticsearch client library

pip install elasticsearch

List documents sorted by similarity

To specify which column in csv file should be checked for similarity use '--column' option which defaults to 'Resposta'

python3 main.py respostes.csv

To get more help

python3 main.py -h

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published