Dup Hunter emerged from a work developed in the discipline of Software Design and Analysis taught by Professor Gusvato Pinto at UFPA. The students participating in the development are: Helder Matos, Lívia Carrera, Pedro Rocha and Thiago Calado.
The initial objective was fulfilled with the creation. This is an analysis of plagiarism among a set of pdf files. After such analysis, a report with comparative indexes is generated.
The technologies used include:
- Django: MVT architecture
- Dash: interactive applications
- Plotly: interactive graphics
- NLTK: natural language processing
- PdfMiner: reading pdfs
- Weasyprint: generation of pdfs from html