Skip to content

Anshuman1249/Plagarism_Checker_Using_Python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 

Repository files navigation

Plagiarism Checker

Description

The Plagiarism Checker is a Python-based tool designed to detect plagiarism in text documents. It utilizes natural language processing (NLP) techniques to compare the similarity between different documents and identify potential instances of plagiarism.

Key Features

  1. Document Comparison: The tool compares the content of multiple documents to determine the level of similarity between them.

  2. Plagiarism Detection: It identifies potential instances of plagiarism by analyzing the text similarities and patterns.

  3. Customizable Thresholds: Users can set thresholds to define the level of similarity at which plagiarism is flagged.

  4. Multiple File Formats: The tool supports various file formats, including plain text files (.txt), Microsoft Word documents (.docx), and more.

How It Works

1.Document Processing: The tool takes input documents and preprocesses them, removing irrelevant information like stop words and punctuation.

2.Text Comparison: It applies NLP techniques, such as stemming, and vectorization,it also uses Levenshtein's Algorithm to convert the documents into numerical representations.

3.Similarity Measurement: The tool calculates the similarity scores between the documents using algorithms like cosine similarity or Jaccard index.

4.Threshold Evaluation: The calculated similarity scores are compared against the user-defined thresholds to determine if plagiarism is present.

Clone the repository:

https://github.com/Anshuman1249/Plagarism_Checker_Using_Python.git

Run the plagiarism checker:

python3 Plagarism_Checker_Using_Python.py

Contributing

Contributions to the Plagiarism Checker are welcome! If you have any ideas, suggestions, or bug reports, please open an issue or submit a pull request on the GitHub repository.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages