File content comparing to find the percentage of plagiarism
- Add the contents of the main file in the corpus
- Transform the main file into the form of vector
- Find the similarity between the two files by using python inbuilt library function cosine_similarity by comparing the content of the new file dat a with the main data
- print the accuracy(amount of plagiarism)