Evaluating Performance of Semi-Supervised Self Training in Identifying Fake Reviews.

The main objective of this project is to build classifiers using Semi-Supervised learning methods. We will then use this classifier to identify “fake” restaurant reviews posted on Yelp. Yelp is a website which publishes crowd-sourced reviews about local businesses including restaurants. Yelp uses its own proprietary algorithm for filtering “fake” reviews. For the purpose of this project, we would be assuming Yelp classification as pseudo ground truth. Semi-supervised learning is a class of supervised learning tasks and techniques that also make use of unlabeled data for training - typically a small amount of with a large amount of unlabeled data. Supervised learning methods are effective when there are sufficient labeled instances to construct classifiers. Labeled instances are often difficult, expensive, or time consuming to obtain, because they require empirical research. When it comes to restaurant reviews, we have a large supply of unlabeled data. Often semi supervised learning achieves a better accuracy than supervised learning which is only trained on the labeled data.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Crawler		Crawler
Data		Data
ML Model		ML Model
FinalProjectReport.pdf		FinalProjectReport.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluating Performance of Semi-Supervised Self Training in Identifying Fake Reviews.

About

Releases

Packages

Contributors 3

Languages

ssrivas/fake-review-detection

Folders and files

Latest commit

History

Repository files navigation

Evaluating Performance of Semi-Supervised Self Training in Identifying Fake Reviews.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages