Skip to content

kadabengarann/useful-review-classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation


Useful App Review Classification


This machine learning project utilizes the IndoBERT-BiLSTM model to automate the classification of app reviews into "Useful" or "Not Useful" categories. A "Useful Review" is defined as one that provides constructive feedback and valuable insights to aid developers in improving their applications. Experiment results demonstrate that the IndoBERT-BiLSTM model achieves the best performance with an accuracy rate of 95.49%, representing a 1.16% improvement over the fine-tuned IndoBERT model.

View Demo

About The Project

Useful App Review Classification aims to address the challenge of evaluating the quality and performance of applications based on user reviews. In today's world, where the volume of app usage and the number of reviews are increasing rapidly, not all reviews are constructive. Some contain only praise or criticism without offering valuable feedback.

What is useful review

Manually sorting through thousands of reviews is a time-consuming and labor-intensive task. To overcome this challenge, we employ an efficient and effective method for identifying useful reviews. This research project utilizes the IndoBERT word embedding method and a BiLSTM classifier for review classification..

(back to top)

Model Architecture

Model Architecture

The approach involves using the IndoBERT-BiLSTM model. The input sentences, tokenized for processing, are fed into the IndoBERT model to obtain output, which is the last hidden state of IndoBERT for each token in sequence. These sequences are then passed through a BiLSTM layer to capture sequential information. The final hidden state from the BiLSTM layer undergoes dropout layers to prevent overfitting and enhance generalization. Finally, the output layer performs a linear transformation to generate logits, the model's output for the classification task.

(back to top)

Technologies Used

This section should list any major frameworks/libraries used to bootstrap your project. Leave any add-ons/plugins for the acknowledgements section. Here are a few examples.

  • Pytorch
  • Huggingface Transformers
  • Streamlit

(back to top)

Technologies Used

The website application, deployed using Streamlit and Hugging Face Space, offers several powerful features:

  • Single Input Classification: Easily classify a single text review to determine whether it falls into the category of "Useful Review" or not. This feature simplifies the evaluation of individual reviews.

Single Input Classification

  • Multiple Input Classification: Streamline the review classification process by uploading multiple text reviews in a CSV format. The application will efficiently classify these reviews and present the results in a user-friendly table format for easy analysis and interpretation.

Multiple Input Classification

(back to top)