Sentires-Guide

A quick guide to Sentires: Phrase-level Sentiment Analysis toolkit

Zhang, Yongfeng, et al. Do users rate or review? boost phrase-level sentiment labeling with review-level sentiment classification. SIGIR'14.

To obtain this tool, please check Sentires.

Motivation

The tool is very meaningful to the IR community, as many research works are built on top of its results. However, it may be difficult to obtain (feature, opinion, sentence, sentiment) quadruples from user reviews, since it did not have such a function in the first place. Moreover, nowadays people are usually more familiar with Python rather than Java on which this tool was developed. Therefore, here we present our data processing steps (mostly in Python) in the following paper to help researchers quickly obtain the aforementioned quadruples from user reviews.

Lei Li, Yongfeng Zhang, Li Chen. Generate Neural Template Explanations for Recommendation. CIKM'20. [Paper] [Code]

A small ecosystem for Recommender Systems-based Natural Language Generation is available at NLG4RS!

Creation Steps

Place the folder lei in the tool's folder "English-Jar" as shown above.
Modify 0.format.py, including data path (line 5), the keys (line 12, 13, 16, 20, 21, 23) and how you load the data and iterate over each review (line 9), so that your datasets can be processed in the right format of the tool's input. Remove line 12-15, if your dataset has no summary or tip, which is meant to include as much textual data as possible.
Update the absolute paths (line 65, 78, 94, 95) in 4.lexicon.linux accordingly.
Run the commands in run_lei.sh one by one (do not run this script, otherwise it may throw errors)

Friendly reminder

Memory error might be fixed by specifying the memory allocation of JVM, e.g., java -jar -Xmx200G thuir-sentires.jar -t lexicon -c lei/4.lexicon.linux >> log.txt
The POS step could really take a lot of time, e.g., days.

Results

You will find a file "reviews.pickle" in lei/output/, which is a python list, where each element is a python dict with the following keys:

'user',
'item',
'rating',
'text',
'sentence' # a list of tuples and each tuple looks like (feature, adjective, sentence, score). It could be empty, because the tool may fail to identify feature-opinion pairs from the review.

Datasets to download

TripAdvisor Hong Kong
Amazon Movies & TV
Yelp 2019

Citations

@inproceedings{CIKM20-NETE,
	title={Generate Neural Template Explanations for Recommendation},
	author={Li, Lei and Zhang, Yongfeng and Chen, Li},
	booktitle={CIKM},
	year={2020}
}
@inproceedings{WWW20-NETE,
	title={Towards Controllable Explanation Generation for Recommender Systems via Neural Template},
	author={Li, Lei and Chen, Li and Zhang, Yongfeng},
	booktitle={WWW Demo},
	year={2020}
}
@inproceedings{SIGIR14-Sentires,
	title={Do users rate or review? Boost phrase-level sentiment labeling with review-level sentiment classification},
	author={Zhang, Yongfeng and Zhang, Haochen and Zhang, Min and Liu, Yiqun and Ma, Shaoping},
	booktitle={SIGIR},
	year={2014}
}

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
lei		lei
README.md		README.md
folder-hierarchy.png		folder-hierarchy.png
run_lei.sh		run_lei.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lei

lei

README.md

README.md

folder-hierarchy.png

folder-hierarchy.png

run_lei.sh

run_lei.sh

Repository files navigation

Sentires-Guide

Motivation

Creation Steps

Friendly reminder

Results

Datasets to download

Citations

About

Releases

Packages

Languages

lileipisces/Sentires-Guide

Folders and files

Latest commit

History

Repository files navigation

Sentires-Guide

Motivation

Creation Steps

Friendly reminder

Results

Datasets to download

Citations

About

Resources

Stars

Watchers

Forks

Languages