Skip to content

The SentiAspect-pt comprises 180 product reviews annotated according to implicit and explicit fine-grained opinions, which were hierarchically organized for aspect-based sentiment analysis and opinion summarization applications.

Notifications You must be signed in to change notification settings

franciellevargas/SentiAspect-pt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

67 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DOI

Aspect-Based Annotated Corpus of Web Consumer Reviews in Portuguese

The SentiAspect-pt corpus comprises 180 annotated product reviews at fine-grained opinion level. We manually annotated implicit and explicit aspects and hierarchically organized them for aspect-based sentiment analysis and opinion summarization applications in Portuguese.

SentiAspect-pt Corpus

Natural languages are very rich and allow us to express subjectivity in different ways. Consumers, for example, may use implicit or explicit aspects to refer to the same product property. e.g., the sentences “she got calls at the São Francisco river” and “working anywhere” have been employed in actual reviews to evaluate the (implicit) “signal” aspect of a smartphone. Therefore, not every opinion is directly expressed and not every aspect appears explicitly in the text. For example, "The camera is expensive", in this case, the evaluated aspect is “price”, but it is implicit, not being explicitly said in the sentence and, therefore, must be inferred from the context.

Annotation Process

We annotated 180 reviews according to implicit and explicit fine-grained opinions (aspect-based sentiment analysis). For the identification of implicit aspects, we also annotated the "clue terms" that indicated the aspects. For example, in "This camera is expensive", the aspect that was evaluated here is "price", but it is implicit. The term “expensive” is, therephore, the "clue term" to identify the aspect "price". The identification of explicit aspects has directly annotated. For example, "The history of the book is bad". In this case, “history” is an explicit aspect. In the last stage, the aspects were gruped according to their similar meanings. For example, the “cost”, “value”, “price” and “investment” aspects are aspects of the same group.

Screenshot from 2019-09-16 12-10-13


CITING

Vargas, F.A. and Pardo, T.A.S. (2017). Aspect clustering methods for sentiment analysis. Proceedings of the 13th International Conference on the Computational Processing of Portuguese (PROPOR 2018), pp. 365–374. Canela, Brazil

.

BIBTEX

@inproceedings{Vargasetal2018, author = {Vargas, F. and Pardo, T. A. S.}, title = {Aspect clustering methods for sentiment analysis}, booktitle = {Proceedings of the 13th International Conference on the Computational Processing of Portuguese (PROPOR 2018)}, year = {2018}, pages = {365–374}, address = {Canela, Brazil}, url = {https://link.springer.com/chapter/10.1007/978-3-319-99722-3_37}}


About

The SentiAspect-pt comprises 180 product reviews annotated according to implicit and explicit fine-grained opinions, which were hierarchically organized for aspect-based sentiment analysis and opinion summarization applications.

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published