AttributeError with Tokenizer #36

adriguerra · 2020-12-26T18:08:40Z

I'm trying to reproduce the example in the README.

name = 'absa/classifier-rest-0.2'
model = absa.BertABSClassifier.from_pretrained(name)
tokenizer = absa.BertTokenizer.from_pretrained(name)
professor = absa.Professor()     # Explained in detail later on.
text_splitter = absa.sentencizer()  # The English CNN model from SpaCy.
nlp = absa.Pipeline(model, tokenizer, professor, text_splitter)

But I get an AttributeError with the tokenizer.

---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
<ipython-input-9-c6e986c7be44> in <module>
      1 name = 'absa/classifier-rest-0.2'
      2 model = absa.BertABSClassifier.from_pretrained(name)
----> 3 tokenizer = absa.BertTokenizer.from_pretrained(name)
      4 professor = absa.Professor()     # Explained in detail later on.
      5 text_splitter = absa.sentencizer()  # The English CNN model from SpaCy.

AttributeError: module 'aspect_based_sentiment_analysis' has no attribute 'BertTokenizer'

Could you also clarify how the professor works. The article is missing the hyperlink in the README: "In the article [here], we discuss in detail how the model and the professor work"

Thanks in advance.

The text was updated successfully, but these errors were encountered:

adriguerra · 2020-12-30T11:48:04Z

This fixes the first issue:

from transformers import BertTokenizer
tokenizer = BertTokenizer.from_pretrained(model_name)

pepi99 · 2022-03-02T08:08:16Z

Thank you for your solution. Now I get another error:

Traceback (most recent call last):
File "/Users/petar.ulev/Documents/prepare_sentiment_data/spacystuff.py", line 29, in
task = nlp.preprocess(text=text, aspects=aspects)
File "/Users/petar.ulev/Documents/prepare_sentiment_data/venv/lib/python3.8/site-packages/aspect_based_sentiment_analysis/pipelines.py", line 213, in preprocess
spans = self.text_splitter(text) if self.text_splitter else [text]
File "/Users/petar.ulev/Documents/prepare_sentiment_data/venv/lib/python3.8/site-packages/aspect_based_sentiment_analysis/text_splitters.py", line 17, in wrapper
sentences = [sent.string.strip() for sent in doc.sents]
File "/Users/petar.ulev/Documents/prepare_sentiment_data/venv/lib/python3.8/site-packages/aspect_based_sentiment_analysis/text_splitters.py", line 17, in
sentences = [sent.string.strip() for sent in doc.sents]
AttributeError: 'spacy.tokens.span.Span' object has no attribute 'string'

Did you experience it as well?

hoangthangta · 2023-02-09T05:14:27Z

Can you try this?

sentences = [sent.text.strip() for sent in doc.sents]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AttributeError with Tokenizer #36

AttributeError with Tokenizer #36

adriguerra commented Dec 26, 2020 •

edited

adriguerra commented Dec 30, 2020 •

edited

pepi99 commented Mar 2, 2022

hoangthangta commented Feb 9, 2023

AttributeError with Tokenizer #36

AttributeError with Tokenizer #36

Comments

adriguerra commented Dec 26, 2020 • edited

adriguerra commented Dec 30, 2020 • edited

pepi99 commented Mar 2, 2022

hoangthangta commented Feb 9, 2023

adriguerra commented Dec 26, 2020 •

edited

adriguerra commented Dec 30, 2020 •

edited