Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Where does the training data come from? #6

Closed
polm opened this issue Jun 30, 2020 · 5 comments · Fixed by #16
Closed

Where does the training data come from? #6

polm opened this issue Jun 30, 2020 · 5 comments · Fixed by #16

Comments

@polm
Copy link

polm commented Jun 30, 2020

What does the model here use as training data? Didn't see an explanation anywhere...

@othmane-ab
Copy link

It appears he didn't add the training data in the repository

@HeroadZ
Copy link

HeroadZ commented Aug 21, 2020

hi, there. @polm See you again.
I'm a beginner of NLP and interested in japanese. But I can't find dataset with labels in japanese for sentiment analysis.
Could you give me some advices? 😄

@polm
Copy link
Author

polm commented Aug 21, 2020

Sorry, I am not aware of any good training set, which is why I was curious what was used for this model. pymlask is rule based but seems solid.

Since you are affiliated with an academic institution you may have access to this Rakuten dataset.

https://rit.rakuten.co.jp/data_release_ja/

@HeroadZ
Copy link

HeroadZ commented Aug 21, 2020

Sorry, I am not aware of any good training set, which is why I was curious what was used for this model. pymlask is rule based but seems solid.

I just saw an article which compare the current japanese sentiment analysis tools. It seems like that there exist problems for the pymlask. If test sentence doesn't contain any word in dictionary, the polarity is None. If you are interested, check this.

@ikegami-yukino how do you think about the article above? Thank you for your work.

Since you are affiliated with an academic institution you may have access to this Rakuten dataset. https://rit.rakuten.co.jp/data_release_ja/

Thank you for your advices. But maybe I cannot access it because the research I'm doing now is from a company where I work as an intern. 😢

@polm
Copy link
Author

polm commented Oct 18, 2022

If you want to close this that's fine, but I still don't see an explanation of the training data anywhere. Did I miss it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants