Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about Classical Japanese #10

Closed
kylepjohnson opened this issue Jul 19, 2016 · 2 comments
Closed

Question about Classical Japanese #10

kylepjohnson opened this issue Jul 19, 2016 · 2 comments

Comments

@kylepjohnson
Copy link

Hello,

Thank you for your wonderful library.

I run an open source project, the Classical Language Toolkit, which helps researchers do NLP in ancient and classical languages.

One of our contributors found your software and is interested in porting some of it for our users.

But because I do not know Japanese, I am interested to learn whether jProcessing is suitable for old Japanese texts (say, up until the year AD 1600).

Thanks again for sharing your software with the world. Feel free to be in touch with me directly at kyle@kyle-p-johnson.com if you prefer!

@kevincobain2000
Copy link
Owner

  • Basic algorithms can of course be used. As WSD is not applied there, (optionally can be applied)
    • Similarity between two sentences
    • Longest common string etc.
  • Morphological Analyzer
    • jProcessing uses Cabocha, if your target is ancient Japanese text, then you should be able to separately train cabocha with your own training data.
      and call it as it is via this python lib.
  • Finding parallel example sentences from Edict
  • Sentiment Classification via WSD on Japanese text.
    • Senses in Japanese text are NOT disambiguated at this moment anyways.
    • Just using the Japanese word's baseform and getting score from SentiwordNet is enough for a baseline classifier.

@kylepjohnson
Copy link
Author

Thank you this is really helpful.

We cannot use the sentiment analysis work (though it does look interesting). Cabocha interesting, however do you know of any treebanks for Classical Japanese?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants