New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not all of the non-English text is German #11

Open
Conal-Tuohy opened this Issue Dec 2, 2016 · 7 comments

Comments

Projects
None yet
2 participants

@Conal-Tuohy Conal-Tuohy changed the title from Not all of the non-German text is English to Not all of the non-English text is German Dec 10, 2016

@Conal-Tuohy

This comment has been minimized.

Owner

Conal-Tuohy commented Dec 10, 2016

What other languages are present in the corpus? Just French, German, and English?

@LucasHorseshoeBend

This comment has been minimized.

Collaborator

LucasHorseshoeBend commented Dec 13, 2016

There is some Russian and some Spanish.
I have not looked at the corpus to try to find examples. I think theremight also be some Italian, and possible Portuguese, in certificates of election to learned societies in those countries.

Ler me now if you need me to identify somme examples.

@Conal-Tuohy

This comment has been minimized.

Owner

Conal-Tuohy commented Dec 13, 2016

Thanks! An example of each would be handy

@Conal-Tuohy Conal-Tuohy self-assigned this Dec 15, 2016

@Conal-Tuohy

This comment has been minimized.

Owner

Conal-Tuohy commented Dec 15, 2016

Great! These examples will be very helpful for automating the language classification.

By the way, if you come across any more, please add an additional comment. If this issue is closed in the meantime, you can re-open it or create a new one.

@LucasHorseshoeBend

This comment has been minimized.

Collaborator

LucasHorseshoeBend commented Jul 16, 2017

Another language:
I came across a letter in Hungarian from Mueller today. (Probably originally in English, but we only have a translation in a Hungarian botany journal). It has not yet been transcribed, but when it is it will need to be detectable.

@LucasHorseshoeBend

This comment has been minimized.

Collaborator

LucasHorseshoeBend commented Jul 16, 2017

Sorry, hit the wrong button!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment