-
Notifications
You must be signed in to change notification settings - Fork 163
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Announcement: Textstat organisation #167
Comments
Great idea! If there will be a language layer, we could try to use the CMU dictionary (imported from We could also use the migration to improve the documentation. |
Sounds good, maybe more details about the API would be interesting. Also it would be interesting to make texstat compatible with spacy Docs, so the generated analysis are extra properties of those objects |
Update: I had initially been working on getting the template repository in good order. There was a lot of community and helper files I wanted to add: code of conduct, issue templates etc. So, I felt this was a good place to start as it would be a pain to update multiple repositories if there ever needed to be a change to any of these files in the template. However, it seems the best place for them is the .github repo. This frees up the template repo, which means I can use it to generate the first language repo: English. Once generated, the English formulas from Textstat will need to be reimplemented in textstat-en. I will try and get some issues writen up, and something basic implemented, ASAP. |
Hello! I am very pleased to announce the creation of the Textstat GitHub organisation: https://github.com/textstat.
I have been reviewing the current state of Textstat, and recently there has been a lot of interest in additional language support. We've accepted a number of contributions to add Spanish, German, Italian, and Arabic!
However, as more languages are getting added, the core of Textstat is becoming harder to maintain. With the addition of more languages, the base calculations are being pulled in more directions, some of which expect conflicting results.
With the introduction of the organisation, I am beginning the work to separate each language implementation into its own module. This will give each language the space to deviate from the core when applicable, but still be able to default back to solid base calculations.
Let me know your thoughts, either below, or via email: alxwrd@googlemail.com.
Details
shivam5992/textstat
➜textstat/textstat
soon.Planned architecture
The following diagram shows the planned "architecture" of Textstat.
The text was updated successfully, but these errors were encountered: