Skip to content

A national initiative for the creation of infrastructure, research and development of advanced capabilities for the advancement of the field of NLP in Hebrew and Arabic.

NNLP-IL/NNLP-IL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 

Repository files navigation

NNLP-IL (National Natural Language Processing plan of Israel)

NNLP-IL is a national initiative for the creation of infrastructure, research and development of advanced capabilities for the advancement of the field of NLP in Hebrew and Arabic.

We know what you're thinking.. (Why in english? 🤦‍♀️) - as for now we have decided english will work best for the NNLP-IL open source community, for more information see NNLP-IL Homepage.

Why Is There a Need for a National Plan?

NLP in Hebrew (and to a lesser extent also in Arabic) is left behind. The major breakthrough that will allow significant use has not yet been made, the cost of fitting and customizing each use case on its own is very high.

The Core Reasons

  • Hebrew and Arabic are difficult languages (rich in morphology), most of the technological development is with morphologically thin languages.
  • Modern language models require vast datasets. The accessible data in Hebrew is very limited.
  • The industry's economic interest in investing in NLP in Hebrew (and to some extent also in Arabic) is limited compared to other common languages, since it is a relatively small market.

Guiding Prinicples

  • Generic framework that will allow fitting and customizing solutions to various applications (without focusing on specific use cases).
  • Open sourced (as much as possible) - Everyone can take part, contribute and use.
  • Break through the data barrier - creating tagged and untagged datasets and make them accessible to the general public.
  • Usability - distributing capabilities through manuals, convenient packaging of code and more.

Who's taking part?

  • You!
  • The Israeli Ministry of Defence Directorate of Defense Research and Development (DDR&D).
  • Israel Innovation Authority.
  • The Ministry of Innovation, Science & Technology.

Active Projects

⭐ Contributing

The main purpose of this repository is to increase the development in Hebrew and Arabic NLP, Making it relevant and easier to use. Read below to learn how you can take part in improving NNLP-IL.

Code of Conduct

Read our Code of Conduct that we expect project participants to adhere to. Please read the full text so that you can understand what actions will and will not be tolerated.

Contributing Guide

Read our Contributing Guide to learn about our development process, how to propose bugfixes and improvements, and how to build and test your changes to NNLP-IL.

License

NNLP-IL is Apache 2.0 licensed.

About

A national initiative for the creation of infrastructure, research and development of advanced capabilities for the advancement of the field of NLP in Hebrew and Arabic.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published