Interesting links to Slovak NLP tools, utils, corpora and resources. Feel free to add more interesting links via pull request.
- https://nlp.kemt.fei.tuke.sk/speech/tedx
- https://catalog.ldc.upenn.edu/LDC2018S08
- https://github.com/facebookresearch/voxpopuli
- https://lindat.mff.cuni.cz/repository/xmlui/handle/11858/00-097C-0000-0006-AADD-3
- https://lindat.mff.cuni.cz/repository/xmlui/handle/11858/00-097C-0000-0006-AADC-5
- https://lindat.mff.cuni.cz/repository/xmlui/handle/11858/00-097C-0000-0006-AADB-7
- https://github.com/stopwords-iso/stopwords-sk
- http://text.fiit.stuba.sk/zoznam_stop_slov.php#focus
- https://github.com/to-mas-re/stopwords-sk
- https://github.com/Ardevop-sk/stopwords-sk
- https://nlp.web.tuke.sk/
- http://nlp.bednarik.top/
- https://github.com/drndos/nlp-tools
- http://text.fiit.stuba.sk
- https://korpus.sk/tools.html
- http://arl6.library.sk/nlp4sk/nlpservices/demo
- https://github.com/adobe/NLP-Cube
- https://sentigrade.fiit.stuba.sk/analyzer
- http://arl6.library.sk/nlp4sk/nlpservices/demo
- https://github.com/SamuelPecar/Slovak-sentiment-analysis
- https://nlp.web.tuke.sk/nlpform
- https://github.com/hladek/dagger
- http://morpholyzer.fiit.stuba.sk:8080/PosTagger/
- https://github.com/dalibor-meszaros/SkCrfPosTagger
- http://nlp.bednarik.top/tagger/
- http://arl6.library.sk/nlp4sk/nlpservices/demo
- https://nlp.fi.muni.cz/slovak-morphology-analyser/
- http://ufal.mff.cuni.cz/morphodita (SK version by korpus.sk)
- https://morphodita.juls.savba.sk (improved model compared to the one above)
- https://www.juls.savba.sk/bezdiak/ (model for Slovak without diacritics)
- https://github.com/ufal/udpipe (you need to find slovak models on lindat)
- http://try.ui.sav.sk/lemmatag/
- https://universaldependencies.org/tagset-conversion/sk-snk-uposf.html
- https://nlp.web.tuke.sk/pages/tokenizer
- http://nlp.bednarik.top/tokenizer/
- http://nlp.bednarik.top/ssplit/
- http://arl6.library.sk/nlp4sk/nlpservices/demo
- https://github.com/hladek/slovak-lexer
- https://slovak-text-checker.herokuapp.com/
- http://text.fiit.stuba.sk:8081/
- https://www.juls.savba.sk/diakritik.html
- https://diakritika.brm.sk/
- http://arl6.library.sk/nlp4sk/nlpservices/demo
- https://www.umberto.sk/
- http://www.forma.sk/spell.aspx
- http://www.forma.sk/prod-jm-ms.aspx
- https://korektor.sk/korektor
- http://sk-spell.sk.cx/
- http://ufal.mff.cuni.cz/korektor (TBD release slovak models)
- http://try.ui.sav.sk/diacritics/
- https://github.com/essential-data/lucene-fst-lemmatizer
- https://github.com/essential-data/stemmer-sk
- http://nlp.bednarik.top/lemmatizer/
- http://arl6.library.sk/nlp4sk/nlpservices/demo
- http://text.fiit.stuba.sk/lemmatizer/
- http://www.forma.sk/prod-jm-is.aspx
- http://www.forma.sk/prod-jm-us.aspx
- https://github.com/mrshu/stemm-sk
- https://github.com/mrshu/lemm-sk
- https://github.com/michmech/lemmatization-lists/
- http://nlp.kiv.zcu.cz/projects/hps
- http://try.ui.sav.sk/lemmatag/
- http://sk-spell.sk.cx/
- https://korpus.sk/dicts.html
- http://slex.sk/
- https://github.com/Kroid/multext-data
- TBD http://nlp.bednarik.top/
- https://github.com/essential-data/word2vec-sk
- https://nlp.h-its.org/bpemb/sk/
- http://vectors.nlpl.eu/repository/
- https://www.juls.savba.sk/semä.html
- http://nlp.bednarik.top/ner/
- http://arl6.library.sk/nlp4sk/nlpservices/demo
- https://github.com/Ardevop-sk/sk-bert-ner
- https://www.juls.savba.sk/nerd/