Training project from Hyperskill (JetBrains) -- Python Core. The project demonstrates skills in work with numpy, xml basics, scikit-learn (for getting TF-IDF matrix through Vectorizer), nltk for stemming and lemmatization of the texts, removing stopwords and punctuation marks.
The script is in ./Key Terms Extraction/task.
Other directories are the steps of the completing project with tasks descriptions.