Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
-
Updated
Nov 14, 2024 - Python
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
Automatic Speech Recognition (ASR) - German
Calculate your taxes from cryptocurrency gains
A tokenizer and sentence splitter for German and English web and social media texts.
A lemmatizer for German language text
Ten Thousand German News Articles Dataset for Topic Classification
An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English
An easy to use python package for deep learning-based german sentiment classification.
Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa models for Japanese and other languages
📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF
Finetuning instruct-LLaMA on german datasets.
Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to clean messy text (e.g. map peculiar Unicode encodings to ASCII) or replace common abbreviations in text in combination with various text mining tasks.
product recommendation text generation using OpenCCG
a decent German Diceware word list to generate memorable passphrases
Vocab-based profanity checking tool for English, Spanish, Portuguese, German, and Turkish.
Add a description, image, and links to the german topic page so that developers can more easily learn about it.
To associate your repository with the german topic, visit your repo's landing page and select "manage topics."