The most accurate natural language detection library for Python, suitable for short text and mixed-language text
-
Updated
Apr 3, 2024 - Python
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
End-to-end spoken language identification out of the box.
A TensorFlow-based spoken language identification
Faster, modernized fork of the language identification tool langid.py
GlotLID: Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
📚 Сборник полезных штук из Natural Language Processing: Определение языка текста, Разделение текста на предложения, Получение основного содержимого из html документа
Simple, yet fast, Python scripts to read Kaldi NNet3 models and compute bottleneck features
End to End Dialect Identification using Convolutional Neural Network
Suite of Python modules to recognise the language of a file
🎏🎌 language recognition script implemented using basic algorithms and spaghetti code
The European Parliament Proceedings Parallel Corpus (1996-2011) (https://www.statmt.org/europarl/) is a well-known dataset in Natural Language Processing tasks, it contains proceedings of the European Parliament in 21 European languages. In this project we will only extract data from 6 languages (German, French, Spanish, Italian, Polish and Eng…
A single-layer neural network written from scratch that predicts the language of the text.
Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification
too Simple Language Recognition
Add a description, image, and links to the language-recognition topic page so that developers can more easily learn about it.
To associate your repository with the language-recognition topic, visit your repo's landing page and select "manage topics."