Skip to content
View manyeyes's full-sized avatar

Block or report manyeyes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Python Vietnamese Core NLP Toolkit

Jupyter Notebook 258 50 Updated Sep 26, 2024

Thai natural language processing in Python

Python 1,015 275 Updated Mar 10, 2025

Kiwi(지능형 한국어 형태소 분석기)

C++ 565 50 Updated Mar 8, 2025

The official AWS SDK for .NET. For more information on the AWS SDK for .NET, see our web site:

C# 2,114 863 Updated Mar 12, 2025

An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.

C# 902 108 Updated Mar 13, 2025

Prism is a framework for building loosely coupled, maintainable, and testable XAML applications in WPF, Xamarin Forms, and Uno / Win UI Applications..

C# 6,462 1,652 Updated Feb 21, 2025

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,776 147 Updated Mar 12, 2025

SOTA Open Source TTS

Python 19,952 1,549 Updated Mar 3, 2025

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,627 137 Updated Feb 26, 2025

Official inference framework for 1-bit LLMs

C++ 12,796 902 Updated Feb 18, 2025

A recreation of the classic Visual Basic 6 IDE and language in C# with Avalonia

C# 1,383 83 Updated Nov 17, 2024

Interface for OuteTTS models.

Python 947 83 Updated Feb 14, 2025

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 891 107 Updated Aug 7, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 10,283 1,405 Updated Mar 12, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 7,025 843 Updated Mar 6, 2025

The .NET MAUI Community Toolkit is a community-created library that contains .NET MAUI Extensions, Advanced UI/UX Controls, and Behaviors to help make your life as a .NET MAUI developer easier

C# 2,413 423 Updated Mar 13, 2025

Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.

Python 1,815 195 Updated Mar 13, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,767 627 Updated Mar 13, 2025

A C# library for extract audio features in speech recognition (ASR) task, support kaldi fbank

C# 3 Updated Aug 29, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,870 417 Updated Mar 5, 2025

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,071 766 Updated Oct 16, 2024

A C# library for decoding the Wenet ASR model

C# 2 Updated Aug 6, 2024

Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"

Python 54 4 Updated Feb 3, 2025

Source code for Consistent ensemble distillation for audio tagging

Python 26 4 Updated Jul 16, 2024

SharpToken is a C# library for tokenizing natural language text. It's based on the tiktoken Python library and designed to be fast and accurate.

C# 228 16 Updated May 17, 2024

This project implements token calculation for OpenAI's gpt-4 and gpt-3.5-turbo model, specifically using `cl100k_base` encoding.

C# 67 4 Updated Mar 3, 2025

Multilingual Voice Understanding Model

Python 4,900 443 Updated Jan 8, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,863 1,180 Updated Mar 13, 2025

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook 5,170 330 Updated Oct 18, 2023

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,779 315 Updated Jan 8, 2025
Next
Showing results