Lists (6)
Sort Name ascending (A-Z)
Stars
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
AI-powered assistant to help you with your daily tasks, powered by Llama 3, DeepSeek R1, and many more models on HuggingFace.
A modular graph-based Retrieval-Augmented Generation (RAG) system
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Official website for "Empowering Time Series Analysis with Large Language Models: A Survey"
Awesome Deep Learning for Time-Series Imputation, including an unmissable paper list about applying neural networks to impute incomplete time series containing NaN missing values/data
Official implementation for "AutoTimes: Autoregressive Time Series Forecasters via Large Language Models"
An up-to-date list of time-series related papers in AI venues.
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. Official website: https://asappresearch.github.io/slue-toolkit/
OpenMMLab Pre-training Toolbox and Benchmark
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
Command line utility for forced alignment using Kaldi
Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syllables
Thư viện chuẩn hóa văn bản Tiếng Việt
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.
Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM
Keep track of big models in audio domain, including speech, singing, music etc.
Data science interview questions and answers
OpenSSA: Small Specialist Agents based on Domain-Aware Neurosymbolic Agent (DANA) architecture for industrial problem-solving
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Python sample codes and textbook for robotics algorithms.
Vietnamese song lyric alignment framework