Skip to content
View xiaojingyi's full-sized avatar

Organizations

@ssit-ml

Block or report xiaojingyi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

nlp related resources+all kinds of tutorials 中文自然语言处理相关资料

Jupyter Notebook 8 Updated Oct 9, 2020

FinRL­®-Meta: Dynamic datasets and market environments for FinRL.

Python 1,486 646 Updated Apr 1, 2025

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调

Python 4,743 477 Updated Feb 21, 2025

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,274 623 Updated Nov 21, 2022

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3,145 104 Updated May 23, 2024

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Python 8,289 960 Updated Feb 25, 2022

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡

Python 2,943 199 Updated Nov 26, 2023

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,890 2,227 Updated Jul 29, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,795 1,889 Updated Apr 30, 2024

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,754 1,850 Updated Jun 27, 2024

RoBERTa中文预训练模型: RoBERTa for Chinese

Python 2,693 413 Updated Jul 22, 2024

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

Python 1,585 139 Updated Mar 25, 2025

AI chat for any model.

TypeScript 30,888 8,663 Updated Aug 3, 2024

用 Express 和 Vue3 搭建的 ChatGPT 演示网页

Vue 31,935 11,212 Updated Aug 16, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,293 6,498 Updated Jan 9, 2025

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

951 81 Updated Oct 17, 2022

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,891 4,333 Updated Apr 12, 2025

A collection of large question answering datasets

377 38 Updated Jul 1, 2024

PubMedQA: A Dataset for Biomedical Research Question Answering

Python 310 42 Updated Apr 18, 2023

3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型

Python 294 41 Updated Oct 11, 2022

Representation learning on large graphs using stochastic graph convolutions.

Python 3,506 849 Updated Aug 4, 2024

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,111 547 Updated May 23, 2024

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,048 234 Updated Apr 14, 2024

Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)

669 58 Updated Mar 29, 2023

Transformer related optimization, including BERT, GPT

C++ 6,118 901 Updated Mar 27, 2024

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 6,652 802 Updated Apr 9, 2025

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,608 1,071 Updated Apr 11, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 105,523 17,133 Updated Apr 13, 2025

GO Simple Tunnel - a simple tunnel written in golang

Go 16,616 2,538 Updated Dec 31, 2024

OpenAI管理界面,聚合了OpenAI的所有接口进行界面操作(所有模型、图片、音频、微调、文件)等,支持Markdown格式(公式、图表,表格)等,后期会一点一点的将OpenAI接口进行接入大家支持一下,右上角点个Star。

Vue 2,978 664 Updated Jan 18, 2025
Next
Showing results