Dataset
The Resources for "Natural Language to Logical Form" ; "自然语言转逻辑形式"研究资料收集。
A large annotated semantic parsing corpus for developing natural language interfaces.
📝 An Awesome Collection of Chinese Legal Dataset and Relevant Resources. 致力于收集全面的中文法律数据源
PMC-Patients: A Large-scale Dataset of Patient Summaries and Relations for Benchmarking Retrieval-based Clinical Decision Support Systems.
A Chinese National Medical Licensing Examination dataset and large languge model benchmarks
OLAPH: Improving Factuality in Biomedical Long-form Question Answering
ICML'2024 | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
A benchmark for few-shot evaluation of foundation models for electronic health records (EHRs)
A Large-Scale In-the-wild Dataset for Plant Disease Segmentation
A reading list on LLM based Synthetic Data Generation 🔥
VizNet is a repository providing real-world datasets that enable, among other things, (re)running empirical studies with higher ecological validity
This repo includes introduction, code and dataset of our paper Deep Sequence Learning with Auxiliary Information for Traffic Prediction (KDD 2018).