This repository provides the official implementation of Data Agent, an end-to-end dynamic data selection framework that learns to select training data adaptively during model optimization.
- 2026/03: Paper released on arXiv.
- 2026/04: Accepted by ICML 2026.
- 2026/05: Code released.
The overall framework of Data Agent is shown below.
Figure 1. Overview of Data Agent. A PPO-based agent learns to dynamically select informative training data during model optimization.
Clone this repository:
git clone https://github.com/Jackbrocp/Data-Agent.git
cd Data-AgentPython >= 3.8
PyTorch
Torchvision
NumPy
Please install the PyTorch version that matches your CUDA environment.
Create the data directory:
mkdir -p dataFor CIFAR experiments, please download the provided files and place them under ./data/.
If you place the data in another directory, please modify the corresponding dataset path argument.
Train ResNet-18 on CIFAR-10 with a 50% selection ratio:
python train.py --dataset cifar10 --model r18 --ratio 0.5Train ResNet-50 on CIFAR-100 with a 30% selection ratio:
python train.py --dataset cifar100 --model r50 --ratio 0.3We also provide training logs using both datasets with a 50% selection ratio under 'logs/' for reference.
If you find this repository useful in your research, please consider citing our paper:
@misc{yang2026dataagentlearningselect,
title={Data Agent: Learning to Select Data via End-to-End Dynamic Optimization},
author={Suorong Yang and Fangjian Su and Hai Gan and Ziqi Ye and Jie Li and Baile Xu and Furao Shen and Soujanya Poria},
year={2026},
eprint={2603.07433},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2603.07433},
}For questions, issues, or collaboration, please feel free to contact:
sryang@smail.nju.edu.cn
This codebase builds upon InfoBatch. We sincerely thank the authors for their excellent open-source implementation.
