Rohan Asthana, Joschua Conrad, Youssef Dawoud, Maurits Ortmanns, Vasileios Belagiannis
This repository contains the code for the paper titled "Multi-conditioned Graph Diffusion for Neural Architecture Search" [link].
Neural architecture search automates the design of neural network architectures usually by exploring a large and thus complex architecture search space. To advance the architecture search, we present a graph diffusion-based NAS approach that uses discrete conditional graph diffusion processes to generate high-performing neural network architectures. We then propose a multi-conditioned classifier-free guidance approach applied to graph diffusion networks to jointly impose constraints such as high accuracy and low hardware latency. Unlike the related work, our method is completely differentiable and requires only a single model training. In our evaluations, we show promising results on six standard benchmarks, yielding novel and unique architectures at a fast speed, i.e. less than 0.2 seconds per architecture. Furthermore, we demonstrate the generalisability and efficiency of our method through experiments on ImageNet dataset.
nasbench101
: for the NAS-Bench-101 benchmarknasbench201
: for the NAS-Bench-201 benchmarknasbench301
: for the NAS-Bench-301 benchmarknasbenchNLP
: for the NAS-Bench-NLP benchmarknasbenchHW
: for the NAS-Bench-HW benchmark
To get started with the DiNAS project, follow these steps:
- Clone the repository:
git clone https://github.com/rohanasthana/DiNAS.git
- Load the conda environment 'environment.yml' using the command
conda env create -f environment.yml
- Run the training process:
python main_reg_free.py --dataset nasbench101
The pre-trained models for the NAS-Bench-101, NAS-Bench-201, NAS-Bench-301 and NAS-Bench-NLP benchmarks can be found in Google Drive: https://drive.google.com/drive/folders/1a9CpJDWAe5MMe1hU-JvKf9njhWyUT3yw?usp=sharing
@article{
asthana2024multiconditioned,
title={Multi-conditioned Graph Diffusion for Neural Architecture Search},
author={Rohan Asthana and Joschua Conrad and Youssef Dawoud and Maurits Ortmanns and Vasileios Belagiannis},
journal={Transactions on Machine Learning Research},
issn={2835-8856},
year={2024},
url={https://openreview.net/forum?id=5VotySkajV},
note={}
}
This project is licensed under the MIT License.