# Coding Project: Differentiable NAS

* ### Based on the paper:H. Liu, K. Simonyanand Y. Yang, “DARTS: Differentiable Architecture Search,” International Conference on Learning Representations (ICLR),2019

* ### Assignment

  1. Find a codebase of this paper (the original DARTS implementation is available, and you can find a few variants), download the CIFAR10 and CIFAR100 datasets

  **The dataset and codebase have already upload in the OBS of Huawei Cloud Platform, you can use it directly in the ModelArts.**

  1. Run the basic code on the server, with the standard configuration of the selected paperon CIFAR10 (take the computational costs into consideration)
  
  2. Finish the required task and one of the optional tasks (see the following slides) –of course, you can do more than one optional tasks if you wish (bonus points)
  3. If you have more ideas, please specify a new task by yourself (bonus points)
  4. Remember: integrate your results into your reading report
  5. Date assigned: Nov. 19, 2019;    Date Due: Dec 14, 2020


# Required Task

* The basic training and testing pipeline
    * Run a complete search process with DARTS or any of its variant on CIFAR10 (PC-DARTS is preferred due to the low costs)
    * Note: due to the limitation of computational resource, you may not have sufficient resource to perform the re-training process
    * Pay attention to the hyper-parameters (config, epochs, etc.)
* Questions that should be answered in the report
    * Paste complete training and testing curves and the final architecture
    * Report the training and validation accuracy throughout the process
    * How is performance changing with the number of iterations?
    * Any other significant features that can be recognized in the curves?

## Preparation
One time installation of required libraries from requirement.txt and creating data path

In [3]:
# !pip3 install torch
!mkdir data

mkdir: cannot create directory ‘data’: File exists


Downloading CIFAR10

In [4]:
from dataset.dataset_dowloader_ import *

cifar10_dowloader()

Successfully download file cv-course-public/coding-1/cifar-10-python.tar.gz from OBS to local ./data/cifar-10-python.tar.gz


Let's start!

We are going to search couple of genotypes. We choose next combinations of initial hyperparams:

  * `N3-E50-CS6-BS256-CT10-BT96` - classic
     - `N3` - nodes number (4) in each cell during search
     - `E50` - epochs number (50) for searching final genotype
     - `CS6` - cell number (8) as a "layer"
     - `BS256` - batch size (256) from CIFAR10 (training portion of data is 0.5 - 25k) during search
     - `CT10` - nodes number (4) in each cell during eval
     - `BT96` - batch size (256) from CIFAR10 (training portion of data is 0.5 - 25k) during eval
     
  * `N3-E50-CS6-BS128-CT10-BT96` - batch size (128)
     
  * `N3-E100-CS6-BS256-CT10-BT96` - epochs number (100)

In [None]:
!python train_search.py --data='./data' --save='N3-E50-CS6-BS256-CT10-BT96' --nodes=3 --multiplier=3 --layers=6

In [None]:
!python train_search.py --data='./data' --save='N3-E50-CS6-BS128-CT10-BT96' --nodes=3 --multiplier=3 --layers=6 --batch_size=128

In [None]:
!python train_search.py --data='./data' --save='N3-E100-CS6-BS256-CT10-BT96' --nodes=3 --multiplier=3 --layers=6 --epochs=100

Experiment dir : search-N3-E100-CS6-BS256-CT10-BT96-20200301-013736
03/01 01:37:36 AM gpu device = 0
03/01 01:37:36 AM args = Namespace(arch_learning_rate=0.0006, arch_weight_decay=0.001, batch_size=256, cutout=False, cutout_length=16, data='./data', drop_path_prob=0.3, epochs=100, gpu=0, grad_clip=5, init_channels=16, layers=6, learning_rate=0.1, learning_rate_min=0.001, model_path='saved_models', momentum=0.9, multiplier=3, nodes=3, report_freq=50, save='search-N3-E100-CS6-BS256-CT10-BT96-20200301-013736', seed=2, set='cifar10', train_portion=0.5, unrolled=False, weight_decay=0.0003)
03/01 01:37:39 AM param size = 0.134410MB
Using downloaded and verified file: ./data/cifar-10-python.tar.gz
03/01 01:37:41 AM epoch 0 lr 1.000000e-01
03/01 01:37:41 AM genotype_debug = Genotype(normal=[('dil_conv_3x3', 'max_pool_3x3', 0), ('dil_conv_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 1), ('skip_connect', 'max_pool_3x3', 2), ('sep_conv_3x3', 'avg_pool_3x3', 0), ('avg_pool_3x3', 'ma

tensor([[0.1250, 0.1249, 0.1251, 0.1249, 0.1251, 0.1251, 0.1250, 0.1248],
        [0.1251, 0.1250, 0.1249, 0.1250, 0.1249, 0.1251, 0.1251, 0.1249],
        [0.1250, 0.1248, 0.1249, 0.1252, 0.1251, 0.1252, 0.1250, 0.1249],
        [0.1250, 0.1251, 0.1251, 0.1250, 0.1249, 0.1250, 0.1249, 0.1249],
        [0.1249, 0.1251, 0.1251, 0.1249, 0.1250, 0.1250, 0.1250, 0.1249],
        [0.1250, 0.1252, 0.1251, 0.1250, 0.1250, 0.1249, 0.1250, 0.1249],
        [0.1250, 0.1251, 0.1251, 0.1248, 0.1250, 0.1250, 0.1250, 0.1249],
        [0.1252, 0.1251, 0.1249, 0.1249, 0.1249, 0.1249, 0.1250, 0.1251],
        [0.1251, 0.1250, 0.1251, 0.1252, 0.1249, 0.1250, 0.1248, 0.1249]],
       device='cuda:0', grad_fn=<SoftmaxBackward>)
tensor([0.3330, 0.3336, 0.3334], device='cuda:0', grad_fn=<SoftmaxBackward>)
03/01 01:40:07 AM train 000 1.326397e+00 53.906250 93.750000
03/01 01:40:43 AM train 050 1.360932e+00 50.375306 93.642770
03/01 01:41:16 AM train_acc 51.804000
03/01 01:41:16 AM epoch 3 lr 9.978032e-02
03/

In [None]:
!python train.py --auxiliary --cutout --arch='' --data='./data' --save='N4-E50-CS8-BS256'

In [None]:
!python train.py --auxiliary --cutout --arch='N4-E50-CS8-BS128-20200118-105259' --data='./data' --save='N4-E50-CS8-BS128'

In [None]:
!python train.py --auxiliary --cutout --arch='N4-E50-CS4-BS256-20200118-105518' --data='./data' --save='N4-E50-CS4-BS256'

In [None]:
!python train.py --auxiliary --cutout --arch='N4-E20-CS8-BS256-20200118-105659' --data='./data' --save='N4-E20-CS8-BS256'