# Coding Project: Differentiable NAS

* ### Based on the paper:H. Liu, K. Simonyanand Y. Yang, “DARTS: Differentiable Architecture Search,” International Conference on Learning Representations (ICLR),2019

* ### Assignment

  1. Find a codebase of this paper (the original DARTS implementation is available, and you can find a few variants), download the CIFAR10 and CIFAR100 datasets

  **The dataset and codebase have already upload in the OBS of Huawei Cloud Platform, you can use it directly in the ModelArts.**

  1. Run the basic code on the server, with the standard configuration of the selected paperon CIFAR10 (take the computational costs into consideration)
  
  2. Finish the required task and one of the optional tasks (see the following slides) –of course, you can do more than one optional tasks if you wish (bonus points)
  3. If you have more ideas, please specify a new task by yourself (bonus points)
  4. Remember: integrate your results into your reading report
  5. Date assigned: Nov. 19, 2019;    Date Due: Dec 14, 2020


# Required Task

* The basic training and testing pipeline
    * Run a complete search process with DARTS or any of its variant on CIFAR10 (PC-DARTS is preferred due to the low costs)
    * Note: due to the limitation of computational resource, you may not have sufficient resource to perform the re-training process
    * Pay attention to the hyper-parameters (config, epochs, etc.)
* Questions that should be answered in the report
    * Paste complete training and testing curves and the final architecture
    * Report the training and validation accuracy throughout the process
    * How is performance changing with the number of iterations?
    * Any other significant features that can be recognized in the curves?

## Preparation
One time installation of required libraries from requirement.txt and creating data path

In [1]:
# !pip3 install torch
!mkdir data

Downloading CIFAR10

In [2]:
from dataset.dataset_dowloader_ import *

cifar10_dowloader()

Successfully download file cv-course-public/coding-1/cifar-10-python.tar.gz from OBS to local ./data/cifar-10-python.tar.gz


Let's start!

We are going to search couple of genotypes. We choose next combinations of initial hyperparams:

  * `N3-E50-CS6-BS256-CT10-BT96` - classic
     - `N3` - nodes number (4) in each cell during search
     - `E50` - epochs number (50) for searching final genotype
     - `CS6` - cell number (8) as a "layer"
     - `BS256` - batch size (256) from CIFAR10 (training portion of data is 0.5 - 25k) during search
     - `CT10` - nodes number (4) in each cell during eval
     - `BT96` - batch size (256) from CIFAR10 (training portion of data is 0.5 - 25k) during eval
     
  * `N3-E50-CS6-BS128-CT10-BT96` - batch size (128)
     
  * `N3-E100-CS6-BS256-CT10-BT96` - epochs number (100)

In [None]:
!python train_search.py --data='./data' --save='N3-E50-CS6-BS256-CT10-BT96' --nodes=3 --multiplier=3 --layers=6

Experiment dir : search-N3-E50-CS6-BS256-CT10-BT96-20200301-004058
03/01 12:40:58 AM gpu device = 0
03/01 12:40:58 AM args = Namespace(arch_learning_rate=0.0006, arch_weight_decay=0.001, batch_size=256, cutout=False, cutout_length=16, data='./data', drop_path_prob=0.3, epochs=50, gpu=0, grad_clip=5, init_channels=16, layers=6, learning_rate=0.1, learning_rate_min=0.001, model_path='saved_models', momentum=0.9, multiplier=3, nodes=3, report_freq=50, save='search-N3-E50-CS6-BS256-CT10-BT96-20200301-004058', seed=2, set='cifar10', train_portion=0.5, unrolled=False, weight_decay=0.0003)
03/01 12:41:04 AM param size = 0.134410MB
Using downloaded and verified file: ./data/cifar-10-python.tar.gz
03/01 12:41:06 AM epoch 0 lr 1.000000e-01
03/01 12:41:06 AM genotype_debug = Genotype(normal=[('dil_conv_3x3', 'max_pool_3x3', 0), ('dil_conv_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 1), ('skip_connect', 'max_pool_3x3', 2), ('sep_conv_3x3', 'avg_pool_3x3', 0), ('avg_pool_3x3', 'max_p

        [0.1251, 0.1250, 0.1249, 0.1250, 0.1249, 0.1251, 0.1251, 0.1249],
        [0.1250, 0.1248, 0.1249, 0.1252, 0.1251, 0.1252, 0.1250, 0.1249],
        [0.1250, 0.1251, 0.1251, 0.1250, 0.1249, 0.1250, 0.1249, 0.1249],
        [0.1249, 0.1251, 0.1251, 0.1249, 0.1250, 0.1250, 0.1250, 0.1249],
        [0.1250, 0.1252, 0.1251, 0.1250, 0.1250, 0.1249, 0.1250, 0.1249],
        [0.1250, 0.1251, 0.1251, 0.1248, 0.1250, 0.1250, 0.1250, 0.1249],
        [0.1252, 0.1251, 0.1249, 0.1249, 0.1249, 0.1249, 0.1250, 0.1251],
        [0.1251, 0.1250, 0.1251, 0.1252, 0.1249, 0.1250, 0.1248, 0.1249]],
       device='cuda:0', grad_fn=<SoftmaxBackward>)
tensor([0.3330, 0.3336, 0.3334], device='cuda:0', grad_fn=<SoftmaxBackward>)
03/01 12:43:31 AM train 000 1.269483e+00 50.781250 93.750000
03/01 12:44:07 AM train 050 1.351383e+00 50.321691 94.125306
03/01 12:44:40 AM train_acc 51.528000
03/01 12:44:41 AM epoch 3 lr 9.912322e-02
03/01 12:44:41 AM genotype_debug = Genotype(normal=[('dil_conv_3x3', 'max_poo

03/01 12:47:02 AM train 000 1.047716e+00 62.109375 97.656250
03/01 12:47:38 AM train 050 1.088510e+00 60.860907 96.155025
03/01 12:48:12 AM train_acc 61.988000
03/01 12:48:12 AM epoch 6 lr 9.652394e-02
03/01 12:48:12 AM genotype_debug = Genotype(normal=[('dil_conv_3x3', 'max_pool_3x3', 0), ('dil_conv_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 1), ('skip_connect', 'max_pool_3x3', 2), ('sep_conv_3x3', 'avg_pool_3x3', 0), ('avg_pool_3x3', 'max_pool_3x3', 0), ('skip_connect', 'avg_pool_3x3', 2), ('dil_conv_3x3', 'sep_conv_3x3', 1), ('dil_conv_5x5', 'max_pool_3x3', 3)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'skip_connect', 1), ('sep_conv_5x5', 'skip_connect', 0), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 0), ('skip_connect', 'avg_pool_3x3', 3)], reduce_concat=range(2, 5))
03/01 12:48:12 A

03/01 12:50:33 AM train 000 1.049808e+00 61.718750 98.046875
03/01 12:51:09 AM train 050 9.550819e-01 65.456495 97.326900
03/01 12:51:42 AM train_acc 65.680000
03/01 12:51:43 AM epoch 9 lr 9.229423e-02
03/01 12:51:43 AM genotype_debug = Genotype(normal=[('dil_conv_3x3', 'max_pool_3x3', 0), ('dil_conv_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 1), ('skip_connect', 'max_pool_3x3', 2), ('sep_conv_3x3', 'avg_pool_3x3', 0), ('avg_pool_3x3', 'max_pool_3x3', 0), ('skip_connect', 'avg_pool_3x3', 2), ('dil_conv_3x3', 'sep_conv_3x3', 1), ('dil_conv_5x5', 'max_pool_3x3', 3)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'skip_connect', 1), ('sep_conv_5x5', 'skip_connect', 0), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 0), ('skip_connect', 'avg_pool_3x3', 3)], reduce_concat=range(2, 5))
03/01 12:51:43 A

03/01 12:54:04 AM train 000 8.622259e-01 71.875000 96.484375
03/01 12:54:40 AM train 050 8.741702e-01 68.550858 97.771140
03/01 12:55:13 AM train_acc 68.980000
03/01 12:55:13 AM epoch 12 lr 8.658395e-02
03/01 12:55:13 AM genotype_debug = Genotype(normal=[('dil_conv_3x3', 'max_pool_3x3', 0), ('dil_conv_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 1), ('skip_connect', 'max_pool_3x3', 2), ('sep_conv_3x3', 'avg_pool_3x3', 0), ('avg_pool_3x3', 'max_pool_3x3', 0), ('skip_connect', 'avg_pool_3x3', 2), ('dil_conv_3x3', 'sep_conv_3x3', 1), ('dil_conv_5x5', 'max_pool_3x3', 3)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'skip_connect', 1), ('sep_conv_5x5', 'skip_connect', 0), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 0), ('skip_connect', 'avg_pool_3x3', 3)], reduce_concat=range(2, 5))
03/01 12:55:13 

03/01 12:57:35 AM train 000 8.212977e-01 71.093750 98.437500
03/01 12:58:11 AM train 050 8.143055e-01 71.239277 98.085172
03/01 12:58:45 AM train_acc 72.252000
03/01 12:58:45 AM epoch 15 lr 7.959537e-02
03/01 12:58:45 AM genotype_debug = Genotype(normal=[('dil_conv_3x3', 'max_pool_3x3', 0), ('dil_conv_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 1), ('skip_connect', 'max_pool_3x3', 2), ('sep_conv_3x3', 'avg_pool_3x3', 0), ('avg_pool_3x3', 'max_pool_3x3', 0), ('skip_connect', 'avg_pool_3x3', 2), ('dil_conv_3x3', 'sep_conv_3x3', 1), ('dil_conv_5x5', 'max_pool_3x3', 3)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'skip_connect', 1), ('sep_conv_5x5', 'skip_connect', 0), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 0), ('skip_connect', 'avg_pool_3x3', 3)], reduce_concat=range(2, 5))
03/01 12:58:45 

03/01 01:05:17 AM train 000 6.087936e-01 80.859375 99.218750
03/01 01:06:23 AM train 050 6.891733e-01 75.628064 98.766850
03/01 01:07:25 AM train_acc 75.584000
03/01 01:07:26 AM epoch 19 lr 6.872217e-02
03/01 01:07:26 AM genotype_debug = Genotype(normal=[('sep_conv_5x5', 'sep_conv_3x3', 0), ('sep_conv_5x5', 'sep_conv_3x3', 1), ('sep_conv_5x5', 'sep_conv_3x3', 2), ('dil_conv_5x5', 'sep_conv_5x5', 1), ('sep_conv_3x3', 'skip_connect', 0), ('sep_conv_5x5', 'sep_conv_3x3', 2), ('sep_conv_5x5', 'sep_conv_3x3', 1), ('sep_conv_5x5', 'sep_conv_3x3', 3), ('sep_conv_5x5', 'max_pool_3x3', 0)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'skip_connect', 1), ('sep_conv_5x5', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('sep_conv_3x3', 'max_pool_3x3', 0), ('max_pool_3x3', 'max_pool_3x3', 3), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('sep_conv_5x5', 'max_pool_3x3', 0)], reduce_concat=range(2, 5))
03/01 01:07:26 

03/01 01:11:48 AM train 000 6.389222e-01 78.515625 99.218750
03/01 01:12:54 AM train 050 6.455403e-01 76.999081 98.935355
03/01 01:13:56 AM train_acc 77.572000
03/01 01:13:56 AM epoch 22 lr 5.977538e-02
03/01 01:13:56 AM genotype_debug = Genotype(normal=[('sep_conv_5x5', 'sep_conv_3x3', 0), ('sep_conv_5x5', 'sep_conv_3x3', 1), ('dil_conv_5x5', 'max_pool_3x3', 1), ('sep_conv_5x5', 'sep_conv_3x3', 2), ('dil_conv_5x5', 'sep_conv_3x3', 0), ('dil_conv_5x5', 'sep_conv_5x5', 2), ('sep_conv_3x3', 'avg_pool_3x3', 1), ('sep_conv_5x5', 'max_pool_3x3', 3), ('dil_conv_5x5', 'sep_conv_5x5', 0)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'skip_connect', 1), ('sep_conv_5x5', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('sep_conv_3x3', 'max_pool_3x3', 0), ('max_pool_3x3', 'max_pool_3x3', 3), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 0)], reduce_concat=range(2, 5))
03/01 01:13:56 

03/01 01:18:18 AM train 000 5.307762e-01 80.078125 99.609375
03/01 01:19:24 AM train 050 5.801601e-01 79.335172 99.119179
03/01 01:20:27 AM train_acc 79.028000
03/01 01:20:27 AM epoch 25 lr 5.050000e-02
03/01 01:20:27 AM genotype_debug = Genotype(normal=[('sep_conv_5x5', 'sep_conv_3x3', 0), ('sep_conv_5x5', 'sep_conv_3x3', 1), ('dil_conv_5x5', 'max_pool_3x3', 1), ('sep_conv_5x5', 'sep_conv_3x3', 2), ('dil_conv_5x5', 'sep_conv_3x3', 0), ('dil_conv_5x5', 'sep_conv_5x5', 2), ('sep_conv_3x3', 'avg_pool_3x3', 1), ('sep_conv_5x5', 'max_pool_3x3', 3), ('dil_conv_5x5', 'sep_conv_5x5', 0)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'skip_connect', 1), ('sep_conv_5x5', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('sep_conv_3x3', 'max_pool_3x3', 0), ('max_pool_3x3', 'max_pool_3x3', 3), ('max_pool_3x3', 'max_pool_3x3', 2), ('sep_conv_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 0)], reduce_concat=range(2, 5))
03/01 01:20:27 

03/01 01:24:49 AM train 000 4.476742e-01 83.984375 100.000000
03/01 01:25:55 AM train 050 5.350664e-01 81.104473 99.203431
03/01 01:26:57 AM train_acc 80.912000
03/01 01:26:57 AM epoch 28 lr 4.122462e-02
03/01 01:26:57 AM genotype_debug = Genotype(normal=[('sep_conv_5x5', 'sep_conv_3x3', 0), ('sep_conv_5x5', 'sep_conv_3x3', 1), ('dil_conv_5x5', 'sep_conv_5x5', 1), ('sep_conv_5x5', 'sep_conv_3x3', 2), ('dil_conv_5x5', 'sep_conv_3x3', 0), ('dil_conv_5x5', 'sep_conv_5x5', 2), ('sep_conv_3x3', 'avg_pool_3x3', 1), ('sep_conv_5x5', 'max_pool_3x3', 3), ('sep_conv_5x5', 'sep_conv_3x3', 0)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'max_pool_3x3', 1), ('sep_conv_5x5', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('sep_conv_3x3', 'max_pool_3x3', 0), ('max_pool_3x3', 'max_pool_3x3', 3), ('max_pool_3x3', 'max_pool_3x3', 2), ('sep_conv_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 0)], reduce_concat=range(2, 5))
03/01 01:26:57

03/01 01:31:20 AM train 000 5.481396e-01 81.250000 98.437500
03/01 01:32:26 AM train 050 4.962912e-01 82.682292 99.356618
03/01 01:33:28 AM train_acc 82.400000
03/01 01:33:28 AM epoch 31 lr 3.227783e-02
03/01 01:33:28 AM genotype_debug = Genotype(normal=[('sep_conv_5x5', 'sep_conv_3x3', 0), ('sep_conv_5x5', 'sep_conv_3x3', 1), ('dil_conv_5x5', 'sep_conv_5x5', 1), ('sep_conv_5x5', 'sep_conv_3x3', 2), ('dil_conv_5x5', 'sep_conv_3x3', 0), ('dil_conv_5x5', 'sep_conv_5x5', 2), ('sep_conv_3x3', 'avg_pool_3x3', 1), ('sep_conv_5x5', 'max_pool_3x3', 3), ('sep_conv_5x5', 'sep_conv_3x3', 0)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'skip_connect', 1), ('sep_conv_5x5', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('sep_conv_3x3', 'max_pool_3x3', 0), ('max_pool_3x3', 'max_pool_3x3', 3), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_3x3', 'max_pool_3x3', 1)], reduce_concat=range(2, 5))
03/01 01:33:28 

03/01 01:37:50 AM train 000 3.960838e-01 85.546875 99.218750
03/01 01:38:57 AM train 050 4.552195e-01 83.999694 99.440870
03/01 01:39:59 AM train_acc 83.768000
03/01 01:39:59 AM epoch 34 lr 2.397657e-02
03/01 01:39:59 AM genotype_debug = Genotype(normal=[('sep_conv_5x5', 'sep_conv_3x3', 0), ('sep_conv_5x5', 'sep_conv_3x3', 1), ('dil_conv_5x5', 'sep_conv_5x5', 1), ('sep_conv_5x5', 'skip_connect', 2), ('sep_conv_3x3', 'max_pool_3x3', 0), ('dil_conv_5x5', 'sep_conv_5x5', 2), ('sep_conv_3x3', 'avg_pool_3x3', 1), ('sep_conv_5x5', 'skip_connect', 3), ('sep_conv_5x5', 'sep_conv_3x3', 0)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'skip_connect', 1), ('sep_conv_5x5', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('sep_conv_3x3', 'max_pool_3x3', 0), ('max_pool_3x3', 'max_pool_3x3', 3), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_3x3', 'max_pool_3x3', 1)], reduce_concat=range(2, 5))
03/01 01:39:59 

03/01 01:44:21 AM train 000 4.584221e-01 83.984375 100.000000
03/01 01:45:27 AM train 050 4.337088e-01 84.788603 99.555760
03/01 01:46:29 AM train_acc 84.680000
03/01 01:46:30 AM epoch 37 lr 1.661492e-02
03/01 01:46:30 AM genotype_debug = Genotype(normal=[('sep_conv_5x5', 'sep_conv_3x3', 0), ('sep_conv_5x5', 'sep_conv_3x3', 1), ('dil_conv_5x5', 'sep_conv_5x5', 1), ('sep_conv_5x5', 'skip_connect', 2), ('sep_conv_3x3', 'max_pool_3x3', 0), ('sep_conv_3x3', 'avg_pool_3x3', 1), ('dil_conv_5x5', 'sep_conv_5x5', 2), ('sep_conv_5x5', 'skip_connect', 3), ('sep_conv_5x5', 'sep_conv_3x3', 0)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'skip_connect', 1), ('sep_conv_5x5', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('sep_conv_3x3', 'max_pool_3x3', 0), ('max_pool_3x3', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 3), ('max_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'max_pool_3x3', 1)], reduce_concat=range(2, 5))
03/01 01:46:30

03/01 01:50:52 AM train 000 4.359577e-01 85.156250 99.218750
03/01 01:51:58 AM train 050 3.971666e-01 86.098346 99.609375
03/01 01:53:00 AM train_acc 86.064000
03/01 01:53:00 AM epoch 40 lr 1.045366e-02
03/01 01:53:00 AM genotype_debug = Genotype(normal=[('sep_conv_5x5', 'sep_conv_3x3', 0), ('sep_conv_5x5', 'sep_conv_3x3', 1), ('sep_conv_5x5', 'max_pool_3x3', 1), ('sep_conv_5x5', 'skip_connect', 2), ('sep_conv_3x3', 'max_pool_3x3', 0), ('sep_conv_3x3', 'avg_pool_3x3', 1), ('dil_conv_5x5', 'dil_conv_3x3', 2), ('sep_conv_5x5', 'sep_conv_3x3', 0), ('skip_connect', 'max_pool_3x3', 3)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'skip_connect', 1), ('sep_conv_5x5', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('sep_conv_3x3', 'max_pool_3x3', 0), ('dil_conv_5x5', 'sep_conv_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 3), ('max_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'sep_conv_3x3', 1)], reduce_concat=range(2, 5))
03/01 01:53:00 

03/01 01:57:22 AM train 000 3.876121e-01 86.718750 100.000000
03/01 01:58:28 AM train 050 3.656934e-01 87.385110 99.632353
03/01 01:59:30 AM train_acc 87.296000
03/01 01:59:31 AM epoch 43 lr 5.711061e-03
03/01 01:59:31 AM genotype_debug = Genotype(normal=[('sep_conv_5x5', 'sep_conv_3x3', 0), ('sep_conv_5x5', 'sep_conv_3x3', 1), ('sep_conv_5x5', 'max_pool_3x3', 1), ('skip_connect', 'max_pool_3x3', 2), ('sep_conv_3x3', 'max_pool_3x3', 0), ('sep_conv_3x3', 'avg_pool_3x3', 1), ('dil_conv_5x5', 'dil_conv_3x3', 2), ('sep_conv_5x5', 'sep_conv_3x3', 0), ('skip_connect', 'max_pool_3x3', 3)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('max_pool_3x3', 'max_pool_3x3', 1), ('sep_conv_5x5', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('sep_conv_3x3', 'max_pool_3x3', 0), ('dil_conv_5x5', 'sep_conv_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 3), ('max_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_3x3', 'max_pool_3x3', 1)], reduce_concat=range(2, 5))
03/01 01:59:31

03/01 02:03:53 AM train 000 2.883026e-01 90.234375 100.000000
03/01 02:04:59 AM train 050 3.416548e-01 87.944240 99.701287
03/01 02:06:01 AM train_acc 88.052000
03/01 02:06:02 AM epoch 46 lr 2.555134e-03
03/01 02:06:02 AM genotype_debug = Genotype(normal=[('sep_conv_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'max_pool_3x3', 1), ('sep_conv_5x5', 'max_pool_3x3', 1), ('skip_connect', 'max_pool_3x3', 2), ('sep_conv_3x3', 'max_pool_3x3', 0), ('sep_conv_3x3', 'avg_pool_3x3', 1), ('dil_conv_5x5', 'dil_conv_3x3', 2), ('sep_conv_5x5', 'sep_conv_3x3', 0), ('skip_connect', 'max_pool_3x3', 3)], normal_concat=range(2, 5), reduce=[('avg_pool_3x3', 'max_pool_3x3', 0), ('max_pool_3x3', 'max_pool_3x3', 1), ('sep_conv_5x5', 'max_pool_3x3', 2), ('max_pool_3x3', 'max_pool_3x3', 1), ('max_pool_3x3', 'max_pool_3x3', 0), ('dil_conv_5x5', 'sep_conv_3x3', 2), ('sep_conv_3x3', 'max_pool_3x3', 3), ('max_pool_3x3', 'max_pool_3x3', 0), ('sep_conv_5x5', 'max_pool_3x3', 1)], reduce_concat=range(2, 5))
03/01 02:06:02

03/01 02:10:24 AM train 000 3.391409e-01 88.281250 100.000000


In [None]:
!python train_search.py --data='./data' --save='N3-E50-CS6-BS128-CT10-BT96' --nodes=3 --multiplier=3 --layers=6 --batch_size=128

In [None]:
!python train_search.py --data='./data' --save='N3-E100-CS6-BS256-CT10-BT96' --nodes=3 --multiplier=3 --layers=6 --epochs=100

In [None]:
!python train.py --auxiliary --cutout --arch='' --data='./data' --save='N4-E50-CS8-BS256'

In [None]:
!python train.py --auxiliary --cutout --arch='N4-E50-CS8-BS128-20200118-105259' --data='./data' --save='N4-E50-CS8-BS128'

In [None]:
!python train.py --auxiliary --cutout --arch='N4-E50-CS4-BS256-20200118-105518' --data='./data' --save='N4-E50-CS4-BS256'

In [None]:
!python train.py --auxiliary --cutout --arch='N4-E20-CS8-BS256-20200118-105659' --data='./data' --save='N4-E20-CS8-BS256'