Optimal Transport Kernels for Sequential and Parallel Neural Architecture Search

Introduction

Neural architecture search (NAS) automates the design of deep neural networks. One of the main challenges in searching complex and non-continuous architectures is to compare the similarity of networks that the conventional Euclidean metric may fail to capture. Optimal transport (OT) is resilient to such complex structure by considering the minimal cost for transporting a network into another. However, the OT is generally not negative definite which may limit its ability to build the positive-definite kernels required in many kernel-dependent frameworks. Building upon tree-Wasserstein (TW), which is a negative definite variant of OT, we develop a novel discrepancy for neural architectures, and demonstrate it within a Gaussian process surrogate model for the sequential NAS settings. Furthermore, we derive a novel parallel NAS, using quality k-determinantal point process on the GP posterior, to select diverse and high-performing architectures from a discrete set of candidates. Empirically, we demonstrate that our TW-based approaches outperform other baselines in both sequential and parallel NAS.

Requirements

tensorflow == 1.14.0
pytorch == 1.2.0, torchvision == 0.4.0
pot == 0.7 https://pythonot.github.io/#
cyDPP (already included in the package)
matplotlib, jupyter
nasbench101 (follow the installation instructions here)
nasbench201 (follow the installation instructions here)

Dataset

To run on NASBench101, download nasbench_only108.tfrecord from here and place it in the top level folder of this repo. To run on NASBench201, download NAS-Bench-201-v1_1-096897.pth from here and place it in the top level folder of this repo.

Sequential NAS on the NASBench search space

To customize your experiment (select algorithm and number of iteration including number of initial observations), open params.py and edit the algo_params_seq. Here, you can change the hyperparameters and the algorithms to run.

python run_experiments/run_experiments_sequential.py
python run_experiments_sequential.py --search_space nasbench
python run_experiments_sequential.py --search_space nasbench201_cifar10
python run_experiments_sequential.py --search_space nasbench201_cifar100
python run_experiments_sequential.py --search_space nasbench201_ImageNet16-120

This will run the sequential NAS setting including the BO algorithm against several other sequential NAS algorithms on the NASBench101 search space.

Batch NAS on the NASBench search space

To customize your experiment (select algorithm and number of iteration including number of initial observations), open params.py and edit the algo_params_batch. Here, you can change the hyperparameters and the algorithms to run.

python run_experiments/run_experiments_batch.py
python run_experiments_batch.py --search_space nasbench
python run_experiments_batch.py --search_space nasbench201_cifar10
python run_experiments_batch.py --search_space nasbench201_cifar100
python run_experiments_batch.py --search_space nasbench201_ImageNet16-120

This will run the batch NAS setting including the k-DPP quality algorithm against several other batch baseline algorithms on the NASBench201 search space.

We adapt the source code from BANANAS to enable the fair comparison with BANANAS and other baselines https://github.com/naszilla/bananas

Plot the result for comparison

python plot_result_sequential.py
python plot_result_batch.py

Examples of the results over 5 independent runs.

Parallel NAS

Sequential NAS

Reference

Vu Nguyen*, Tam Le*, Makoto Yamada, Michael A. Osborne.  "Optimal Transport Kernels for Sequential and Parallel Neural Architecture Search." International Conference on Machine Learning (ICML), 2021.

@inproceedings{nguyen2021optimal,
  title={Optimal transport kernels for sequential and parallel neural architecture search},
  author={Nguyen, Vu and Le, Tam and Yamada, Makoto and Osborne, Michael A},
  booktitle={International Conference on Machine Learning},
  pages={8084--8095},
  year={2021},
  organization={PMLR}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
__pycache__		__pycache__
bo		bo
cyDPP		cyDPP
nas_bench		nas_bench
run_experiments		run_experiments
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
acquisition_functions.py		acquisition_functions.py
batch_nas_algorithms.py		batch_nas_algorithms.py
data.py		data.py
meta_neural_net.py		meta_neural_net.py
params.py		params.py
sequential_nas_algorithms.py		sequential_nas_algorithms.py
test_nb201.py		test_nb201.py
tw_2g_v2b.py		tw_2g_v2b.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimal Transport Kernels for Sequential and Parallel Neural Architecture Search

Introduction

Requirements

Dataset

Sequential NAS on the NASBench search space

Batch NAS on the NASBench search space

Plot the result for comparison

Parallel NAS

Sequential NAS

Reference

About

Releases

Packages

Languages

License

ntienvu/TW_NAS

Folders and files

Latest commit

History

Repository files navigation

Optimal Transport Kernels for Sequential and Parallel Neural Architecture Search

Introduction

Requirements

Dataset

Sequential NAS on the NASBench search space

Batch NAS on the NASBench search space

Plot the result for comparison

Parallel NAS

Sequential NAS

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages