Exploiting Operation Importance for Differentiable Neural Architecture Search

Yuan Zhou, Xukai Xie, Sun Yuan Kung

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

Recently, differentiable neural architecture search (NAS) methods have made significant progress in reducing the computational costs of NASs. Existing methods search for the best architecture by choosing candidate operations with higher architecture weights. However, architecture weights cannot accurately reflect the importance of each operation, that is, the operation with the highest weight might not be related to the best performance. To circumvent this deficiency, we propose a novel indicator that can fully represent the operation importance and, thus, serve as an effective metric to guide the model search. Based on this indicator, we further develop a NAS scheme for 'exploiting operation importance for effective NAS' (EoiNAS). More precisely, we propose a high-order Markov chain-based strategy to slim the search space to further improve search efficiency and accuracy. To evaluate the effectiveness of the proposed EoiNAS, we applied our method to two tasks: image classification and semantic segmentation. Extensive experiments on both tasks provided strong evidence that our method is capable of discovering high-performance architectures while guaranteeing the requisite efficiency during searching.

Original languageEnglish (US)
Pages (from-to)6235-6248
Number of pages14
JournalIEEE Transactions on Neural Networks and Learning Systems
Volume33
Issue number11
DOIs
StatePublished - Nov 1 2022

All Science Journal Classification (ASJC) codes

  • Software
  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications

Keywords

  • High-order Markov chain
  • image classification
  • neural architecture search (NAS)
  • semantic segmentation

Fingerprint

Dive into the research topics of 'Exploiting Operation Importance for Differentiable Neural Architecture Search'. Together they form a unique fingerprint.

Cite this