Skip to main content
To KTH's start page To KTH's start page

Bayesian Optimization for Neural Architecture Search using Graph Kernels

Time: Tue 2021-02-02 15.00

Location: https://kth-se.zoom.us/j/2884945301

Respondent: Bharathwaj Krishnaswami Sreedhar

Opponent: Nishant Joshi

Supervisor: Magnus Boman (Examiner: Amir H. Payberah)

Export to calendar

Neural architecture search is a popular method for automating architecture design. Bayesian optimization is a widely used approach for hyper-parameter optimization and can estimate a function with limited samples. However, Bayesian optimization methods are not preferred for architecture search as it expects vector inputs while graphs are high dimensional data. This thesis presents a Bayesian approach with Gaussian priors that use graph kernels specifically targeted to work in the higher-dimensional graph space. We implemented three different graph kernels and show that on the NAS-Bench-101 dataset, even an untrained GCN kernel outperforms previous methods significantly in terms of the best network found and the number of samples required to find it. We follow the NAS guidelines to make this work reproducible.