Timezone: »
We propose a locally hierarchical auto-regressive model with multiple resolutions of discrete codes. In the first stage of our algorithm, we represent an image with a pyramid of codes using Hierarchically Quantized Variational AutoEncoder (HQ-VAE), which disentangles the information contained in the multi-level codes. For an example of two-level codes, we create two separate pathways to carry high-level coarse structures of input images using top codes while compensating for missing fine details by constructing a residual connection for bottom codes. An appropriate selection of resizing operations for code embedding maps enables top codes to capture maximal information within images and the first stage algorithm achieves better performance on both vector quantization and image generation. The second stage adopts Hierarchically Quantized Transformer (HQ-Transformer) to process a sequence of local pyramids, which consist of a single top code and its corresponding bottom codes. Contrary to other hierarchical models, we sample bottom codes in parallel by exploiting the conditional independence assumption on the bottom codes. This assumption is naturally harvested from our first-stage model, HQ-VAE, where the bottom code learns to describe local details. On class-conditional and text-conditional generation benchmarks, our model shows competitive performance to previous AR models in terms of fidelity of generated images while enjoying lighter computational budgets.
Author Information
Tackgeun You (POSTECH)
Saehoon Kim (Kakao Brain)
Chiheon Kim (Kakao Brain)
Doyup Lee (Kakao Brain)
Bohyung Han (Seoul National University)
More from the Same Authors
-
2022 Poster: MCL-GAN: Generative Adversarial Networks with Multiple Specialized Discriminators »
Jinyoung Choi · Bohyung Han -
2022 Poster: Draft-and-Revise: Effective Image Generation with Contextual RQ-Transformer »
Doyup Lee · Chiheon Kim · Saehoon Kim · Minsu Cho · WOOK SHIN HAN -
2022 Poster: Information-Theoretic GAN Compression with Variational Energy-based Model »
Minsoo Kang · Hyewon Yoo · Eunhee Kang · Sehwan Ki · Hyong Euk Lee · Bohyung Han -
2021 Poster: Learning Student-Friendly Teacher Networks for Knowledge Distillation »
Dae Young Park · Moon-Hyun Cha · changwook jeong · Daesin Kim · Bohyung Han -
2021 Poster: Learning Debiased and Disentangled Representations for Semantic Segmentation »
Sanghyeok Chu · Dongwan Kim · Bohyung Han -
2020 Poster: Rotation-Invariant Local-to-Global Representation Learning for 3D Point Cloud »
SEOHYUN KIM · JaeYoo Park · Bohyung Han -
2019 Poster: Combinatorial Inference against Label Noise »
Paul Hongsuck Seo · Geeho Kim · Bohyung Han -
2019 Poster: Mining GOLD Samples for Conditional GANs »
Sangwoo Mo · Chiheon Kim · Sungwoong Kim · Minsu Cho · Jinwoo Shin -
2019 Poster: Fast AutoAugment »
Sungbin Lim · Ildoo Kim · Taesup Kim · Chiheon Kim · Sungwoong Kim -
2018 Poster: Uncertainty-Aware Attention for Reliable Interpretation and Prediction »
Jay Heo · Hae Beom Lee · Saehoon Kim · Juho Lee · Kwang Joon Kim · Eunho Yang · Sung Ju Hwang -
2018 Poster: Learning to Specialize with Knowledge Distillation for Visual Question Answering »
Jonghwan Mun · Kimin Lee · Jinwoo Shin · Bohyung Han -
2018 Poster: DropMax: Adaptive Variational Softmax »
Hae Beom Lee · Juho Lee · Saehoon Kim · Eunho Yang · Sung Ju Hwang -
2017 : Learning to Transfer Initializations for Bayesian Hyperparameter Optimization »
Saehoon Kim -
2017 Poster: Regularizing Deep Neural Networks by Noise: Its Interpretation and Optimization »
Hyeonwoo Noh · Tackgeun You · Jonghwan Mun · Bohyung Han -
2017 Poster: Visual Reference Resolution using Attention Memory for Visual Dialog »
Paul Hongsuck Seo · Andreas Lehrmann · Bohyung Han · Leonid Sigal -
2015 Poster: Decoupled Deep Neural Network for Semi-supervised Semantic Segmentation »
Seunghoon Hong · Hyeonwoo Noh · Bohyung Han -
2015 Spotlight: Decoupled Deep Neural Network for Semi-supervised Semantic Segmentation »
Seunghoon Hong · Hyeonwoo Noh · Bohyung Han -
2014 Poster: Object Localization based on Structural SVM using Privileged Information »
Jan Feyereisl · Suha Kwak · Jeany Son · Bohyung Han