Timezone: »
Language and image understanding are two major goals of artificial intelligence which can both be conceptually formulated in terms of parsing the input signal into a hierarchical representation. Natural language researchers have made great progress by exploiting the 1D structure of language to design efficient polynomial-time parsing algorithms. By contrast, the two-dimensional nature of images makes it much harder to design efficient image parsers and the form of the hierarchical representations is also unclear. Attempts to adapt representations and algorithms from natural language have only been partially successful. In this paper, we propose a Hierarchical Image Model (HIM) for 2D image parsing which outputs image segmentation and object recognition. This HIM has multiple layers (five in this paper) and has advantages for representation, inference, and learning. Firstly, the HIM has a coarse-to-fine representation which is capable of capturing long-range dependency and exploiting different levels of contextual information. Secondly, the structure of the HIM allows us to design a rapid inference algorithm, based on dynamic programming, which enables us to parse the image rapidly in polynomial time. Thirdly, we can learn the HIM efficiently in a discriminative manner from a labeled dataset. We demonstrate that HIM outperforms other state-of-the-art methods by evaluation on the challenging public MSRC image dataset. Finally, we sketch how the HIM architecture can be extended to model more complex image phenomena.
Author Information
Long Zhu (Massachusetts Institute of Technology)
Yuanhao Chen (University of California, Los Angeles)
Yuan Lin (SJTU)
Alan Yuille (JHU)
Related Events (a corresponding poster, oral, or spotlight)
-
2008 Spotlight: A Hierarchical Image Model for Polynomial-Time 2D Parsing »
Wed. Dec 10th 07:51 -- 07:52 PM Room
More from the Same Authors
-
2021 : Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge »
Jiyang Qi · Yan Gao · Yao Hu · Xinggang Wang · Xiaoyu Liu · Xiang Bai · Serge Belongie · Alan Yuille · Philip Torr · Song Bai -
2021 : Understanding Catastrophic Forgetting and Remembering in Continual Learning with Optimal Relevance Mapping »
prakhar kaushik · Adam Kortylewski · Alex Gain · Alan Yuille -
2022 : Volumetric Neural Human for Robust Pose Optimization via Analysis-by-synthesis »
Pengliang Ji · Angtian Wang · Yi Zhang · Adam Kortylewski · Alan Yuille -
2022 : Synthetic Tumors Make AI Segment Tumors Better »
Qixin Hu · Junfei Xiao · Alan Yuille · Zongwei Zhou -
2022 : Assembling Existing Labels from Public Datasets to\\Diagnose Novel Diseases: COVID-19 in Late 2019 »
Zengle Zhu · Mintong Kang · Alan Yuille · Zongwei Zhou -
2022 : Making Your First Choice: To Address Cold Start Problem in Vision Active Learning »
Liangyu Chen · Yutong Bai · Siyu Huang · Yongyi Lu · Bihan Wen · Alan Yuille · Zongwei Zhou -
2023 Poster: 3D-Aware Visual Question Answering about Parts, Poses and Occlusions »
XINGRUI WANG · Zhuowan Li · Wufei Ma · Adam Kortylewski · Alan Yuille -
2023 Poster: Annotating 8,000 Abdominal CT Volumes for Multi-Organ Segmentation in Three Weeks »
Chongyu Qu · Tiezheng Zhang · Hualin Qiao · jie liu · Yucheng Tang · Alan Yuille · Zongwei Zhou -
2021 Poster: Glance-and-Gaze Vision Transformer »
Qihang Yu · Yingda Xia · Yutong Bai · Yongyi Lu · Alan Yuille · Wei Shen -
2021 Poster: Are Transformers more robust than CNNs? »
Yutong Bai · Jieru Mei · Alan Yuille · Cihang Xie -
2021 Poster: Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose »
Angtian Wang · Shenxiao Mei · Alan Yuille · Adam Kortylewski -
2017 Poster: Label Distribution Learning Forests »
Wei Shen · KAI ZHAO · Yilu Guo · Alan Yuille -
2016 Poster: SURGE: Surface Regularized Geometry Estimation from a Single Image »
Peng Wang · Xiaohui Shen · Bryan Russell · Scott Cohen · Brian Price · Alan Yuille -
2016 Poster: Training and Evaluating Multimodal Word Embeddings with Large-scale Web Annotated Images »
Junhua Mao · Jiajing Xu · Kevin Jing · Alan Yuille -
2014 Workshop: Modern Nonparametrics 3: Automating the Learning Pipeline »
Eric Xing · Mladen Kolar · Arthur Gretton · Samory Kpotufe · Han Liu · Zoltán Szabó · Alan Yuille · Andrew G Wilson · Ryan Tibshirani · Sasha Rakhlin · Damian Kozbur · Bharath Sriperumbudur · David Lopez-Paz · Kirthevasan Kandasamy · Francesco Orabona · Andreas Damianou · Wacha Bounliphone · Yanshuai Cao · Arijit Das · Yingzhen Yang · Giulia DeSalvo · Dmitry Storcheus · Roberto Valerio -
2014 Poster: Articulated Pose Estimation by a Graphical Model with Image Dependent Pairwise Relations »
Xianjie Chen · Alan Yuille -
2014 Poster: Learning From Weakly Supervised Data by The Expectation Loss SVM (e-SVM) algorithm »
Jun Zhu · Junhua Mao · Alan Yuille -
2010 Poster: Gaussian sampling by local perturbations »
George Papandreou · Alan Yuille -
2010 Poster: Functional form of motion priors in human motion perception »
HongJing Lu · Tungyou Lin · Alan L Lee · Luminita Vese · Alan Yuille -
2010 Poster: A unified model of short-range and long-range motion perception »
Shuang Wu · Xuming He · HongJing Lu · Alan Yuille -
2009 Poster: Modeling the spacing effect in sequential category learning »
HongJing Lu · Matthew Weiden · Alan Yuille -
2008 Poster: Model selection and velocity estimation using novel priors for motion patterns »
Alan Yuille · Shuang Wu · HongJing Lu -
2008 Oral: Model selection and velocity estimation using novel priors for motion patterns »
Alan Yuille · Shuang Wu · HongJing Lu -
2007 Workshop: The Grammar of Vision: Probabilistic Grammar-Based Models for Visual Scene Understanding and Object Categorization »
Virginia Savova · Josh Tenenbaum · Leslie Kaelbling · Alan Yuille -
2007 Poster: The Noisy-Logical Distribution and its Application to Causal Inference »
Alan Yuille · HongJing Lu -
2007 Poster: Rapid Inference on a novel AND/OR graph: Detection, Segmentation and Parsing of Articulated Deformable Objects in Cluttered Backgrounds »
Yuanhao Chen · Long Zhu · Chenxi Lin · Alan Yuille · Hongjiang Zhang -
2006 Talk: Unsupervised Learning of a Probabilistic Grammar for Object Detection and Parsing »
Long Zhu · Yuanhao Chen · Alan Yuille -
2006 Poster: Unsupervised Learning of a Probabilistic Grammar for Object Detection and Parsing »
Long Zhu · Yuanhao Chen · Alan Yuille