Timezone: »
We propose a method that learns a discriminative yet semantic space for object categorization, where we also embed auxiliary semantic entities such as supercategories and attributes. Contrary to prior work which only utilized them as side information, we explicitly embed the semantic entities into the same space where we embed categories, which enables us to represent a category as their linear combination. By exploiting such a unified model for semantics, we enforce each category to be generated as a sparse combination of a supercategory + attributes, with an additional exclusive regularization to learn discriminative composition. The proposed reconstructive regularization guides the discriminative learning process to learn a better generalizing model, as well as generates compact semantic description of each category, which enables humans to analyze what has been learned.
Author Information
Sung Ju Hwang (Disney Research)
Leonid Sigal (University of British Columbia)
More from the Same Authors
-
2017 Poster: Non-parametric Structured Output Networks »
Andreas Lehrmann · Leonid Sigal -
2017 Poster: Visual Reference Resolution using Attention Memory for Visual Dialog »
Paul Hongsuck Seo · Andreas Lehrmann · Bohyung Han · Leonid Sigal -
2013 Poster: Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization »
Nataliya Shapovalova · Michalis Raptis · Leonid Sigal · Greg Mori -
2011 Poster: Facial Expression Transfer with Input-Output Temporal Restricted Boltzmann Machines »
Matthew D Zeiler · Graham Taylor · Leonid Sigal · Iain Matthews · Rob Fergus