Timezone: »
Can we learn to locate objects in images, only from the list of objects those images contain? Or the sentiment of a phrase in a review from the overall score? Can we tell who voted for Obama in 2012? Or which population strata are more likely to be infected by Ebola, only looking at geographical incidence and census data? Are large corporations able to infer sensitive traits of their customers such as sex preferences, unemployment or ethnicity, only based on state-level statistics?
In contrast, how can we publicly release data containing personal information to the research community, while guaranteeing that individuals’ sensitive information will not be compromised? How realistic is the idea of outsourcing machine-learning tasks without sharing datasets but only a few statistics sufficient for training?
Despite their diversity, solutions to those problems can be surprisingly alike, as they all play with the same elements: variables without a clear one-to-one mapping, and the search for/the protection against models and statistics sufficient to recover the relevant variables.
Aggregate statistics and obfuscated data are abundant, as they are released much more frequently than plain individual-level information; the latter are often too sensitive because of privacy constraints or business value, or too expensive to collect. Learning in those scenarios has been conceptualized, for example, by multiple instance learning, learning from label proportions, and learning from noisy labels, and it is common in a variety of application fields, such as computer vision, sentiment analysis and bioinformatics, whenever labels for single image patches, sentences or proteins are unknown, while higher-level supervision is possible.
This problem is not limited to computer science, though. In fact, as natural, social and medical disciplines have studied the problem of inference from aggregates for a long time, including the so-called ecological inference in political science, econometrics and epidemiology, and the modifiable areal unit problem in spatial statistics.
But as those approaches are shown to be effective in practice, to the point that the available statistics reveal sensitive attributes with high accuracy, the question is turned around into a search for privacy guarantees. Traditional statistics has studied the problem of confidential data release. Research in k-anonymity, l-diversity and, more recently, differential privacy has proposed procedures to mask data in a way that one can trade-off protection and usefulness for statistical analysis.
Sat 6:10 a.m. - 7:00 a.m.
|
Kamalika Chaudhuri
(
Invited speaker
)
|
Kamalika Chaudhuri 🔗 |
Sat 7:30 a.m. - 8:10 a.m.
|
Nando de Freitas
(
Invited speaker
)
|
Nando de Freitas 🔗 |
Sat 8:10 a.m. - 8:30 a.m.
|
Kentaro Minami: $(\varepsilon, \delta)$-differential privacy of Gibbs posteriors
(
Talk
)
|
Kentaro Minami 🔗 |
Sat 8:30 a.m. - 8:50 a.m.
|
Yu-Xiang Wang: Learning with differential privacy: stability, learnability and the sufficiency and necessity of ERM principle
(
Talk
)
|
Yu-Xiang Wang 🔗 |
Sat 8:50 a.m. - 9:10 a.m.
|
Vishesh Karwa: Private posterior distributions from variational approximations
(
Talk
)
|
Vishesh Karwa 🔗 |
Sat 11:30 a.m. - 12:20 p.m.
|
Wendy Cho : Information Theoretic Research Designs
(
Invited Speaker
)
|
Wendy K. Tam Cho 🔗 |
Sat 12:20 p.m. - 12:40 p.m.
|
Fan Li: Alter-CNN: an approach for learning from label proportions with the application to ice-water Classification
(
Talk
)
|
Fan Li 🔗 |
Sat 12:40 p.m. - 1:00 p.m.
|
Tao Sun: Message passing for collective graphical models
(
Talk
)
|
Tao Sun 🔗 |
Sat 1:30 p.m. - 2:15 p.m.
|
Max Ott: Confidential Computing
(
Invited Speaker
)
|
Max Ott 🔗 |
Author Information
Giorgio Patrini (Australian National University / NICTA)
Tony Jebara (Columbia University)
Richard Nock (Data61, The Australian National University & The University of Sydney)
Dimitrios Kotzias (University of California Irvine)
Felix Xinnan Yu (Google Research)
More from the Same Authors
-
2022 : FedDM: Iterative Distribution Matching for Communication-Efficient Federated Learning »
Yuanhao Xiong · Ruochen Wang · Minhao Cheng · Felix Yu · Cho-Jui Hsieh -
2023 Poster: SpecTr: Fast Speculative Decoding via Optimal Transport »
Ziteng Sun · Ananda Theertha Suresh · Jae Hun Ro · Ahmad Beirami · Himanshu Jain · Felix Yu -
2020 Poster: Learning discrete distributions: user vs item-level privacy »
Yuhan Liu · Ananda Theertha Suresh · Felix Xinnan Yu · Sanjiv Kumar · Michael D Riley -
2019 Poster: A New Distribution on the Simplex with Auto-Encoding Applications »
Andrew Stirn · Tony Jebara · David Knowles -
2019 Poster: Sampled Softmax with Random Fourier Features »
Ankit Singh Rawat · Jiecao Chen · Felix Xinnan Yu · Ananda Theertha Suresh · Sanjiv Kumar -
2018 Poster: cpSGD: Communication-efficient and differentially-private distributed SGD »
Naman Agarwal · Ananda Theertha Suresh · Felix Xinnan Yu · Sanjiv Kumar · Brendan McMahan -
2018 Spotlight: cpSGD: Communication-efficient and differentially-private distributed SGD »
Naman Agarwal · Ananda Theertha Suresh · Felix Xinnan Yu · Sanjiv Kumar · Brendan McMahan -
2017 Poster: f-GANs in an Information Geometric Nutshell »
Richard Nock · Zac Cranko · Aditya K Menon · Lizhen Qu · Robert Williamson -
2017 Spotlight: f-GANs in an Information Geometric Nutshell »
Richard Nock · Zac Cranko · Aditya K Menon · Lizhen Qu · Robert Williamson -
2017 Poster: Multiscale Quantization for Fast Similarity Search »
Xiang Wu · Ruiqi Guo · Ananda Theertha Suresh · Sanjiv Kumar · Daniel Holtmann-Rice · David Simcha · Felix Yu -
2016 Poster: A scaled Bregman theorem with applications »
Richard Nock · Aditya Menon · Cheng Soon Ong -
2016 Poster: Orthogonal Random Features »
Felix Xinnan Yu · Ananda Theertha Suresh · Krzysztof M Choromanski · Daniel Holtmann-Rice · Sanjiv Kumar -
2016 Oral: Orthogonal Random Features »
Felix Xinnan Yu · Ananda Theertha Suresh · Krzysztof M Choromanski · Daniel Holtmann-Rice · Sanjiv Kumar -
2016 Poster: On Regularizing Rademacher Observation Losses »
Richard Nock -
2015 Poster: Spherical Random Features for Polynomial Kernels »
Jeffrey Pennington · Felix Yu · Sanjiv Kumar -
2015 Spotlight: Spherical Random Features for Polynomial Kernels »
Jeffrey Pennington · Felix Yu · Sanjiv Kumar -
2014 Poster: Clamping Variables and Approximate Inference »
Adrian Weller · Tony Jebara -
2014 Poster: Making Pairwise Binary Graphical Models Attractive »
Nicholas Ruozzi · Tony Jebara -
2014 Spotlight: Making Pairwise Binary Graphical Models Attractive »
Nicholas Ruozzi · Tony Jebara -
2014 Oral: Clamping Variables and Approximate Inference »
Adrian Weller · Tony Jebara -
2014 Poster: (Almost) No Label No Cry »
Giorgio Patrini · Richard Nock · Tiberio Caetano · Paul Rivera -
2014 Spotlight: (Almost) No Label No Cry »
Giorgio Patrini · Richard Nock · Tiberio Caetano · Paul Rivera -
2013 Poster: A multi-agent control framework for co-adaptation in brain-computer interfaces »
Josh S Merel · Roy Fox · Tony Jebara · Liam Paninski -
2013 Poster: Adaptive Anonymity via $b$-Matching »
Krzysztof M Choromanski · Tony Jebara · Kui Tang -
2013 Spotlight: Adaptive Anonymity via $b$-Matching »
Krzysztof M Choromanski · Tony Jebara · Kui Tang -
2012 Workshop: Log-Linear Models »
Dimitri Kanevsky · Tony Jebara · Li Deng · Stephen Wright · Georg Heigold · Avishy Carmi -
2012 Poster: Majorization for CRFs and Latent Likelihoods »
Tony Jebara · Anna Choromanska -
2012 Spotlight: Majorization for CRFs and Latent Likelihoods »
Tony Jebara · Anna Choromanska -
2011 Poster: Variance Penalizing AdaBoost »
Pannagadatta K Shivaswamy · Tony Jebara -
2011 Poster: Learning a Distance Metric from a Network »
Blake Shaw · Bert Huang · Tony Jebara -
2008 Workshop: Analyzing Graphs: Theory and Applications »
Edo M Airoldi · David Blei · Jake M Hofman · Tony Jebara · Eric Xing -
2008 Poster: Relative Margin Machines »
Pannagadatta K Shivaswamy · Tony Jebara -
2008 Session: Oral session 8: Physics and High Order Statistics »
Tony Jebara -
2007 Poster: Density Estimation under Independent Similarly Distributed Sampling Assumptions »
Tony Jebara · Yingbo Song · Kapil Thadani -
2007 Spotlight: Density Estimation under Independent Similarly Distributed Sampling Assumptions »
Tony Jebara · Yingbo Song · Kapil Thadani -
2007 Spotlight: Learning Monotonic Transformations for Classification »
Andrew G Howard · Tony Jebara -
2007 Poster: Learning Monotonic Transformations for Classification »
Andrew G Howard · Tony Jebara -
2006 Poster: An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments »
Michael Mandel · Daniel P Ellis · Tony Jebara -
2006 Poster: Gaussian and Wishart Hyperkernels »
Risi Kondor · Tony Jebara