Timezone: »
Differential Privacy is a popular and well-studied notion of privacy. In the era ofbig data that we are in, privacy concerns are becoming ever more prevalent and thusdifferential privacy is being turned to as one such solution. A popular method forensuring differential privacy of a classifier is known as subsample-and-aggregate,in which the dataset is divided into distinct chunks and a model is learned on eachchunk, after which it is aggregated. This approach allows for easy analysis of themodel on the data and thus differential privacy can be easily applied. In this paper,we extend this approach by dividing the data several times (rather than just once)and learning models on each chunk within each division. The first benefit of thisapproach is the natural improvement of utility by aggregating models trained ona more diverse range of subsets of the data (as demonstrated by the well-knownbagging technique). The second benefit is that, through analysis that we provide inthe paper, we can derive tighter differential privacy guarantees when several queriesare made to this mechanism. In order to derive these guarantees, we introducethe upwards and downwards moments accountants and derive bounds for thesemoments accountants in a data-driven fashion. We demonstrate the improvementsour model makes over standard subsample-and-aggregate in two datasets (HeartFailure (private) and UCI Adult (public)).
Author Information
James Jordon (University of Oxford)
Jinsung Yoon (University of California, Los Angeles)
I am a research scientist at Google Cloud AI. I am currently working on diverse machine learning research topics such as generative models, self- and semi-supervised learning, model interpretation, data imputation, and synthetic data generation. Previously, I worked on machine learning for medicine with Professor Mihaela van der Schaar as a graduate student researcher in UCLA Electrical and Computer Engineering Department. I received my Ph.D. and M.S. in Electrical and Computer Engineering Department at UCLA, and B.S. in Electrical and Computer Engineering at Seoul National University (SNU).
Mihaela van der Schaar (University of Cambridge, Alan Turing Institute and UCLA)
More from the Same Authors
-
2022 : Provable Re-Identification Privacy »
Zachary Izzo · Jinsung Yoon · Sercan Arik · James Zou -
2022 : Closing Remarks »
Cheng Zhang · Mihaela van der Schaar -
2022 : Panel Discussion »
Cheng Zhang · Mihaela van der Schaar · Ilya Shpitser · Aapo Hyvarinen · Yoshua Bengio · Bernhard Schölkopf -
2022 : Opening Remarks »
Cheng Zhang · Mihaela van der Schaar -
2021 : Invited talk #5: Mihaela van der Schaar »
Mihaela van der Schaar -
2020 : Closing remarks »
James Jordon -
2020 : What we learned from the Hide-and-Seek privacy challenge »
James Jordon -
2020 : Synthetic data in the healthcare setting »
James Jordon -
2020 : The importance of synthetic data »
James Jordon -
2020 : Introducing the Hide-and-Seek privacy challenge »
James Jordon -
2020 Poster: Estimating the Effects of Continuous-valued Interventions using Generative Adversarial Networks »
Ioana Bica · James Jordon · Mihaela van der Schaar -
2020 Poster: OrganITE: Optimal transplant donor organ offering using an individual treatment effect »
Jeroen Berrevoets · James Jordon · Ioana Bica · alexander gimson · Mihaela van der Schaar -
2020 : Q&A for invited speaker, Mihaela van der Schaar »
Mihaela van der Schaar -
2020 : Interpretable AutoML: Powering the machine learning revolution in healthcare in the era of Covid-19 and beyond »
Mihaela van der Schaar -
2020 Poster: VIME: Extending the Success of Self- and Semi-supervised Learning to Tabular Domain »
Jinsung Yoon · Yao Zhang · James Jordon · Mihaela van der Schaar -
2019 Poster: Attentive State-Space Modeling of Disease Progression »
Ahmed Alaa · Mihaela van der Schaar -
2019 Poster: Demystifying Black-box Models with Symbolic Metamodels »
Ahmed Alaa · Mihaela van der Schaar -
2019 Poster: Time-series Generative Adversarial Networks »
Jinsung Yoon · Daniel Jarrett · Mihaela van der Schaar -
2019 Poster: Conditional Independence Testing using Generative Adversarial Networks »
Alexis Bellot · Mihaela van der Schaar -
2019 Spotlight: Conditional Independence Testing using Generative Adversarial Networks »
Alexis Bellot · Mihaela van der Schaar -
2018 Poster: Forecasting Treatment Responses Over Time Using Recurrent Marginal Structural Networks »
Bryan Lim · Ahmed Alaa · Mihaela van der Schaar -
2017 : Coffee break and Poster Session II »
Mohamed Kane · Albert Haque · Vagelis Papalexakis · John Guibas · Peter Li · Carlos Arias · Eric Nalisnick · Padhraic Smyth · Frank Rudzicz · Xia Zhu · Theodore Willke · Noemie Elhadad · Hans Raffauf · Harini Suresh · Paroma Varma · Yisong Yue · Ognjen (Oggi) Rudovic · Luca Foschini · Syed Rameel Ahmad · Hasham ul Haq · Valerio Maggio · Giuseppe Jurman · Sonali Parbhoo · Pouya Bashivan · Jyoti Islam · Mirco Musolesi · Chris Wu · Alexander Ratner · Jared Dunnmon · Cristóbal Esteban · Aram Galstyan · Greg Ver Steeg · Hrant Khachatrian · Marc Górriz · Mihaela van der Schaar · Anton Nemchenko · Manasi Patwardhan · Tanay Tandon -
2017 Poster: DPSCREEN: Dynamic Personalized Screening »
Kartik Ahuja · William Zame · Mihaela van der Schaar -
2017 Poster: Deep Multi-task Gaussian Processes for Survival Analysis with Competing Risks »
Ahmed Alaa · Mihaela van der Schaar -
2017 Spotlight: Deep Multi-task Gaussian Processes for Survival Analysis with Competing Risks »
Ahmed Alaa · Mihaela van der Schaar -
2017 Poster: Bayesian Inference of Individualized Treatment Effects using Multi-task Gaussian Processes »
Ahmed Alaa · Mihaela van der Schaar