Contributed talk
in
Workshop: Privacy in Machine Learning (PriML) 2021

Simple Baselines Are Strong Performers for Differentially Private Natural Language Processing

Xuechen (Chen) Li · Florian Tramer · Percy Liang · Tatsunori Hashimoto

2021 Contributed talk
in
Workshop: Privacy in Machine Learning (PriML) 2021

Project Page [ OpenReview]

Abstract

Differentially private learning has seen limited success for deep learning models of text, resulting in a perception that differential privacy may be incompatible with the language model fine-tuning paradigm. We demonstrate that this perception is inaccurate and that with the right setup, high performing private models can be learned on moderately-sized corpora by directly fine-tuning with differentially private optimization.Our work highlights the important role of hyperparameters, task formulations, and pretrained models.Our analyses also show that the low performance of naive differentially private baselines in prior work is attributable to suboptimal choices in these factors.Empirical results reveal that differentially private optimization does not suffer from dimension-dependent performance degradation with pretrained models and achieves performance on-par with state-of-the-art private training procedures and strong non-private baselines.

Video

Chat is not available.