Skip to yearly menu bar Skip to main content


Contributed talk
in
Workshop: Privacy in Machine Learning (PriML) 2021

Simple Baselines Are Strong Performers for Differentially Private Natural Language Processing

Xuechen (Chen) Li · Florian Tramer · Percy Liang · Tatsunori Hashimoto


Abstract:

Differentially private learning has seen limited success for deep learning models of text, resulting in a perception that differential privacy may be incompatible with the language model fine-tuning paradigm. We demonstrate that this perception is inaccurate and that with the right setup, high performing private models can be learned on moderately-sized corpora by directly fine-tuning with differentially private optimization.Our work highlights the important role of hyperparameters, task formulations, and pretrained models.Our analyses also show that the low performance of naive differentially private baselines in prior work is attributable to suboptimal choices in these factors.Empirical results reveal that differentially private optimization does not suffer from dimension-dependent performance degradation with pretrained models and achieves performance on-par with state-of-the-art private training procedures and strong non-private baselines.

Chat is not available.