FedML-HE: An Efficient Homomorphic-Encryption-Based Privacy-Preserving Federated Learning System
Abstract
Federated Learning trains machine learning models on distributed devices by aggregating local model updates instead of local data. However, privacy concerns arise as the aggregated local models on the server may reveal sensitive personal information by inversion attacks. Privacy-preserving methods, such as Homomorphic Encryption (HE), then become necessary for FL training. Despite HE's post-quantum security advantages, its applications suffer from impractical overheads, especially for foundation models. In this paper, we present FedHE, the first practical federated learning system with efficient HE-based secure model aggregation. FedHE proposes to selectively encrypt sensitive parameters, significantly reducing both computation and communication overheads during training while providing customizable privacy preservation. Our optimized system demonstrates considerable overhead reduction, particularly for large foundation models (e.g., ~10x reduction for HE-federated training of ResNet-50, and ~40x reduction for BERT), demonstrating the potential for scalable HE-based FL deployment.