Lancelot: Towards Efficient and Privacy-Preserving Byzantine-Robust Federated Learning within Fully Homomorphic Encryption

USENIX Security Symposium

Local Model Poisoning Attacks to Byzantine-Robust Federated Learning

Minghong Fang, Xiaoyu Cao, Jinyuan Jia, Neil Zhenqiang Gong

Published: 2019.11.27

In federated learning, multiple client devices jointly learn a machine learning model: each client device maintains a local model for its local training dataset, while a master device maintains a global model via aggregating the local models from the client devices. The machine learning community recently proposed several federated learning methods that were claimed to be robust against Byzantine failures (e.g., system failures, adversarial manipulations) of certain client devices. In this work, we perform the first systematic study on local model poisoning attacks to federated learning. We assume an attacker has compromised some client devices, and the attacker manipulates the local model parameters on the compromised client devices during the learning process such that the global model has a large testing error rate. We formulate our attacks as optimization problems and apply our attacks to four recent Byzantine-robust federated learning methods. Our empirical results on four real-world datasets show that our attacks can substantially increase the error rates of the models learnt by the federated learning methods that were claimed to be robust against Byzantine failures of some client devices. We generalize two defenses for data poisoning attacks to defend against our local model poisoning attacks. Our evaluation results show that one defense can effectively defend against our attacks in some cases, but the defenses are not effective enough in other cases, highlighting the need for new defenses against our local model poisoning attacks to federated learning.

ポイズニング攻撃タイプモデル性能評価

International Conference on Machine Learning (ICML)

被引用数 8

Analyzing Federated Learning through an Adversarial Lens

Arjun Nitin Bhagoji, Supriyo Chakraborty, Prateek Mittal, Seraphin Calo

Published: 2018.11.30

Federated learning distributes model training among a multitude of agents, who, guided by privacy concerns, perform training using their local data but share only model parameter updates, for iterative aggregation at the server. In this work, we explore the threat of model poisoning attacks on federated learning initiated by a single, non-colluding malicious agent where the adversarial objective is to cause the model to misclassify a set of chosen inputs with high confidence. We explore a number of strategies to carry out this attack, starting with simple boosting of the malicious agent's update to overcome the effects of other agents' updates. To increase attack stealth, we propose an alternating minimization strategy, which alternately optimizes for the training loss and the adversarial objective. We follow up by using parameter estimation for the benign agents' updates to improve on attack success. Finally, we use a suite of interpretability techniques to generate visual explanations of model decisions for both benign and malicious models and show that the explanations are nearly visually indistinguishable. Our results indicate that even a highly constrained adversary can carry out model poisoning attacks while simultaneously maintaining stealth, thus highlighting the vulnerability of the federated learning setting and the need to develop effective defense strategies.

ポイズニング重み更新手法連合学習

CoRR

How To Backdoor Federated Learning

Eugene Bagdasaryan, Andreas Veit, Yiqing Hua, Deborah Estrin, Vitaly Shmatikov

Published: 2018

International Conference on Learning Representations

Dba: Distributed backdoor attacks against federated learning

C. Xie, K. Huang, P.-Y. Chen, B. Li

Published: 2020

International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI)

Suppressing Poisoning Attacks on Federated Learning for Medical Imaging

Naif Alkhunaizi, Dmitry Kamzolov, Martin Takáč, Karthik Nandakumar

Published: 2022.7.15

Collaboration among multiple data-owning entities (e.g., hospitals) can accelerate the training process and yield better machine learning models due to the availability and diversity of data. However, privacy concerns make it challenging to exchange data while preserving confidentiality. Federated Learning (FL) is a promising solution that enables collaborative training through exchange of model parameters instead of raw data. However, most existing FL solutions work under the assumption that participating clients are \emph{honest} and thus can fail against poisoning attacks from malicious parties, whose goal is to deteriorate the global model performance. In this work, we propose a robust aggregation rule called Distance-based Outlier Suppression (DOS) that is resilient to byzantine failures. The proposed method computes the distance between local parameter updates of different clients and obtains an outlier score for each client using Copula-based Outlier Detection (COPOD). The resulting outlier scores are converted into normalized weights using a softmax function, and a weighted average of the local parameters is used for updating the global model. DOS aggregation can effectively suppress parameter updates from malicious clients without the need for any hyperparameter selection, even when the data distributions are heterogeneous. Evaluation on two medical imaging datasets (CheXpert and HAM10000) demonstrates the higher robustness of DOS method against a variety of poisoning attacks in comparison to other state-of-the-art methods. The code can be found here https://github.com/Naiftt/SPAFD.

ポイズニング攻撃ビザンチン耐性計算効率

Advances in neural information processing systems

Machine learning with adversaries: Byzantine tolerant gradient descent

Blanchard, P., El Mhamdi, E. M., Guerraoui, R., Stainer, J.

Published: 2017

Distributed Statistical Machine Learning in Adversarial Settings: Byzantine Gradient Descent

被引用数 8

Yudong Chen, Lili Su, Jiaming Xu

Published: 2017.5.16

We consider the problem of distributed statistical machine learning in adversarial settings, where some unknown and time-varying subset of working machines may be compromised and behave arbitrarily to prevent an accurate model from being learned. This setting captures the potential adversarial attacks faced by Federated Learning -- a modern machine learning paradigm that is proposed by Google researchers and has been intensively studied for ensuring user privacy. Formally, we focus on a distributed system consisting of a parameter server and $m$ working machines. Each working machine keeps $N/m$ data samples, where $N$ is the total number of samples. The goal is to collectively learn the underlying true model parameter of dimension $d$. In classical batch gradient descent methods, the gradients reported to the server by the working machines are aggregated via simple averaging, which is vulnerable to a single Byzantine failure. In this paper, we propose a Byzantine gradient descent method based on the geometric median of means of the gradients. We show that our method can tolerate $q \le (m-1)/2$ Byzantine failures, and the parameter estimate converges in $O(\log N)$ rounds with an estimation error of $\sqrt{d(2q+1)/N}$, hence approaching the optimal error rate $\sqrt{d/N}$ in the centralized and failure-free setting. The total computational complexity of our algorithm is of $O((Nd/m) \log N)$ at each working machine and $O(md + kd \log^3 N)$ at the central server, and the total communication cost is of $O(m d \log N)$. We further provide an application of our general results to the linear regression problem. A key challenge arises in the above problem is that Byzantine failures create arbitrary and unspecified dependency among the iterations and the aggregated gradients. We prove that the aggregated gradient converges uniformly to the true gradient function.

モデル性能評価ロバスト性分散学習

International Conference on Machine Learning (ICML)

The Hidden Vulnerability of Distributed Learning in Byzantium

El Mahdi El Mhamdi, Rachid Guerraoui, Sébastien Rouault

Published: 2018.2.22

While machine learning is going through an era of celebrated success, concerns have been raised about the vulnerability of its backbone: stochastic gradient descent (SGD). Recent approaches have been proposed to ensure the robustness of distributed SGD against adversarial (Byzantine) workers sending poisoned gradients during the training phase. Some of these approaches have been proven Byzantine-resilient: they ensure the convergence of SGD despite the presence of a minority of adversarial workers. We show in this paper that convergence is not enough. In high dimension $d \gg 1$, an adver\-sary can build on the loss function's non-convexity to make SGD converge to ineffective models. More precisely, we bring to light that existing Byzantine-resilient schemes leave a margin of poisoning of $\Omega\left(f(d)\right)$, where $f(d)$ increases at least like $\sqrt{d~}$. Based on this leeway, we build a simple attack, and experimentally show its strong to utmost effectivity on CIFAR-10 and MNIST. We introduce Bulyan, and prove it significantly reduces the attackers leeway to a narrow $O( \frac{1}{\sqrt{d~}})$ bound. We empirically show that Bulyan does not suffer the fragility of existing aggregation rules and, at a reasonable cost in terms of required batch size, achieves convergence as if only non-Byzantine gradients had been used to update the model.

機械学習手法ポイズニング敵対的攻撃

Byzantine-Resilient Stochastic Gradient Descent for Distributed Learning: A Lipschitz-Inspired Coordinate-wise Median Approach

Haibo Yang, Xin Zhang, Minghong Fang, Jia Liu

Published: 2019.9.10

In this work, we consider the resilience of distributed algorithms based on stochastic gradient descent (SGD) in distributed learning with potentially Byzantine attackers, who could send arbitrary information to the parameter server to disrupt the training process. Toward this end, we propose a new Lipschitz-inspired coordinate-wise median approach (LICM-SGD) to mitigate Byzantine attacks. We show that our LICM-SGD algorithm can resist up to half of the workers being Byzantine attackers, while still converging almost surely to a stationary region in non-convex settings. Also, our LICM-SGD method does not require any information about the number of attackers and the Lipschitz constant, which makes it attractive for practical implementations. Moreover, our LICM-SGD method enjoys the optimal $O(md)$ computational time-complexity in the sense that the time-complexity is the same as that of the standard SGD under no attacks. We conduct extensive experiments to show that our LICM-SGD algorithm consistently outperforms existing methods in training multi-class logistic regression and convolutional neural networks with MNIST and CIFAR-10 datasets. In our experiments, LICM-SGD also achieves a much faster running time thanks to its low computational time-complexity.

ビザンチン攻撃対策収束保証計算効率

International conference on machine learning

Byzantine-robust distributed learning: Towards optimal statistical rates

Yin, D., Chen, Y., Kannan, R., Bartlett, P.

Published: 2018

Deep Leakage from Gradients

Federated Learning

Ligeng Zhu, Zhijian Liu, Song Han

Published: 2019.6.21

Exchanging gradients is a widely used method in modern multi-node machine learning system (e.g., distributed training, collaborative learning). For a long time, people believed that gradients are safe to share: i.e., the training data will not be leaked by gradient exchange. However, we show that it is possible to obtain the private training data from the publicly shared gradients. We name this leakage as Deep Leakage from Gradient and empirically validate the effectiveness on both computer vision and natural language processing tasks. Experimental results show that our attack is much stronger than previous approaches: the recovery is pixel-wise accurate for images and token-wise matching for texts. We want to raise people's awareness to rethink the gradient's safety. Finally, we discuss several possible strategies to prevent such deep leakage. The most effective defense method is gradient pruning.

プライバシー保護防御的欺瞞敵対的攻撃

idlg: Improved deep leakage from gradients

Bo Zhao, Konda Reddy Mopuri, Hakan Bilen

IEEE Transactions on Information Forensics and Security

Federated learning with differential privacy: Algorithms and performance analysis

Kang Wei, Jun Li, Ming Ding, Chuan Ma, Howard H. Yang, Farhad Farokhi, Shi Jin, Tony Q. S. Quek, H. Vincent Poor

Flower: A friendly federated learning research framework

2017 ACM SIGSAC Conference on Computer and Communications Security

Practical secure aggregation for privacy-preserving machine learning

K. Bonawitz

Published: 2017

Advances in Cryptology – ASIACRYPT 2017

Homomorphic encryption for arithmetic of approximate numbers

Jung Hee Cheon, Andrey Kim, Miran Kim, Yongsoo Song

Published: 2017

2020 USENIX annual technical conference (USENIX ATC 20)

{BatchCrypt}: Efficient homomorphic encryption for {Cross-Silo} federated learning

C. Zhang

Published: 2020

15th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 21)

Oort: efficient federated learning via guided participant selection

Fan Lai, Xiangfeng Zhu, Harsha V Madhyastha, Mosharaf Chowdhury

Published: 2021

CoRR

Daniel J. Beutel, Taner Topal, Akhil Mathur, Xinchi Qiu, Titouan Parcollet, Nicholas D. Lane

Multi-digit number recognition from street view imagery using deep convolutional neural networks

IEEE Data Eng. Bull.

Nvidia flare: Federated learning from simulation to real-world

H. R. Roth

Published: 2022

arXiv

Fedml: A research library and benchmark for federated machine learning

C. He, S. Li, J. So, M. Zhang, H. Wang, X. Wang, P. Vepakomma, A. Singh, H. Qiu, L. Shen, P. Zhao, Y. Kang, Y. Liu, R. Raskar, Q. Yang, M. Annavaram, S. Avestimehr

Published: 2020

Proceedings of the 10th Workshop on Encrypted Computing & Applied Homomorphic Cryptography

Openfhe: Open-source fully homomorphic encryption library

A. Al Badawi

Published: 2022

Proceedings of the IEEE

Gradient-based learning applied to document recognition

Y. LeCun, L. Bottou, Y. Bengio, P. Haffner

Published: 1998

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Deep residual learning for image recognition

Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun

Published: 2016

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

Mpaf: Model poisoning attacks to federated learning based on fake clients

Xiaoyu Cao, Neil Zhenqiang Gong

Published: 2022

Cryptol. ePrint Arch.

Implementing and benchmarking word-wise homomorphic encryption schemes on gpu

H. Yang

Published: 2023

IEEE Signal Processing Magazine

The mnist database of handwritten digit images for machine learning research

Li Deng

Published: 2012

Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms

H. Xiao, K. Rasul, R. Vollgraf

Published: 2017

Pacific-Asia Conference on Knowledge Discovery and Data Mining

Pgada: perturbation-guided adversarial alignment for few-shot learning under the support-query shift

S. Jiang, W. Ding, H.-W. Chen, M.-S. Chen

Published: 2022

CoRR

I. J. Goodfellow, Y. Bulatov, J. Ibarz, S. Arnoud, V. D. Shet

Published: 2013

2022 IEEE 38th Int. Conf. on Data Eng. (ICDE)

Federated learning on non-iid data silos: An experimental study

Q. Li, Y. Diao, Q. Chen, B. He

Published: 2021

2021 IEEE 18th Int. Symp. on Biomed. Imaging (ISBI)

Medmnist classification decathlon: A lightweight automl benchmark for medical image analysis

J. Yang, R. Shi, B. Ni

Published: 2020

Sci. Data

Medmnist v2 - a large-scale lightweight benchmark for 2d and 3d biomedical image classification

J. Yang

Published: 2021

arxiv

被引用数 1

Communication-Efficient Learning of Deep Networks from Decentralized Data

H. Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, Blaise Agüera y Arcas

Published: 2016.2.18

Modern mobile devices have access to a wealth of data suitable for learning models, which in turn can greatly improve the user experience on the device. For example, language models can improve speech recognition and text entry, and image models can automatically select good photos. However, this rich data is often privacy sensitive, large in quantity, or both, which may preclude logging to the data center and training there using conventional approaches. We advocate an alternative that leaves the training data distributed on the mobile devices, and learns a shared model by aggregating locally-computed updates. We term this decentralized approach Federated Learning. We present a practical method for the federated learning of deep networks based on iterative model averaging, and conduct an extensive empirical evaluation, considering five different model architectures and four datasets. These experiments demonstrate the approach is robust to the unbalanced and non-IID data distributions that are a defining characteristic of this setting. Communication costs are the principal constraint, and we show a reduction in required communication rounds by 10-100x as compared to synchronized stochastic gradient descent.

深層学習手法連合学習通信コスト削減

IEEE Transactions on Information Forensics and Security

Drynx: Decentralized, Secure, Verifiable System for Statistical Queries and Machine Learning on Distributed Datasets

David Froelicher, Juan Ramón Troncoso-Pastoriza, Joao Sa Sousa, Jean-Pierre Hubaux

Published: 2020

J. medical Internet research

Web-based privacy-preserving multicenter medical data analysis tools via threshold homomorphic encryption: design and development study

Y. Lu, T. Zhou, Y. Tian, S. Zhu, J. Li

Published: 2020

Secure Human Action Recognition by Encrypted Neural Network Inference

Miran Kim, Xiaoqian Jiang, Kristin Lauter, Elkhan Ismayilzada, Shayan Shams

Published: 2021.4.19

Advanced computer vision technology can provide near real-time home monitoring to support "aging in place" by detecting falls and symptoms related to seizures and stroke. Affordable webcams, together with cloud computing services (to run machine learning algorithms), can potentially bring significant social benefits. However, it has not been deployed in practice because of privacy concerns. In this paper, we propose a strategy that uses homomorphic encryption to resolve this dilemma, which guarantees information confidentiality while retaining action detection. Our protocol for secure inference can distinguish falls from activities of daily living with 86.21% sensitivity and 99.14% specificity, with an average inference latency of 1.2 seconds and 2.4 seconds on real-world test datasets using small and large neural nets, respectively. We show that our method enables a 613x speedup over the latency-optimized LoLa and achieves an average of 3.1x throughput increase in secure inference compared to the throughput-optimized nGraph-HE2.

暗号化技術データ保護手法データ管理システム

2022 21st ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN)

Balancefl: Addressing class imbalance in long-tail federated learning

X. Shuai

Published: 2022

ACM Transactions on Sens. Networks

Clusterfl: A clustering-based federated learning system for human activity recognition

X. Ouyang, Z. Xie, J. Zhou, G. Xing, J. Huang

Published: 2022

IACR Cryptology ePrint Archive

Semi-parallel logistic regression for gwas on encrypted data

Kim, M., Song, Y., Li, B., Micciancio, D.

Published: 2019

Cryptol. ePrint Arch.

Xnet: A real-time unified secure inference framework using homomorphic encryption

H. Yang

Published: 2023

Annual International Conference on the Theory and Applications of Cryptographic Techniques

Fully homomorphic encryption with polylog overhead

C. Gentry, S. Halevi, N. P. Smart

Published: 2012

Advances in Cryptology - CRYPTO 2018

Faster homomorphic linear transformations in helib

S. Halevi, V. Shoup

Published: 2022

The computer journal

A simplex method for function minimization

J. A. Nelder, R. Mead

Published: 1965

Scientific data

The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions

P. Tschandl, C. Rosendahl, H. Kittler

Published: 2018

bioRxiv

In situ classification of cell types in human kidney tissue using 3d nuclear staining

A. Woloshuk

Doklady Akademii Nauk

Multiplication of many-digital numbers by automatic computers

A. A. Karatsuba, Y. P. Ofman

Published: 1962

IEEE Transactions on Signal Process.

Robust aggregation for federated learning

K. Pillutla, S. M. Kakade, Z. Harchaoui

Published: 2022

Computing Research Repository (CoRR)

Combining Differential Privacy and Byzantine Resilience in Distributed SGD

Rachid Guerraoui, Nirupam Gupta, Rafael Pinot, Sebastien Rouault, John Stephan

Published: 2021.10.8

Privacy and Byzantine resilience (BR) are two crucial requirements of modern-day distributed machine learning. The two concepts have been extensively studied individually but the question of how to combine them effectively remains unanswered. This paper contributes to addressing this question by studying the extent to which the distributed SGD algorithm, in the standard parameter-server architecture, can learn an accurate model despite (a) a fraction of the workers being malicious (Byzantine), and (b) the other fraction, whilst being honest, providing noisy information to the server to ensure differential privacy (DP). We first observe that the integration of standard practices in DP and BR is not straightforward. In fact, we show that many existing results on the convergence of distributed SGD under Byzantine faults, especially those relying on $(\alpha,f)$-Byzantine resilience, are rendered invalid when honest workers enforce DP. To circumvent this shortcoming, we revisit the theory of $(\alpha,f)$-BR to obtain an approximate convergence guarantee. Our analysis provides key insights on how to improve this guarantee through hyperparameter optimization. Essentially, our theoretical and empirical results show that (1) an imprudent combination of standard approaches to DP and BR might be fruitless, but (2) by carefully re-tuning the learning algorithm, we can obtain reasonable learning accuracy while simultaneously guaranteeing DP and BR.

DP-SGD アルゴリズム設計分散学習

Proc. 2021 ACM Symp. on Princ. Distributed Comput.

Differential privacy and byzantine resilience in sgd: Do they add up?

R. Guerraoui, N. Gupta, R. Pinot, S. Rouault, J. Stephan

Published: 2021

arXiv

Bridging differential privacy and byzantine-robustness via model aggregation

H. Zhu, Q. Ling

Published: 2022

USENIX

FLAME: taming backdoors in federated learning

Thien Duc Nguyen, Phillip Rieger, Huili Chen, Hossein Yalame, Helen Mollering, Hossein Fereidooni, Samuel Marchal, Markus Miettinen, Azalia Mirhoseini, Shaza Zeitouni, Farinaz Koushanfar, Ahmad-Reza Sadeghi, Thomas Schneider

Published: 2022

2023 IEEE Symposium on Security and Privacy (SP)

Elsa: Secure aggregation for federated learning with malicious actors

M. Rathee, C. Shen, S. Wagh, R. A. Popa

Published: 2023

Cryptol. ePrint Arch.

Safefl: Mpc-friendly framework for private and robust federated learning

T. Gehlhar

Published: 2023

Computing Research Repository (CoRR)

被引用数 2

Secure Byzantine-Robust Machine Learning

Lie He, Sai Praneeth Karimireddy, Martin Jaggi

Published: 2020.6.9

Increasingly machine learning systems are being deployed to edge servers and devices (e.g. mobile phones) and trained in a collaborative manner. Such distributed/federated/decentralized training raises a number of concerns about the robustness, privacy, and security of the procedure. While extensive work has been done in tackling with robustness, privacy, or security individually, their combination has rarely been studied. In this paper, we propose a secure two-server protocol that offers both input privacy and Byzantine-robustness. In addition, this protocol is communication-efficient, fault-tolerant and enjoys local differential privacy.

連合学習プライバシー評価 MPCアルゴリズム

IEEE Journal on Selected Areas in Communications

Byzantine-resilient secure federated learning

J. So, B. Guler, A. S. Avestimehr

Published: 2020

IEEE Transactions on Information Forensics and Security

ShieldFL: Mitigating model poisoning attacks in privacy-preserving federated learning

Zhuoran Ma, Jianfeng Ma, Yinbin Miao, Yingjiu Li, Robert H Deng

Published: 2022

IACR Cryptology ePrint Archive

Flod: Oblivious defender for private byzantine-robust federated learning with dishonest-majority

Y. Dong, X. Chen, K. Li, D. Wang, S. Zeng

Published: 2021

IEEE Transactions on Information Forensics and Security

Privacy-enhanced federated learning against poisoning adversaries

X. Liu, H. Li, G. Xu, Z. Chen, X. Huang, R. Lu

Published: 2021