Federated learning (FL) has become one of the standard approaches for
deploying machine learning models on edge devices, where private training data
are distributed across clients, and a shared model is learned by aggregating
locally computed updates from each client. While this paradigm enhances
communication efficiency by only requiring updates at the end of each training
epoch, the transmitted model updates remain vulnerable to malicious tampering,
posing risks to the integrity of the global model. Although current digital
signature algorithms can protect these communicated model updates, they fail to
ensure quantum security in the era of large-scale quantum computing.
Fortunately, various post-quantum cryptography algorithms have been developed
to address this vulnerability, especially the three NIST-standardized
algorithms - Dilithium, FALCON, and SPHINCS+. In this work, we empirically
investigate the impact of these three NIST-standardized PQC algorithms for
digital signatures within the FL procedure, covering a wide range of models,
tasks, and FL settings. Our results indicate that Dilithium stands out as the
most efficient PQC algorithm for digital signature in federated learning.
Additionally, we offer an in-depth discussion of the implications of our
findings and potential directions for future research.
Risc-v galois field isa extension for non-binary error-correction codes and classical and post-quantum cryptography
Y.-M. Kuo, F. Garćıa-Herrero, O. Ruano, J. A. Maestro
Published: 2023
2021 IEEE/ACM Symposium on Edge Computing (SEC)
Exploring system performance of continual learning for mobile and embedded sensing applications
Y. D. Kwon, J. Chauhan, A. Kumar, P. H. HKUST, C. Mascolo
Published: 2021
2018 IEEE 8th Annual Computing and Communication Workshop and Conference (CCWC)
Next-generation of virtual personal assistants (microsoft cortana, apple siri, amazon alexa and google home)
V. Kepuska, G. Bohouta
Published: 2018
2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Personalized speech recognition on mobile devices
I. McGraw, R. Prabhavalkar, R. Alvarez, M. G. Arenas, K. Rao, D. Ryback, O. Alsharif, H. Sak, A. Gruenstein, F. Beaufays, C. Parada
Published: 2016
arxiv
被引用数 1
Communication-Efficient Learning of Deep Networks from Decentralized Data
H. Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, Blaise Agüera y Arcas
Published: 2016.2.18
Modern mobile devices have access to a wealth of data suitable for learning
models, which in turn can greatly improve the user experience on the device.
For example, language models can improve speech recognition and text entry, and
image models can automatically select good photos. However, this rich data is
often privacy sensitive, large in quantity, or both, which may preclude logging
to the data center and training there using conventional approaches. We
advocate an alternative that leaves the training data distributed on the mobile
devices, and learns a shared model by aggregating locally-computed updates. We
term this decentralized approach Federated Learning.
We present a practical method for the federated learning of deep networks
based on iterative model averaging, and conduct an extensive empirical
evaluation, considering five different model architectures and four datasets.
These experiments demonstrate the approach is robust to the unbalanced and
non-IID data distributions that are a defining characteristic of this setting.
Communication costs are the principal constraint, and we show a reduction in
required communication rounds by 10-100x as compared to synchronized stochastic
gradient descent.
International Conference on Learning Representations (ICLR)
Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data
Nicolas Papernot, Martín Abadi, Úlfar Erlingsson, Ian Goodfellow, Kunal Talwar
Published: 2016.10.19
Some machine learning applications involve training data that is sensitive,
such as the medical histories of patients in a clinical trial. A model may
inadvertently and implicitly store some of its training data; careful analysis
of the model may therefore reveal sensitive information.
To address this problem, we demonstrate a generally applicable approach to
providing strong privacy guarantees for training data: Private Aggregation of
Teacher Ensembles (PATE). The approach combines, in a black-box fashion,
multiple models trained with disjoint datasets, such as records from different
subsets of users. Because they rely directly on sensitive data, these models
are not published, but instead used as "teachers" for a "student" model. The
student learns to predict an output chosen by noisy voting among all of the
teachers, and cannot directly access an individual teacher or the underlying
data or parameters. The student's privacy properties can be understood both
intuitively (since no single teacher and thus no single dataset dictates the
student's training) and formally, in terms of differential privacy. These
properties hold even if an adversary can not only query the student but also
inspect its internal workings.
Compared with previous work, the approach imposes only weak assumptions on
how teachers are trained: it applies to any model, including non-convex models
like DNNs. We achieve state-of-the-art privacy/utility trade-offs on MNIST and
SVHN thanks to an improved privacy analysis and semi-supervised learning.