AIセキュリティポータル K Program
Federated Learning on Transcriptomic Data: Model Quality and Performance Trade-Offs
Share
Abstract
Machine learning on large-scale genomic or transcriptomic data is important for many novel health applications. For example, precision medicine tailors medical treatments to patients on the basis of individual biomarkers, cellular and molecular states, etc. However, the data required is sensitive, voluminous, heterogeneous, and typically distributed across locations where dedicated machine learning hardware is not available. Due to privacy and regulatory reasons, it is also problematic to aggregate all data at a trusted third party.Federated learning is a promising solution to this dilemma, because it enables decentralized, collaborative machine learning without exchanging raw data. In this paper, we perform comparative experiments with the federated learning frameworks TensorFlow Federated and Flower. Our test case is the training of disease prognosis and cell type classification models. We train the models with distributed transcriptomic data, considering both data heterogeneity and architectural heterogeneity. We measure model quality, robustness against privacy-enhancing noise, computational performance and resource overhead. Each of the federated learning frameworks has different strengths. However, our experiments confirm that both frameworks can readily build models on transcriptomic data, without transferring personal raw data to a third party with abundant computational resources.
TensorFlow: a system for Large-Scale machine learning
M. Abadi, et al.
Published: 2016
Federated learning and differential privacy for medical image analysis
M. Adnan, S. Kalra, J.C. Cresswell, G.W. Taylor, H.R. Tizhoosh
Published: 2022
Federated learning for healthcare: Systematic review and architecture proposal
R.S. Antunes, C. Andre da Costa, A. Kunderle, I.A. Yari, B. Eskofier
Published: 2022
When the Curious Abandon Honesty: Federated Learning Is Not Private
Franziska Boenisch, Adam Dziedzic, Roei Schuster, Ali Shahin Shamsabadi, Ilia Shumailov, Nicolas Papernot
Published: 2021.12.6
Federated learning for multi-omics: a performance evaluation in parkinson’s disease
B.P. Danek, M.B. Makarious, A. Dadu, D. Vitale, M.A. Nalls, J. Sun, F. Faghri, P.S. Lee
Published: 2023
Federated learning for predicting clinical outcomes in patients with COVID-19
I. Dayan, H. R. Roth, A. Zhong, A. Harouni, A. Gentili, A. Z. Abidin, A. Liu, A. B. Costa, B. J. Wood, C.-S. Tsai
Published: 2021
Client selection in federated learning: Principles, challenges, and opportunities
L. Fu, H. Zhang, G. Gao, M. Zhang, X. Liu
Published: 2023
Property inference attacks on fully connected neural networks using permutation invariant representations
K. Ganju, Q. Wang, W. Yang, C. A. Gunter, N. Borisov
Published: 2018
Fedml: A research library and benchmark for federated machine learning
C. He, S. Li, J. So, M. Zhang, H. Wang, X. Wang, P. Vepakomma, A. Singh, H. Qiu, L. Shen, P. Zhao, Y. Kang, Y. Liu, R. Raskar, Q. Yang, M. Annavaram, S. Avestimehr
Published: 2020
Conserved cell types with divergent features in human versus mouse cortex
R.D. Hodge, et al.
Published: 2019
Precision medicine
R. Hodson
Published: 2016
Advances and open problems in federated learning
Peter Kairouz, H. Brendan McMahan, Brendan Avent, Aurélien Bellet, Mehdi Bennis, Arjun Nitin Bhagoji, Kallista Bonawitz, Zachary Charles, Graham Cormode, Rachel Cummings, Rafael G. L. D’Oliveira, Hubert Eichner, Salim El Rouayheb, David Evans, Josh Gardner, Zachary Garrett, Adrià Gascón, Badih Ghazi, Phillip B. Gibbons, Marco Gruteser, Zaid Harchaoui, Chaoyang He, Lie He, Zhouyuan Huo, Ben Hutchinson, Justin Hsu, Martin Jaggi, Tara Javidi, Gauri Joshi, Mikhail Khodak, Jakub Konecný, Aleksandra Korolova, Farinaz Koushanfar, Sanmi Koyejo, Tancrède Lepoint, Yang Liu, Prateek Mittal, Mehryar Mohri, Richard Nock, Ayfer Özgür, Rasmus Pagh, Hang Qi, Daniel Ramage, Ramesh Raskar, Mariana Raykova, Dawn Song, Weikang Song, Sebastian U. Stich, Ziteng Sun, Ananda Theertha Suresh, Florian Tramèr, Praneeth Vepakomma, Jianyu Wang, Li Xiong, Zheng Xu, Qiang Yang, Felix X. Yu, Han Yu, Sen Zhao
Published: 2021
Precision medicine
M.R. Kosorok, E.B. Laber
Published: 2019
Federated learning on clinical benchmark data: performance assessment
G.H. Lee, S.Y. Shin
Published: 2020
Fate: An industrial grade platform for collaborative learning with data protection
Yang Liu, Tao Fan, Tianjian Chen, Qian Xu, Qiang Yang
Published: 2021
Data structures for statistical computing in python
W. McKinney, et al.
Published: 2010
Communication-Efficient Learning of Deep Networks from Decentralized Data
H. Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, Blaise Agüera y Arcas
Published: 2016.2.18
Local learning matters: Rethinking data heterogeneity in federated learning
M. Mendieta, T. Yang, P. Wang, M. Lee, Z. Ding, C. Chen
Published: 2022
Privacy considerations for sharing genomics data
M. Oestreich, D. Chen, J.L. Schultze, M. Fritz, M. Becker
Published: 2021
Scikit-learn: Machine learning in python
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg
Published: 2011
Federated learning in a medical context: a systematic literature review
B. Pfitzner, N. Steckhan, B. Arnrich
Published: 2021
Swarm learning for decentralized and confidential clinical machine learning
S. Warnat-Herresthal, H. Schultze, K.L. Shastry, S. Manamohan, S. Mukherjee, V. Garg, R. Sarveswara, K. Handler, P. Pickkers, N.A. Aziz, et al.
Published: 2021
Scalable prediction of acute myeloid leukemia using high-dimensional machine learning and blood transcriptomics
S. Warnat-Herresthal, et al.
Published: 2020
Integrating transcriptomics, genomics, and imaging in alzheimer’s disease: A federated model
J. Wu, Y. Chen, P. Wang, R.J. Caselli, P.M. Thompson, J. Wang, Y. Wang
Published: 2022
Ppml-omics: a privacy-preserving federated machine learning method protects patients’ privacy in omic data
J. Zhou, S. Chen, Y. Wu, H. Li, B. Zhang, L. Zhou, Y. Hu, Z. Xiang, Z. Li, N. Chen, et al.
Published: 2022
Pysyft: A library for easy federated learning
A. Ziller, A. Trask, A. Lopardo, B. Szymkow, B. Wagner, E. Bluemke, J.M. Nounahon, J. Passerat-Palmbach, K. Prakash, N. Rose, et al.
Published: 2021
Share