Data fabric is an automated and AI-driven data fusion approach to accomplish
data management unification without moving data to a centralized location for
solving complex data problems. In a Federated learning architecture, the global
model is trained based on the learned parameters of several local models that
eliminate the necessity of moving data to a centralized repository for machine
learning. This paper introduces a secure approach for medical image analysis
using federated learning and partially homomorphic encryption within a
distributed data fabric architecture. With this method, multiple parties can
collaborate in training a machine-learning model without exchanging raw data
but using the learned or fused features. The approach complies with laws and
regulations such as HIPAA and GDPR, ensuring the privacy and security of the
data. The study demonstrates the method's effectiveness through a case study on
pituitary tumor classification, achieving a significant level of accuracy.
However, the primary focus of the study is on the development and evaluation of
federated learning and partially homomorphic encryption as tools for secure
medical image analysis. The results highlight the potential of these techniques
to be applied to other privacy-sensitive domains and contribute to the growing
body of research on secure and privacy-preserving machine learning.
Efficient privacy-preserving access control scheme in electronic health records system
Y. Ming, T. Zhang
Published: 2018
2020 International Conferences on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData) and IEEE Congress on Cybermatics (Cybermatics)
Research and design of square kilometer array astronomical data management model based on fabric
J. Fu, J. Xu, S. Zhang, C. Zhang
Published: 2020
Advances in Intelligent Systems and Computing
Preserving privacy of data in distributed systems using homomorphic encryption
P. Kalyani, M. Masooda, P. Namrata
Published: 2021
Complex intell. syst.
A systematic review of homomorphic encryption and its contributions in healthcare industry
in 2021 Reconciling Data Analytics, Automation, Privacy, and Security: A Big Data Challenge (RDAAPS)
A secure federated learning framework using homomorphic encryption and verifiable computing
A. Madi, O. Stan, A. Mayoue, A. Grivet-Sebert, C. Gouy-Pailler, R. Sirdey
Published: 2021
Computer Science and Information Systems
A homomorphic-encryption-based vertical federated learning scheme for risk management
W. Ou, J. Zeng, Z. Guo, W. Yan, D. Liu, S. Fuentes
Published: 2020
in 2018 Second World Conference on Smart Trends in Systems, Security and Sustainability (WorldS4)
Decentralized e-health architecture for boosting healthcare analytics
I. Kotsiuba, A. Velvkzhanin, Y. Yanovich, I. S. Bandurova, Y. Dyachenko, V. Zhygulin
Published: 2018
IEEE Access
Optimizing the electronic health records through big data analytics: A knowledge-based view
C. Zhang, R. Ma, S. Sun, Y. Li, Y. Wang, Z. Yan
Published: 2019
Sensors (Basel)
Imtidad: A reference architecture and a case study on developing distributed AI services for skin disease diagnosis over cloud, fog and edge
N. Janbi, R. Mehmood, I. Katib, A. Albeshri, J. M. Corchado, T. Yigitcanlar
Published: 2022
Technol. Forecast. Soc. Change
Distributed, decentralized, and democratized artificial intelligence
G. A. Montes, B. Goertzel
Published: 2018
2016 IEEE International Conference on Big Data (Big Data)
A novel bigdata processing framework for healthcare applications: Big-data-healthcare-in-a-box
F. Rahman, M. Slepian, A. Mitra
Published: 2016
in 2018 Fifth HCT Information Technology Trends (ITT)
Analysis of big data cloud computing environment on healthcare organizations by implementing hadoop clusters
M. J. Kaur, V. P. Mishra
Published: 2018
Int Res J Eng Technol
Impact of cloud computing on health care
V. K. Nigam, S. Bhatia
Published: 2016
Rethinking the meaning of cloud computing for health care: A taxonomic perspective and future research directions
F. Gao, S. Thiebes, A. Sunyaev
Published: 2018
IEEE Access
A security model for preserving the privacy of medical big data in a healthcare cloud using a fog computing facility with pairing-based cryptography
H. A. Al Hamid, S. M. M. Rahman, M. S. Hossain, A. Almogren, A. Alamri
Published: 2017
IEEE trans. serv. comput.
Trustbased scheduling framework for big data processing with MapReduce
T. D. Dang, D. Hoang, D. N. Nguyen
Published: 2022
Brain tumor mri dataset
Msoud Nickparvar
Published: 2021
A Practical Guide, 1st Ed., Cham: Springer International Publishing
The EU general data protection regulation (GDPR)
Paul Voigt, Axel Von dem Bussche
Published: 2017
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Deep residual learning for image recognition
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
Published: 2016
3rd International Conference on Learning Representations
Very deep convolutional networks for large-scale image recognition
K. Simonyan, A. Zisserman
Published: 2015
arxiv
被引用数 1
Communication-Efficient Learning of Deep Networks from Decentralized Data
H. Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, Blaise Agüera y Arcas
Published: 2016.2.18
Modern mobile devices have access to a wealth of data suitable for learning
models, which in turn can greatly improve the user experience on the device.
For example, language models can improve speech recognition and text entry, and
image models can automatically select good photos. However, this rich data is
often privacy sensitive, large in quantity, or both, which may preclude logging
to the data center and training there using conventional approaches. We
advocate an alternative that leaves the training data distributed on the mobile
devices, and learns a shared model by aggregating locally-computed updates. We
term this decentralized approach Federated Learning.
We present a practical method for the federated learning of deep networks
based on iterative model averaging, and conduct an extensive empirical
evaluation, considering five different model architectures and four datasets.
These experiments demonstrate the approach is robust to the unbalanced and
non-IID data distributions that are a defining characteristic of this setting.
Communication costs are the principal constraint, and we show a reduction in
required communication rounds by 10-100x as compared to synchronized stochastic
gradient descent.