AIセキュリティポータル K Program
martFL: Enabling Utility-Driven Data Marketplace with a Robust and Verifiable Federated Learning Architecture
Share
Abstract
The development of machine learning models requires a large amount of training data. Data marketplaces are essential for trading high-quality, private-domain data not publicly available online. However, due to growing data privacy concerns, direct data exchange is inappropriate. Federated Learning (FL) is a distributed machine learning paradigm that exchanges data utilities (in form of local models or gradients) among multiple parties without directly sharing the raw data. However, several challenges exist when applying existing FL architectures to construct a data marketplace: (i) In existing FL architectures, Data Acquirers (DAs) cannot privately evaluate local models from Data Providers (DPs) prior to trading; (ii) Model aggregation protocols in existing FL designs struggle to exclude malicious DPs without "overfitting" to the DA's (possibly biased) root dataset; (iii) Prior FL designs lack a proper billing mechanism to enforce the DA to fairly allocate the reward according to contributions made by different DPs. To address above challenges, we propose martFL, the first federated learning architecture that is specifically designed to enable a secure utility-driven data marketplace. At a high level, martFL is powered by two innovative designs: (i) a quality-aware model aggregation protocol that achieves robust local model aggregation even when the DA's root dataset is biased; (ii) a verifiable data transaction protocol that enables the DA to prove, both succinctly and in zero-knowledge, that it has faithfully aggregates the local models submitted by different DPs according to the committed aggregation weights, based on which the DPs can unambiguously claim the corresponding reward. We implement a prototype of martFL and evaluate it extensively over various tasks. The results show that martFL can improve the model accuracy by up to 25% while saving up to 64% data acquisition cost.
Byzantine-resilient non-convex stochastic gradient descent
Allen-Zhu, Z., Ebrahimianghazani, F., Li, J., Alistarh, D.
Published: 2021
How to Backdoor Federated Learning
Bagdasaryan, E., Veit, A., Hua, Y., Estrin, D., Shmatikov, V.
Published: 2020
Data Markets in the Cloud: An Opportunity for the Database Community
Balazinska, M., Howe, B., Suciu, D.
Published: 2011
signsgd with majority vote is communication efficient and fault tolerant
J. Bernstein, J. Zhao, K. Azizzadenesheli, A. Anandkumar
Published: 2019
Machine learning with adversaries: Byzantine tolerant gradient descent
Blanchard, P., El Mhamdi, E. M., Guerraoui, R., Stainer, J.
Published: 2017
Verifiable Delay Functions
Boneh, D., Bonneau, J., Bünz, B., Fisch, B.
Published: 2018
Recursive Proof Composition without a Trusted Setup
Bowe, S., Grigg, J., Hopwood, D.
Published: 2019
Bulletproofs: Short Proofs for Confidential Transactions and More
Bünz, B., Bootle, J., Boneh, D., Poelstra, A., Wuille, P., Maxwell, G.
Published: 2018
Transparent SNARKs from DARK Compilers
Bünz, B., Fisch, B., Szepieniec, A.
Published: 2020
Citizens’s Data Privacy in China: The State of the Art of the Personal Information Protection Law (PIPL)
Calzada, I.
Published: 2022
Fltrust: Byzantine-robust federated learning via trust bootstrapping
X. Cao, M. Fang, J. Liu, N. Z. Gong
Published: 2021
Robust Blockchained Federated Learning with Model Validation and Proof-of-Stake Inspired Consensus
Chen, H., Asif, S. A., Park, J., Shen, C.-C., Bennis, M.
Published: 2021
Ekiden: A Platform for Confidentiality-preserving, Trustworthy, and Performant Smart Contracts
Cheng, R., Zhang, F., Kos, J., He, W., Hynes, N., Johnson, N., Juels, A., Miller, A., Song, D.
Published: 2019
Homomorphic encryption for arithmetic of approximate numbers
Jung Hee Cheon, Andrey Kim, Miran Kim, Yongsoo Song
Published: 2017
Marlin: Preprocessing zkSNARKs with Universal and Updatable SRS
Chiesa, A., Hu, Y., Maller, M., Mishra, P., Vesely, N., Ward, N.
Published: 2020
A coefficient of agreement for nominal scales
Cohen, J.
Published: 1960
Betrayal, Distrust, and Rationality: Smart Counter-Collusion Contracts for Verifiable Cloud Computing
Dong, C., Wang, Y., Aldweesh, A., McCorry, P., van Moorsel, A.
Published: 2017
Fairswap: How to Fairly Exchange Digital Goods
Dziembowski, S., Eckey, L., Faust, S.
Published: 2018
ZoKrates - Scalable Privacy-Preserving Off-Chain Computations
Eberhardt, J., Tai, S.
Published: 2018
Local Model Poisoning Attacks to Byzantine-Robust Federated Learning
Minghong Fang, Xiaoyu Cao, Jinyuan Jia, Neil Zhenqiang Gong
Published: 2019.11.27
iSyn: Semi-automated Smart Contract Synthesis from Legal Financial Agreements
Fang, P., Zou, Z., Xiao, X., Liu, Z.
Published: 2023
ZEN: An Optimizing Compiler for Verifiable, Zero-knowledge Neural Network Inferences
Feng, B., Qin, L., Zhang, Z., Ding, Y., Chu, S.
Published: 2021
Data Market Platforms: Trading Data Assets to Solve Data Problems
Fernandez, R. C., Subramaniam, P., Franklin, M. J.
Published: 2020
The limitations of federated learning in sybil settings
C. Fung, C. J. M. Yoon, I. Beschastnikh
Published: 2020
PLONK: Permutations over Lagrange-bases for Oecumenical Noninteractive arguments of Knowledge
Gabizon, A., Williamson, Z. J., Ciobotaru, O.
Published: 2019
The Bitcoin Backbone Protocol with Chains of Variable Difficulty
Garay, J., Kiayias, A., Leonardos, N.
Published: 2017
SafetyNets: Verifiable Execution of Deep Neural Networks on an Untrusted Cloud
Ghodsi, Z., Gu, T., Garg, S.
Published: 2017
Data shapley: Equitable valuation of data for machine learning
A. Ghorbani, J. Zou
Published: 2019
The Knowledge Complexity of Interactive Proof-systems
Goldwasser, S., Micali, S., Rackoff, C.
Published: 1989
Poseidon: A New Hash Function for Zero-Knowledge Proof Systems
Grassi, L., Khovratovich, D., Rechberger, C., Roy, A., Schofnegger, M.
Published: 2021
On the size of pairing-based non-interactive arguments
J. Groth
Published: 2016
Quantization and training of neural networks for efficient integer-arithmetic-only inference
B. Jacob, S. Kligys, B. Chen, M. Zhu, M. Tang, A. Howard, H. Adam, D. Kalenichenko
Published: 2018
Towards efficient data valuation based on the shapley value
Ruoxi Jia
Hawk: The Blockchain Model of Cryptography and Privacy-preserving Smart Contracts
Kosba, A., Miller, A., Shi, E., Wen, Z., Papamanthou, C.
Published: 2016
Nova: Recursive zero-knowledge arguments from folding schemes
A. Kothapalli, S. Setty, I. Tzialla
Published: 2022
Toward Practical Query Pricing with Querymarket
Koutris, P., Upadhyaya, P., Balazinska, M., Howe, B., Suciu, D.
Published: 2013
Agora: A Privacy-aware Data Marketplace
Koutsos, V., Papadopoulos, D., Chatzopoulos, D., Tarkoma, S., Hui, P.
Published: 2020
I3: An IoT Marketplace for Smart Communities
Krishnamachari, B., Power, J., Kim, S. H., Shahabi, C.
Published: 2018
Learning multiple layers of features from tiny images
Alex Krizhevsky, Geoffrey Hinton
Published: 2009
Backpropagation applied to handwritten zip code recognition
Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackel
Published: 1989
Verifying the Quality of Outsourced Training on Clouds
Li, P., Wang, Y., Liu, Z., Xu, K., Wang, Q., Shen, C., Li, Q.
Published: 2022
martFL: Enabling Utility-Driven Data Marketplace with a Robust and Verifiable Federated Learning Architecture
Qi Li, Zhuotao Liu, Qi Li, Ke Xu
Published: 2023.9.3
Learning Question Classifiers
Li, X., Roth, D.
Published: 2002
OmniLytics: A Blockchain-based Secure Data Market for Decentralized Machine Learning
Jiacheng Liang, Songze Li, Bochuan Cao, Wensi Jiang, Chaoyang He
Published: 2021.7.12
ZkCNN: Zero knowledge proofs for convolutional neural network predictions and accuracy.
Tianyi Liu, Xiang Xie, Yupeng Zhang
Published: 2021
Hyperservice: Interoperability and Programmability across Heterogeneous Blockchains
Liu, Z., Xiang, Y., Shi, J., Gao, P., Wang, H., Xiao, X., Wen, B., Hu, Y.-C.
Published: 2019
Make Web3. 0 Connected
Liu, Z., Xiang, Y., Shi, J., Gao, P., Wang, H., Xiao, X., Wen, B., Li, Q., Hu, Y.-C.
Published: 2022
Least Squares Quantization in PCM
Lloyd, S.
Published: 1982
Collaborative Fairness in Federated Learning
Lyu, L., Xu, X., Wang, Q., Yu, H.
Published: 2020
Towards Fair and Privacy-preserving Federated Deep Models
Lyu, L., Yu, J., Nandakumar, K., Li, Y., Ma, X., Jin, J., Yu, H., Ng, K. S.
Published: 2020
Communication-Efficient Learning of Deep Networks from Decentralized Data
H. Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, Blaise Agüera y Arcas
Published: 2016.2.18
Verifiable Random Functions
Micali, S., Rabin, M., Vadhan, S.
Published: 1999
Achieving Data Truthfulness and Privacy Preservation in Data Markets
Niu, C., Zheng, Z., Wu, F., Gao, X., Chen, G.
Published: 2018
Defending against Backdoors in Federated Learning with Robust Learning Rate
Mustafa Safa Ozdayi, Murat Kantarcioglu, Yulia R. Gel
Published: 2020.7.8
Transaction Protection by Beacons
Rabin, M. O.
Published: 1983
Who Belongs in the Family?
Thorndike, R.
Published: 1953
Estimating the Number of Clusters in a Data Set Via the Gap Statistic
Tibshirani, R., Walther, G., Hastie, T.
Published: 2001
The EU General Data Protection Regulation (GDPR): A Practical Guide
Voigt, P., Von dem Bussche, A.
Doubly-Efficient zkSNARKs Without Trusted Setup
Wahby, R. S., Tzialla, I., Shelat, A., Thaler, J., Walfish, M.
Published: 2018
Ethereum: A secure decentralised generalised transaction ledger.
Gavin Wood et al.
Published: 2014
Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms
H. Xiao, K. Rasul, R. Vollgraf
Published: 2017
zkBridge: Trustless Cross-chain Bridges Made Practical
Xie, T., Zhang, J., Cheng, Z., Zhang, F., Zhang, Y., Jia, Y., Boneh, D., Song, D.
Published: 2022
A Reputation Mechanism Is All You Need: Collaborative Fairness and Adversarial Robustness in Federated Learning
Xu, X., Lyu, L.
Published: 2021
Byzantine-robust distributed learning: Towards optimal statistical rates
Yin, D., Chen, Y., Kannan, R., Bartlett, P.
Published: 2018
Convolutional Neural Networks for Sentence Classification
Yoon, K.
Published: 2014
Xclaim: Trustless, Interoperable, Cryptocurrency-Backed Assets
Zamyatin, A., Harz, D., Lind, J., Panayiotou, P., Gervais, A., Knottenbelt, W.
Published: 2019
Zero knowledge proofs for decision tree predictions and accuracy.
Jiaheng Zhang, Zhiyong Fang, Yupeng Zhang, Dawn Song
Published: 2020
Veriml: Enabling integrity assurances and fair payments for machine learning as a service
Zhao, L., Wang, Q., Wang, C., Li, Q., Shen, C., Feng, B.
Published: 2021
Advanced free-rider attacks in federated learning
Z. Zhu, J. Shu, X. Zou, X. Jia
Published: 2021
Share