AIセキュリティポータル K Program
Privacy-Preserving UCB Decision Process Verification via zk-SNARKs
Share
Abstract
With the increasingly widespread application of machine learning, how to strike a balance between protecting the privacy of data and algorithm parameters and ensuring the verifiability of machine learning has always been a challenge. This study explores the intersection of reinforcement learning and data privacy, specifically addressing the Multi-Armed Bandit (MAB) problem with the Upper Confidence Bound (UCB) algorithm. We introduce zkUCB, an innovative algorithm that employs the Zero-Knowledge Succinct Non-Interactive Argument of Knowledge (zk-SNARKs) to enhance UCB. zkUCB is carefully designed to safeguard the confidentiality of training data and algorithmic parameters, ensuring transparent UCB decision-making. Experiments highlight zkUCB's superior performance, attributing its enhanced reward to judicious quantization bit usage that reduces information entropy in the decision-making process. zkUCB's proof size and verification time scale linearly with the execution steps of zkUCB. This showcases zkUCB's adept balance between data security and operational efficiency. This approach contributes significantly to the ongoing discourse on reinforcing data privacy in complex decision-making processes, offering a promising solution for privacy-sensitive applications.
Reinforcement learning based recommender systems: A survey
M Mehdi Afsar, Trafford Crump, Behrouz Far
Published: 2022
Offline contextual multi-armed bandits for mobile health interventions: A case study on emotion regulation
Mawulolo K Ameko, Miranda L Beltzer, Lihua Cai, Mehdi Boukhechba, Bethany A Teachman, Laura E Barnes
Published: 2020
On multi-armed bandit designs for dose-finding clinical trials
Maryam Aziz, Emilie Kaufmann, Marie-Karelle Riviere
Published: 2021
When privacy meets partial information: A refined analysis of differentially private bandits
Achraf Azize, Debabrota Basu
Published: 2022
An interactive prover for protocol verification in the computational model
David Baelde, Stephanie Delaune, Charlie Jacomme, Adrien Koutsos, Solene Moreau
Published: 2021
Combination of auction theory and multi-armed bandits: Model, algorithm, and application
Guoju Gao, Sijie Huang, He Huang, Mingjun Xiao, Jie Wu, Yu-E Sun, Sheng Zhang
Published: 2022
Safetynets: Verifiable execution of deep neural networks on an untrusted cloud
Zahra Ghodsi, Tianyu Gu, Siddharth Garg
Published: 2017
On the size of pairing-based non-interactive arguments
J. Groth
Published: 2016
Zero-knowledge using garbled circuits: how to prove non-algebraic statements efficiently
Marek Jawurek, Florian Kerschbaum, Claudio Orlandi
Published: 2013
Asymptotically faster multi-key homomorphic encryption from homomorphic gadget decomposition
Taechan Kim, Hyesun Kwak, Dongwon Lee, Jinyeong Seo, Yongsoo Song
Published: 2023
Deep reinforcement learning for autonomous driving: A survey
B Ravi Kiran, Ibrahim Sobh, Victor Talpaert, Patrick Mannion, Ahmad A Al Sallab, Senthil Yogamani, Patrick Perez
Published: 2021
vcnn: Verifiable convolutional neural network based on zk-snarks
Seunghwa Lee, Hankyung Ko, Jihye Kim, Hyunok Oh
Published: 2024
Privacy preservation for machine learning training and classification based on homomorphic encryption schemes
Jing Li, Xiaohui Kuang, Shujie Lin, Xu Ma, Yi Tang
Published: 2020
ZkCNN: Zero knowledge proofs for convolutional neural network predictions and accuracy.
Tianyi Liu, Xiang Xie, Yupeng Zhang
Published: 2021
Zilch: A framework for deploying transparent zero-knowledge proofs
Dimitris Mouris, Nektarios Georgios Tsoutsos
Published: 2021
Achieving fairness in the stochastic multi-armed bandit problem
Vishakha Patil, Ganesh Ghalme, Vineet Nair, Yadati Narahari
Published: 2021
An adaptive authenticated data structure with privacy-preserving for big data stream in cloud
Yi Sun, Qian Liu, Xingyuan Chen, Xuehui Du
Published: 2020
Doubly-efficient zksnarks without trusted setup
Riad S Wahby, Ioanna Tzialla, Abhi Shelat, Justin Thaler, Michael Walfish
Published: 2018
Mystique: Efficient conversions for zero-knowledge proofs with applications to machine learning
Chenkai Weng, Kang Yang, Xiang Xie, Jonathan Katz, Xiao Wang
Published: 2021
Zero-knowledge Proof Meets Machine Learning in Verifiability: A Survey
Zhibo Xing, Zijian Zhang, Jiamou Liu, Ziang Zhang, Meng Li, Liehuang Zhu, Giovanni Russello
Published: 10.23.2023
Non-interactive zero-knowledge proofs to multiple verifiers
Kang Yang, Xiao Wang
Published: 2022
Reinforcement learning in healthcare: A survey
Chao Yu, Jiming Liu, Shamim Nemati, Guosheng Yin
Published: 2021
Zero knowledge proofs for decision tree predictions and accuracy.
Jiaheng Zhang, Zhiyong Fang, Yupeng Zhang, Dawn Song
Published: 2020
Differentially private unknown worker recruitment for mobile crowdsensing using multi-armed bandits
Hui Zhao, Mingjun Xiao, Jie Wu, Yun Xu, He Huang, Sheng Zhang
Published: 2020
Veriml: Enabling integrity assurances and fair payments for machine learning as a service
Zhao, L., Wang, Q., Wang, C., Li, Q., Shen, C., Feng, B.
Published: 2021
Share