These labels were automatically added by AI and may be inaccurate. For details, see About Literature Database.
Abstract
With the increasingly widespread application of machine learning, how to
strike a balance between protecting the privacy of data and algorithm
parameters and ensuring the verifiability of machine learning has always been a
challenge. This study explores the intersection of reinforcement learning and
data privacy, specifically addressing the Multi-Armed Bandit (MAB) problem with
the Upper Confidence Bound (UCB) algorithm. We introduce zkUCB, an innovative
algorithm that employs the Zero-Knowledge Succinct Non-Interactive Argument of
Knowledge (zk-SNARKs) to enhance UCB. zkUCB is carefully designed to safeguard
the confidentiality of training data and algorithmic parameters, ensuring
transparent UCB decision-making. Experiments highlight zkUCB's superior
performance, attributing its enhanced reward to judicious quantization bit
usage that reduces information entropy in the decision-making process. zkUCB's
proof size and verification time scale linearly with the execution steps of
zkUCB. This showcases zkUCB's adept balance between data security and
operational efficiency. This approach contributes significantly to the ongoing
discourse on reinforcing data privacy in complex decision-making processes,
offering a promising solution for privacy-sensitive applications.
References
ACM Computing Surveys
Reinforcement learning based recommender systems: A survey
M Mehdi Afsar, Trafford Crump, Behrouz Far
Published: 2022
Proceedings of the 14th ACM Conference on Recommender Systems
Offline contextual multi-armed bandits for mobile health interventions: A case study on emotion regulation
Mawulolo K Ameko, Miranda L Beltzer, Lihua Cai, Mehdi Boukhechba, Bethany A Teachman, Laura E Barnes
Published: 2020
The Journal of Machine Learning Research
On multi-armed bandit designs for dose-finding clinical trials
Combination of auction theory and multi-armed bandits: Model, algorithm, and application
Guoju Gao, Sijie Huang, He Huang, Mingjun Xiao, Jie Wu, Yu-E Sun, Sheng Zhang
Published: 2022
Advances in Neural Information Processing Systems
Safetynets: Verifiable execution of deep neural networks on an untrusted cloud
Zahra Ghodsi, Tianyu Gu, Siddharth Garg
Published: 2017
Advances in Cryptology–EUROCRYPT 2016: 35th Annual International Conference on the Theory and Applications of Cryptographic Techniques, Vienna, Austria
On the size of pairing-based non-interactive arguments
J. Groth
Published: 2016
Proceedings of the 2013 ACM SIGSAC conference on Computer & communications security
Zero-knowledge using garbled circuits: how to prove non-algebraic statements efficiently
Marek Jawurek, Florian Kerschbaum, Claudio Orlandi
Published: 2013
Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security
Asymptotically faster multi-key homomorphic encryption from homomorphic gadget decomposition
Taechan Kim, Hyesun Kwak, Dongwon Lee, Jinyeong Seo, Yongsoo Song
Published: 2023
IEEE Transactions on Intelligent Transportation Systems
Deep reinforcement learning for autonomous driving: A survey
B Ravi Kiran, Ibrahim Sobh, Victor Talpaert, Patrick Mannion, Ahmad A Al Sallab, Senthil Yogamani, Patrick Perez
Published: 2021
IEEE Transactions on Dependable and Secure Computing
vcnn: Verifiable convolutional neural network based on zk-snarks
Seunghwa Lee, Hankyung Ko, Jihye Kim, Hyunok Oh
Published: 2024
Information Sciences
Privacy preservation for machine learning training and classification based on homomorphic encryption schemes
Jing Li, Xiaohui Kuang, Shujie Lin, Xu Ma, Yi Tang
Published: 2020
Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications Security
ZkCNN: Zero knowledge proofs for convolutional neural network predictions and accuracy.
Tianyi Liu, Xiang Xie, Yupeng Zhang
Published: 2021
IEEE Transactions on Information Forensics and Security
Zilch: A framework for deploying transparent zero-knowledge proofs
Dimitris Mouris, Nektarios Georgios Tsoutsos
Published: 2021
The Journal of Machine Learning Research
Achieving fairness in the stochastic multi-armed bandit problem