zkLLM: Zero Knowledge Proofs for Large Language Models

TOP Literature Database zkLLM: Zero Knowledge Proofs for Large Language Models

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2404.16109

PDF

https://arxiv.org/pdf/2404.16109

Paper Information

Author: Haochen Sun;Jason Li;Hongyang Zhang
Published: 4-25-2024
Affiliation: University of Waterloo
Country: Canada
Conference

Labels Estimated by AI

Watermark Robustness Prompt Injection Computational Efficiency

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

The recent surge in artificial intelligence (AI), characterized by the prominence of large language models (LLMs), has ushered in fundamental transformations across the globe. However, alongside these advancements, concerns surrounding the legitimacy of LLMs have grown, posing legal challenges to their extensive applications. Compounding these concerns, the parameters of LLMs are often treated as intellectual property, restricting direct investigations. In this study, we address a fundamental challenge within the realm of AI legislation: the need to establish the authenticity of outputs generated by LLMs. To tackle this issue, we present zkLLM, which stands as the inaugural specialized zero-knowledge proof tailored for LLMs to the best of our knowledge. Addressing the persistent challenge of non-arithmetic operations in deep learning, we introduce tlookup, a parallelized lookup argument designed for non-arithmetic tensor operations in deep learning, offering a solution with no asymptotic overhead. Furthermore, leveraging the foundation of tlookup, we introduce zkAttn, a specialized zero-knowledge proof crafted for the attention mechanism, carefully balancing considerations of running time, memory usage, and accuracy. Empowered by our fully parallelized CUDA implementation, zkLLM emerges as a significant stride towards achieving efficient zero-knowledge verifiable computations over LLMs. Remarkably, for LLMs boasting 13 billion parameters, our approach enables the generation of a correctness proof for the entire inference process in under 15 minutes. The resulting proof, compactly sized at less than 200 kB, is designed to uphold the privacy of the model parameters, ensuring no inadvertent information leakage.

References

Gemini: A Family of Highly Capable Multimodal Models

Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton

Published: 2016

ACM

Ligero++: A New Optimized Sublinear IOP

Rishabh Bhadauria, Zhiyong Fang, Carmit Hazay, Muthuramakrishnan Venkit Subramaniam, Tiancheng Xie, Yupeng Zhang

Published: 2020

J. Cryptol.

Succinct Non-Interactive Arguments via Linear Interactive Proofs

Nir Bitansky, Alessandro Chiesa, Yuval Ishai, Rafail Ostrovsky, Omer Paneth

Published: 2022

IACR Cryptol. ePrint Arch.

Halo Infinite: Recursive zk-SNARKs from any Additive Polynomial Commitment Scheme

Dan Boneh, Justin Drake, Ben Fisch, Ariel Gabizon

Published: 2020

Lecture Notes in Computer Science, Vol. 2248

Short Signatures from the Weil Pairing

Dan Boneh, Ben Lynn, Hovav Shacham

Published: 2001

IACR Cryptol. ePrint Arch.

Halo: Recursive Proof Composition without a Trusted Setup

Sean Bowe, Jack Grigg, Daira Hopwood

Published: 2019

OpenAI Technical Report

Language models are few-shot learners

T. B. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. Herbert-Voss, G. Krueger, T. Henighan, R. Child, A. Ramesh, D. M. Ziegler, J. Wu, C. Winter, C. Hesse, M. Chen, E. Sigler, M. Litwin, S. Gray, B. Chess, J. Clark, C. Berner, S. McCandlish, A. Radford, I. Sutskever, D. Amodei

Published: 2020

CoRR

QuIP: 2-Bit Quantization of Large Language Models With Guarantees

Jerry Chee, Yaohui Cai, Volodymyr Kuleshov, Christopher De Sa

Published: 2023

Cryptology ePrint Archive

HyperPlonk: Plonk with linear-time prover and high-degree custom gates

B. Chen, B. Bünz, D. Boneh, Z. Zhang

Published: 2022

Electron. Colloquium Comput. Complex.

A Zero Knowledge Sumcheck and its Applications

Alessandro Chiesa, Michael A. Forbes, Nicholas Spooner