AIセキュリティポータル K Program
The poison of dimensionality
Share
Abstract
This paper advances the understanding of how the size of a machine learning model affects its vulnerability to poisoning, despite state-of-the-art defenses. Given isotropic random honest feature vectors and the geometric median (or clipped mean) as the robust gradient aggregator rule, we essentially prove that, perhaps surprisingly, linear and logistic regressions with $D \geq 169 H^2/P^2$ parameters are subject to arbitrary model manipulation by poisoners, where $H$ and $P$ are the numbers of honestly labeled and poisoned data points used for training. Our experiments go on exposing a fundamental tradeoff between augmenting model expressivity and increasing the poisoners' attack surface, on both synthetic data, and on MNIST & FashionMNIST data for linear classifiers with random features. We also discuss potential implications for source-based learning and neural nets.
Robust distributed learning: tight error bounds and breakdown point under data heterogeneity
Allouah, Y., Guerraoui, R., Gupta, N., Pinot, R., Rizk, G.
Published: 2024
Robust training in high dimensions via block coordinate geometric median descent
Anish Acharya, Abolfazl Hashemi, Prateek Jain, Sujay Sanghavi, Inderjit S Dhillon, Ufuk Topcu
Published: 2022
“team jorge”: In the heart of a global disinformation machine
C´ecile Andrzejewski
Published: 2023
Strong data augmentation sanitizes poisoning and backdoor attacks without an accuracy tradeoff
Eitan Borgnia, Valeriia Cherepanova, Liam Fowl, Amin Ghiasi, Jonas Geiping, Micah Goldblum, Tom Goldstein, Arjun Gupta
Published: 2021
On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?
Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, Shmargaret Shmitchell
Reconciling modern machine-learning practice and the classical bias–variance trade-off
M. Belkin, D. Hsu, S. Ma, S. Mandal
Published: 2019
Two models of double descent for weak features
M. Belkin, D. Hsu, J. Xu
Published: 2020
Machine learning with adversaries: Byzantine tolerant gradient descent
Blanchard, P., El Mhamdi, E. M., Guerraoui, R., Stainer, J.
Published: 2017
Language models are few-shot learners
T. B. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. Herbert-Voss, G. Krueger, T. Henighan, R. Child, A. Ramesh, D. M. Ziegler, J. Wu, C. Winter, C. Hesse, M. Chen, E. Sigler, M. Litwin, S. Gray, B. Chess, J. Clark, C. Berner, S. McCandlish, A. Radford, I. Sutskever, D. Amodei
Published: 2020
Poisoning attacks against support vector machines
Battista Biggio, Blaine Nelson, Pavel Laskov
Published: 2012
Wild Patterns: Ten Years After the Rise of Adversarial Machine Learning
Battista Biggio, Fabio Roli
Published: 12.9.2017
Poisoning qos-aware cloud API recommender system with generative adversarial network attack
Zhen Chen, Taiyu Bao, Wenchao Qi, Dianlong You, Linlin Liu, Limin Shen
Published: 2024
Geometric median in nearly linear time
Michael B Cohen, Yin Tat Lee, Gary Miller, Jakub Pachocki, Aaron Sidford
Published: 2016
Clean-image backdoor: Attacking multi-label models with poisoned labels only
Chen, K., Lou, X., Xu, G., Li, J., Zhang, T.
Published: 2022
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-Ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-Hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel
Published: 2022
Effective backdoor defense by exploiting sensitivity of poisoned samples
W. Chen, B. Wu, H. Wang
Published: 2022
Collaborative learning in the jungle (decentralized, byzantine, heterogeneous, asynchronous and nonconvex learning)
El-Mhamdi, E. M., Farhadkhani, S., Guerraoui, R., Guirguis, A., Hoang, L.-N., Rouault, S.
Published: 2021
On the strategyproofness of the geometric median
El-Mahdi El-Mhamdi, Sadegh Farhadkhani, Rachid Guerraoui, Lˆe-Nguyˆen Hoang
Published: 2023
Is out-of-distribution detection learnable?
Zhen Fang, Yixuan Li, Jie Lu, Jiahua Dong, Bo Han, Feng Liu
Published: 2022
A survey on data poisoning attacks and defenses
Jiaxin Fan, Qi Yan, Mohan Li, Guanqun Qu, Yang Xiao
Published: 2022
Switch transformers: Scaling to trillion parameter models with simple and efficient sparsity
William Fedus, Barret Zoph, Noam Shazeer
Published: 2022
Neural networks and the bias/variance dilemma
Stuart Geman, Elie Bienenstock, Ren´e Doursat
Published: 1992
Planting undetectable backdoors in machine learning models
Shafi Goldwasser, Michael P Kim, Vinod Vaikuntanathan, Or Zamir
Published: 2022
A survey of outlier detection methodologies
Victoria J. Hodge, Jim Austin
Published: 2004
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
Evan Hubinger, Carson Denison, Jesse Mu, Mike Lambert, Meg Tong, Monte MacDiarmid, Tamera Lanham, Daniel M. Ziegler, Tim Maxwell, Newton Cheng, Adam Jermyn, Amanda Askell, Ansh Radhakrishnan, Cem Anil, David Duvenaud, Deep Ganguli, Fazl Barez, Jack Clark, Kamal Ndousse, Kshitij Sachan, Michael Sellitto, Mrinank Sharma, Nova DasSarma, Roger Grosse, Shauna Kravec, Yuntao Bai, Zachary Witten, Marina Favaro, Jan Brauner, Holden Karnofsky, Paul Christiano, Samuel R. Bowman, Logan Graham, Jared Kaplan, Sören Mindermann, Ryan Greenblatt, Buck Shlegeris, Nicholas Schiefer, Ethan Perez
Published: 1.11.2024
Metapoi son: Practical general-purpose clean-label data poisoning
W. Ronny Huang, Jonas Geiping, Liam Fowl, Gavin Taylor, Tom Goldstein
Published: 2020
Surprises in high-dimensional ridgeless least squares interpolation
Trevor Hastie, Andrea Montanari, Saharon Rosset, Ryan J Tibshirani
Published: 2022
Gradient inversion with generative image prior
Jinwoo Jeon, Jaechang Kim, Kangwook Lee, Sewoong Oh, Jungseul Ok
Published: 2021
Subpopulation Data Poisoning Attacks
Matthew Jagielski, Giorgio Severi, Niklas Pousette Harger, Alina Oprea
Published: 6.25.2020
Gradient descent with linearly correlated noise: Theory and applications to differential privacy
Anastasia Koloskova, Ryan McKenna, Zachary Charles, John Keith Rush, H. Brendan McMahan
Published: 2023
Adversarial machine learning-industry perspectives
Ram Shankar Siva Kumar, Magnus Nystrom, John Lambert, Andrew Marshall, Mario Goertzel, Andi Comissoneru, Matt Swann, Sharon Xia
Published: 2020
Bias plus variance decomposition for zero-one loss functions
Ron Kohavi, David H. Wolpert
Published: 1996
Residual unfairness in fair machine learning from prejudiced data
Nathan Kallus, Angela Zhou
Published: 2018
Deep Partition Aggregation: Provable Defense against General Poisoning Attacks
Alexander Levine, Soheil Feizi
Published: 6.26.2020
On the relation between s-estimators and m-estimators of multivariate location and covariance
Hendrik P Lopuhaa
Published: 1989
RSA: Byzantine-robust stochastic aggregation methods for distributed learning from heterogeneous datasets
Li, L., Xu, W., Chen, T., Giannakis, G. B., Ling, Q.
Published: 2019
Persia: An open, hybrid system scaling deep learning-based recommenders up to 100 trillion parameters
Xiangru Lian, Binhang Yuan, Xuefeng Zhu, Yulong Wang, Yongjun He, Honghuan Wu, Lei Sun, Haodong Lyu, Chengjun Liu, Xing Dong, Yiqiao Liao, Mingnan Luo, Congfei Zhang, Jingru Xie, Haonan Li, Lei Chen, Renjie Huang, Jianying Lin, Chengchun Shu, Xuezhong Qiu, Zhishan Liu, Dongying Kong, Lei Yuan, Hai Yu, Sen Yang, Ce Zhang, Ji Liu
Published: 2022
The Hidden Vulnerability of Distributed Learning in Byzantium
El Mahdi El Mhamdi, Rachid Guerraoui, Sébastien Rouault
Published: 2.22.2018
Geometric median and robust estimation in banach spaces
Stanislav Minsker
Published: 2015
Data poisoning attacks on regression learning and corresponding defenses
N. M¨uller, D. Kowatsch, K. B¨ottinger
Published: 2020
The generalization error of random features regression: Precise asymptotics and the double descent curve
Song Mei, Andrea Montanari
Published: 2022
Harmless interpolation of noisy data in regression
Vidya Muthukumar, Kailas Vodrahalli, Anant Sahai
Published: 2019
Deep double descent: Where bigger models and more data hurt
Preetum Nakkiran, Gal Kaplun, Yamini Bansal, Tristan Yang, Boaz Barak, Ilya Sutskever
Published: 2020
Robust aggregation for federated learning
Krishna Pillutla, Sham M. Kakade, Za¨ıd Harchaoui
Published: 2022
Practical black-box attacks against machine learning
Nicolas Papernot, Patrick D. McDaniel, Ian J. Goodfellow, Somesh Jha, Z. Berkay Celik, Ananthram Swami
Published: 2017
Random features for large-scale kernel machines
A. Rahimi, B. Recht
Published: 2007
Certified Robustness to Label-Flipping Attacks via Randomized Smoothing
Elan Rosenfeld, Ezra Winston, Pradeep Ravikumar, J. Zico Kolter
Published: 2.8.2020
Glaze: Protecting artists from style mimicry by text-to-image models
Shawn Shan, Jenna Cryan, Emily Wenger, Haitao Zheng, Rana Hanocka, Ben Y. Zhao
Published: 2023
Election coding for distributed learning: Protecting signsgd against byzantine attacks
Jy-yong Sohn, Dong-Jun Han, Beongjun Choi, Jaekyun Moon
Published: 2020
Dirt cheap web-scale parallel text from the common crawl
Jason R. Smith, Herve Saint-Amand, Magdalena Plamada, Philipp Koehn, Chris Callison-Burch, Adam Lopez
Published: 2013
A comprehensive survey on poisoning attacks and countermeasures in machine learning
Z. Tian, L. Cui, J. Liang, S. Yu
Published: 2022
A theory of the learnable
Leslie G. Valiant
Published: 1984
High-dimensional probability: An introduction with applications in data science
Roman Vershynin
Published: 2018
High-dimensional statistics: A non-asymptotic viewpoint.
Wainwright, M.J.
Published: 2019
Learning to invert: Simple adaptive attacks for gradient inversion in federated learning
Ruihan Wu, Xiangyu Chen, Chuan Guo, Kilian Q. Weinberger
Published: 2023
Defense strategies toward model poisoning attacks in federated learning: A survey
Z. Wang, Q. Kang, X. Zhang, Q. Hu
Published: 2022
Improved Certified Defenses against Data Poisoning with (Deterministic) Finite Aggregation
Wenxiao Wang, Alexander Levine, Soheil Feizi
Published: 2.6.2022
Lethal Dose Conjecture on Data Poisoning
Wenxiao Wang, Alexander Levine, Soheil Feizi
Published: 8.6.2022
Threats to training: A survey of poisoning attacks and defenses on machine learning systems
Zhibo Wang, Jingjing Ma, Xue Wang, Jiahui Hu, Zhan Qin, Kui Ren
Published: 2022
Manufacturing consensus: Understanding propaganda in the era of automation and anonymity
Samuel Woolley
Published: 2023
RAB: Provable Robustness Against Backdoor Attacks
Maurice Weber, Xiaojun Xu, Bojan Karlaš, Ce Zhang, Bo Li
Published: 3.20.2020
Poisoning attacks in federated learning: A survey
Geming Xia, Jian Chen, Chaodong Yu, Jun Ma
Published: 2023
Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms
H. Xiao, K. Rasul, R. Vollgraf
Published: 2017
Byrdie: Byzantine-resilient distributed coordinate descent for decentralized learning
Zhixiong Yang, Waheed U. Bajwa
Published: 2019
Byzantine-robust distributed learning: Towards optimal statistical rates
Yin, D., Chen, Y., Kannan, R., Bartlett, P.
Published: 2018
Facebook removed 2.2 billion fake accounts in three months
Kaya Yurieff
Published: 2019
Understanding deep learning requires rethinking generalization
Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, Oriol Vinyals
Published: 2017
Fldetector: Defending federated learning against model poisoning attacks via detecting malicious clients
Z. Zhang, X. Cao, J. Jia, N. Z. Gong
Published: 2022
Share