AIセキュリティポータル K Program
Detecting Functional Memorization in Code Language Models
Share
Abstract
Large language models (LLMs) are increasingly used to generate code at scale. Meanwhile, prior work has investigated whether training data may be recoverable from model outputs, by auditing the textual overlap between training examples and model generations. Code, however, can be functionally equivalent while textually dissimilar. In this work, we study functional memorization: extraction of functional logic beyond what verbatim metrics detect. We construct a counterfactual setup for Olmo-3-32B, comparing a midtrained model (exposed to target code) against a pretrained reference (not exposed). We prompt both models with Python function signatures and measure both textual and functional similarity (i.e., LLM-as-a-judge, execution-based). Our results show clear evidence of functional memorization, highlighting the need for auditing metrics that go beyond textual overlap.
Traces of Memorisation in Large Language Models for Code
Ali Al-Kaswan, Maliheh Izadi, Arie van Deursen
Published: 2023.12.19
Physics of language models: part 3.1, knowledge storage and extraction
Zeyuan Allen-Zhu, Yuanzhi Li
Published: 2024
Code clone detection techniques based on large language models
Afnan A Almatrafi, Fathy A Eassa, Sanaa A Sharaf
Published: 2025
Do membership inference attacks work on large language models?
Michael Duan, Anshuman Suri, Niloofar Mireshghallah, Sewon Min, Weijia Shi, Luke Zettlemoyer, Yulia Tsvetkov, Yejin Choi, David Evans, Hannaneh Hajishirzi
Published: 2024
What Neural Networks Memorize and Why: Discovering the Long Tail via Influence Estimation
V. Feldman, C. Zhang
Published: 2020
Exploring the limits of strong membership inference attacks on large language models
Jamie Hayes, Ilia Shumailov, Christopher A Choquette-Choo, Matthew Jagielski, Georgios Kaissis, Milad Nasr, Meenatchi Sundaram Muthu Selva Annamalai, Niloofar Mireshghallah, Igor Shilov, Matthieu Meeus
Published: 2025
Measuring memorization in rlhf for code completion
Jamie Hayes, Ilia Shumailov, William P Porter, Aneesh Pappu
Published: 2025
Measuring memorization in language models via probabilistic extraction
Jamie Hayes, Marika Swanberg, Harsh Chaudhari, Itay Yona, Ilia Shumailov, Milad Nasr, Christopher A Choquette-Choo, Katherine Lee, A Feder Cooper
Published: 2025
Your code secret belongs to me: Neural code completion tools can memorize hard-coded credentials
Yizhan Huang, Yichen Li, Weibin Wu, Jianping Zhang, Michael R Lyu
Published: 2024
Deduplicating training data mitigates privacy risks in language models
Nikhil Kandpal, Eric Wallace, Colin Raffel
Published: 2022
An exploratory investigation into code license infringements in large language model training datasets
Jonathan Katzy, Razvan Popescu, Arie Van Deursen, Maliheh Izadi
Published: 2024
Competition-level code generation with alphacode
Yujia Li, David Choi, Junyoung Chung, Nate Kushman, Julian Schrittwieser, Rémi Leblond, Tom Eccles, James Keeling, Felix Gimeno, Agustin Dal Lago, et al.
Published: 2022
Language Models May Verbatim Complete Text They Were Not Explicitly Trained On
Ken Ziyu Liu, Christopher A. Choquette-Choo, Matthew Jagielski, Peter Kairouz, Sanmi Koyejo, Percy Liang, Nicolas Papernot
Published: 2025.3.22
Starcoder 2 and the stack v2: The next generation
Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo, Evgenii Zheltonozhskii, Nii Osae Osae Dade, Wenhao Yu, Lucas Krauß, Naman Jain, Yixuan Su, Xuanli He, Manan Dey, Edoardo Abati, Yekun Chai, Niklas Muennighoff, Xiangru Tang, Muhtasham Oblokulov, Christopher Akiki, Marc Marone, Chenghao Mou, Mayank Mishra, Alex Gu, Binyuan Hui, Tri Dao, Armel Zebaze, Olivier Dehaene, Nicolas Patry, Canwen Xu, Julian McAuley, Han Hu, Torsten Scholak, Sebastien Paquet, Jennifer Robinson, Carolyn Jane Anderson, Nicolas Chapados, Mostofa Patwary, Nima Tajbakhsh, Yacine Jernite, Carlos Munoz Ferrandis, Lingming Zhang, Sean Hughes, Thomas Wolf, Arjun Guha, Leandro von Werra, Harm de Vries
Published: 2024
Membership Inference Attacks against Language Models via Neighbourhood Comparison
Justus Mattern, Fatemehsadat Mireshghallah, Zhijing Jin, Bernhard Schölkopf, Mrinmaya Sachan, Taylor Berg-Kirkpatrick
Published: 2023.5.29
What can large language models capture about code functional equivalence?
Nickil Maveli, Antonio Vergari, Shay B Cohen
Published: 2025
Did the neurons read your book? document-level membership inference for large language models
Matthieu Meeus, Shubham Jain, Marek Rei, Yves-Alexandre de Montjoye
Published: 2024
Sok: Membership inference attacks on llms are rushing nowhere (and how to fix it)
Matthieu Meeus, Igor Shilov, Shubham Jain, Manuel Faysse, Marek Rei, Yves-Alexandre de Montjoye
Published: 2025
Scalable extraction of training data from aligned, production language models
Milad Nasr, Javier Rando, Nicholas Carlini, Jonathan Hayase, Matthew Jagielski, A Feder Cooper, Daphne Ippolito, Christopher A Choquette-Choo, Florian Tramèr, Katherine Lee
Published: 2025
Decoding Secret Memorization in Code LLMs Through Token-Level Characterization
Yuqing Nie, Chong Wang, Kailong Wang, Guoai Xu, Guosheng Xu, Haoyu Wang
Published: 2024.10.11
{CodexLeaks}: Privacy leaks from code generation language models in {GitHub} copilot
L. Niu, S. Mirza, Z. Maradni, C. Popper
Published: 2023
Quantifying contamination in evaluating code generation capabilities of language models
Martin Riddell, Ansong Ni, Arman Cohan
Published: 2024
A survey on software clone detection research
Chanchal Kumar Roy, James R Cordy
Published: 2007
How much do code language models remember? an investigation on data extraction attacks before and after fine-tuning
Fabio Salerno, Ali Al-Kaswan, Maliheh Izadi
Published: 2025
Rethinking llm memorization through the lens of adversarial compression
Avi Schwarzschild, Zhili Feng, Pratyush Maini, Zachary C Lipton, J Zico Kolter
Published: 2024
The mosaic memory of large language models
Igor Shilov, Matthieu Meeus, Yves-Alexandre de Montjoye
Published: 2026
Coding after coders: The end of computer programming as we know it
Clive Thompson
Published: 2026
VeriLeaky: Navigating IP Protection vs Utility in Fine-Tuning for LLM-Driven Verilog Coding
Zeng Wang, Minghao Shao, Mohammed Nabeel, Prithwish Basu Roy, Likhitha Mankali, Jitendra Bhandari, Ramesh Karri, Ozgur Sinanoglu, Muhammad Shafique, Johann Knechtel
Published: 2025.3.17
Unveiling memorization in code models
Zhou Yang, Zhipeng Zhao, Chenyu Wang, Jieke Shi, Dongsun Kim, Donggyun Han, David Lo
Published: 2024
A multiple representation transformer with optimized abstract syntax tree for efficient code clone detection
Tianchen Yu, Li Yuan, Liannan Lin, Hongkui He
Published: 2025
Counterfactual memorization in neural language models
Chiyuan Zhang, Daphne Ippolito, Katherine Lee, Matthew Jagielski, Florian Tramèr, Nicholas Carlini
Published: 2023
Share