Towards Code Watermarking with Dual-Channel Transformations

2021 IEEE Symposium on Security and Privacy (SP)

Adversarial watermarking transformer: Towards tracing text provenance with data hiding

Sahar Abdelnabi, Mario Fritz

Published: 2021

Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security

Large-scale and language-oblivious code authorship identification

Mohammed Abuhamad, Tamer AbuHmed, Aziz Mohaisen, DaeHun Nyang

Published: 2018

27th USENIX Security Symposium (USENIX Security)

Turning your weakness into a strength: Watermarking deep neural networks by backdooring

Y. Adi, C. Baum, M. Cisse, B. Pinkas, J. Keshet

Published: 2018

The Eleventh International Conference on Learning Representations

Multi-lingual evaluation of code generation models

Ben Athiwaratkun, Sanjay Krishna Gouda, Zijian Wang, Xiaopeng Li, Yuchen Tian, Ming Tan, Wasi Uddin Ahmad, Shiqi Wang, Qing Sun, Mingyue Shang, et al.

Published: 2022

2014 Eighth International Conference on Complex, Intelligent and Software Intensive Systems

Function level control flow obfuscation for software security

Vivek Balachandran, Ng Wee Keong, Sabu Emmanuel

Published: 2014

Advances in neural information processing systems

Hiding images in plain sight: Deep steganography

Shumeet Baluja

Published: 2017

2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER)

Learning-based recursive aggregation of abstract syntax trees for code clone detection

Lutz Buch, Artur Andrzejak

Published: 2019

24th {USENIX} Security Symposium ({USENIX} Security 15)

De-anonymizing programmers via code stylometry

Aylin Caliskan-Islam, Richard Harang, Andrew Liu, Arvind Narayanan, Clare Voss, Fabian Yamaguchi, Rachel Greenstadt

Published: 2015

Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering: New Ideas and Emerging Results

A theory of dual channel constraints

Casey Casalnuovo, Earl T Barr, Santanu Kumar Dash, Prem Devanbu, Emily Morgan

Published: 2020

Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering

Natgen: generative pre-training by “naturalizing” source code

Saikat Chakraborty, Toufique Ahmed, Yangruibo Ding, Premkumar T Devanbu, Baishakhi Ray

Published: 2022

Computational linguistics

Practical linguistic steganography using contextual synonym substitution and a novel vertex coding method

Ching-Yun Chang, Stephen Clark

Published: 2014

Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2017

Software watermarking for java program based on method name encoding

Jianping Chen, Kui Li, Wanzhi Wen, Weixu Chen, Chenxue Yan

Published: 2018

2017 IEEE International Conference on Computational Science and Engineering (CSE) and IEEE International Conference on Embedded and Ubiquitous Computing (EUC)

Hidden path: dynamic software watermarking based on control flow obfuscation

Zhe Chen, Chunfu Jia, Donghui Xu

Published: 2017

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Learning phrase representations using rnn encoder–decoder for statistical machine translation

Kyunghyun Cho, Bart Van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, Yoshua Bengio

Published: 2014

Journal of Computer Security

Software watermarking in the frequency domain: implementation, analysis, and attacks

Christian Collberg, Tapas Ranjan Sahoo

Published: 2005

Proceedings of NAACL-HLT

Bert: Pre-training of deep bidirectional transformers for language understanding

Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova

Published: 2019

INAE Letters

Software watermarking: Progress and challenges

Ayan Dey, Sukriti Bhattacharya, Nabendu Chaki

Published: 2019

100 million developers and counting

Thomas Dohmke

Published: 2023

Advances in neural information processing systems

Rethinking deep neural network ownership verification: Embedding passports to defeat ambiguity attacks

Lixin Fan, Kam Woh Ng, Chee Seng Chan

Published: 2019

CodeBERT: A pre-trained model for programming and natural languages

Zhangyin Feng, Daya Guo, Duyu Tang, et al.

Published: 2020

Advances in neural information processing systems

Generating steganographic images via adversarial training

Jamie Hayes, George Danezis

Published: 2017

Communications of the ACM

On the naturalness of software

Abram Hindle, Earl T Barr, Mark Gabel, Zhendong Su, Premkumar Devanbu

Published: 2016

CoRR

Improving neural networks by preventing co-adaptation of feature detectors

Geoffrey E. Hinton, Nitish Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan Salakhutdinov

Published: 2012

arXiv preprint

Codesearchnet challenge: Evaluating the state of semantic code search

Hamel Husain, Ho-Hsiang Wu, Tiferet Gazit, Miltiadis Allamanis, Marc Brockschmidt

Published: 2019

Categorical reparameterization with gumbel-softmax

Eric Jang, Shixiang Gu, Ben Poole

Published: 2016

AAAI

Codeattack: Code-based adversarial attacks for pre-trained programming language models

A. Jha, C. K. Reddy

Published: 2023

arxiv

被引用数 7

Entangled Watermarks as a Defense against Model Extraction

Hengrui Jia, Christopher A. Choquette-Choo, Varun Chandrasekaran, Nicolas Papernot

Published: 2020.2.28

Machine learning involves expensive data collection and training procedures. Model owners may be concerned that valuable intellectual property can be leaked if adversaries mount model extraction attacks. As it is difficult to defend against model extraction without sacrificing significant prediction accuracy, watermarking instead leverages unused model capacity to have the model overfit to outlier input-output pairs. Such pairs are watermarks, which are not sampled from the task distribution and are only known to the defender. The defender then demonstrates knowledge of the input-output pairs to claim ownership of the model at inference. The effectiveness of watermarks remains limited because they are distinct from the task distribution and can thus be easily removed through compression or other forms of knowledge transfer. We introduce Entangled Watermarking Embeddings (EWE). Our approach encourages the model to learn features for classifying data that is sampled from the task distribution and data that encodes watermarks. An adversary attempting to remove watermarks that are entangled with legitimate data is also forced to sacrifice performance on legitimate data. Experiments on MNIST, Fashion-MNIST, CIFAR-10, and Speech Commands validate that the defender can claim model ownership with 95\% confidence with less than 100 queries to the stolen copy, at a modest cost below 0.81 percentage points on average in the defended model's performance.

ロバスト性評価 DNN IP保護手法防御手法

ACM Computing Surveys (CSUR)

Code authorship attribution: Methods and challenges

Vaibhavi Kalgutkar, Ratinder Kaur, Hugo Gonzalez, Natalia Stakhanova, Alina Matyukhina

Published: 2019

IEEE Access

A review of text watermarking: theory, methods, and applications

Nurul Shamimi Kamaruddin, Amirrudin Kamsin, Lip Yee Por, Hameedur Rahman

Published: 2018

Annual Computer Security Applications Conference

Softmark: Software watermarking via a binary function relocation

Honggoo Kang, Yonghwi Kwon, Sangjin Lee, Hyungjoon Koo

Published: 2021

arxiv

被引用数 3

How Secure is Code Generated by ChatGPT?

Raphaël Khoury, Anderson R. Avila, Jacob Brunelle, Baba Mamadou Camara

Published: 2023.4.19

In recent years, large language models have been responsible for great advances in the field of artificial intelligence (AI). ChatGPT in particular, an AI chatbot developed and recently released by OpenAI, has taken the field to the next level. The conversational model is able not only to process human-like text, but also to translate natural language into code. However, the safety of programs generated by ChatGPT should not be overlooked. In this paper, we perform an experiment to address this issue. Specifically, we ask ChatGPT to generate a number of program and evaluate the security of the resulting source code. We further investigate whether ChatGPT can be prodded to improve the security by appropriate prompts, and discuss the ethical aspects of using AI to generate code. Results suggest that ChatGPT is aware of potential vulnerabilities, but nonetheless often generates source code that are not robust to certain attacks.

セキュリティ分析脆弱性予測プログラムの検証

3rd International Conference on Learning Representations, ICLR 2015

Adam: A method for stochastic optimization

Kingma, D. P., Ba, J.

Published: 2015

arxiv

被引用数 1

International Conference on Machine Learning (ICML)

A Watermark for Large Language Models

John Kirchenbauer, Jonas Geiping, Yuxin Wen, Jonathan Katz, Ian Miers, Tom Goldstein

Published: 2023.1.25

Potential harms of large language models can be mitigated by watermarking model output, i.e., embedding signals into generated text that are invisible to humans but algorithmically detectable from a short span of tokens. We propose a watermarking framework for proprietary language models. The watermark can be embedded with negligible impact on text quality, and can be detected using an efficient open-source algorithm without access to the language model API or parameters. The watermark works by selecting a randomized set of "green" tokens before a word is generated, and then softly promoting use of green tokens during sampling. We propose a statistical test for detecting the watermark with interpretable p-values, and derive an information-theoretic framework for analyzing the sensitivity of the watermark. We test the watermark using a multi-billion parameter model from the Open Pretrained Transformer (OPT) family, and discuss robustness and security.

プロンプトインジェクションウォーターマーキング検出手法の分析

Proceedings of the 44th International Conference on Software Engineering

Ropgen: Towards robust code authorship attribution via automatic coding style transformation

Zhen Li, Guenevere Chen, Chen Chen, Yayi Zou, Shouhuai Xu

Published: 2022

Thirty-seventh Conference on Neural Information Processing Systems

Is your code generated by chatGPT really correct? rigorous evaluation of large language models for code generation

J. Liu, C. S. Xia, et al.

Published: 2023

Roberta: A robustly optimized bert pretraining approach

Liu, Y.

Published: 2019

International Conference on Learning Representations

Decoupled weight decay regularization

Ilya Loshchilov, Frank Hutter

Published: 2018

NeurIPS

Codexglue: A machine learning benchmark dataset for code understanding and generation

S. Lu, D. Guo, S. Ren, J. Huang, A. Svyatkovskiy, A. Blanco, C. Clement, D. Drain, D. Jiang, D. Tang

Published: 2022

IEEE Transactions on Information Forensics and Security

Xmark: dynamic software watermarking using collatz conjecture

Haoyu Ma, Chunfu Jia, Shijia Li, Wantong Zheng, Dinghao Wu

Published: 2019

Security, Steganography, and Watermarking of Multimedia Contents IX

Syntactic tools for text watermarking

Hasan M Meral, Emre Sevinc, Bulent Sankur, A Sumru Ozsoy, Tunga Gungor

Published: 2007

Proceedings 24th Annual International Computer Software and Applications Conference. COMPSAC2000

A practical method for watermarking java programs

A. Monden, H. Iida, K. Matsumoto, K. Inoue, K. Torii

Published: 2000

Introducing chatgpt

OpenAI

Published: 2022

Temporary policy: Chatgpt is banned

OpenAI

Published: 2023

Proceedings of the 40th annual meeting of the Association for Computational Linguistics

Bleu: a method for automatic evaluation of machine translation.

Kishore Papineni, Salim Roukos, Todd Ward, Wei-Jing Zhu

Published: 2002

arxiv

被引用数 2

Misleading Authorship Attribution of Source Code using Adversarial Learning

Erwin Quiring, Alwin Maier, Konrad Rieck

Published: 2019.5.29

In this paper, we present a novel attack against authorship attribution of source code. We exploit that recent attribution methods rest on machine learning and thus can be deceived by adversarial examples of source code. Our attack performs a series of semantics-preserving code transformations that mislead learning-based attribution but appear plausible to a developer. The attack is guided by Monte-Carlo tree search that enables us to operate in the discrete domain of source code. In an empirical evaluation with source code from 204 programmers, we demonstrate that our attack has a substantial effect on two recent attribution methods, whose accuracy drops from over 88% to 1% under attack. Furthermore, we show that our attack can imitate the coding style of developers with high accuracy and thereby induce false attributions. We conclude that current approaches for authorship attribution are inappropriate for practical application and there is a need for resilient analysis techniques.

著者帰属手法敵対的攻撃攻撃の評価

Codebleu: a method for automatic evaluation of code synthesis.

Shuo Ren, Daya Guo, Shuai Lu, Long Zhou, Shujie Liu, Duyu Tang, Neel Sundaresan, Ming Zhou, Ambrosio Blanco, Shuai Ma

Published: 2020

International Journal of Engineering and Innovative Technology (IJEIT)

A survey of digital watermarking techniques, applications and attacks

Prabhishek Singh, Ramneet Singh Chadha

Published: 2013

Proceedings of the ACM Web Conference 2022

Coprotector: Protect open-source code against unauthorized training usage with data poisoning

Zhensu Sun, Xiaoning Du, Fu Song, Mingze Ni, Li Li

Published: 2022

IEEE Transactions on Software Engineering

Software plagiarism detection with birthmarks based on dynamic key instruction sequences

Zhenzhou Tian, Qinghua Zheng, Ting Liu, Ming Fan, Eryue Zhuang, Zijiang Yang

Published: 2015

Proceedings of the 4th ACM international workshop on Contents protection and security

Words are not enough: sentence level natural language watermarking

Mercan Topkara, Umut Topkara, Mikhail J Atallah

Published: 2006

Proceedings of the 8th workshop on Multimedia and security

The hiding virtues of ambiguity: quantifiably resilient watermarking of natural language text through synonym substitutions

Umut Topkara, Mercan Topkara, Mikhail J Atallah

Published: 2006

Advances in Neural Information Processing Systems

Attention is all you need

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. u. Kaiser, I. Polosukhin

Published: 2017

Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing

Watermarking the outputs of structured prediction with an application in statistical machine translation

Ashish Venugopal, Jakob Uszkoreit, David Talbot, Franz Josef Och, Juri Ganitkevitch

Published: 2011

Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering

You see what I want you to see: poisoning vulnerabilities in neural code search

Yao Wan, Shijie Zhang, Hongyu Zhang, Yulei Sui, Guandong Xu, Dezhong Yao, Hai Jin, Lichao Sun

Published: 2022

Proceedings of the 44th International Conference on Software Engineering

Bridging pre-trained models and downstream tasks for source code understanding

Deze Wang, Zhouyang Jia, Shanshan Li, Yue Yu, Yun Xiong, Wei Dong, Xiangke Liao

Published: 2022

IEEE Access

Exception handling-based dynamic software watermarking

Yilong Wang, Daofu Gong, Bin Lu, Fei Xiang, Fenlin Liu

Published: 2018

EMNLP

Codet5: Identifier-aware unified pre-trained encoder-decoder models for code understanding and generation

Y. Wang, W. Wang, S. Joty, S. C. Hoi

Published: 2021

Proceedings of the AAAI Conference on Artificial Intelligence

Tracing text provenance via context-aware lexical substitution

Xi Yang, Jie Zhang, Kejiang Chen, Weiming Zhang, Zehua Ma, Feng Wang, Nenghai Yu

Published: 2022

Proceedings of the 44th International Conference on Software Engineering

Natural attack for pre-trained models of code

Zhou Yang, Jieke Shi, Junda He, David Lo

Published: 2022