Comment on Revisiting Neural Program Smoothing for Fuzzing

TOP Literature Database Comment on Revisiting Neural Program Smoothing for Fuzzing

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2409.04504

PDF

https://arxiv.org/pdf/2409.04504

Paper Information

Author: Dongdong She;Kexin Pei;Junfeng Yang;Baishakhi Ray;Suman Jana
Published: 9-7-2024
Affiliation: Hong Kong University of Science and Technology
Country: China
Conference

Labels Estimated by AI

Program Analysis Evaluation Method Watermarking

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

MLFuzz, a work accepted at ACM FSE 2023, revisits the performance of a machine learning-based fuzzer, NEUZZ. We demonstrate that its main conclusion is entirely wrong due to several fatal bugs in the implementation and wrong evaluation setups, including an initialization bug in persistent mode, a program crash, an error in training dataset collection, and a mistake in fuzzing result collection. Additionally, MLFuzz uses noisy training datasets without sufficient data cleaning and preprocessing, which contributes to a drastic performance drop in NEUZZ. We address these issues and provide a corrected implementation and evaluation setup, showing that NEUZZ consistently performs well over AFL on the FuzzBench dataset. Finally, we reflect on the evaluation methods used in MLFuzz and offer practical advice on fair and scientific fuzzing evaluations.

External Datasets

FuzzBench

Google FuzzBench