Machine Learning-based Fuzz Testing Techniques: A Survey

Fuzz testing is a vulnerability discovery technique that tests the robustness of target programs by providing them with unconventional data. With the rapid increase in software quantity, scale and complexity, traditional fuzzing has revealed issues such as incomplete logic coverage, low automation l...

Full description

Saved in:

Bibliographic Details
Published in	IEEE access Vol. 12; p. 1
Main Authors	Zhang, Ao, Zhang, Yiying, Xu, Yao, Wang, Cong, Li, Siwei
Format	Journal Article
Language	English
Published	Piscataway IEEE 01.01.2024 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Closed box Data analysis Fuzzing Glass box Machine learning Machine learning algorithms Optimization Performance gain Software testing Testing Vulnerability discovery Workflow
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Fuzz testing is a vulnerability discovery technique that tests the robustness of target programs by providing them with unconventional data. With the rapid increase in software quantity, scale and complexity, traditional fuzzing has revealed issues such as incomplete logic coverage, low automation level and insufficient test cases. Machine learning, with its exceptional capabilities in data analysis and classification prediction, presents a promising approach for improve fuzzing. This paper investigates the latest research results in fuzzing and provides a systematic review of machine learning-based fuzzing techniques. Firstly, by outlining the workflow of fuzzing, it summarizes the optimization of different stages of fuzzing using machine learning. Specifically, it focuses on the application of machine learning in the preprocessing phase, test case generation phase, input selection phase and result analysis phase. Secondly, it mentally focuses on the optimization methods of machine learning in the process of mutation, generation and filtering of test cases and compares and analyzes its technical principles. Furthermore, it analyzes the performance gains brought by applying machine learning techniques to fuzzing, mainly including coverage, vulnerability detection capability, efficiency and effectiveness of test cases. Lastly, it concludes by summarizing the challenges and difficulties in combining machine learning with fuzzing and presents prospects for future trends in this field.
ISSN:	2169-3536 2169-3536
DOI:	10.1109/ACCESS.2023.3347652