CEDAR: Continuous Testing of Deep Learning Libraries

Since Deep Learning (DL) libraries undergo rapid development with thousands of lines of code changes daily, they require continuous testing to detect software bugs and ensure code quality. In this paper, we explore DL testing approaches in a continuous testing setting. To make it feasible, we presen...

Full description

Saved in:
Bibliographic Details
Published in2024 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER) pp. 371 - 382
Main Authors Xie, Danning, Wang, Jiannan, Pham, Hung Viet, Tan, Lin, Guo, Yu, Aziz, Adnan, Meijer, Erik
Format Conference Proceeding
LanguageEnglish
Published IEEE 12.03.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Since Deep Learning (DL) libraries undergo rapid development with thousands of lines of code changes daily, they require continuous testing to detect software bugs and ensure code quality. In this paper, we explore DL testing approaches in a continuous testing setting. To make it feasible, we present the first continuous testing framework for DL libraries-CEDAR-that integrates two state-of-the-art DL testing approaches (DocTer and EAGLE) efficiently to test two popular DL libraries, PyTorch and TensorFlow. Through the application of CEDAR to 20 versions of PyTorch and TensorFlow, CEDAR detects 83 bugs in 140 APIs. Out of the 83 bugs, 23 are previously unknown bugs with 21 confirmed or fixed by the developers. The results also show CEDAR has effectively shortened the bug detection latency by almost a year (338.6 days) on average. In addition, CEDAR demonstrates its effectiveness in detecting new regression bugs and masked bugs. With three optimization strategies, CEDAR reduces the time and space overhead by a factor of 15.4 and 9.7. We share insights and lessons learned from our research, aiming to advance the development of more effective and efficient continuous testing for DL libraries, benefiting both developers and researchers.
ISSN:2640-7574
DOI:10.1109/SANER60148.2024.00044