Defect Categorization in Compilers: A Multi-vocal Literature Review

Akond Rahman, Dibyendu Brinto Bose, Farhat Lamia Barsha, Rahul Pandita in ACM Computing Surveys (CSUR), 2024 Pre-print

Context: Compilers are the fundamental tools for software development. Thus, compiler defects can disrupt development productivity and propagate errors into developer-written software source code. Categorizing defects in compilers can inform practitioners and researchers about the existing defects in compilers and techniques that can be used to identify defects systematically.

Objective: The goal of this paper is to help researchers understand the nature of defects in compilers by conducting a review of Internet artifacts and peer-reviewed publications that study defect characteristics of compilers.

Methodology: We conduct a multi-vocal literature review (MLR) with 26 publications and 32 Internet artifacts to characterize compiler defects.

Results: From our MLR, we identify 13 categories of defects, amongst which optimization defects have been the most reported defects in our artifacts publications. We observed 15 defect identification techniques tailored for compilers and no single technique identifying all observed defect categories.

Conclusion: Our MLR lays the groundwork for practitioners and researchers to identify defects in compilers systematically.