Article

Program Transformation Landscapes for Automated Program Modification Using Gin

Full Text

Details

Citation

Petke J, Alexander B, Barr ET, Brownlee A, Wagner M & White D (2023) Program Transformation Landscapes for Automated Program Modification Using Gin. Empirical Software Engineering.

Abstract
Automated program modification underlies two successful research areas-genetic improvement and program repair. Under the generate-and-validate strategy, automated program modification transforms a program, then validates the result against a test suite. Much work has focused on the search space of application of single fine-grained operators-copy, delete, replace , and swap at both line and statement granularity. This work explores the limits of this strategy. We scale up existing findings an order of magnitude from small corpora to 10 real-world Java programs comprising up to 500k LoC. We decisively show that the grammar-specificity of statement granular edits pays off: its pass rate triples that of line edits and uses 10% less computational resources. We confirm previous findings that delete is the most effective operator for creating test-suite equivalent program variants. We go farther than prior work by exploring the limits of delete's effectiveness by exhaustively applying it. We show this strategy is too costly in practice to be used to search for improved software variants. We further find that pass rates drop from 12-34% for single statement edits to 2-6% for 5-edit sequences, which implies that further progress will need human-inspired operators that target specific faults or improvements. A program is amenable to automated modification to the extent to which automatically editing it is likely to produce test-suite passing variants. We are the first to systematically search for a code measure that correlates with a program's amenability to automated modification. We found no strong correlations , leaving the question open.

Keywords
Automated Program Modification; Genetic Improvement; Automated Program Repair; Search-Based Software Engineering

Notes
Output Status: Forthcoming

Journal
Empirical Software Engineering

Status	Accepted
Funders	The Carnegie Trust, Engineering and Physical Sciences Research Council, EPSRC Engineering and Physical Sciences Research Council and Australian Research Council
Date accepted by journal	23/05/2023
URL	http://hdl.handle.net/1893/35130
ISSN	1382-3256
eISSN	1573-7616

People (1)

People

Dr Sandy Brownlee

Senior Lecturer in Computing Science, Computing Science and Mathematics - Division

Projects (1)

Towards grammar-aware operators for automatic improvement of software
PI:
01/03/2019–29/02/2020