The Importance of the Correlation in Crossover Experiments

Kitchenham, Barbara; Madeyski, Lech; Scanniello, Giuseppe; Gravino, Carmine

doi:10.1109/tse.2021.3070480

The Importance of the Correlation in Crossover Experiments

Kitchenham, Barbara; Madeyski, Lech; Scanniello, Giuseppe; Gravino, Carmine

Authors

Barbara Kitchenham

Lech Madeyski

Giuseppe Scanniello

Carmine Gravino

Abstract

Context: In empirical software engineering, crossover designs are popular for experiments comparing software engineering techniques that must be undertaken by human participants. However, their value depends on the correlation ( r ) between the outcome measures on the same participants. Software engineering theory emphasizes the importance of individual skill differences, so we would expect the values of r to be relatively high. However, few researchers have reported the values of r . Goal: To investigate the values of r found in software engineering experiments. Method: We undertook simulation studies to investigate the theoretical and empirical properties of r . Then we investigated the values of r observed in 35 software engineering crossover experiments. Results: The level of r obtained by analysing our 35 crossover experiments was small. Estimates based on means, medians, and random effect analysis disagreed but were all between 0.2 and 0.3. As expected, our analyses found large variability among the individual r estimates for small sample sizes, but no indication that r estimates were larger for the experiments with larger sample sizes that exhibited smaller variability. Conclusions: Low observed r values cast doubts on the validity of crossover designs for software engineering experiments. However, if the cause of low r values relates to training limitations or toy tasks, this affects all Software Engineering (SE) experiments involving human participants. For all human-intensive SE experiments, we recommend more intensive training and then tracking the improvement of participants as they practice using specific techniques, before formally testing the effectiveness of the techniques.

Citation

Kitchenham, B., Madeyski, L., Scanniello, G., & Gravino, C. (2022). The Importance of the Correlation in Crossover Experiments. IEEE Transactions on Software Engineering, 48(8), 2802 - 2813. https://doi.org/10.1109/tse.2021.3070480

Journal Article Type	Article
Acceptance Date	Mar 28, 2021
Publication Date	Aug 1, 2022
Journal	IEEE Transactions on Software Engineering
Print ISSN	0098-5589
Publisher	Institute of Electrical and Electronics Engineers (IEEE)
Volume	48
Issue	8
Pages	2802 - 2813
DOI	https://doi.org/10.1109/tse.2021.3070480
Keywords	Software
Public URL	https://keele-repository.worktribe.com/output/424315
Publisher URL	https://ieeexplore.ieee.org/document/9395223