Boosting Spectrum-Based Fault Localization for Spreadsheets with Product Metrics in a Learning Approach

Adil Mukhtar, Birgit Gertraud Hofer, Dietmar Jannach, Franz Wotawa, Konstantin Schekotihin

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

Faults in spreadsheets are not uncommon and they can have significant negative consequences in practice. Various approaches for fault localization were proposed in recent years, among them techniques that transferred ideas from spectrum-based fault localization (SFL) to the spreadsheet domain. Applying SFL to spreadsheets proved to be effective, but has certain limitations. Specifically, the constrained computational structures of spreadsheets may lead to large sets of cells that have the same assumed fault probability according to SFL and thus have to be inspected manually. In this work, we propose to combine SFL with a fault prediction method based on spreadsheet metrics in a machine learning (ML) approach. In particular, we train supervised ML models using two orthogonal types of features: (i) variables that are used to compute similarity coefficients in SFL and (ii) spreadsheet metrics that have shown to be good predictors for faulty formulas in previous work. Experiments with a widely-used corpus of faulty spreadsheets indicate that the combined model helps to significantly improve fault localization performance in terms of wasted effort and accuracy.
Originalspracheenglisch
TitelASE '22: Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering
UntertiteliWOAR 2022 - 7th International Workshop on Sensor-Based Activity Recognition and Artificial Intelligence
Herausgeber (Verlag)Association of Computing Machinery
Seitenumfang5
ISBN (elektronisch)978-1-4503-9475-8
DOIs
PublikationsstatusVeröffentlicht - 19 Sept. 2022
Veranstaltung37th IEEE/ACM International Conference on Automated Software Engineering: ASE '22 - Rochester, USA / Vereinigte Staaten
Dauer: 10 Okt. 202214 Okt. 2022

Publikationsreihe

NameACM International Conference Proceeding Series

Konferenz

Konferenz37th IEEE/ACM International Conference on Automated Software Engineering
KurztitelASE '22
Land/GebietUSA / Vereinigte Staaten
OrtRochester
Zeitraum10/10/2214/10/22

ASJC Scopus subject areas

  • Software
  • Human-computer interaction
  • Maschinelles Sehen und Mustererkennung
  • Computernetzwerke und -kommunikation

Fields of Expertise

  • Information, Communication & Computing

Fingerprint

Untersuchen Sie die Forschungsthemen von „Boosting Spectrum-Based Fault Localization for Spreadsheets with Product Metrics in a Learning Approach“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren