Predictive Reranking using Code Smells for Information Retrieval Fault Localization

Thomas Hirsch*, Birgit Gertraud Hofer

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

Code metrics and static checker information can be used to increase the performance of Information Retrieval Fault Localization (IRFL). While previous work focused on the calculation of bug proneness scores for reranking IRFL tool outputs, we propose a dynamic reranking approach based on code smells. We hypothesize that code smell information can be predicted from textual bug reports and that these insights can support the fault localization task. To test this hypothesis, we implemented a fault localization pipeline consisting of a static checker (PMD) and a Neural Network based classifier to predict code smells. We apply this pipeline to predict code smells for a given textual bug report and use this information to rerank the outputs of BugLocator, BLIA, and BRTracer. We trained and evaluated our pipeline on the Bench4BL bug benchmark. While overall performance on the Bench4BL benchmark stayed the same, we found significant MAP localization performance increases for specific input software projects. We identify and discuss the characteristics of debugging benchmarks that limit their suitability for machine learning experiments. Despite these issues, we demonstrated that predicting certain smells based on textual bug reports is possible and that this information may be beneficial to fault localization.
Original languageEnglish
Title of host publication2024 IEEE 22nd World Symposium on Applied Machine Intelligence and Informatics, SAMI 2024 - Proceedings
PublisherIEEE Xplore
Pages277-282
Number of pages6
ISBN (Electronic)9798350317206
DOIs
Publication statusPublished - 2024
Event22nd IEEE World Symposium on Applied Machine Intelligence and Informatics : SAMI 2024 - Stara Lesna, Slovakia
Duration: 25 Jan 202427 Jan 2024

Conference

Conference22nd IEEE World Symposium on Applied Machine Intelligence and Informatics
Abbreviated titleSAMI 2024
Country/TerritorySlovakia
CityStara Lesna
Period25/01/2427/01/24

Keywords

  • NLP
  • Bug reports
  • Fault localization
  • IRFL
  • fault localization
  • bug reports

ASJC Scopus subject areas

  • Information Systems and Management
  • Artificial Intelligence
  • Information Systems
  • Education
  • Computer Science Applications
  • Modelling and Simulation

Fingerprint

Dive into the research topics of 'Predictive Reranking using Code Smells for Information Retrieval Fault Localization'. Together they form a unique fingerprint.

Cite this