Using Data Abstraction for Clustering in the Context of Test Case Generation

Nour Chetouane*, Franz Wotawa

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

Data abstraction plays a crucial role in various application domains, allowing for simplification and representation of complex data sets. This paper focuses on data abstraction in the context of data clustering for test case generation, specifically in the automotive domain. Our main objective is to investigate whether we can use data abstraction to enhance the clustering outcome. We propose different abstraction functions for vehicle sensor data obtained from real-world driving data. We use these abstracted data sets as input to a clustering approach that identifies similar driving scenarios and extracts driving episodes. We evaluate the quality of the clusters using three clustering validation metrics and a Pearson correlation-based metric that assesses the similarity between the extracted driving episodes. To evaluate the effectiveness of data abstraction, we compare the metrics results to those obtained using clustering based on the original data sets comprising numerical data. The findings indicate that data abstraction primarily improves the three clustering validation metrics while delivering nearly comparable results regarding the Pearson correlation-based metric and comes with a substantially reduced runtime.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security, QRS 2023
PublisherIEEE
Pages260-271
Number of pages12
ISBN (Electronic)9798350319583
DOIs
Publication statusPublished - 2023
Event23rd IEEE International Conference on Software Quality, Reliability, and Security: QRS 2023 - Chiang Mai, Hybrid / Virtual, Thailand
Duration: 22 Oct 202326 Oct 2023

Conference

Conference23rd IEEE International Conference on Software Quality, Reliability, and Security
Country/TerritoryThailand
CityChiang Mai, Hybrid / Virtual
Period22/10/2326/10/23

Keywords

  • clustering
  • Data abstraction
  • test case generation

ASJC Scopus subject areas

  • Software
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'Using Data Abstraction for Clustering in the Context of Test Case Generation'. Together they form a unique fingerprint.

Cite this