On Using k-means Clustering for Test Suite Reduction

Nour Chetouane, Franz Wotawa, Hermann Felbinger, Mihai Nica

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review


Testing large software or other systems where test execution is time consuming or requires high computational resources is difficult and requires the selection of appropriate test cases. In the context of testing, appropriate means to have tests that most likely reveal faults or at least indicate when passing that the important functionality of the system works. In practice it is often the case that we have to reduce available test suites in order to finalize testing in a given time not exceeding other resources. In this paper, we introduce a machine learning based algorithm for test suite reduction that combines k-means clustering with binary search. The idea behind the algorithm is to cluster test cases that are close together and to select a representative test case from each of the clusters to be used in the new reduced test suite. We use binary search for looking for the proper number of clusters that allows to reduce the test suite under the condition of not substantially deviating from coverage or mutation score obtained from the initial tests suite. Besides discussing the algorithm, we present experimental results using small to larger Java programs with different types of inputs and outputs. For all example cases we were able to considerably reduce the number of test cases requiring a short reduction time especially compared to other test suite reduction approaches.
Original languageEnglish
Title of host publicationProceedings - 2020 IEEE 13th International Conference on Software Testing, Verification and Validation Workshops, ICSTW 2020
PublisherInstitute of Electrical and Electronics Engineers
Pages380 - 385
Number of pages6
ISBN (Electronic)9781728110752
ISBN (Print)9781728110769
Publication statusPublished - Oct 2020
EventThe 15th Workshop on Testing: Academia-Industry Collaboration, Practice and Research Techniques
: Co-located with: ICST 2020
- Porto, Portugal
Duration: 24 Oct 2020 → …


WorkshopThe 15th Workshop on Testing: Academia-Industry Collaboration, Practice and Research Techniques
Abbreviated titleTAIC PART2020
Period24/10/20 → …
Internet address

ASJC Scopus subject areas

  • Software
  • Safety, Risk, Reliability and Quality
  • Modelling and Simulation


Dive into the research topics of 'On Using k-means Clustering for Test Suite Reduction'. Together they form a unique fingerprint.

Cite this