Abstract
Clustering is a core building block for data analysis, aiming to extract otherwise hidden structures and relations from raw datasets, such as particular groups that can be effectively related, compared, and interpreted. A plethora of visual-interactive cluster analysis techniques has been proposed to date, however, arriving at useful clusterings often requires several rounds of user interactions to fine-tune the data preprocessing and algorithms. We present a multi-stage Visual Analytics (VA) approach for iterative cluster refinement together with an implementation (SOMFlow) that uses Self-Organizing Maps (SOM) to analyze time series data. It supports exploration by offering the analyst a visual platform to analyze intermediate results, adapt the underlying computations, iteratively partition the data, and to reflect previous analytical activities. The history of previous decisions is explicitly visualized within a flow graph, allowing to compare earlier cluster refinements and to explore relations. We further leverage quality and interestingness measures to guide the analyst in the discovery of useful patterns, relations, and data partitions. We conducted two pair analytics experiments together with a subject matter expert in speech intonation research to demonstrate that the approach is effective for interactive data analysis, supporting enhanced understanding of clustering results as well as the interactive process itself.
Original language | English |
---|---|
Pages (from-to) | 120-130 |
Journal | IEEE Transactions on Visualization and Computer Graphics |
Volume | 24 |
Issue number | 1 |
DOIs | |
Publication status | Published - 2018 |
Keywords
- Data analysis
- Data visualization
- Density measurement
- Feature extraction
- Layout
- Symmetric matrices
- Visualization
- Feature Detection/Selection
- Matrix Visualization
- Quality Metrics
- Relational Data
- Visual Quality Measures
Fields of Expertise
- Information, Communication & Computing