Dependent state space Student-t processes for imputation and data augmentation in plasma diagnostics

Katharina Rath*, David Rügamer, Bernd Bischl, Udo von Toussaint, Christopher G. Albert

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Multivariate time series measurements in plasma diagnostics present several challenges when training machine learning models: the availability of only a few labeled data increases the risk of overfitting, and missing data points or outliers due to sensor failures pose additional difficulties. To overcome these issues, we introduce a fast and robust regression model that enables imputation of missing points and data augmentation by massive sampling while exploiting the inherent correlation between input signals. The underlying Student-t process allows for a noise distribution with heavy tails and thus produces robust results in the case of outliers. We consider the state space form of the Student-t process, which reduces the computational complexity and makes the model suitable for high-resolution time series. We evaluate the performance of the proposed method using two test cases, one of which was inspired by measurements of flux loop signals.

Original languageEnglish
Article numbere202200175
Journal Contributions to Plasma Physics
Volume63
Issue number5-6
DOIs
Publication statusPublished - 1 Jun 2023

Keywords

  • data augmentation
  • data imputation
  • Gaussian processes
  • multivariate time series
  • state space models
  • Student-t processes
  • surrogate models

ASJC Scopus subject areas

  • Condensed Matter Physics

Fields of Expertise

  • Information, Communication & Computing
  • Sustainable Systems

Fingerprint

Dive into the research topics of 'Dependent state space Student-t processes for imputation and data augmentation in plasma diagnostics'. Together they form a unique fingerprint.

Cite this