DAPHNE Runtime: Harnessing Parallelism for Integrated Data Analysis Pipelines

Aristotelis Vontzalidis, Stratos Psomadakis, Constantinos Bitsakos, Mark Dokter, Kevin Innerebner, Patrick Damme, Matthias Boehm, Florina Ciorba, Ahmed Eleliemy, Vasileios Karakostas, Aleš Zamuda, Dimitrios Tsoumakos*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

Integrated data analysis pipelines combine rigorous data management and processing, high-performance computing and machine learning tasks. While these systems and operations share many compilation and runtime techniques, data analysts and scientists are currently dealing with multiple systems for each stage of their pipeline. DAPHNE is an open and extensible system infrastructure for such pipelines, including language abstractions, compilation and runtime techniques, multi-level scheduling, hardware accelerators and computational storage. In this demonstration, we focus on the DAPHNE runtime that provides the implementation of kernels for local, distributed and accelerator-enhanced operations, vectorized execution, integration with existing frameworks and libraries for productivity and interoperability, as well as efficient I/O and communication primitives.

Original languageEnglish
Title of host publicationEuro-Par 2023
Subtitle of host publicationParallel Processing Workshops - Euro-Par 2023 International Workshops, 2023, Revised Selected Papers
EditorsDemetris Zeinalipour, Dora Blanco Heras, George Pallis, Herodotos Herodotou, Demetris Trihinas, Daniel Balouek, Patrick Diehl, Terry Cojean, Karl Fürlinger, Maja Hanne Kirkeby, Matteo Nardelli, Pierangelo Di Sanzo
PublisherSpringer Science and Business Media Deutschland GmbH
Pages242-246
Number of pages5
ISBN (Print)9783031488023
DOIs
Publication statusPublished - 2024
Event29th International Conference on Parallel and Distributed Computing: Euro-Par 2023 - Limassol, Cyprus
Duration: 28 Aug 20231 Sept 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14352 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference29th International Conference on Parallel and Distributed Computing
Abbreviated titleEuro-Par 2023
Country/TerritoryCyprus
CityLimassol
Period28/08/231/09/23

Keywords

  • Distributed Systems
  • High Performance Computing
  • Machine Learning Systems
  • Vectorized Execution

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'DAPHNE Runtime: Harnessing Parallelism for Integrated Data Analysis Pipelines'. Together they form a unique fingerprint.

Cite this