The KuiSCIMA Dataset for Optical Music Recognition of Ancient Chinese Suzipu Notation

Tristan Repolusk, Eduardo Enrique Veas

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

In recent years, the development of Optical Music Recognition (OMR) has progressed significantly. However, music cultures with smaller communities have only recently been considered in this process. This results in a lack of adequate ground truth datasets needed for the development and benchmarking of OMR systems. In this work, the KuiSCIMA (Jiang Kui Score Images for Musicological Analysis) dataset is introduced. KuiSCIMA is the first machine-readable dataset of the suzipu notations in Jiang Kui’s collection Baishidaoren Gequ from 1202. Collected from five different woodblock print editions, the dataset contains 21797 manually annotated instances on 153 pages in total, from which 14500 are text character annotations, and 7297 are suzipu notation symbols. The dataset comes with an open-source tool which allows editing, visualizing, and exporting the contents of the dataset files. In total, this contribution promotes the preservation and understanding of cultural heritage through digitization.
Original languageEnglish
Title of host publicationDocument Analysis and Recognition - ICDAR 2024 - 18th International Conference, Proceedings
EditorsElisa H. Barney Smith, Marcus Liwicki, Liangrui Peng
PublisherSpringer Nature Switzerland AG
Pages38-54
Number of pages17
ISBN (Print)9783031705519
DOIs
Publication statusPublished - 11 Sept 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14809 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Keywords

  • optical music recognition
  • Cultural heritage
  • Optical Music Recognition
  • Jiang Kui
  • Suzipu
  • Ancient Chinese music
  • Banzipu

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'The KuiSCIMA Dataset for Optical Music Recognition of Ancient Chinese Suzipu Notation'. Together they form a unique fingerprint.

Cite this