Improved Acoustic Modeling for Automatic Piano Music Transcription Using Echo State Networks

Peter Steiner, Azarakhsh Jalalvand, Peter Birkholz

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Automatic music transcription (AMT) is one of the challenging problems in Music Information Retrieval with the goal of generating a score-like representation of a polyphonic audio signal. Typically, the starting point of AMT is an acoustic model that computes note likelihoods from feature vectors. In this work, we evaluate the capabilities of Echo State Networks (ESNs) in acoustic modeling of piano music. Our experiments show that the ESN-based models outperform state-of-the-art Convolutional Neural Networks (CNNs) by an absolute improvement of 0.5 F1 -score without using an extra language model. We also discuss that a two-layer ESN, which mimics a hybrid acoustic and language model, achieves better results than the best reference approach that combines Invertible Neural Networks (INNs) with a biGRU language model by an absolute improvement of 0.91 F1 -score.

Original languageEnglish (US)
Title of host publicationAdvances in Computational Intelligence - 16th International Work-Conference on Artificial Neural Networks, IWANN 2021, Proceedings
EditorsIgnacio Rojas, Gonzalo Joya, Andreu Catala
PublisherSpringer Science and Business Media Deutschland GmbH
Pages143-154
Number of pages12
ISBN (Print)9783030850982
DOIs
StatePublished - 2021
Event16th International Work-Conference on Artificial Neural Networks, IWANN 2021 - Virtual, Online
Duration: Jun 16 2021Jun 18 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12862 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Work-Conference on Artificial Neural Networks, IWANN 2021
CityVirtual, Online
Period6/16/216/18/21

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Keywords

  • Acoustic modeling
  • Automatic piano transcription
  • Echo state network

Fingerprint

Dive into the research topics of 'Improved Acoustic Modeling for Automatic Piano Music Transcription Using Echo State Networks'. Together they form a unique fingerprint.

Cite this