Multipitch tracking in music signals using echo state networks

Peter Steiner, Simon Stone, Peter Birkholz, Azarakhsh Jalalvand

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Scopus citations

Abstract

Currently, convolutional neural networks (CNNs) define the state of the art for multipitch tracking in music signals. Echo State Networks (ESNs), a recently introduced recurrent neural network architecture, achieved similar results as CNNs for various tasks, such as phoneme or digit recognition. However, they have not yet received much attention in the community of Music Information Retrieval. The core of ESNs is a group of unordered, randomly connected neurons, i.e., the reservoir, by which the low-dimensional input space is non-linearly transformed into a high-dimensional feature space. Because only the weights of the connections between the reservoir and the output are trained using linear regression, ESNs are easier to train than deep neural networks. This paper presents a first exploration of ESNs for the challenging task of multipitch tracking in music signals. The best results presented in this paper were achieved with a bidirectional two-layer ESN with 20 000 neurons in each layer. Although the final F-score of 0.7198 still falls below the state of the art (0.7370), the proposed ESN-based approach serves as a baseline for further investigations of ESNs in audio signal processing in the future.

Original languageEnglish (US)
Title of host publication28th European Signal Processing Conference, EUSIPCO 2020 - Proceedings
PublisherEuropean Signal Processing Conference, EUSIPCO
Pages126-130
Number of pages5
ISBN (Electronic)9789082797053
DOIs
StatePublished - Jan 24 2021
Externally publishedYes
Event28th European Signal Processing Conference, EUSIPCO 2020 - Amsterdam, Netherlands
Duration: Aug 24 2020Aug 28 2020

Publication series

NameEuropean Signal Processing Conference
Volume2021-January
ISSN (Print)2219-5491

Conference

Conference28th European Signal Processing Conference, EUSIPCO 2020
Country/TerritoryNetherlands
CityAmsterdam
Period8/24/208/28/20

All Science Journal Classification (ASJC) codes

  • Signal Processing
  • Electrical and Electronic Engineering

Keywords

  • Echo State Network
  • MIR
  • Multipitch
  • RNN
  • Reservoir Computing

Fingerprint

Dive into the research topics of 'Multipitch tracking in music signals using echo state networks'. Together they form a unique fingerprint.

Cite this