Paper
17 January 2005 Towards automatic music transcription: note extraction based on independent subspace analysis
Author Affiliations +
Abstract
In this paper we present a technique for the separation of harmonic sounds within real sound mixtures for automatic music transcription using Independent Subspace Analysis (ISA). The algorithm is based on the assumption that tones played by an instrument within polyphonic music consist of components that are statistically independent from components of other tones. The first step of the algorithm is a temporal segmentation into note events. Both features in the time domain and in the frequency domain are used to detect segment boundaries, which are represented by starting or decaying tones. Each segment is now examined using the ISA and a set of statistically independent components is calculated. One tone played by an instrument consists of the fundamental frequency and its harmonics. Usually, the ISA results in more independent components than played notes, because not all harmonics are separated to the component containing their fundamental frequencies. Some harmonics are separated in components of its own. Using the Kullback-Leibler divergence components belonging together are grouped. A note classification, which is trained for piano music at the time, is the last step of the algorithm. Results show, that statistic independence is a promising measure for separating sounds into single notes using ISA as a step towards automatic music transcription.
© (2005) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jens Wellhausen and Michael Hoynck "Towards automatic music transcription: note extraction based on independent subspace analysis", Proc. SPIE 5682, Storage and Retrieval Methods and Applications for Multimedia 2005, (17 January 2005); https://doi.org/10.1117/12.587164
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Statistical analysis

Independent component analysis

Databases

Analytical research

Fourier transforms

Classification systems

Communication engineering

RELATED CONTENT

New generation of the multimedia search engines
Proceedings of SPIE (September 14 2016)
PNRS: personalized news retrieval system
Proceedings of SPIE (August 24 1999)
Music classification with MPEG-7
Proceedings of SPIE (January 10 2003)
Human perception of geometric distortions in images
Proceedings of SPIE (June 22 2004)

Back to Top