Vienna University of Technology
Institute of Software Technology and Interactive Systems
Information & Software Engineering Group

Music Information Retrieval

TU Logo  IFS Logo
  [Home] [People] [Publications]  

Audio Feature Extraction

Music Classification

PlaySOM - Organisation of Music Archives

Browsing Music Spaces on Mobile Devices

Map of Mozart

Visualization Enhancements

SOMeJB

With the creation of large audio collections, we need to devise ways to make those collections accessible to the users. Currently, access to music repositories is mostly limited to query-based retrieval based on textual meta-data, with some advanced systems supporting acoustic queries. What we would like to have additionally, is a way to facilitate exploration of musical libraries. We thus need to automatically organize music according to its sound characteristics in such a way that we find similar pieces of music grouped together, allowing us to find a classical section, or a hard-rock section etc. in a music repository.

We thus research various methods of indexing and structuring audio collections, as well as providing intuitive user interfaces for a wide range of devices. Feature extraction from audio signals, incorporating psychoacoustic information, is combined with textual features. Machine learning techniques are used to e xtract semantic information, group audio by similarity, or classify it into various genres. Advanced visualization techniques are employed to provide intuitive interfaces to audio collections on standard PC as well as mobile devices.

Contact: Andreas Rauber.

 

Audio Feature Extraction

Using methods from digital signal processing and psycho-acoustics we are extracting semantic information from music. The features extracted from the audio signal are able to describe the stylistic content of the music, e.g. beat, presence of voice, timbre, etc. Thus, a system using audio feature extraction is able to tell about the content of a piece of music without the need of annotated labels such as artist, song title or genre. Moreover, it is able to find similar music automatically. Different kinds of feature sets (Rhythm Patterns, amongst others) are the basis to many subsequent tasks, such as automatic music organization or classification into genres. [Details]
 

Project Members: Thomas Lidy, David Laister, Alexander Trinkl

 

Music Classification

Applying machine learning methods and employing the features calculated from audio signals we built a system that is able to learn the classification of music pieces into a pre-defined taxonomy, corresponding to the user's likes. The system has to be trained with a number of examples and is then able to categorize music into different classes, e.g. music genres (classical, jazz, hip hop, electronic, ...) or can identify artists. In another project we combine textual with audio information for music classification. A new project focuses on the recognition of mood (emotions) in music. [Details]
 

Project Members: Thomas Lidy, Alexander Trinkl, Robert Neumayer, Doris Baum

 

PlaySOM - Organisation of Music Archives

Contrary to Music Classification, this project is about unsupervised organisation of whole music archives, which means that no training of the system is necessary. We apply the Self Organizing Map (SOM) algorithm on audio features and are thus able to organize a music archive on an intuitive map, grouping audio tracks by their perceived acoustic similarity. Music with a similar style is located close to each other building clusters (or islands), while music with different style is located farther away. Different visualization techniques enable various browsing metaphors, such as geographic maps or weather charts. As the name says, the system is compeletely self-organizing, and thus overcomes traditional genre boundaries and adapts individually depending on the music archive used. [Details]
 

Project Members: Michael Dittenbach, Robert Neumayer, Rudolf Mayer, Thomas Lidy

 

Browsing Music Spaces on Mobile Devices

Large music collections are becoming more and more important also for the mobile user. People are able to carry their complete audio collection with them on small portable devices. We believe, that intuitive and intelligent access to music spaces is also needed on portable devices, such as PDAs and mobile phones. Manually searching through directory structures and sorting music into playlists is no way to enjoy music, especially when one wants to listen a music of a particular mood. Thus, we are making effort to implement our metaphors from the SOM enhanced JukeBox also on state-of-the-art portable devices. [Details]
 

Project Members: Robert Neumayer

 

Map of Mozart

We extracted acoustic features from the complete works of Wolfgang Amadeus Mozart and created an interactive Map of Mozart. The map provides an immediate overview of all works by Mozart. The shape of the map was chosen to resemble Mozart's silhouette by using the MnemonicSOM algorithm. A semi-transparent layer enables the visualization of both Mozart's head and the Islands of Music metaphor for intuitive browsing and selection of the music. All pieces of music ever composed by Mozart can thus be retrieved and listened to by using the PlaySOM application. [Explore]
 

Project Members: Thomas Lidy, Rudolf Mayer

 

Visualization Enhancements

Our SOMeJB system already provides a number of intuitive visualization methods making browsing through music a pleasure. We implemented the U-Matrix, showing distances between neighbouring SOM-clusters, Smoothed Data Histograms, showing a methaphor of Islands of Music, as well as Weather Charts, and more. Research is continuing to provide the most enjoyable means of access to the wealth of music possible.
 

Project Members: Georg Pölzlbauer, Rudolf Mayer

 

SOMeJB - The SOM-enhanced JukeBox

SOMeJB, the SOM-enhanced JukeBox, commenced in 1999, established the basis for our browsing interfaces to music archives. The SOMeJB Music Digital Library Project aims at creating a browsable music archive by combining a variety of technologies from the fields of audio processing, neural networks, and information visualization, to create maps of music archives. It has its roots in the SOMLib Digital Library for text archives. It is based on the self-organizing map (SOM), a popular unsupervised neural network, and its extension, the Growing Hierarchical Self-Organizing Map (GHSOM). The resulting maps of the music archive can be explored, and new, unknown pieces of music similar to ones personal likings can be discovered, with Islands of Music and Weather Charts providing an intuitive interface to the system. [Details]
 

(Former) Project Members: Markus Frühwirth, Elias Pampalk, Thomas Lidy

 
 
top
last edited 11.04.2006 by Thomas Lidy