Conference papers

Active Learning for Auditory Hierarchy

William Coleman, Technological University DublinFollow
Sarah Jane Delany, Technological University DublinFollow
Charlie Cullen, Technological University DublinFollow
Ming Yan, XperiFollow

Author ORCID Identifier

0000-0002-7551-882X

Document Type

Conference Paper

Rights

Available under a Creative Commons Attribution Non-Commercial Share Alike 4.0 International Licence

Disciplines

Computer Sciences

Publication Details

Cross Domain Conference for Machine Learning and Knowl- edge Extraction (CD-MAKE), Dublin, Ireland; 25-28 August, 2020.

Abstract

Much audio content today is rendered as a static stereo mix: fundamentally a fixed single entity. Object-based audio envisages the delivery of sound content using a collection of individual sound ‘objects’ controlled by accompanying metadata. This offers potential for audio to be delivered in a dynamic manner providing enhanced audio for consumers. One example of such treatment is the concept of applying varying levels of data compression to sound objects thereby reducing the volume of data to be transmitted in limited bandwidth situations. This application motivates the ability to accurately classify objects in terms of their ‘hierarchy’. That is, whether or not an object is a foreground sound, which should be reproduced at full quality if possible, or a background sound, which can be heavily compressed without causing a deterioration in the listening experience. Lack of suitably labelled data is an acknowledged problem in the domain. Active Learning is a method that can greatly reduce the manual effort required to label a large corpus by identifying the most effective instances to train a model to high accuracy levels. This paper compares a number of Active Learning methods to investigate which is most effective in the context of a hierarchical labelling task on an audio dataset. Results show that the number of manual labels required can be reduced to 1.7% of the total dataset while still retaining high prediction accuracy.

DOI

https://doi.org/10.1007/978-3-030-57321-8_20

Recommended Citation

Coleman, w. et al. (2020) Active Learning for Auditory Hierarchy, Cross Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), Dublin, Ireland; 25-28 August, 2020. DOI:10.1007/978-3-030-57321-8_20

Funder

Irish Research Council

Download

Included in

Other Computer Engineering Commons

COinS

Conference papers

Active Learning for Auditory Hierarchy

Author ORCID Identifier

Document Type

Rights

Disciplines

Publication Details

Abstract

DOI

Recommended Citation

Funder

Included in

Search

Browse

Author Corner

Conference papers

Active Learning for Auditory Hierarchy

Authors

Author ORCID Identifier

Document Type

Rights

Disciplines

Publication Details

Abstract

DOI

Recommended Citation

Funder

Included in

Share

Search

Browse

Author Corner