Deep Learning on Low-Resource Datasets

Preprint

Article

Deep Learning on Low-Resource Datasets

Altmetrics

Downloads

357

Views

1080

Comments

A peer-reviewed article of this preprint also exists.

This version is not peer-reviewed

Submitted:

10 July 2018

Posted:

10 July 2018

You are already at the latest version

Alerts

Abstract

In training a deep learning system to perform audio transcription, two practical problems may arise. Firstly, most datasets are weakly labelled, having only a list of events present in each recording without any temporal information for training. Secondly, deep neural networks need a very large amount of labelled training data to achieve good quality performance, yet in practice it is difficult to collect enough samples for most classes of interest. In this paper, we propose factorising the final task of audio transcription into multiple intermediate tasks in order to improve the training performance when dealing with this kind of low-resource datasets. We evaluate three data-efficient approaches of training a stacked convolutional and recurrent neural network for the intermediate tasks. Our results show that different methods of training have different advantages and disadvantages.

Keywords:

Subject: Engineering - Electrical and Electronic Engineering

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

MDPI Initiatives

Important Links

Choose an area of interest and we will send you notifications of new preprints at your preferred frequency.

Disclaimer

Deep Learning on Low-Resource Datasets

Abstract

MDPI Initiatives

Important Links

Subscribe