Singing voice detection with deep recurrent neural networks

Simon Leglaive; Romain Hennequin; Roland Badeau

Communication Dans Un Congrès Année : 2015

Singing voice detection with deep recurrent neural networks

(1, 2) , (1) , (2)

1
2

Simon Leglaive

Fonction : Auteur
PersonId : 20853
IdHAL : simon-leglaive
ORCID : 0000-0002-8219-1298
IdRef : 25312171X

Audionamix

Département Traitement du Signal et des Images

Romain Hennequin

Fonction : Auteur
PersonId : 963444

Audionamix

Roland Badeau

Fonction : Auteur
PersonId : 1121
IdHAL : rbadeau
ORCID : 0000-0002-9630-6877
IdRef : 106938134

Département Traitement du Signal et des Images

Résumé

In this paper, we propose a new method for singing voice detection based on a Bidirectional Long Short-Term Memory (BLSTM) Recurrent Neural Network (RNN). This classifier is able to take into account a past and future temporal context to decide on the presence/absence of singing voice, thus using the inherent sequential aspect of a short-term feature extraction in a piece of music. The BLSTM-RNN contains several hidden layers, so it is able to extract from low-level features a simple representation fitted to our task. The results we obtain significantly outperform state-of-the-art methods on a common database.

Mots clés

Recurrent Neural Networks Singing Voice Detection Long Short-Term Memory Deep Learning

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

Leglaive-ICASSP-2015.pdf (660.51 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Roland Badeau : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01110035

Soumis le : lundi 27 avril 2015-14:45:09

Dernière modification le : lundi 9 octobre 2023-12:49:39

Archivage à long terme le : mercredi 19 avril 2017-07:32:13

Dates et versions

hal-01110035 , version 1 (27-04-2015)

Identifiants

HAL Id : hal-01110035 , version 1

Citer

Simon Leglaive, Romain Hennequin, Roland Badeau. Singing voice detection with deep recurrent neural networks. 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. pp.121-125. ⟨hal-01110035⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM CNRS PARISTECH IDS S2A ANR

932 Consultations

2677 Téléchargements

Singing voice detection with deep recurrent neural networks

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager