Fine-grain voice strength estimation from vowel spectral cues

Jean-Sylvain Liénard; Claude Barras

Communication Dans Un Congrès Année : 2013

Fine-grain voice strength estimation from vowel spectral cues

(1) , (1)

Jean-Sylvain Liénard

Fonction : Auteur
PersonId : 1026941

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Claude Barras

Fonction : Auteur
PersonId : 17217
IdHAL : claude-barras
IdRef : 165065583

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Résumé

This study investigates the possibility to recover the voice strength, i.e. the sound level produced by the speaker, from the signal recorded. The dataset consists of a set of isolated vowels (720 tokens) recorded in a situation where two interlocutors interacted orally at a distance comprised between 0.40 and 6 meters, in a furnished room. For each token, voice strength is measured at the intensity peak, and several sets of acoustic cues are extracted from the signal spectrum, after frequency weighting and intensity normalization. In the first phase, the tokens are grouped into increasing voice strength categories. Discriminant Analysis produces a classifier which takes into account all the signal dimensions implicitly coded in the set of cues. In the second phase, the cues of a new token are given to the classifier, which in turn produces its distances to the groups, providing the basis for estimating the unknown voice strength. The quality of the process is evaluated either in self-consistency mode or by cross-validation, i.e. by comparing the estimate with the value initially measured on the same token. The statistical margin of error is quite low, of the order of 3 dB, depending on the sets of cues used.

Mots clés

vocal effort vocal intensity voice quality discriminant analysis

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

i13_0128.pdf (177.79 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Claude Barras : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01690249

Soumis le : mardi 23 janvier 2018-17:08:28

Dernière modification le : samedi 7 octobre 2023-21:36:20

Archivage à long terme le : jeudi 24 mai 2018-10:15:07

Dates et versions

hal-01690249 , version 1 (23-01-2018)

Identifiants

HAL Id : hal-01690249 , version 1

Citer

Jean-Sylvain Liénard, Claude Barras. Fine-grain voice strength estimation from vowel spectral cues. Interspeech 2013, Aug 2013, Lyon, France. ⟨hal-01690249⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIMSI SORBONNE-UNIVERSITE LISN

17 Consultations

20 Téléchargements

Fine-grain voice strength estimation from vowel spectral cues

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager