Samoa Digital Library

Comparative Study on Normalisation in Emotion Recognition from Speech

Show simple item record

dc.contributor.author Böck, Ronald
dc.contributor.author Egorow, Olga
dc.contributor.author Siegert, Ingo ...et.al.
dc.date.accessioned 2021-12-10T03:01:19Z
dc.date.available 2021-12-10T03:01:19Z
dc.date.issued 2017
dc.identifier.citation Böck R., Egorow O., Siegert I., Wendemuth A. (2017) Comparative Study on Normalisation in Emotion Recognition from Speech. In: Horain P., Achard C., Mallem M. (eds) Intelligent Human Computer Interaction. IHCI 2017. Lecture Notes in Computer Science, vol 10688. Springer, Cham. https://doi.org/10.1007/978-3-319-72038-8_15 en_US
dc.identifier.uri https://doi.org/10.1007/978-3-319-72038-8_15
dc.identifier.uri ${sadil.baseUrl}/handle/123456789/1652
dc.description 13 p. ; PDF en_US
dc.description.abstract The recognition performance of a classifier is affected by various aspects. A huge influence is given by the input data pre-processing.In the current paper we analysed the relation between different normalisation methods for emotionally coloured speech samples deriving general trends to be considered during data pre-processing. From the best of our knowledge, various normalisation approaches are used in the spoken affect recognition community but so far no multi-corpus comparison was conducted. Therefore, well-known methods from literature were compared in a larger study based on nine benchmark corpora, where within each data set a leave-one-speaker-out validation strategy was applied. As normalisation approaches, we investigated standardisation, range normalisation, and centering. These were tested in two possible options: (1) The normalisation parameters were estimated on the whole data set and (2) we obtained the parameters by using emotionally neutral samples only. For classification Support Vector Machines with linear and polynomial kernels as well as Random Forest were used as representatives of classifiers handling input material in different ways. Besides further recommendations we showed that standardisation leads to a significant improvement of the recognition performance. It is also discussed when and how to apply normalisation methods. en_US
dc.language.iso en en_US
dc.publisher Springer Cham en_US
dc.title Comparative Study on Normalisation in Emotion Recognition from Speech en_US
dc.type Book en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Saili Sadil


Vaavaai

O a'u faʻamatalaga