Automatic motherese detection for face-to-face interaction analysis

A. Mahdhaoui, M. Chetouani, C. Zong, R. S. Cassel, C. Saint-Georges, M. C. Laznik, S. Maestro, F. Apicella, [No Value] MuratoriF., D. Cohen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper deals with emotional speech detection in home movies. In this study, we focus on infant-directed speech also called "motherese" which is characterized by higher pitch, slower tempo, and exaggerated intonation. In this work, we show the robustness of approaches to automatic discrimination between infant-directed speech and normal directed speech. Specifically, we estimate the generalization capability of two feature extraction schemes extracted from supra-segmental and segmental information. In addition, two machine learning approaches are considered: k-nearest neighbors (k-NN) and Gaussian mixture models (GMM). Evaluations are carried out on real-life databases: home movies of the first year of an infant.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages248-255
Number of pages8
Volume5398 LNAI
DOIs
Publication statusPublished - 2009
EventCOST Action 2102 and euCognition International School on Multimodal Signals: Cognitive and Algorithmic Issues - Vietri sul Mare, Italy
Duration: Apr 21 2008Apr 26 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5398 LNAI
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

OtherCOST Action 2102 and euCognition International School on Multimodal Signals: Cognitive and Algorithmic Issues
CountryItaly
CityVietri sul Mare
Period4/21/084/26/08

Fingerprint

Interaction
Emotional Speech
Gaussian Mixture Model
Feature Extraction
Discrimination
Nearest Neighbor
Machine Learning
Robustness
Learning systems
Feature extraction
Evaluation
Estimate
Speech
Life
Generalization

Keywords

  • feature and classifier fusion
  • Motherese detection

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Mahdhaoui, A., Chetouani, M., Zong, C., Cassel, R. S., Saint-Georges, C., Laznik, M. C., ... Cohen, D. (2009). Automatic motherese detection for face-to-face interaction analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5398 LNAI, pp. 248-255). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5398 LNAI). https://doi.org/10.1007/978-3-642-00525-1_25

Automatic motherese detection for face-to-face interaction analysis. / Mahdhaoui, A.; Chetouani, M.; Zong, C.; Cassel, R. S.; Saint-Georges, C.; Laznik, M. C.; Maestro, S.; Apicella, F.; MuratoriF., [No Value]; Cohen, D.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5398 LNAI 2009. p. 248-255 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5398 LNAI).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Mahdhaoui, A, Chetouani, M, Zong, C, Cassel, RS, Saint-Georges, C, Laznik, MC, Maestro, S, Apicella, F, MuratoriF., NV & Cohen, D 2009, Automatic motherese detection for face-to-face interaction analysis. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 5398 LNAI, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5398 LNAI, pp. 248-255, COST Action 2102 and euCognition International School on Multimodal Signals: Cognitive and Algorithmic Issues, Vietri sul Mare, Italy, 4/21/08. https://doi.org/10.1007/978-3-642-00525-1_25
Mahdhaoui A, Chetouani M, Zong C, Cassel RS, Saint-Georges C, Laznik MC et al. Automatic motherese detection for face-to-face interaction analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5398 LNAI. 2009. p. 248-255. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-642-00525-1_25
Mahdhaoui, A. ; Chetouani, M. ; Zong, C. ; Cassel, R. S. ; Saint-Georges, C. ; Laznik, M. C. ; Maestro, S. ; Apicella, F. ; MuratoriF., [No Value] ; Cohen, D. / Automatic motherese detection for face-to-face interaction analysis. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5398 LNAI 2009. pp. 248-255 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{ba65c7038c9842f5bea115deec8b4bb4,
title = "Automatic motherese detection for face-to-face interaction analysis",
abstract = "This paper deals with emotional speech detection in home movies. In this study, we focus on infant-directed speech also called {"}motherese{"} which is characterized by higher pitch, slower tempo, and exaggerated intonation. In this work, we show the robustness of approaches to automatic discrimination between infant-directed speech and normal directed speech. Specifically, we estimate the generalization capability of two feature extraction schemes extracted from supra-segmental and segmental information. In addition, two machine learning approaches are considered: k-nearest neighbors (k-NN) and Gaussian mixture models (GMM). Evaluations are carried out on real-life databases: home movies of the first year of an infant.",
keywords = "feature and classifier fusion, Motherese detection",
author = "A. Mahdhaoui and M. Chetouani and C. Zong and Cassel, {R. S.} and C. Saint-Georges and Laznik, {M. C.} and S. Maestro and F. Apicella and MuratoriF., {[No Value]} and D. Cohen",
year = "2009",
doi = "10.1007/978-3-642-00525-1_25",
language = "English",
isbn = "3642005241",
volume = "5398 LNAI",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "248--255",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Automatic motherese detection for face-to-face interaction analysis

AU - Mahdhaoui, A.

AU - Chetouani, M.

AU - Zong, C.

AU - Cassel, R. S.

AU - Saint-Georges, C.

AU - Laznik, M. C.

AU - Maestro, S.

AU - Apicella, F.

AU - MuratoriF., [No Value]

AU - Cohen, D.

PY - 2009

Y1 - 2009

N2 - This paper deals with emotional speech detection in home movies. In this study, we focus on infant-directed speech also called "motherese" which is characterized by higher pitch, slower tempo, and exaggerated intonation. In this work, we show the robustness of approaches to automatic discrimination between infant-directed speech and normal directed speech. Specifically, we estimate the generalization capability of two feature extraction schemes extracted from supra-segmental and segmental information. In addition, two machine learning approaches are considered: k-nearest neighbors (k-NN) and Gaussian mixture models (GMM). Evaluations are carried out on real-life databases: home movies of the first year of an infant.

AB - This paper deals with emotional speech detection in home movies. In this study, we focus on infant-directed speech also called "motherese" which is characterized by higher pitch, slower tempo, and exaggerated intonation. In this work, we show the robustness of approaches to automatic discrimination between infant-directed speech and normal directed speech. Specifically, we estimate the generalization capability of two feature extraction schemes extracted from supra-segmental and segmental information. In addition, two machine learning approaches are considered: k-nearest neighbors (k-NN) and Gaussian mixture models (GMM). Evaluations are carried out on real-life databases: home movies of the first year of an infant.

KW - feature and classifier fusion

KW - Motherese detection

UR - http://www.scopus.com/inward/record.url?scp=67650477797&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=67650477797&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-00525-1_25

DO - 10.1007/978-3-642-00525-1_25

M3 - Conference contribution

AN - SCOPUS:67650477797

SN - 3642005241

SN - 9783642005244

VL - 5398 LNAI

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 248

EP - 255

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -