메뉴 건너뛰기




Volumn 49, Issue 2, 2007, Pages 144-158

Automatic discrimination between laughter and speech

Author keywords

Automatic detection emotion; Automatic detection laughter

Indexed keywords

ACOUSTIC WAVES; ERROR DETECTION; GESTURE RECOGNITION; LEARNING SYSTEMS; MATHEMATICAL MODELS;

EID: 33846907749     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2007.01.001     Document Type: Article
Times cited : (129)

References (38)
  • 1
    • 84893194243 scopus 로고    scopus 로고
    • Adami, A.G., Hermansky, H., 2003. Segmentation of speech for speaker and language recognition. In: Proc. of the Eurospeech 2003, Geneva, Switzerland, pp. 841-844.
  • 2
    • 0033884857 scopus 로고    scopus 로고
    • Score normalization for text-independent speaker verification systems
    • Auckenthaler R., Carey M., and Lloyd-Thomas H. Score normalization for text-independent speaker verification systems. Digital Signal Process. 10 (2000) 42-54
    • (2000) Digital Signal Process. , vol.10 , pp. 42-54
    • Auckenthaler, R.1    Carey, M.2    Lloyd-Thomas, H.3
  • 4
    • 33846920217 scopus 로고    scopus 로고
    • Bett, M., Gross, R., Yu, H., Zhu, X., Pan, Y., Yang, J., Waibel, A., 2000. Multimodal Meeting Tracker. In: Proc. of the RIAO 2000, Paris, France.
  • 5
    • 33846902469 scopus 로고    scopus 로고
    • Bickley, C., Hunnicutt, S., 1992. Acoustic analysis of laughter. In: Proc. of the ICSLP 1992, Banff, Canada, pp. 927-930.
  • 6
    • 33846937197 scopus 로고    scopus 로고
    • Boersma, P., Weenink, D., 2005. Praat: doing phonetics by computer (Version 4.3.01) [Computer program]. Retrieved from .
  • 7
    • 0037382560 scopus 로고    scopus 로고
    • Emotions, speech and the ASR framework
    • Bosch ten L. Emotions, speech and the ASR framework. Speech Commun. 40 (2003) 213-225
    • (2003) Speech Commun. , vol.40 , pp. 213-225
    • Bosch ten, L.1
  • 8
    • 33846940605 scopus 로고    scopus 로고
    • Cai, R., Lie L., Zhang, H-J., Cai, L-H., 2003. Highlight sound effects detection in audio stream. In: Proc. of the IEEE International Conference on Multimedia and Expo 2003, Baltimore, USA, pp. 37-40.
  • 9
    • 0036289656 scopus 로고    scopus 로고
    • Campbell, W.M., 2002. Generalized linear discriminant sequence kernels for speaker recognition. In: Proc. of the IEEE International Conference on Acoustics Speech and Signal Processing 2002, Orlando, USA, pp. 161-164.
  • 10
    • 33846915245 scopus 로고    scopus 로고
    • Campbell, W.M., Reynolds, D.A., Campbell, J.P., 2004. Fusing discriminative and generative methods for speaker recognition: experiments on switchboard and NFI/TNO field data. In: Proc. Odyssey: The Speaker and Language Recognition Workshop 2004, Toledo, Spain, pp. 41-44.
  • 11
    • 33745209459 scopus 로고    scopus 로고
    • Campbell, N., Kashioka, H., Ohara, R., 2005. No laughing matter. In: Proc. of the Interspeech 2005, Lisbon, Portugal, pp. 465-468.
  • 12
    • 33846897071 scopus 로고    scopus 로고
    • Carey, M.J., Parris, E.S., Lloyd-Thomas, H., 1999. A comparison of features for speech, music discrimination. In: Proc. of the ICASSP 1999, Phoenix, USA, pp. 1432-1435.
  • 13
    • 0000913324 scopus 로고    scopus 로고
    • SVMTorch: support vector machines for large-scale regression problems
    • Collobert R., and Bengio S. SVMTorch: support vector machines for large-scale regression problems. J. Mach. Learning Res. 1 (2001) 143-160
    • (2001) J. Mach. Learning Res. , vol.1 , pp. 143-160
    • Collobert, R.1    Bengio, S.2
  • 14
    • 0033738539 scopus 로고    scopus 로고
    • The NIST speaker recognition evaluation - overview, methodology, systems, results, perspective
    • Doddington G., Przybocki M., Martin A., and Reynolds D. The NIST speaker recognition evaluation - overview, methodology, systems, results, perspective. Speech Commun. 31 (2000) 225-254
    • (2000) Speech Commun. , vol.31 , pp. 225-254
    • Doddington, G.1    Przybocki, M.2    Martin, A.3    Reynolds, D.4
  • 15
    • 0027957839 scopus 로고
    • Effect of temporal envelope smearing on speech reception
    • Drullman R., Festen J.M., and Plomp R. Effect of temporal envelope smearing on speech reception. J. Acoust. Soc. Amer. 95 2 (1994) 1053-1064
    • (1994) J. Acoust. Soc. Amer. , vol.95 , Issue.2 , pp. 1053-1064
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 16
    • 33846926194 scopus 로고    scopus 로고
    • El Hannani, A., Petrovska-Delacretaz, D., 2005. Exploiting high-level information provided by ALISP in speaker recognition. In: Proceedings of Non Linear Speech Processing Workshop (NOLISP05), Barcelona, Spain, pp. 19-24.
  • 17
    • 0024909979 scopus 로고    scopus 로고
    • Gillick, L., Cox, S., 1989. Some statistical issues in the comparison of speech recognition algorithms. In: ICASSP 1989, Glasgow, Scotland.
  • 18
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Hermansky H. Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Amer. 87 4 (1990) 1738-1752
    • (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 19
    • 33846904652 scopus 로고    scopus 로고
    • Janin, A., Ang, J., Bhagat, S., Dhillon, R., Edwards, J., Macias-Guarasa, J., Morgan, N., Peskin, B., Shriberg, E., Stolcke, A., Wooters, C., Wrede, B., 2004. The ICSI meeting project: resources and research. In: NIST ICASSP 2004 Meeting Recognition Workshop, Montreal, Canada.
  • 20
    • 33846925416 scopus 로고    scopus 로고
    • Kennedy, L.S., Ellis, D.P.W., 2004. Laughter detection in meetings. In: NIST ICASSP 2004 Meeting Recognition Workshop, Montreal, Canada.
  • 21
    • 0012046853 scopus 로고
    • LNKnet: neural network, machine learning, and statistical software for pattern classification
    • Lippmann R.P., Kukolich L., and Singer E. LNKnet: neural network, machine learning, and statistical software for pattern classification. Lincoln Lab. J. 6 2 (1993) 249-268
    • (1993) Lincoln Lab. J. , vol.6 , Issue.2 , pp. 249-268
    • Lippmann, R.P.1    Kukolich, L.2    Singer, E.3
  • 22
    • 0038726242 scopus 로고    scopus 로고
    • Lockerd, A., Mueller, F., 2002. LAFCam - leveraging affective feedback camcorder. In: Proc. of the CHI 2002 Conference on Human Factors in Computing Systems, Minneapolis, USA, pp. 574-575.
  • 23
    • 33846898390 scopus 로고    scopus 로고
    • Martin, A., Doddington, G., Kamm, T., Ordowski, M., Przybocki, M., 1997. The DET curve in assessment of detection task performance. In: Proc. of the Eurospeech 1997, Rhodes, Greece, pp. 1895-1898.
  • 25
    • 0242721417 scopus 로고    scopus 로고
    • Speech emotion recognition using hidden Markov models
    • Nwe T.L., Foo S.W., and De Silva L.C. Speech emotion recognition using hidden Markov models. Speech Commun. 41 (2003) 603-623
    • (2003) Speech Commun. , vol.41 , pp. 603-623
    • Nwe, T.L.1    Foo, S.W.2    De Silva, L.C.3
  • 26
    • 0027140974 scopus 로고
    • Vocal affect in three-year-olds: a quantitative acoustic analysis of child laughter
    • Nwokah E.E., Davies P., Islam A., Hsu H.-C., and Fogel A. Vocal affect in three-year-olds: a quantitative acoustic analysis of child laughter. J. Acoust. Soc. Amer. 94 6 (1993) 3076-3090
    • (1993) J. Acoust. Soc. Amer. , vol.94 , Issue.6 , pp. 3076-3090
    • Nwokah, E.E.1    Davies, P.2    Islam, A.3    Hsu, H.-C.4    Fogel, A.5
  • 27
    • 33846917039 scopus 로고    scopus 로고
    • Ohara, R., 2004. Analysis of a laughing voice and the method of laughter in dialogue speech. Unpublished Masters Thesis, Nara Institute of Science and Technology.
  • 28
    • 78649270174 scopus 로고    scopus 로고
    • Oostdijk, N., 2000. The spoken Dutch corpus: overview and first evaluation. In: Proc. of the LREC 2000, Athens, Greece, pp. 887-894.
  • 29
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • Reynolds D.A., Quatieri T.F., and Dunn R. Speaker verification using adapted Gaussian mixture models. Digital Signal Process. 10 (2000) 19-41
    • (2000) Digital Signal Process. , vol.10 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.3
  • 30
    • 0031668260 scopus 로고    scopus 로고
    • Analysis of laughter and speech sounds in Italian and German students
    • Rothganger H., Hauser G., Cappellini A.C., and Guidotti A. Analysis of laughter and speech sounds in Italian and German students. Naturwissenschaften 85 8 (1998) 394-402
    • (1998) Naturwissenschaften , vol.85 , Issue.8 , pp. 394-402
    • Rothganger, H.1    Hauser, G.2    Cappellini, A.C.3    Guidotti, A.4
  • 31
    • 0000679696 scopus 로고
    • Methods of research on vocal communication: paradigms and parameters
    • Scherer K.R., and Ekman P. (Eds), Cambridge UP, NewYork
    • Scherer K.R. Methods of research on vocal communication: paradigms and parameters. In: Scherer K.R., and Ekman P. (Eds). Handbook of Methods in Nonverbal Behavior Research (1982), Cambridge UP, NewYork 36-198
    • (1982) Handbook of Methods in Nonverbal Behavior Research , pp. 36-198
    • Scherer, K.R.1
  • 32
    • 33846939129 scopus 로고    scopus 로고
    • Trouvain, J., 2003. Segmenting phonetic units in laughter. In: Proc. of the ICPhS, Barcelona, Spain, pp. 2793-2796.
  • 33
    • 33745188038 scopus 로고    scopus 로고
    • Truong, K.P., Van Leeuwen, D.A., 2005. Automatic detection of laughter. In: Proc. of the Interspeech 2005, Lisbon, Portugal, pp. 485-488.
  • 36
    • 0015409613 scopus 로고
    • Emotions and speech: some acoustical correlates
    • Williams C.E., and Stevens K.N. Emotions and speech: some acoustical correlates. J. Acoust. Soc. Amer. 52 (1972) 1238-1250
    • (1972) J. Acoust. Soc. Amer. , vol.52 , pp. 1238-1250
    • Williams, C.E.1    Stevens, K.N.2
  • 38
    • 84947280249 scopus 로고    scopus 로고
    • Yacoub, S., Simske, S., Lin, X., Burns, J., 2003. Recognition of emotions in interactive voice response systems. In: Proc. of the Eurospeech 2003, Geneva, Switzerland, pp. 729-732.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.