메뉴 건너뛰기




Volumn , Issue , 2010, Pages 1539-1544

Modified LTSE-VAD algorithm for applications requiring reduced silence frame misclassification

Author keywords

[No Author keywords available]

Indexed keywords

SPEECH RECOGNITION;

EID: 79959855030     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (17)
  • 5
    • 34547958553 scopus 로고    scopus 로고
    • Multistyle classification of speech under stress using feature subset selection based on genetic algorithms
    • Salvatore Casale, Alessandra Russo, and Salvatore Serano. 2007. Multistyle classification of speech under stress using feature subset selection based on genetic algorithms. Speech Communication, 49(10):801-810.
    • (2007) Speech Communication , vol.49 , Issue.10 , pp. 801-810
    • Casale, S.1    Russo, A.2    Serano, S.3
  • 10
    • 48149087416 scopus 로고    scopus 로고
    • Real-time emotion detection system using speech: Multi-modal fusion of different timescale features
    • Crete
    • Samuel Kim, Panayiotis G. Georgiou, Sungbok Lee, and Shrikanth Narayanan. 2007. Real-time emotion detection system using speech: Multi-modal fusion of different timescale features. In IEEE Workshop on Multimedia Signal Processing, pages 48-51, Crete.
    • (2007) IEEE Workshop on Multimedia Signal Processing , pp. 48-51
    • Kim, S.1    Georgiou, P.G.2    Lee, S.3    Narayanan, S.4
  • 11
    • 0141478766 scopus 로고    scopus 로고
    • Pitch maxima for robust speaker recognition
    • Hong Kong
    • S. Krishnakumar, K.R. Prasanna Kumar, and N. Balakrishnan. 2003. Pitch maxima for robust speaker recognition. In ICASSP, Volume 2, pages 201-204, Hong Kong.
    • (2003) ICASSP , vol.2 , pp. 201-204
    • Krishnakumar, S.1    Prasanna Kumar, K.R.2    Balakrishnan, N.3
  • 12
    • 85046873967 scopus 로고    scopus 로고
    • The DET curve in assessment of detection task performance
    • Rhodes, Greece
    • Alvin F. Martin, George R. Doddington, Terri Kamm, Mark Ordowski, and Mark A. Przybocki. 1997. The DET curve in assessment of detection task performance. In Eurospeech, pages 1895-1898, Rhodes, Greece.
    • (1997) Eurospeech , pp. 1895-1898
    • Martin, A.F.1    Doddington, G.R.2    Kamm, T.3    Ordowski, M.4    Przybocki, M.A.5
  • 13
    • 0242721417 scopus 로고    scopus 로고
    • Speech emotion recognition using hidden Markov models
    • Tin Lay Nwe, Say Wei Foo, and Liyanage C. de Silva. 2003. Speech emotion recognition using hidden Markov models. Speech Communication, 41(4):603-623.
    • (2003) Speech Communication , vol.41 , Issue.4 , pp. 603-623
    • Nwe, T.L.1    Foo, S.W.2    De Silva, L.C.3
  • 14
    • 33646093001 scopus 로고    scopus 로고
    • Feature representation and discrimination based on Gaussian mixture model probability densities - Practices and algorithms
    • Pekka Paalanen, Joni-Kristian Kamarainen, Jarmo Ilonen, and Heikki Klviinen. 2006. Feature representation and discrimination based on gaussian mixture model probability densities - practices and algorithms. Pattern Recognition, 39(7):1346-1358.
    • (2006) Pattern Recognition , vol.39 , Issue.7 , pp. 1346-1358
    • Paalanen, P.1    Kamarainen, J.-K.2    Ilonen, J.3    Klviinen, H.4
  • 15
    • 23144440245 scopus 로고    scopus 로고
    • Global trend of fundamental frequency in emotional speech
    • Nara, Japan
    • A. Paeschke. 2004. Global trend of fundamental frequency in emotional speech. In Speech Prosody, pages 671-674, Nara, Japan.
    • (2004) Speech Prosody , pp. 671-674
    • Paeschke, A.1
  • 16
    • 1842476689 scopus 로고    scopus 로고
    • Efficient voice activity detection algorithms using long term speech information
    • Javier Ramirez, Jose C. Segura, Carmen Benitez, Angel de la Torre, and Antonio Rubio. 2004. Efficient voice activity detection algorithms using long term speech information. Speech Communication, 42:271-287.
    • (2004) Speech Communication , vol.42 , pp. 271-287
    • Ramirez, J.1    Segura, J.C.2    Benitez, C.3    De La Torre, A.4    Rubio, A.5
  • 17
    • 38049048651 scopus 로고    scopus 로고
    • Frame vs. Turn-level: Emotion recognition from speech considering static and dynamic processing
    • Bogdan Vlasenko, Bjrn Schuller, Andreas Wendemuth, and Gerhard Rigoll. 2007. Frame vs. turn-level: Emotion recognition from speech considering static and dynamic processing. Lecture Notes on Computer Science, 4738:139-147.
    • (2007) Lecture Notes on Computer Science , vol.4738 , pp. 139-147
    • Vlasenko, B.1    Schuller, B.2    Wendemuth, A.3    Rigoll, G.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.