메뉴 건너뛰기




Volumn 36, Issue 2-3, 2004, Pages 91-104

Acoustic Feature Analysis and Discriminative Modeling of Filled Pauses for Spontaneous Speech Recognition

Author keywords

Disfluency; Filled pause; Guassian mixture model; Karhunen Lo ve transform; Linear discriminant analysis; Speech recognition

Indexed keywords

ALGORITHMS; MATHEMATICAL MODELS; MATHEMATICAL TRANSFORMATIONS; OPTIMIZATION; SYSTEMS ANALYSIS;

EID: 1542783199     PISSN: 13875485     EISSN: None     Source Type: Journal    
DOI: 10.1023/b:vlsi.0000015089.17975.f4     Document Type: Conference Paper
Times cited : (12)

References (20)
  • 1
    • 0026366683 scopus 로고
    • Understanding Spontaneous Speech: The Phoenix System
    • W. Ward, "Understanding Spontaneous Speech: The Phoenix System," Proc. of ICASSP-91, 1991, pp. 365-367.
    • (1991) Proc. of ICASSP-91 , pp. 365-367
    • Ward, W.1
  • 2
    • 1542600432 scopus 로고
    • Investigation on Unknown Word Processing and Strategies for Spontaneous Speech Understanding
    • A. Kai and S. Nakagawa, "Investigation on Unknown Word Processing and Strategies for Spontaneous Speech Understanding," Proc. of Eurospeech'95, 1995, pp. 2095-2098.
    • (1995) Proc. of Eurospeech'95 , pp. 2095-2098
    • Kai, A.1    Nakagawa, S.2
  • 3
    • 0029765629 scopus 로고    scopus 로고
    • Statistical Language Model for Speech Disfluencies
    • A. Stolcke and E. Shriberg, "Statistical Language Model for Speech Disfluencies,"Proc. of ICASSP-96, vol. 1, 1996, pp. 405-408.
    • (1996) Proc. of ICASSP-96 , vol.1 , pp. 405-408
    • Stolcke, A.1    Shriberg, E.2
  • 4
    • 0030365510 scopus 로고    scopus 로고
    • Modeling Disfluencies in Conversation Speech
    • M. Siu and M. Ostendorf, "Modeling Disfluencies in Conversation Speech," Proc. of ICSLP-96, vol. 1, 1996, pp. 386-389.
    • (1996) Proc. of ICSLP-96 , vol.1 , pp. 386-389
    • Siu, M.1    Ostendorf, M.2
  • 5
    • 0033873049 scopus 로고    scopus 로고
    • Variable N-Grams and Extensions for Conversational Speech Language Modeling
    • M. Siu and M. Ostendorf, "Variable N-Grams and Extensions for Conversational Speech Language Modeling," IEEE Trans. Speech and Audio Processing, vol. 8, no. 1, 2000, pp. 63-75.
    • (2000) IEEE Trans. Speech and Audio Processing , vol.8 , Issue.1 , pp. 63-75
    • Siu, M.1    Ostendorf, M.2
  • 6
    • 0033692777 scopus 로고    scopus 로고
    • Linguistic Properties of Non-Native Speech
    • L.M. Tomokiyo, "Linguistic Properties of Non-Native Speech," Proc. of ICASSP-2000, vol. 3, 2000, pp. 1335-1338.
    • (2000) Proc. of ICASSP-2000 , vol.3 , pp. 1335-1338
    • Tomokiyo, L.M.1
  • 7
    • 0030365533 scopus 로고    scopus 로고
    • Filled Pauses as Markers of Discourse Structure
    • M. Swerts, A. Wichmann, and R.J. Beun, "Filled Pauses as Markers of Discourse Structure," Proc. ICSLP-96, vol. 2, 1996, pp. 1033-1036.
    • (1996) Proc. ICSLP-96 , vol.2 , pp. 1033-1036
    • Swerts, M.1    Wichmann, A.2    Beun, R.J.3
  • 8
    • 85009071758 scopus 로고
    • Recognition of Hesitations in Spontaneous Speech
    • D. O'Shaughnessy, "Recognition of Hesitations in Spontaneous Speech," Proc. of ICASSP-92, vol. 1, 1992, pp. 521-524.
    • (1992) Proc. of ICASSP-92 , vol.1 , pp. 521-524
    • O'Shaughnessy, D.1
  • 9
    • 85009070294 scopus 로고    scopus 로고
    • Detection of Filled Pauses in Spontaneous Conversation Speech
    • M. Gabrea and D. O'Shaughnessy, "Detection of Filled Pauses in Spontaneous Conversation Speech," Proc. of ICSLP-2000, 2000.
    • (2000) Proc. of ICSLP-2000
    • Gabrea, M.1    O'Shaughnessy, D.2
  • 10
    • 0029953272 scopus 로고    scopus 로고
    • Some Acoustic Feature of Nasal and Nasalized Vowels: A Target for Vowel Nasalization
    • G. Feng and E. Castelli, "Some Acoustic Feature of Nasal and Nasalized Vowels: A Target for Vowel Nasalization," J. Acoust. Soc. Am., vol. 99, no. 6, 1996, pp. 3694-3706.
    • (1996) J. Acoust. Soc. Am. , vol.99 , Issue.6 , pp. 3694-3706
    • Feng, G.1    Castelli, E.2
  • 11
    • 0030772174 scopus 로고    scopus 로고
    • Acoustic Correlates of English and French Nasalized Vowels
    • M.Y. Chen, "Acoustic Correlates of English and French Nasalized Vowels," J. Acoust. Soc. Am., vol. 102, no. 4, 1997, pp. 2360-2370.
    • (1997) J. Acoust. Soc. Am. , vol.102 , Issue.4 , pp. 2360-2370
    • Chen, M.Y.1
  • 12
    • 0001559782 scopus 로고
    • Analysis of Nasal Consonants
    • O. Fujimura, "Analysis of Nasal Consonants," J. Acoust. Soc. Am., vol. 34, 1962, pp. 1865-1875.
    • (1962) J. Acoust. Soc. Am. , vol.34 , pp. 1865-1875
    • Fujimura, O.1
  • 13
    • 0020741527 scopus 로고
    • Place Cues for Nasal Consonants with Special Reference to Catalan
    • D. Recasens, "Place Cues for Nasal Consonants with Special Reference to Catalan,"J. Acoust. Soc. Am., vol. 73, no. 4, 1983, pp. 1346-1353.
    • (1983) J. Acoust. Soc. Am. , vol.73 , Issue.4 , pp. 1346-1353
    • Recasens, D.1
  • 14
    • 85009090967 scopus 로고    scopus 로고
    • Discriminative Disfluency Modeling for Spontaneous Speech Recognition
    • C.-H. Wu and G.-L. Yan, "Discriminative Disfluency Modeling for Spontaneous Speech Recognition," EuroSpeech, vol. 3, 2001, pp. 1955-1958.
    • (2001) EuroSpeech , vol.3 , pp. 1955-1958
    • Wu, C.-H.1    Yan, G.-L.2
  • 17
    • 0031546409 scopus 로고    scopus 로고
    • Hierarchical Approach to Formant Detection and Tracking Through Instantaneous Frequency Estimation
    • S. Ghaemmaghami, M. Deriche, and B. Boashash, "Hierarchical Approach to Formant Detection and Tracking Through Instantaneous Frequency Estimation," Electronics Letters, vol. 33, no. 1, 1997, pp. 17-18.
    • (1997) Electronics Letters , vol.33 , Issue.1 , pp. 17-18
    • Ghaemmaghami, S.1    Deriche, M.2    Boashash, B.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.