메뉴 건너뛰기




Volumn 46, Issue 3-4, 2005, Pages 418-439

Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus

Author keywords

Automatic speech recognition; Prosody

Indexed keywords

ACOUSTIC PROPERTIES; MATHEMATICAL MODELS; PHONOGRAPHS; RADIO COMMUNICATION; STATISTICAL METHODS;

EID: 21844465704     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2005.01.009     Document Type: Conference Paper
Times cited : (40)

References (60)
  • 3
    • 0003597966 scopus 로고
    • Guidelines for ToBI labelling
    • Ohio State University
    • Beckman, M.E., Elam, G.A., 1994. Guidelines for ToBI labelling. Technical report, Ohio State University. Available from < http://www.ling.ohio-state.edu/research/phonetics/E_ToBI/singer_tobi.html>.
    • (1994) Technical Report
    • Beckman, M.E.1    Elam, G.A.2
  • 6
    • 21844474476 scopus 로고    scopus 로고
    • Acoustic differentiation of ip and IP boundary levels: Comparison of L- and L-L% in the switchboard corpus
    • Nara, Japan
    • Chavarria, S., Yoon, T., Cole, J., Hasegawa-Johnson, M., 2004. Acoustic differentiation of ip and IP boundary levels: Comparison of L- and L-L% in the switchboard corpus. In Proc. SpeechProsody, Nara, Japan.
    • (2004) Proc. SpeechProsody
    • Chavarria, S.1    Yoon, T.2    Cole, J.3    Hasegawa-Johnson, M.4
  • 9
    • 85009211730 scopus 로고    scopus 로고
    • Prosody dependent speech recognition with explicit duration modeling at intonational phrase boundaries
    • Geneva
    • Chen, K., Borys, S., Hasegawa-Johnson, M., 2003a. Prosody dependent speech recognition with explicit duration modeling at intonational phrase boundaries. In: Proc. EUROSPEECH, Geneva, pp. 393-396.
    • (2003) Proc. EUROSPEECH , pp. 393-396
    • Chen, K.1    Borys, S.2    Hasegawa-Johnson, M.3
  • 11
    • 4544275067 scopus 로고    scopus 로고
    • An automatic prosody labeling system using ANN-based syntactic prosodic model and GMM-based acoustic prosodic model
    • Chen, K., Hasegawa-Johnson, M., Cohen, A., 2004a. An automatic prosody labeling system using ANN-based syntactic prosodic model and GMM-based acoustic prosodic model. In: Proc. ICASSP.
    • (2004) Proc. ICASSP
    • Chen, K.1    Hasegawa-Johnson, M.2    Cohen, A.3
  • 17
    • 0023921973 scopus 로고
    • Segmental druations in connected-speech signals: Current results
    • T.H. Crystal, and A.S. House Segmental druations in connected-speech signals: Current results J. Acoust. Soc. Amer. 83 1988 1553 1573
    • (1988) J. Acoust. Soc. Amer. , vol.83 , pp. 1553-1573
    • Crystal, T.H.1    House, A.S.2
  • 18
    • 21844469157 scopus 로고
    • The supraglottal articulation of prominence in English: Linguistic stress as localized hyperarticulation
    • K. DeJong The supraglottal articulation of prominence in English: Linguistic stress as localized hyperarticulation J. Acoust. Soc. Amer. 89 1 1995 369 382
    • (1995) J. Acoust. Soc. Amer. , vol.89 , Issue.1 , pp. 369-382
    • Dejong, K.1
  • 19
    • 0030268342 scopus 로고    scopus 로고
    • Glottalization of word-initial vowels as a function of prosodic structure
    • L. Dilley, S. Shattuck-Hufnagel, and M. Ostendorf Glottalization of word-initial vowels as a function of prosodic structure J. Phonet. 24 1996 423 444
    • (1996) J. Phonet. , vol.24 , pp. 423-444
    • Dilley, L.1    Shattuck-Hufnagel, S.2    Ostendorf, M.3
  • 20
  • 21
    • 0141702354 scopus 로고    scopus 로고
    • A prosody-based approach to end-of-utterance detection that does not require speech recognition
    • Ferrer, L., Shriberg, E., Stolcke, A., 2003. A prosody-based approach to end-of-utterance detection that does not require speech recognition. In: Proc. ICASSP. pp. 608-611.
    • (2003) Proc. ICASSP , pp. 608-611
    • Ferrer, L.1    Shriberg, E.2    Stolcke, A.3
  • 22
    • 0031009252 scopus 로고    scopus 로고
    • Articulatory strengthening at edges of prosodic domains
    • C. Fougeron, and P.A. Keating Articulatory strengthening at edges of prosodic domains J. Acoust. Soc. Amer. 101 6 1997 3728 3740
    • (1997) J. Acoust. Soc. Amer. , vol.101 , Issue.6 , pp. 3728-3740
    • Fougeron, C.1    Keating, P.A.2
  • 23
    • 85011187169 scopus 로고
    • Analysis of voice fundamental frequency contours for declarative sentence of Japanese
    • H. Fujisaki, and K. Hirose Analysis of voice fundamental frequency contours for declarative sentence of Japanese J. Acoust. Soc. Jpn. 5 4 1984 233 242
    • (1984) J. Acoust. Soc. Jpn. , vol.5 , Issue.4 , pp. 233-242
    • Fujisaki, H.1    Hirose, K.2
  • 24
    • 85016587886 scopus 로고
    • SWITCHBOARD: Telephone speech corpus for research and development
    • Godfrey, J., Holliman, E., McDaniel, J., 1992. SWITCHBOARD: telephone speech corpus for research and development. In: Proc. ICASSP. pp. 517-520.
    • (1992) Proc. ICASSP , pp. 517-520
    • Godfrey, J.1    Holliman, E.2    McDaniel, J.3
  • 28
    • 0037795510 scopus 로고    scopus 로고
    • 0 control rules using statistical analysis
    • J.P.H. van Santen R.W. Sproat J.P. Olive J. Hirschberg Springer-Verlag New York
    • 0 control rules using statistical analysis J.P.H. van Santen R.W. Sproat J.P. Olive J. Hirschberg Progress in Speech Synthesis 1997 Springer-Verlag New York 333 346
    • (1997) Progress in Speech Synthesis , pp. 333-346
    • Hirai, T.1    Iwahashi, N.2    Higuchi, N.3    Sagisaka, Y.4
  • 30
    • 0002838041 scopus 로고
    • Consonant types, vowel quality and tone
    • Fromkin, V. (Ed.)
    • Hombert, J., 1978. Consonant types, vowel quality and tone. In: Fromkin, V. (Ed.), Tone: A Linguistic Survey. pp. 77-112.
    • (1978) Tone: A Linguistic Survey , pp. 77-112
    • Hombert, J.1
  • 31
    • 0032203256 scopus 로고    scopus 로고
    • Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method
    • S. Katagiri, B.-H. Juang, and C.-H. Lee Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method Proc. IEEE 86 11 1998 2345 2373
    • (1998) Proc. IEEE , vol.86 , Issue.11 , pp. 2345-2373
    • Katagiri, S.1    Juang, B.-H.2    Lee, C.-H.3
  • 32
    • 0015196653 scopus 로고
    • Effects of stress contrasts on certain articulatory parameters
    • Kent, and Netsell Effects of stress contrasts on certain articulatory parameters Phonetica. 24 1971 23 44
    • (1971) Phonetica. , vol.24 , pp. 23-44
    • Kent1    Netsell2
  • 33
    • 0032584970 scopus 로고    scopus 로고
    • Time-delay recurrent neural network for temporal correlations and prediction
    • S.-S. Kim Time-delay recurrent neural network for temporal correlations and prediction Neurocomputing 20 1998 253 263
    • (1998) Neurocomputing , vol.20 , pp. 253-263
    • Kim, S.-S.1
  • 34
    • 21844462286 scopus 로고    scopus 로고
    • The effect of accent on acoustic cues to stop voicing and place of articulation in radio news speech
    • Nara, Japan
    • Kim, H., Cole, J., Choi, H., Hasegawa-Johnson, M., 2004a. The effect of accent on acoustic cues to stop voicing and place of articulation in radio news speech. In: Proc. SpeechProsody, Nara, Japan.
    • (2004) Proc. SpeechProsody
    • Kim, H.1    Cole, J.2    Choi, H.3    Hasegawa-Johnson, M.4
  • 35
    • 3142765506 scopus 로고    scopus 로고
    • Automatic recognition of pitch movements using multi-layer perceptron and time-delay recursive neural network
    • S.-S. Kim, M. Hasegawa-Johnson, and K. Chen Automatic recognition of pitch movements using multi-layer perceptron and time-delay recursive neural network IEEE Signal Process. Lett. 11 7 2004 645 648
    • (2004) IEEE Signal Process. Lett. , vol.11 , Issue.7 , pp. 645-648
    • Kim, S.-S.1    Hasegawa-Johnson, M.2    Chen, K.3
  • 36
    • 0016952322 scopus 로고
    • Linguistic uses of segmental duration in english: Acoustic and perceptual evidence
    • D.H. Klatt Linguistic uses of segmental duration in english: Acoustic and perceptual evidence J. Acoust. Soc. Amer. 59 5 1976 1208 1221
    • (1976) J. Acoust. Soc. Amer. , vol.59 , Issue.5 , pp. 1208-1221
    • Klatt, D.H.1
  • 38
    • 0024768209 scopus 로고
    • Speaker-independent phone recognition using hidden Markov models
    • K.-F. Lee, and H.-W. Hon Speaker-independent phone recognition using hidden Markov models IEEE Trans. Acoust., Speech, Signal Process. 37 11 1989 1641 1648 November
    • (1989) IEEE Trans. Acoust., Speech, Signal Process. , vol.37 , Issue.11 , pp. 1641-1648
    • Lee, K.-F.1    Hon, H.-W.2
  • 39
    • 85009223733 scopus 로고    scopus 로고
    • Automatic disfluency identification in conversational speech using multiple knowledge sources
    • Liu, Y., Shriberg, E., Stolcke, A., 2003. Automatic disfluency identification in conversational speech using multiple knowledge sources. In: Proc. EUROSPEECH.
    • (2003) Proc. EUROSPEECH
    • Liu, Y.1    Shriberg, E.2    Stolcke, A.3
  • 40
    • 0346707540 scopus 로고    scopus 로고
    • Approximately independent factors of speech using non-linear symplectic transformation
    • M.K. Omar, and M. Hasegawa-Johnson Approximately independent factors of speech using non-linear symplectic transformation IEEE Trans. Speech and Audio Process. 11 6 2003 660 671
    • (2003) IEEE Trans. Speech and Audio Process. , vol.11 , Issue.6 , pp. 660-671
    • Omar, M.K.1    Hasegawa-Johnson, M.2
  • 47
    • 34848820349 scopus 로고    scopus 로고
    • Direct modeling of prosody: An overview of applications in automatic speech processing
    • Shriberg, E., Stolcke, A., 2004. Direct modeling of prosody: An overview of applications in automatic speech processing. In: Proc. SpeechProsody.
    • (2004) Proc. SpeechProsody
    • Shriberg, E.1    Stolcke, A.2
  • 51
    • 79959823252 scopus 로고    scopus 로고
    • Modeling the prosody of hidden events for improved word recognition
    • Stolcke, A., Shriberg, E., Hakkani-Tür, D., Tür, G., 1999. Modeling the prosody of hidden events for improved word recognition. In: Proc. EUROSPEECH, pp. 307-310.
    • (1999) Proc. EUROSPEECH , pp. 307-310
    • Stolcke, A.1    Shriberg, E.2    Hakkani-Tür, D.3    Tür, G.4
  • 52
    • 0034008810 scopus 로고    scopus 로고
    • Analysis and synthesis of intonation using the Tilt model
    • P. Taylor Analysis and synthesis of intonation using the Tilt model J. Acoust. Soc. Amer. 107 3 2000 1697 1714
    • (2000) J. Acoust. Soc. Amer. , vol.107 , Issue.3 , pp. 1697-1714
    • Taylor, P.1
  • 53
    • 0033096914 scopus 로고    scopus 로고
    • Acoustic characteristics of lexical stress in continuous telephone speech
    • D. van Kuijk, and L. Boves Acoustic characteristics of lexical stress in continuous telephone speech Speech Comm. 27 1999 95 111
    • (1999) Speech Comm. , vol.27 , pp. 95-111
    • Van Kuijk, D.1    Boves, L.2
  • 57
    • 0026734712 scopus 로고
    • Segmental durations in the vicinity of prosodic phrase boundaries
    • C. Wightman, S. Shattuck-Hufnagel, and M.O. Patti Price Segmental durations in the vicinity of prosodic phrase boundaries J. Acoust. Soc. Amer. 91 3 1992 1707 1717 March
    • (1992) J. Acoust. Soc. Amer. , vol.91 , Issue.3 , pp. 1707-1717
    • Wightman, C.1    Shattuck-Hufnagel, S.2    Patti Price, M.O.3
  • 60
    • 0025477640 scopus 로고
    • Speech database development at MIT: TIMIT and beyond
    • V. Zue, S. Seneff, and J. Glass Speech database development at MIT: TIMIT and beyond Speech Comm. 9 1990 351 356
    • (1990) Speech Comm. , vol.9 , pp. 351-356
    • Zue, V.1    Seneff, S.2    Glass, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.