메뉴 건너뛰기




Volumn 358, Issue 1769, 2000, Pages 1325-1338

Incorporating linguistic theories of pronunciation variation into speech-recognition models

Author keywords

Acoustic modelling; Phonetic variation; Pronunciation modelling

Indexed keywords


EID: 0347307067     PISSN: 1364503X     EISSN: None     Source Type: Journal    
DOI: 10.1098/rsta.2000.0589     Document Type: Article
Times cited : (5)

References (42)
  • 5
    • 84958907242 scopus 로고
    • The geometry of phonological features
    • Clements, G. 1985 The geometry of phonological features. In Phonology Yearbook, vol. 2, pp. 223-252.
    • (1985) Phonology Yearbook , vol.2 , pp. 223-252
    • Clements, G.1
  • 8
    • 0026458724 scopus 로고
    • Structural design of HMM speech recognizer using multi-valued phonetic features: Comparison with segmental speech units
    • Deng, L. & Erler, K. 1992 Structural design of HMM speech recognizer using multi-valued phonetic features: comparison with segmental speech units. J. Acoust. Soc. Am. 92, 3058-3067.
    • (1992) J. Acoust. Soc. Am. , vol.92 , pp. 3058-3067
    • Deng, L.1    Erler, K.2
  • 9
    • 0030359816 scopus 로고    scopus 로고
    • Hierarchical partition of the articulatory state space for overlapping-feature based speech recognition
    • Deng, L. & Wu, J. 1996 Hierarchical partition of the articulatory state space for overlapping-feature based speech recognition. In Proc. Int. Conf. Spoken Language Processing, pp. 2266-2269.
    • (1996) Proc. Int. Conf. Spoken Language Processing , pp. 2266-2269
    • Deng, L.1    Wu, J.2
  • 10
    • 0030268342 scopus 로고    scopus 로고
    • Glottalization of vowel-initial syllables as a function of prosodic structure
    • Dilley, L., Shattuck-Hufnagel, S. & Ostendorf, M. 1996 Glottalization of vowel-initial syllables as a function of prosodic structure. J. Phonetics 24, 423-444.
    • (1996) J. Phonetics , vol.24 , pp. 423-444
    • Dilley, L.1    Shattuck-Hufnagel, S.2    Ostendorf, M.3
  • 12
    • 0000003732 scopus 로고
    • Articulatory timing and the prosodic interpretation of syllable duration
    • Edwards, J. & Beckman, M. 1988 Articulatory timing and the prosodic interpretation of syllable duration. Phonetica 45, 156-174.
    • (1988) Phonetica , vol.45 , pp. 156-174
    • Edwards, J.1    Beckman, M.2
  • 13
    • 0346833363 scopus 로고    scopus 로고
    • Automatic modeling of pronunciation variations
    • Eide, E. 1999 Automatic modeling of pronunciation variations. In Proc. DARPA Broadcast News Workshop, pp. 95-98.
    • (1999) Proc. DARPA Broadcast News Workshop , pp. 95-98
    • Eide, E.1
  • 14
    • 0027627252 scopus 로고
    • Hidden Markov model representation of quantized articulatory features for speech recognition
    • Erler, K. & Deng, L. 1993 Hidden Markov model representation of quantized articulatory features for speech recognition. Comp. Speech Language 7, 265-282.
    • (1993) Comp. Speech Language , vol.7 , pp. 265-282
    • Erler, K.1    Deng, L.2
  • 15
    • 85027454087 scopus 로고    scopus 로고
    • Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition
    • Finke, M. & Waibel, A. 1997 Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition. In Proc. Eur. Conf. Speech Communication and Technology, pp. 2379-2382.
    • (1997) Proc. Eur. Conf. Speech Communication and Technology , pp. 2379-2382
    • Finke, M.1    Waibel, A.2
  • 17
    • 0010881006 scopus 로고    scopus 로고
    • Demarcating prosodic groups with articulation
    • Fougeron, C. & Keating, P. 1997 Demarcating prosodic groups with articulation. J. Acoust. Soc. Am. 97, 3384.
    • (1997) J. Acoust. Soc. Am. , vol.97 , pp. 3384
    • Fougeron, C.1    Keating, P.2
  • 19
  • 22
    • 84973969285 scopus 로고
    • Feature geometry and the vocal tract
    • Keyser, S. & Stevens, K. 1994 Feature geometry and the vocal tract. Phonology 11, 207-236.
    • (1994) Phonology , vol.11 , pp. 207-236
    • Keyser, S.1    Stevens, K.2
  • 24
    • 0030355367 scopus 로고    scopus 로고
    • Syllable-level desynchronisation of phonetic features for speech recognition
    • Kirchhoff, K. 1996 Syllable-level desynchronisation of phonetic features for speech recognition. In Proc. Int. Conf. Spoken Language Processing, pp. 2274-2276.
    • (1996) Proc. Int. Conf. Spoken Language Processing , pp. 2274-2276
    • Kirchhoff, K.1
  • 25
    • 85128370668 scopus 로고    scopus 로고
    • Combining articulatory and acoustic information for speech recognition in noisy and reverberant environments
    • Kirchhoff, K. 1998 Combining articulatory and acoustic information for speech recognition in noisy and reverberant environments. In Proc. Int. Conf. Spoken Language Processing, pp. 891-894.
    • (1998) Proc. Int. Conf. Spoken Language Processing , pp. 891-894
    • Kirchhoff, K.1
  • 26
    • 0342759107 scopus 로고    scopus 로고
    • Speech recognition with phonological features
    • Lahiri, A. 1999 Speech recognition with phonological features. In Proc. Int. Congr. Phonetic Sciences, pp. 715-718.
    • (1999) Proc. Int. Congr. Phonetic Sciences , pp. 715-718
    • Lahiri, A.1
  • 27
    • 0000665734 scopus 로고
    • Explaining phonetic variation: A sketch of the H&H theory
    • ed. W. Hardcastle & A. Marchal, Dordrecht: Kluwer
    • Lindblom, B. 1990 Explaining phonetic variation: a sketch of the H&H theory. In Speech production and speech modelling (ed. W. Hardcastle & A. Marchal), pp. 403-439. Dordrecht: Kluwer.
    • (1990) Speech Production and Speech Modelling , pp. 403-439
    • Lindblom, B.1
  • 28
    • 84943154470 scopus 로고    scopus 로고
    • Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch
    • McAllister, D., Gillick, L. Scattone, F. & Newman, M. 1998 Fabricating conversational speech data with acoustic models: a program to examine model-data mismatch. In Proc. Int. Conf. Spoken Language Processing, pp. 1847-1850.
    • (1998) Proc. Int. Conf. Spoken Language Processing , pp. 1847-1850
    • McAllister, D.1    Gillick, L.2    Scattone, F.3    Newman, M.4
  • 30
    • 0030715097 scopus 로고    scopus 로고
    • HMM topology design using maximum likelihood successive state splitting
    • Ostendorf, M. & Singer, H. 1997 HMM topology design using maximum likelihood successive state splitting. Comp. Speech Language 11, 17-42.
    • (1997) Comp. Speech Language , vol.11 , pp. 17-42
    • Ostendorf, M.1    Singer, H.2
  • 32
  • 33
    • 0030682299 scopus 로고    scopus 로고
    • Extensions to phone-state decision-tree clustering: Single tree and tagged clustering
    • Paul, D. 1997 Extensions to phone-state decision-tree clustering: single tree and tagged clustering. In Proc. Int. Conf. Acoustics, Speech and Signal Processing, pp. 1487-1490.
    • (1997) Proc. Int. Conf. Acoustics, Speech and Signal Processing , pp. 1487-1490
    • Paul, D.1
  • 34
    • 0033353288 scopus 로고    scopus 로고
    • Stochastic pronunciation modelling from hand-labelled phonetic corpora
    • Riley, M. (and 10 others) 1999 Stochastic pronunciation modelling from hand-labelled phonetic corpora. Speech Commun. 29, 209-224.
    • (1999) Speech Commun. , vol.29 , pp. 209-224
    • Riley, M.1
  • 36
    • 0030095762 scopus 로고    scopus 로고
    • A prosody tutorial for investigators of auditory sentence processing
    • Shattuck-Hufnagel, S. & Turk, A. 1996 A prosody tutorial for investigators of auditory sentence processing. J. Psycholing. Res. 25, 193-247.
    • (1996) J. Psycholing. Res. , vol.25 , pp. 193-247
    • Shattuck-Hufnagel, S.1    Turk, A.2
  • 38
    • 84958912927 scopus 로고
    • Primary features and their enhancement in consonants
    • Stevens, K. & Keyser, S. 1989 Primary features and their enhancement in consonants. Language 65, 81-106.
    • (1989) Language , vol.65 , pp. 81-106
    • Stevens, K.1    Keyser, S.2
  • 39
    • 85135194422 scopus 로고
    • Building multiple pronunciation models for novel words using exploratory computational phonology
    • Tajchman, G., Fosler, E. & Jurafsky, D. 1995 Building multiple pronunciation models for novel words using exploratory computational phonology. In Proc. Eur. Conf. Speech Communication and Technology, pp. 2247-2250.
    • (1995) Proc. Eur. Conf. Speech Communication and Technology , pp. 2247-2250
    • Tajchman, G.1    Fosler, E.2    Jurafsky, D.3
  • 40
    • 0037954807 scopus 로고    scopus 로고
    • Segmental duration and speech timing
    • ed. Y. Sagisaka, N. Campbell & N. Higuchi. Springer
    • Van Santen, J. 1997 Segmental duration and speech timing. In Computing prosody (ed. Y. Sagisaka, N. Campbell & N. Higuchi). Springer.
    • (1997) Computing Prosody
    • Van Santen, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.