SCOPUS 정보 검색 플랫폼

Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences

Volumn 358, Issue 1769, 2000, Pages 1325-1338

Incorporating linguistic theories of pronunciation variation into speech-recognition models

(1) Ostendorf, Mari a

a University of Washington (United States)

Author keywords

Acoustic modelling; Phonetic variation; Pronunciation modelling

Indexed keywords

EID: 0347307067 PISSN: 1364503X EISSN: None Source Type: Journal
DOI: 10.1098/rsta.2000.0589 Document Type: Article

Times cited : (5)

References (42)

1
- 84937187378
- PhD thesis, Department of EECS, University of California, Berkeley, USA
- Bilmes, J. 1999 Natural statistical models for automatic speech recognition. PhD thesis, Department of EECS, University of California, Berkeley, USA.
- (1999) Natural Statistical Models for Automatic Speech Recognition
- Bilmes, J.¹

2
- 0029725523
- Knowledge-based parameters for HMM speech recognition
- Bitar, N. & Espy-Wilson, C. 1996 Knowledge-based parameters for HMM speech recognition. In Proc. Int. Conf. Acoustics, Speech and Signal Processing, pp. 29-32.
- (1996) Proc. Int. Conf. Acoustics, Speech and Signal Processing , pp. 29-32
- Bitar, N.¹ Espy-Wilson, C.²

3
- 0003802343
- Monterey, CA: Wadsworth and Brooks
- Breiman, L., Friedman, J., Olshen, R. & Stone, C. 1984 Classification and regression trees. Monterey, CA: Wadsworth and Brooks.
- (1984) Classification and Regression Trees
- Breiman, L.¹ Friedman, J.² Olshen, R.³ Stone, C.⁴

4
- 0004147298
- London: Blackwell
- Clark, J. & Yallop, C. 1995 An introduction to phonetics and phonology. London: Blackwell.
- (1995) An Introduction to Phonetics and Phonology
- Clark, J.¹ Yallop, C.²

5
- 84958907242
- The geometry of phonological features
- Clements, G. 1985 The geometry of phonological features. In Phonology Yearbook, vol. 2, pp. 223-252.
- (1985) Phonology Yearbook , vol.2 , pp. 223-252
- Clements, G.¹

6
- 0003721728
- PhD thesis, University of California, Berkeley, USA
- Cohen, M. 1989 Phonological structures for speech recognition. PhD thesis, University of California, Berkeley, USA.
- (1989) Phonological Structures for Speech Recognition
- Cohen, M.¹

7
- 0039503389
- Computational models for speech production
- ed. K. Ponting. Springer
- Deng, L. 1998 Computational models for speech production. In Computational models of speech pattern processing (ed. K. Ponting). Springer.
- (1998) Computational Models of Speech Pattern Processing
- Deng, L.¹

8
- 0026458724
- Structural design of HMM speech recognizer using multi-valued phonetic features: Comparison with segmental speech units
- Deng, L. & Erler, K. 1992 Structural design of HMM speech recognizer using multi-valued phonetic features: comparison with segmental speech units. J. Acoust. Soc. Am. 92, 3058-3067.
- (1992) J. Acoust. Soc. Am. , vol.92 , pp. 3058-3067
- Deng, L.¹ Erler, K.²

9
- 0030359816
- Hierarchical partition of the articulatory state space for overlapping-feature based speech recognition
- Deng, L. & Wu, J. 1996 Hierarchical partition of the articulatory state space for overlapping-feature based speech recognition. In Proc. Int. Conf. Spoken Language Processing, pp. 2266-2269.
- (1996) Proc. Int. Conf. Spoken Language Processing , pp. 2266-2269
- Deng, L.¹ Wu, J.²

10
- 0030268342
- Glottalization of vowel-initial syllables as a function of prosodic structure
- Dilley, L., Shattuck-Hufnagel, S. & Ostendorf, M. 1996 Glottalization of vowel-initial syllables as a function of prosodic structure. J. Phonetics 24, 423-444.
- (1996) J. Phonetics , vol.24 , pp. 423-444
- Dilley, L.¹ Shattuck-Hufnagel, S.² Ostendorf, M.³

11
- 0004131347
- PhD thesis, University of Cambridge, UK
- Donovan, R. 1996 Trainable speech synthesis. PhD thesis, University of Cambridge, UK.
- (1996) Trainable Speech Synthesis
- Donovan, R.¹

12
- 0000003732
- Articulatory timing and the prosodic interpretation of syllable duration
- Edwards, J. & Beckman, M. 1988 Articulatory timing and the prosodic interpretation of syllable duration. Phonetica 45, 156-174.
- (1988) Phonetica , vol.45 , pp. 156-174
- Edwards, J.¹ Beckman, M.²

13
- 0346833363
- Automatic modeling of pronunciation variations
- Eide, E. 1999 Automatic modeling of pronunciation variations. In Proc. DARPA Broadcast News Workshop, pp. 95-98.
- (1999) Proc. DARPA Broadcast News Workshop , pp. 95-98
- Eide, E.¹

14
- 0027627252
- Hidden Markov model representation of quantized articulatory features for speech recognition
- Erler, K. & Deng, L. 1993 Hidden Markov model representation of quantized articulatory features for speech recognition. Comp. Speech Language 7, 265-282.
- (1993) Comp. Speech Language , vol.7 , pp. 265-282
- Erler, K.¹ Deng, L.²

15
- 85027454087
- Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition
- Finke, M. & Waibel, A. 1997 Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition. In Proc. Eur. Conf. Speech Communication and Technology, pp. 2379-2382.
- (1997) Proc. Eur. Conf. Speech Communication and Technology , pp. 2379-2382
- Finke, M.¹ Waibel, A.²

16
- 0342931803
- Incorporating contextual phonetics into automatic speech recognition
- Fosler-Lussier, E., Greenberg, S. & Morgan, N. 1999 Incorporating contextual phonetics into automatic speech recognition. In Proc. Int. Congr. Phonetic Sciences, pp. 611-614.
- (1999) Proc. Int. Congr. Phonetic Sciences , pp. 611-614
- Fosler-Lussier, E.¹ Greenberg, S.² Morgan, N.³

17
- 0010881006
- Demarcating prosodic groups with articulation
- Fougeron, C. & Keating, P. 1997 Demarcating prosodic groups with articulation. J. Acoust. Soc. Am. 97, 3384.
- (1997) J. Acoust. Soc. Am. , vol.97 , pp. 3384
- Fougeron, C.¹ Keating, P.²

18
- 0012588925
- Speaking in shorthand - A syllable-centric perspective for understanding pronunciation variation
- Greenberg, S. 1998 Speaking in shorthand - a syllable-centric perspective for understanding pronunciation variation. In Proc. ESCA Workshop on Modelling Pronunciation Variation for Automatic Speech Recognition, pp. 47-56.
- (1998) Proc. ESCA Workshop on Modelling Pronunciation Variation for Automatic Speech Recognition , pp. 47-56
- Greenberg, S.¹

19
- 0038681656
- Phonological features
- ed. W. Bright. Oxford University Press
- Halle, M. 1992 Phonological features. In International encyclopedia of linguistics (ed. W. Bright). Oxford University Press.
- (1992) International Encyclopedia of Linguistics
- Halle, M.¹

20
- 4544342862
- Phoneme recognition using acoustic events
- Hübener, K. & Carson-Berndsen, J. 1994 Phoneme recognition using acoustic events. In Proc. Int. Conf. Spoken Language Processing, pp. 1919-1922.
- (1994) Proc. Int. Conf. Spoken Language Processing , pp. 1919-1922
- Hübener, K.¹ Carson-Berndsen, J.²

21
- 0000304248
- MIT Computational Cognitive Science technical report 9606
- Jordan, M., Ghahramani, Z. & Saul, L. 1996 Hidden Markov decision trees. MIT Computational Cognitive Science technical report 9606.
- (1996) Hidden Markov Decision Trees
- Jordan, M.¹ Ghahramani, Z.² Saul, L.³

22
- 84973969285
- Feature geometry and the vocal tract
- Keyser, S. & Stevens, K. 1994 Feature geometry and the vocal tract. Phonology 11, 207-236.
- (1994) Phonology , vol.11 , pp. 207-236
- Keyser, S.¹ Stevens, K.²

23
- 79952968027
- Speech recognition via phonetically featured syllables
- King, S., Stephenson, T., Isard, S., Taylor, P. & Strachan, A. 1998 Speech recognition via phonetically featured syllables. In Proc. Int. Conf. Spoken Language Processing, pp. 1031-1034.
- (1998) Proc. Int. Conf. Spoken Language Processing , pp. 1031-1034
- King, S.¹ Stephenson, T.² Isard, S.³ Taylor, P.⁴ Strachan, A.⁵

24
- 0030355367
- Syllable-level desynchronisation of phonetic features for speech recognition
- Kirchhoff, K. 1996 Syllable-level desynchronisation of phonetic features for speech recognition. In Proc. Int. Conf. Spoken Language Processing, pp. 2274-2276.
- (1996) Proc. Int. Conf. Spoken Language Processing , pp. 2274-2276
- Kirchhoff, K.¹

25
- 85128370668
- Combining articulatory and acoustic information for speech recognition in noisy and reverberant environments
- Kirchhoff, K. 1998 Combining articulatory and acoustic information for speech recognition in noisy and reverberant environments. In Proc. Int. Conf. Spoken Language Processing, pp. 891-894.
- (1998) Proc. Int. Conf. Spoken Language Processing , pp. 891-894
- Kirchhoff, K.¹

26
- 0342759107
- Speech recognition with phonological features
- Lahiri, A. 1999 Speech recognition with phonological features. In Proc. Int. Congr. Phonetic Sciences, pp. 715-718.
- (1999) Proc. Int. Congr. Phonetic Sciences , pp. 715-718
- Lahiri, A.¹

27
- 0000665734
- Explaining phonetic variation: A sketch of the H&H theory
- ed. W. Hardcastle & A. Marchal, Dordrecht: Kluwer
- Lindblom, B. 1990 Explaining phonetic variation: a sketch of the H&H theory. In Speech production and speech modelling (ed. W. Hardcastle & A. Marchal), pp. 403-439. Dordrecht: Kluwer.
- (1990) Speech Production and Speech Modelling , pp. 403-439
- Lindblom, B.¹

28
- 84943154470
- Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch
- McAllister, D., Gillick, L. Scattone, F. & Newman, M. 1998 Fabricating conversational speech data with acoustic models: a program to examine model-data mismatch. In Proc. Int. Conf. Spoken Language Processing, pp. 1847-1850.
- (1998) Proc. Int. Conf. Spoken Language Processing , pp. 1847-1850
- McAllister, D.¹ Gillick, L.² Scattone, F.³ Newman, M.⁴

29
- 44849088789
- A detection framework for locating phonetic events
- Niyogi, P., Mitra, P. & Sondhi, M. M. 1998 A detection framework for locating phonetic events. In Proc. Int. Conf. Spoken Language Processing, pp. 1067-1070.
- (1998) Proc. Int. Conf. Spoken Language Processing , pp. 1067-1070
- Niyogi, P.¹ Mitra, P.² Sondhi, M.M.³

30
- 0030715097
- HMM topology design using maximum likelihood successive state splitting
- Ostendorf, M. & Singer, H. 1997 HMM topology design using maximum likelihood successive state splitting. Comp. Speech Language 11, 17-42.
- (1997) Comp. Speech Language , vol.11 , pp. 17-42
- Ostendorf, M.¹ Singer, H.²

31
- 79959854240
- Ostendorf, M. (and 12 others) 1997 Modeling systematic variations in pronunciation via a language-dependent hidden speaking mode. Available from http://www.clsp.jhu.edu/ws96/.
- (1997) Modeling Systematic Variations in Pronunciation Via a Language-dependent Hidden Speaking Mode
- Ostendorf, M.¹

32
- 0001895107
- 1998 Broadcast News benchmark test results: English and non-English word error rate performance measures
- Pallett, D., Fiscuss, J., Garofolo, J., Martin, A. & Przybocki, M. 1999 1998 Broadcast News benchmark test results: English and non-English word error rate performance measures. In Proc. DARPA Broadcast News Workshop, pp. 5-12.
- (1999) Proc. DARPA Broadcast News Workshop , pp. 5-12
- Pallett, D.¹ Fiscuss, J.² Garofolo, J.³ Martin, A.⁴ Przybocki, M.⁵

33
- 0030682299
- Extensions to phone-state decision-tree clustering: Single tree and tagged clustering
- Paul, D. 1997 Extensions to phone-state decision-tree clustering: single tree and tagged clustering. In Proc. Int. Conf. Acoustics, Speech and Signal Processing, pp. 1487-1490.
- (1997) Proc. Int. Conf. Acoustics, Speech and Signal Processing , pp. 1487-1490
- Paul, D.¹

34
- 0033353288
- Stochastic pronunciation modelling from hand-labelled phonetic corpora
- Riley, M. (and 10 others) 1999 Stochastic pronunciation modelling from hand-labelled phonetic corpora. Speech Commun. 29, 209-224.
- (1999) Speech Commun. , vol.29 , pp. 209-224
- Riley, M.¹

35
- 85135262341
- Pronunciation modeling by sharing Gaussian densities across phonetic models
- Saraclar, M., Nock, H. & Khudanpur, S. 1999 Pronunciation modeling by sharing Gaussian densities across phonetic models. In Proc. Eur. Conf. Speech Communication and Technology, pp. 515-518.
- (1999) Proc. Eur. Conf. Speech Communication and Technology , pp. 515-518
- Saraclar, M.¹ Nock, H.² Khudanpur, S.³

36
- 0030095762
- A prosody tutorial for investigators of auditory sentence processing
- Shattuck-Hufnagel, S. & Turk, A. 1996 A prosody tutorial for investigators of auditory sentence processing. J. Psycholing. Res. 25, 193-247.
- (1996) J. Psycholing. Res. , vol.25 , pp. 193-247
- Shattuck-Hufnagel, S.¹ Turk, A.²

37
- 0002220140
- Applying phonetic knowledge to lexical access
- Stevens, K. 1995 Applying phonetic knowledge to lexical access. In Proc. Eur. Conf. Speech Communication and Technology, pp. 3-11.
- (1995) Proc. Eur. Conf. Speech Communication and Technology , pp. 3-11
- Stevens, K.¹

38
- 84958912927
- Primary features and their enhancement in consonants
- Stevens, K. & Keyser, S. 1989 Primary features and their enhancement in consonants. Language 65, 81-106.
- (1989) Language , vol.65 , pp. 81-106
- Stevens, K.¹ Keyser, S.²

39
- 85135194422
- Building multiple pronunciation models for novel words using exploratory computational phonology
- Tajchman, G., Fosler, E. & Jurafsky, D. 1995 Building multiple pronunciation models for novel words using exploratory computational phonology. In Proc. Eur. Conf. Speech Communication and Technology, pp. 2247-2250.
- (1995) Proc. Eur. Conf. Speech Communication and Technology , pp. 2247-2250
- Tajchman, G.¹ Fosler, E.² Jurafsky, D.³

40
- 0037954807
- Segmental duration and speech timing
- ed. Y. Sagisaka, N. Campbell & N. Higuchi. Springer
- Van Santen, J. 1997 Segmental duration and speech timing. In Computing prosody (ed. Y. Sagisaka, N. Campbell & N. Higuchi). Springer.
- (1997) Computing Prosody
- Van Santen, J.¹

41
- 85020794646
- Weintraub, M., Fosler, E., Galles, C., Kao, Y.-H., Khudanpur, S., Saraclar, M. & Wegmann, S. 1996 WS96 project report: automatic learning of word pronunciation from data. Available from http://www.clsp.jhu.edu/ws96/.
- (1996) WS96 Project Report: Automatic Learning of Word Pronunciation from Data
- Weintraub, M.¹ Fosler, E.² Galles, C.³ Kao, Y.-H.⁴ Khudanpur, S.⁵ Saraclar, M.⁶ Wegmann, S.⁷

42
- 0002144369
- Tree-based state tying for high accuracy acoustic modelling
- Young, S., Odell, J. & Woodland, P. 1994 Tree-based state tying for high accuracy acoustic modelling. In Proc. Int. Conf. Acoustics, Speech and Signal Processing, pp. 307-312.
- (1994) Proc. Int. Conf. Acoustics, Speech and Signal Processing , pp. 307-312
- Young, S.¹ Odell, J.² Woodland, P.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.