SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 29, Issue 2, 1999, Pages 137-158

Effects of speaking rate and word frequency on pronunciations in conversational speech

(2) Fosler Lussier, Eric a,b Morgan, Nelson a,b

a INTERNATIONAL COMPUTER SCIENCE INSTITUTE (United States)

b UNIVERSITY OF CALIFORNIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; MATHEMATICAL MODELS; PATTERN RECOGNITION SYSTEMS; SPEECH ANALYSIS;

AUTOMATIC SPEECH RECOGNITION (ASR) SYSTEMS; PRONUNCIATION MODELS;

SPEECH RECOGNITION;

EID: 0033321442 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/S0167-6393(99)00035-7 Document Type: Article

Times cited : (110)

References (42)

1
- 0343367210
- Phonological studies for speech recognition
- Palo Alto, CA
- Bernstein, J., Baldwin, G., Cohen, M., Murveit, H., Weintraub, M., 1992. Phonological studies for speech recognition. In: DARPA Speech Recognition Workshop, Palo Alto, CA, pp. 41-48.
- (1992) DARPA Speech Recognition Workshop , pp. 41-48
- Bernstein, J.¹ Baldwin, G.² Cohen, M.³ Murveit, H.⁴ Weintraub, M.⁵

2
- 0039971087
- The phonology of the lexicon: Evidence from lexical diffusion
- Barlow, M., Kemmer, S. (Eds.)
- Bybee, J., 1996. The phonology of the lexicon: evidence from lexical diffusion. In: Barlow, M., Kemmer, S. (Eds.), Usage-based Models of Language.
- (1996) Usage-based Models of Language
- Bybee, J.¹

3
- 0025692329
- Identification of contextual factors for pronounciation networks
- Chen, F., 1990. Identification of contextual factors for pronounciation networks. In: IEEE ICASSP-90, pp. 753-756.
- (1990) IEEE ICASSP-90 , pp. 753-756
- Chen, F.¹

4
- 0004119259
- Harper and Row, New York, NY
- Chomsky, N., Halle, M., 1968. The Sound Pattern of English, Harper and Row, New York, NY.
- (1968) The Sound Pattern of English
- Chomsky, N.¹ Halle, M.²

5
- 0343367213
- Transcription of broadcast television and radio news: The 1996 ABBOT system
- Chantilly, VA
- Cook, G., Kershaw, D., Christie, J., Robinson, A., 1997. Transcription of broadcast television and radio news: The 1996 ABBOT system. In: DARPA Speech Recognition Workshop, Chantilly, VA.
- (1997) DARPA Speech Recognition Workshop
- Cook, G.¹ Kershaw, D.² Christie, J.³ Robinson, A.⁴

6
- 0343190990
- The SPRACH system for the transcription of broadcast news
- Herndon, VA
- Cook, G., Christie, J., Ellis, D., Fosler-Lussier, E., Gotoh, Y., Kingsbury, B., Morgan, N., Renals, S., Robinson, T., Williams, G., 1999. The SPRACH system for the transcription of broadcast news. In: DARPA Broadcast News Workshop, Herndon, VA.
- (1999) DARPA Broadcast News Workshop
- Cook, G.¹ Christie, J.² Ellis, D.³ Fosler-Lussier, E.⁴ Gotoh, Y.⁵ Kingsbury, B.⁶ Morgan, N.⁷ Renals, S.⁸ Robinson, T.⁹ Williams, G.¹⁰

7
- 0030635306
- Flexible transcription alignment
- Santa Barabara, CA
- Finke, M., Waibel, A., 1997a. Flexible transcription alignment. In: 1997 IEEE Workshop on Automatic Speech Recognition and Understanding, Santa Barabara, CA, pp. 34-40.
- (1997) 1997 IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 34-40
- Finke, M.¹ Waibel, A.²

8
- 85027454087
- Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition
- Finke, M., Waibel, A., 1997b. Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition. In: Eurospeech-97.
- (1997) Eurospeech-97
- Finke, M.¹ Waibel, A.²

9
- 33646912719
- Factors affecting recognition error rate
- Chantilly, VA
- Fisher, W., 1996a. Factors affecting recognition error rate. In: DARPA Speech Recognition Workshop, Chantilly, VA.
- (1996) DARPA Speech Recognition Workshop
- Fisher, W.¹

10
- 0342497676
- NIST. Part of the tsylb2-1.1 software package
- Fisher, W., 1996b. The tsylb2 Program: Algorithm Description. NIST. Part of the tsylb2-1.1 software package.
- (1996) The Tsylb2 Program: Algorithm Description
- Fisher, W.¹

11
- 0037906252
- Not just what, but also when: Guided automatic pronunciation modeling for broadcast news
- Herndon, VA
- Fosler-Lussier, E., Williams, G., 1999. Not just what, but also when: Guided automatic pronunciation modeling for broadcast news. In: DARPA Broadcast News Workshop, Herndon, VA.
- (1999) DARPA Broadcast News Workshop
- Fosler-Lussier, E.¹ Williams, G.²

12
- 0018978177
- Phonetic categorization in auditory word perception
- Ganong, W., 1980. Phonetic categorization in auditory word perception. Journal of Experimental Psychology: Human Performance and Perception 6, 110-125.
- (1980) Journal of Experimental Psychology: Human Performance and Perception , vol.6 , pp. 110-125
- Ganong, W.¹

13
- 0001893347
- Transcribing broadcast news: The LIMSI Nov96 Hub4 system
- Chantilly, VA
- Gauvain, J.L., Adda, G., Lamel, L., Adda-Decker, M., 1997. Transcribing broadcast news: The LIMSI Nov96 Hub4 system. In: DARPA Speech Recognition Workshop, Chantilly, VA.
- (1997) DARPA Speech Recognition Workshop
- Gauvain, J.L.¹ Adda, G.² Lamel, L.³ Adda-Decker, M.⁴

14
- 26844484699
- WS96 project report: The Switchboard transcription project
- Jelinek, F. (Ed.), Center for Language and Speech Processing, Johns Hopkins University, Chapter 6
- Greenberg, S., 1997. WS96 project report: The Switchboard transcription project. In: Jelinek, F. (Ed.), 1996 LVCSR Summer Research Workshop Technical Reports, Center for Language and Speech Processing, Johns Hopkins University, Chapter 6.
- (1997) 1996 LVCSR Summer Research Workshop Technical Reports
- Greenberg, S.¹

15
- 0012588925
- Speaking in shorthand - A syllable-centric perspective for understanding pronunciation variation
- Kerkrade, The Netherlands
- Greenberg, S., 1998. Speaking in shorthand - a syllable-centric perspective for understanding pronunciation variation. In: ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kerkrade, The Netherlands, pp. 47-56.
- (1998) ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition , pp. 47-56
- Greenberg, S.¹

16
- 85128383420
- Reduction of English function words in Switchboard
- Sydney, Australia
- Jurafsky, D., Bell, A., Fosler-Lussier, E., Girand, C., Raymond, W., 1998. Reduction of English function words in Switchboard. In: ICSLP-98, Sydney, Australia.
- (1998) ICSLP-98
- Jurafsky, D.¹ Bell, A.² Fosler-Lussier, E.³ Girand, C.⁴ Raymond, W.⁵

17
- 0003843502
- Garland, New York
- Kahn, D., 1980. Syllable-Based Generalizations in English Phonology. Garland, New York.
- (1980) Syllable-Based Generalizations in English Phonology
- Kahn, D.¹

18
- 0003434858
- Ph.D. thesis, University of California, Berkeley, CA
- Kingsbury, B.E.D., 1998. Perceptually-inspired signal processing strategies for robust speech recognition in reverberant environments. Ph.D. thesis, University of California, Berkeley, CA.
- (1998) Perceptually-inspired Signal Processing Strategies for Robust Speech Recognition in Reverberant Environments
- Kingsbury, B.E.D.¹

19
- 0343802846
- Extraction and representation of rhythmic components of spontaneous speech
- Rhodes, Greece
- Kitazawa, S., Ichikawa, H., Kobayashi, S., Nishinuma, Y., 1997. Extraction and representation of rhythmic components of spontaneous speech. In: Eurospeech-97, Rhodes, Greece, pp. 641-644.
- (1997) Eurospeech-97 , pp. 641-644
- Kitazawa, S.¹ Ichikawa, H.² Kobayashi, S.³ Nishinuma, Y.⁴

20
- 0343802845
- Available from the LDC, ldc@unagi.cis.upenn.edu. Part of the COMLEX distribution
- Linguistic Data Consortium (LDC), 1996. The PRONLEX pronunciation dictionary. Available from the LDC, ldc@unagi.cis.upenn.edu. Part of the COMLEX distribution.
- (1996) The PRONLEX Pronunciation Dictionary

21
- 84943154470
- Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch
- Sydney, Australia
- McAllaster, D., Gillick, L., Scattone, F., Newman, M., 1998. Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch. In: ICSLP-98, Sydney, Australia, pp. 1847-1850.
- (1998) ICSLP-98 , pp. 1847-1850
- McAllaster, D.¹ Gillick, L.² Scattone, F.³ Newman, M.⁴

22
- 0019533618
- How the components of speaking rate influence perception of phonetic segments
- Miller, J., Grosjean, F., 1981. How the components of speaking rate influence perception of phonetic segments. Journal of Experimental Psychology: Human Performance and Perception 7 (1), 208-215.
- (1981) Journal of Experimental Psychology: Human Performance and Perception , vol.7 , Issue.1 , pp. 208-215
- Miller, J.¹ Grosjean, F.²

23
- 0342931849
- Fast speakers in large vocabulary continuous speech recognition: Analysis & antidotes
- Mirghafori, N., Fosler, E., Morgan, N., 1995. Fast speakers in large vocabulary continuous speech recognition: Analysis & antidotes. In: Eurospeech-95.
- (1995) Eurospeech-95
- Mirghafori, N.¹ Fosler, E.² Morgan, N.³

24
- 0029748337
- Towards robustness to fast speech in ASR
- Atlanta, Georgia
- Mirghafori, N., Fosler, E., Morgan, N., 1996. Towards robustness to fast speech in ASR. In: ICASSP-96, Atlanta, Georgia, pp. 1335-338.
- (1996) ICASSP-96 , pp. 1335-1338
- Mirghafori, N.¹ Fosler, E.² Morgan, N.³

25
- 0038194622
- Combining multiple estimators of speaking rate
- Seattle, WA
- Morgan, N., Fosler-Lussier, E., 1998. Combining multiple estimators of speaking rate. In: IEEE ICASSP-98, Seattle, WA.
- (1998) IEEE ICASSP-98
- Morgan, N.¹ Fosler-Lussier, E.²

26
- 85135173867
- Speech recognition using on-line estimation of speaking rate
- Morgan, N., Fosler, E., Mirghafori, N., 1997. Speech recognition using on-line estimation of speaking rate. In: Eurospeech-97.
- (1997) Eurospeech-97
- Morgan, N.¹ Fosler, E.² Mirghafori, N.³

27
- 0141588148
- National Institute of Standards and Technology Speech Disc 9-1 to 9-25
- NIST, 1992. Switchboard corpus: Recorded telephone conversations. National Institute of Standards and Technology Speech Disc 9-1 to 9-25.
- (1992) Switchboard Corpus: Recorded Telephone Conversations

28
- 85031582936
- CSR-V, Hub 4, Produced by the Lingustic Data Consortium
- NIST, 1996. 1996 broadcast news speech corpus. CSR-V, Hub 4, Produced by the Lingustic Data Consortium.
- (1996) 1996 Broadcast News Speech Corpus

29
- 79959854240
- Modeling systematic variations in pronunciation via a language-dependent hidden speaking mode
- Jelinek, F. (Ed.), Center for Language and Speech Processing, Johns Hopkins University, Chapter 4
- Ostendorf, M., Byrne, B., Bacchiani, M., Finke, M., Gunawardana, A., Ross, K., Roweis, S., Shriberg, E., Talkin, D., Waibel, A., Wheatley, B., Zeppenfeld, T., 1997. Modeling systematic variations in pronunciation via a language-dependent hidden speaking mode. In: Jelinek, F. (Ed.), 1996 LVCSR Summer Research Workshop Technical Reports, Center for Language and Speech Processing, Johns Hopkins University, Chapter 4.
- (1997) 1996 LVCSR Summer Research Workshop Technical Reports
- Ostendorf, M.¹ Byrne, B.² Bacchiani, M.³ Finke, M.⁴ Gunawardana, A.⁵ Ross, K.⁶ Roweis, S.⁷ Shriberg, E.⁸ Talkin, D.⁹ Waibel, A.¹⁰ Wheatley, B.¹¹ Zeppenfeld, T.¹²

30
- 85031590734
- 1993 WSJ-CSR benchmark test results
- Princeton, NJ
- Pallett, D.S., Fiscus, J.G., Fisher, W.M., Garofolo, J.S., Lund, B. A., Przybocki, M.A., 1994. 1993 WSJ-CSR benchmark test results. In: ARPA Spoken Language Systems Technology Workshop, Princeton, NJ.
- (1994) ARPA Spoken Language Systems Technology Workshop
- Pallett, D.S.¹ Fiscus, J.G.² Fisher, W.M.³ Garofolo, J.S.⁴ Lund, B.A.⁵ Przybocki, M.A.⁶

31
- 0008746009
- The 1996 hub-4 sphinx-3 system
- Chantilly, VA
- Placeway, P., Chen, S., Eskenazi, M., Jain, U., Parikh, V., Raj, B., Ravishankar, M., Rosenfeld, R., Seymore, K., Siegler, M., Stern, R., Thayer, E., 1997. The 1996 hub-4 sphinx-3 system. In: DARPA Speech Recognition Workshop, Chantilly, VA.
- (1997) DARPA Speech Recognition Workshop
- Placeway, P.¹ Chen, S.² Eskenazi, M.³ Jain, U.⁴ Parikh, V.⁵ Raj, B.⁶ Ravishankar, M.⁷ Rosenfeld, R.⁸ Seymore, K.⁹ Siegler, M.¹⁰ Stern, R.¹¹ Thayer, E.¹²

32
- 0026405248
- A statistical model for generating pronunciation networks
- Riley, M., 1991. A statistical model for generating pronunciation networks. In: IEEE ICASSP-91, pp. 737-740.
- (1991) IEEE ICASSP-91 , pp. 737-740
- Riley, M.¹

33
- 0002802333
- Stochastic pronunciation modelling from hand-labelled phonetic corpora
- Kerkrade, The Netherlands
- Riley, M., Byrne, W., Finke, M., Khudanpur, S., Ljolje, A., McDonough, J., Nock, H., Saraclar, M., Wooters, C., Zavaliagkos, G., 1998. Stochastic pronunciation modelling from hand-labelled phonetic corpora. In: ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kerkrade, The Netherlands, pp. 109-116.
- (1998) ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition , pp. 109-116
- Riley, M.¹ Byrne, W.² Finke, M.³ Khudanpur, S.⁴ Ljolje, A.⁵ McDonough, J.⁶ Nock, H.⁷ Saraclar, M.⁸ Wooters, C.⁹ Zavaliagkos, G.¹⁰

34
- 85031594498
- Automatic learning of a model for word pronunciations: Status report
- Baltimore, MD
- Saraclar, M., 1997. Automatic learning of a model for word pronunciations: Status report. In: Conversational Speech Recognition Workshop: DARPA Hub-5E Evaluation, Baltimore, MD.
- (1997) Conversational Speech Recognition Workshop: DARPA Hub-5E Evaluation
- Saraclar, M.¹

35
- 0342497654
- On the effects of speech rate in large vocabulary speech recognition systems
- Siegler, M.A., Stern, R.M., 1995. On the effects of speech rate in large vocabulary speech recognition systems. In: IEEE ICASSP-95.
- (1995) IEEE ICASSP-95
- Siegler, M.A.¹ Stern, R.M.²

36
- 0030363039
- Dictionary learning for spontaneous speech recognition
- Sloboda, T., Waibel, A., 1996. Dictionary learning for spontaneous speech recognition. In: ICSLP-96.
- (1996) ICSLP-96
- Sloboda, T.¹ Waibel, A.²

37
- 0019622592
- Articulatory rate and perceptual constancy in phonetic perception
- Summerfield, Q., 1981. Articulatory rate and perceptual constancy in phonetic perception. Journal of Experimental Psychology: Human Performance and Perception 7, 1074-1095.
- (1981) Journal of Experimental Psychology: Human Performance and Perception , vol.7 , pp. 1074-1095
- Summerfield, Q.¹

38
- 85135194422
- Building multiple pronunciation models for novel words using exploratory computational phonology
- Madrid, Spain
- Tajchman, G., Fosler, E., Jurafsky, D., 1995. Building multiple pronunciation models for novel words using exploratory computational phonology. In: Eurospeech-95, Madrid, Spain.
- (1995) Eurospeech-95
- Tajchman, G.¹ Fosler, E.² Jurafsky, D.³

39
- 0030376403
- A fast and reliable rate of speech detector
- Philadelphia, PA
- Verhasselt, J.P., Martens, J.-P., 1996. A fast and reliable rate of speech detector. In: ICSLP-96, Philadelphia, PA, pp. 2258-2261.
- (1996) ICSLP-96 , pp. 2258-2261
- Verhasselt, J.P.¹ Martens, J.-P.²

40
- 0343367178
- WS96 project report: Automatic learning of word pronunciation from data
- Jelinek, F. (Ed.), Center for Language and Speech Processing, Johns Hopkins University, Chapter 3
- Weintraub, M., Fosler, E., Galles, C., Kao, Y.-H., Khudanpur, S., Saraclar, M., Wegmann, S., 1997. WS96 project report: Automatic learning of word pronunciation from data. In: Jelinek, F. (Ed.), 1996 LVCSR Summer Research Workshop Technical Reports, Center for Language and Speech Processing, Johns Hopkins University, Chapter 3.
- (1997) 1996 LVCSR Summer Research Workshop Technical Reports
- Weintraub, M.¹ Fosler, E.² Galles, C.³ Kao, Y.-H.⁴ Khudanpur, S.⁵ Saraclar, M.⁶ Wegmann, S.⁷

41
- 0343802842
- Center for the Study of Language and Information, Stanford, CA
- Withgott, M.M., Chen, F.R., 1993. Computational Models of American Speech, Center for the Study of Language and Information, Stanford, CA.
- (1993) Computational Models of American Speech
- Withgott, M.M.¹ Chen, F.R.²

42
- 0002144369
- Tree-based state tying for high accuracy acoustic modelling
- Young, S.J., Odell, J.J., Woodland, P.C., 1994. Tree-based state tying for high accuracy acoustic modelling. In: IEEE ICASSP-94, pp. 307-312.
- (1994) IEEE ICASSP-94 , pp. 307-312
- Young, S.J.¹ Odell, J.J.² Woodland, P.C.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.