-
1
-
-
0343367210
-
Phonological studies for speech recognition
-
Palo Alto, CA
-
Bernstein, J., Baldwin, G., Cohen, M., Murveit, H., Weintraub, M., 1992. Phonological studies for speech recognition. In: DARPA Speech Recognition Workshop, Palo Alto, CA, pp. 41-48.
-
(1992)
DARPA Speech Recognition Workshop
, pp. 41-48
-
-
Bernstein, J.1
Baldwin, G.2
Cohen, M.3
Murveit, H.4
Weintraub, M.5
-
2
-
-
0039971087
-
The phonology of the lexicon: Evidence from lexical diffusion
-
Barlow, M., Kemmer, S. (Eds.)
-
Bybee, J., 1996. The phonology of the lexicon: evidence from lexical diffusion. In: Barlow, M., Kemmer, S. (Eds.), Usage-based Models of Language.
-
(1996)
Usage-based Models of Language
-
-
Bybee, J.1
-
3
-
-
0025692329
-
Identification of contextual factors for pronounciation networks
-
Chen, F., 1990. Identification of contextual factors for pronounciation networks. In: IEEE ICASSP-90, pp. 753-756.
-
(1990)
IEEE ICASSP-90
, pp. 753-756
-
-
Chen, F.1
-
4
-
-
0004119259
-
-
Harper and Row, New York, NY
-
Chomsky, N., Halle, M., 1968. The Sound Pattern of English, Harper and Row, New York, NY.
-
(1968)
The Sound Pattern of English
-
-
Chomsky, N.1
Halle, M.2
-
5
-
-
0343367213
-
Transcription of broadcast television and radio news: The 1996 ABBOT system
-
Chantilly, VA
-
Cook, G., Kershaw, D., Christie, J., Robinson, A., 1997. Transcription of broadcast television and radio news: The 1996 ABBOT system. In: DARPA Speech Recognition Workshop, Chantilly, VA.
-
(1997)
DARPA Speech Recognition Workshop
-
-
Cook, G.1
Kershaw, D.2
Christie, J.3
Robinson, A.4
-
6
-
-
0343190990
-
The SPRACH system for the transcription of broadcast news
-
Herndon, VA
-
Cook, G., Christie, J., Ellis, D., Fosler-Lussier, E., Gotoh, Y., Kingsbury, B., Morgan, N., Renals, S., Robinson, T., Williams, G., 1999. The SPRACH system for the transcription of broadcast news. In: DARPA Broadcast News Workshop, Herndon, VA.
-
(1999)
DARPA Broadcast News Workshop
-
-
Cook, G.1
Christie, J.2
Ellis, D.3
Fosler-Lussier, E.4
Gotoh, Y.5
Kingsbury, B.6
Morgan, N.7
Renals, S.8
Robinson, T.9
Williams, G.10
-
7
-
-
0030635306
-
Flexible transcription alignment
-
Santa Barabara, CA
-
Finke, M., Waibel, A., 1997a. Flexible transcription alignment. In: 1997 IEEE Workshop on Automatic Speech Recognition and Understanding, Santa Barabara, CA, pp. 34-40.
-
(1997)
1997 IEEE Workshop on Automatic Speech Recognition and Understanding
, pp. 34-40
-
-
Finke, M.1
Waibel, A.2
-
8
-
-
85027454087
-
Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition
-
Finke, M., Waibel, A., 1997b. Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition. In: Eurospeech-97.
-
(1997)
Eurospeech-97
-
-
Finke, M.1
Waibel, A.2
-
9
-
-
33646912719
-
Factors affecting recognition error rate
-
Chantilly, VA
-
Fisher, W., 1996a. Factors affecting recognition error rate. In: DARPA Speech Recognition Workshop, Chantilly, VA.
-
(1996)
DARPA Speech Recognition Workshop
-
-
Fisher, W.1
-
11
-
-
0037906252
-
Not just what, but also when: Guided automatic pronunciation modeling for broadcast news
-
Herndon, VA
-
Fosler-Lussier, E., Williams, G., 1999. Not just what, but also when: Guided automatic pronunciation modeling for broadcast news. In: DARPA Broadcast News Workshop, Herndon, VA.
-
(1999)
DARPA Broadcast News Workshop
-
-
Fosler-Lussier, E.1
Williams, G.2
-
13
-
-
0001893347
-
Transcribing broadcast news: The LIMSI Nov96 Hub4 system
-
Chantilly, VA
-
Gauvain, J.L., Adda, G., Lamel, L., Adda-Decker, M., 1997. Transcribing broadcast news: The LIMSI Nov96 Hub4 system. In: DARPA Speech Recognition Workshop, Chantilly, VA.
-
(1997)
DARPA Speech Recognition Workshop
-
-
Gauvain, J.L.1
Adda, G.2
Lamel, L.3
Adda-Decker, M.4
-
14
-
-
26844484699
-
WS96 project report: The Switchboard transcription project
-
Jelinek, F. (Ed.), Center for Language and Speech Processing, Johns Hopkins University, Chapter 6
-
Greenberg, S., 1997. WS96 project report: The Switchboard transcription project. In: Jelinek, F. (Ed.), 1996 LVCSR Summer Research Workshop Technical Reports, Center for Language and Speech Processing, Johns Hopkins University, Chapter 6.
-
(1997)
1996 LVCSR Summer Research Workshop Technical Reports
-
-
Greenberg, S.1
-
16
-
-
85128383420
-
Reduction of English function words in Switchboard
-
Sydney, Australia
-
Jurafsky, D., Bell, A., Fosler-Lussier, E., Girand, C., Raymond, W., 1998. Reduction of English function words in Switchboard. In: ICSLP-98, Sydney, Australia.
-
(1998)
ICSLP-98
-
-
Jurafsky, D.1
Bell, A.2
Fosler-Lussier, E.3
Girand, C.4
Raymond, W.5
-
19
-
-
0343802846
-
Extraction and representation of rhythmic components of spontaneous speech
-
Rhodes, Greece
-
Kitazawa, S., Ichikawa, H., Kobayashi, S., Nishinuma, Y., 1997. Extraction and representation of rhythmic components of spontaneous speech. In: Eurospeech-97, Rhodes, Greece, pp. 641-644.
-
(1997)
Eurospeech-97
, pp. 641-644
-
-
Kitazawa, S.1
Ichikawa, H.2
Kobayashi, S.3
Nishinuma, Y.4
-
20
-
-
0343802845
-
-
Available from the LDC, ldc@unagi.cis.upenn.edu. Part of the COMLEX distribution
-
Linguistic Data Consortium (LDC), 1996. The PRONLEX pronunciation dictionary. Available from the LDC, ldc@unagi.cis.upenn.edu. Part of the COMLEX distribution.
-
(1996)
The PRONLEX Pronunciation Dictionary
-
-
-
21
-
-
84943154470
-
Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch
-
Sydney, Australia
-
McAllaster, D., Gillick, L., Scattone, F., Newman, M., 1998. Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch. In: ICSLP-98, Sydney, Australia, pp. 1847-1850.
-
(1998)
ICSLP-98
, pp. 1847-1850
-
-
McAllaster, D.1
Gillick, L.2
Scattone, F.3
Newman, M.4
-
22
-
-
0019533618
-
How the components of speaking rate influence perception of phonetic segments
-
Miller, J., Grosjean, F., 1981. How the components of speaking rate influence perception of phonetic segments. Journal of Experimental Psychology: Human Performance and Perception 7 (1), 208-215.
-
(1981)
Journal of Experimental Psychology: Human Performance and Perception
, vol.7
, Issue.1
, pp. 208-215
-
-
Miller, J.1
Grosjean, F.2
-
23
-
-
0342931849
-
Fast speakers in large vocabulary continuous speech recognition: Analysis & antidotes
-
Mirghafori, N., Fosler, E., Morgan, N., 1995. Fast speakers in large vocabulary continuous speech recognition: Analysis & antidotes. In: Eurospeech-95.
-
(1995)
Eurospeech-95
-
-
Mirghafori, N.1
Fosler, E.2
Morgan, N.3
-
24
-
-
0029748337
-
Towards robustness to fast speech in ASR
-
Atlanta, Georgia
-
Mirghafori, N., Fosler, E., Morgan, N., 1996. Towards robustness to fast speech in ASR. In: ICASSP-96, Atlanta, Georgia, pp. 1335-338.
-
(1996)
ICASSP-96
, pp. 1335-1338
-
-
Mirghafori, N.1
Fosler, E.2
Morgan, N.3
-
25
-
-
0038194622
-
Combining multiple estimators of speaking rate
-
Seattle, WA
-
Morgan, N., Fosler-Lussier, E., 1998. Combining multiple estimators of speaking rate. In: IEEE ICASSP-98, Seattle, WA.
-
(1998)
IEEE ICASSP-98
-
-
Morgan, N.1
Fosler-Lussier, E.2
-
26
-
-
85135173867
-
Speech recognition using on-line estimation of speaking rate
-
Morgan, N., Fosler, E., Mirghafori, N., 1997. Speech recognition using on-line estimation of speaking rate. In: Eurospeech-97.
-
(1997)
Eurospeech-97
-
-
Morgan, N.1
Fosler, E.2
Mirghafori, N.3
-
27
-
-
0141588148
-
-
National Institute of Standards and Technology Speech Disc 9-1 to 9-25
-
NIST, 1992. Switchboard corpus: Recorded telephone conversations. National Institute of Standards and Technology Speech Disc 9-1 to 9-25.
-
(1992)
Switchboard Corpus: Recorded Telephone Conversations
-
-
-
28
-
-
85031582936
-
-
CSR-V, Hub 4, Produced by the Lingustic Data Consortium
-
NIST, 1996. 1996 broadcast news speech corpus. CSR-V, Hub 4, Produced by the Lingustic Data Consortium.
-
(1996)
1996 Broadcast News Speech Corpus
-
-
-
29
-
-
79959854240
-
Modeling systematic variations in pronunciation via a language-dependent hidden speaking mode
-
Jelinek, F. (Ed.), Center for Language and Speech Processing, Johns Hopkins University, Chapter 4
-
Ostendorf, M., Byrne, B., Bacchiani, M., Finke, M., Gunawardana, A., Ross, K., Roweis, S., Shriberg, E., Talkin, D., Waibel, A., Wheatley, B., Zeppenfeld, T., 1997. Modeling systematic variations in pronunciation via a language-dependent hidden speaking mode. In: Jelinek, F. (Ed.), 1996 LVCSR Summer Research Workshop Technical Reports, Center for Language and Speech Processing, Johns Hopkins University, Chapter 4.
-
(1997)
1996 LVCSR Summer Research Workshop Technical Reports
-
-
Ostendorf, M.1
Byrne, B.2
Bacchiani, M.3
Finke, M.4
Gunawardana, A.5
Ross, K.6
Roweis, S.7
Shriberg, E.8
Talkin, D.9
Waibel, A.10
Wheatley, B.11
Zeppenfeld, T.12
-
30
-
-
85031590734
-
1993 WSJ-CSR benchmark test results
-
Princeton, NJ
-
Pallett, D.S., Fiscus, J.G., Fisher, W.M., Garofolo, J.S., Lund, B. A., Przybocki, M.A., 1994. 1993 WSJ-CSR benchmark test results. In: ARPA Spoken Language Systems Technology Workshop, Princeton, NJ.
-
(1994)
ARPA Spoken Language Systems Technology Workshop
-
-
Pallett, D.S.1
Fiscus, J.G.2
Fisher, W.M.3
Garofolo, J.S.4
Lund, B.A.5
Przybocki, M.A.6
-
31
-
-
0008746009
-
The 1996 hub-4 sphinx-3 system
-
Chantilly, VA
-
Placeway, P., Chen, S., Eskenazi, M., Jain, U., Parikh, V., Raj, B., Ravishankar, M., Rosenfeld, R., Seymore, K., Siegler, M., Stern, R., Thayer, E., 1997. The 1996 hub-4 sphinx-3 system. In: DARPA Speech Recognition Workshop, Chantilly, VA.
-
(1997)
DARPA Speech Recognition Workshop
-
-
Placeway, P.1
Chen, S.2
Eskenazi, M.3
Jain, U.4
Parikh, V.5
Raj, B.6
Ravishankar, M.7
Rosenfeld, R.8
Seymore, K.9
Siegler, M.10
Stern, R.11
Thayer, E.12
-
32
-
-
0026405248
-
A statistical model for generating pronunciation networks
-
Riley, M., 1991. A statistical model for generating pronunciation networks. In: IEEE ICASSP-91, pp. 737-740.
-
(1991)
IEEE ICASSP-91
, pp. 737-740
-
-
Riley, M.1
-
33
-
-
0002802333
-
Stochastic pronunciation modelling from hand-labelled phonetic corpora
-
Kerkrade, The Netherlands
-
Riley, M., Byrne, W., Finke, M., Khudanpur, S., Ljolje, A., McDonough, J., Nock, H., Saraclar, M., Wooters, C., Zavaliagkos, G., 1998. Stochastic pronunciation modelling from hand-labelled phonetic corpora. In: ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kerkrade, The Netherlands, pp. 109-116.
-
(1998)
ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition
, pp. 109-116
-
-
Riley, M.1
Byrne, W.2
Finke, M.3
Khudanpur, S.4
Ljolje, A.5
McDonough, J.6
Nock, H.7
Saraclar, M.8
Wooters, C.9
Zavaliagkos, G.10
-
35
-
-
0342497654
-
On the effects of speech rate in large vocabulary speech recognition systems
-
Siegler, M.A., Stern, R.M., 1995. On the effects of speech rate in large vocabulary speech recognition systems. In: IEEE ICASSP-95.
-
(1995)
IEEE ICASSP-95
-
-
Siegler, M.A.1
Stern, R.M.2
-
36
-
-
0030363039
-
Dictionary learning for spontaneous speech recognition
-
Sloboda, T., Waibel, A., 1996. Dictionary learning for spontaneous speech recognition. In: ICSLP-96.
-
(1996)
ICSLP-96
-
-
Sloboda, T.1
Waibel, A.2
-
38
-
-
85135194422
-
Building multiple pronunciation models for novel words using exploratory computational phonology
-
Madrid, Spain
-
Tajchman, G., Fosler, E., Jurafsky, D., 1995. Building multiple pronunciation models for novel words using exploratory computational phonology. In: Eurospeech-95, Madrid, Spain.
-
(1995)
Eurospeech-95
-
-
Tajchman, G.1
Fosler, E.2
Jurafsky, D.3
-
39
-
-
0030376403
-
A fast and reliable rate of speech detector
-
Philadelphia, PA
-
Verhasselt, J.P., Martens, J.-P., 1996. A fast and reliable rate of speech detector. In: ICSLP-96, Philadelphia, PA, pp. 2258-2261.
-
(1996)
ICSLP-96
, pp. 2258-2261
-
-
Verhasselt, J.P.1
Martens, J.-P.2
-
40
-
-
0343367178
-
WS96 project report: Automatic learning of word pronunciation from data
-
Jelinek, F. (Ed.), Center for Language and Speech Processing, Johns Hopkins University, Chapter 3
-
Weintraub, M., Fosler, E., Galles, C., Kao, Y.-H., Khudanpur, S., Saraclar, M., Wegmann, S., 1997. WS96 project report: Automatic learning of word pronunciation from data. In: Jelinek, F. (Ed.), 1996 LVCSR Summer Research Workshop Technical Reports, Center for Language and Speech Processing, Johns Hopkins University, Chapter 3.
-
(1997)
1996 LVCSR Summer Research Workshop Technical Reports
-
-
Weintraub, M.1
Fosler, E.2
Galles, C.3
Kao, Y.-H.4
Khudanpur, S.5
Saraclar, M.6
Wegmann, S.7
-
41
-
-
0343802842
-
-
Center for the Study of Language and Information, Stanford, CA
-
Withgott, M.M., Chen, F.R., 1993. Computational Models of American Speech, Center for the Study of Language and Information, Stanford, CA.
-
(1993)
Computational Models of American Speech
-
-
Withgott, M.M.1
Chen, F.R.2
-
42
-
-
0002144369
-
Tree-based state tying for high accuracy acoustic modelling
-
Young, S.J., Odell, J.J., Woodland, P.C., 1994. Tree-based state tying for high accuracy acoustic modelling. In: IEEE ICASSP-94, pp. 307-312.
-
(1994)
IEEE ICASSP-94
, pp. 307-312
-
-
Young, S.J.1
Odell, J.J.2
Woodland, P.C.3
|