-
1
-
-
85135188280
-
The temporal properties of spoken Japanese are similar to those of English
-
Rhodes, Greece
-
Arai, T., Greenberg, S., 1997. The temporal properties of spoken Japanese are similar to those of English. In: Proceedings of Eurospeech, Rhodes, Greece, pp. 1011-1014.
-
(1997)
Proceedings of Eurospeech
, pp. 1011-1014
-
-
Arai, T.1
Greenberg, S.2
-
3
-
-
0343367210
-
Phonological studies for speech recognition
-
Bernstein, J., Baldwin, G., Cohen, M., Murveit, H., Weintraub, M., 1992. Phonological studies for speech recognition. In: Proceedings of the DARPA Speech Recognition Workshop, pp. 41-48.
-
(1992)
Proceedings of the DARPA Speech Recognition Workshop
, pp. 41-48
-
-
Bernstein, J.1
Baldwin, G.2
Cohen, M.3
Murveit, H.4
Weintraub, M.5
-
4
-
-
0030637976
-
Pronunciation modelling for conversational speech recognition - A status report from WS97
-
Byrne, W., Finke, M., Khudanpur, S., McDonnough, J., Nock, H., Saraclar, M., Wooters, C., Zavaliagkos, G., 1997. Pronunciation modelling for conversational speech recognition - A status report from WS97. In: Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 26-33.
-
(1997)
Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding
, pp. 26-33
-
-
Byrne, W.1
Finke, M.2
Khudanpur, S.3
McDonnough, J.4
Nock, H.5
Saraclar, M.6
Wooters, C.7
Zavaliagkos, G.8
-
5
-
-
0031624621
-
Pronunciation modeling using a hand-labelled corpus for conversational speech recognition
-
Byrne, W., Finke, M., Khudanpur, S., McDonnough, J., Nock, H., Saraclar, M., Wooters, C., Zavaliagkos, G., 1998. Pronunciation modeling using a hand-labelled corpus for conversational speech recognition. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 313-316.
-
(1998)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
, pp. 313-316
-
-
Byrne, W.1
Finke, M.2
Khudanpur, S.3
McDonnough, J.4
Nock, H.5
Saraclar, M.6
Wooters, C.7
Zavaliagkos, G.8
-
6
-
-
0010968444
-
The phonetic interpretation of headed phonological structures containing overlapping constituents
-
Coleman, J., 1992. The phonetic interpretation of headed phonological structures containing overlapping constituents. Phonetics Yearbook 9, 1-44.
-
(1992)
Phonetics Yearbook
, vol.9
, pp. 1-44
-
-
Coleman, J.1
-
10
-
-
0043272135
-
Automatic learning of word pronunciation from data
-
Fosler, E., Weintraub, M., Wegmann, S., Kao, Y.-H., Khudanpur, S., Galles, C., Saraclar, M., 1996. Automatic learning of word pronunciation from data. In: Proceedings of the International Conference on Spoken Language Processing, pp. S28-29.
-
(1996)
Proceedings of the International Conference on Spoken Language Processing
-
-
Fosler, E.1
Weintraub, M.2
Wegmann, S.3
Kao, Y.-H.4
Khudanpur, S.5
Galles, C.6
Saraclar, M.7
-
11
-
-
0033321442
-
Effects of speaking rate and word frequency on pronunciations in conversational speech
-
Fosler-Lussier, E., Morgan, N., 1998. Effects of speaking rate and word frequency on pronunciations in conversational speech. Speech Communication 29 (2-4), 137-158.
-
(1998)
Speech Communication
, vol.29
, Issue.2-4
, pp. 137-158
-
-
Fosler-Lussier, E.1
Morgan, N.2
-
12
-
-
0342931803
-
Incorporating contextual phonetics into automatic speech recognition
-
San Francisco
-
Fosler-Lussier, E., Greenberg, S., Morgan, N., 1999. Incorporating contextual phonetics into automatic speech recognition. In: Proceedings of the International Congress of Phonetic Sciences, San Francisco.
-
(1999)
Proceedings of the International Congress of Phonetic Sciences
-
-
Fosler-Lussier, E.1
Greenberg, S.2
Morgan, N.3
-
13
-
-
0039129495
-
The words and sounds of telephone conversations
-
French, N.R., Carter, C.W., Koenig, W., 1930. The words and sounds of telephone conversations. Bell System Tech. J. 9, 290-324.
-
(1930)
Bell System Tech. J.
, vol.9
, pp. 290-324
-
-
French, N.R.1
Carter, C.W.2
Koenig, W.3
-
14
-
-
0030638030
-
Syllable - A promising recognition unit for LVCSR
-
Ganapathiraju, A., Goel, V., Picone, J., Corrada, A., Doddington, G., Kirchhoff, K., Ordowski, M., Wheatley, B., 1997. Syllable - A promising recognition unit for LVCSR. In: Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 207-214.
-
(1997)
Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding
, pp. 207-214
-
-
Ganapathiraju, A.1
Goel, V.2
Picone, J.3
Corrada, A.4
Doddington, G.5
Kirchhoff, K.6
Ordowski, M.7
Wheatley, B.8
-
15
-
-
85028690016
-
The LIMSI continuous speech dictation system: Evaluation on the ARPA Wall Street Journal task
-
Gauvain, J., Lamel, L., Adda, G., Adda-Decker, M., 1994. The LIMSI continuous speech dictation system: Evaluation on the ARPA Wall Street Journal task. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 557-560.
-
(1994)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
, pp. 557-560
-
-
Gauvain, J.1
Lamel, L.2
Adda, G.3
Adda-Decker, M.4
-
16
-
-
85016587886
-
SWITCHBOARD: Telephone speech corpus for research and development
-
Godfrey, J.J., Holliman, E.C., McDaniel, J., 1992. SWITCHBOARD: Telephone speech corpus for research and development. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 517-520.
-
(1992)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
, pp. 517-520
-
-
Godfrey, J.J.1
Holliman, E.C.2
McDaniel, J.3
-
17
-
-
0002733797
-
Speech perception and spoken word recognition: Research and theory
-
Lass N. (Ed.), Mosby St. Louis
-
Goldinger, S.D., Pisoni, D.B., Luce, P., 1996. Speech perception and spoken word recognition: Research and theory. In: Lass N. (Ed.), Principles of Experimental Phonetics, Mosby St. Louis, pp. 277-327.
-
(1996)
Principles of Experimental Phonetics
, pp. 277-327
-
-
Goldinger, S.D.1
Pisoni, D.B.2
Luce, P.3
-
19
-
-
26844484699
-
The switchboard transcription project
-
Center for Language and Speech Processing. Johns Hopkins University Press, Baltimore, MD.
-
Greenberg, S., 1997b. The switchboard transcription project. Research Report #24, Large Vocabulary Continuous Speech Recognition Summer Research Workshop Technical Report Series. Center for Language and Speech Processing. Johns Hopkins University Press, Baltimore, MD.
-
(1997)
Research Report #24, Large Vocabulary Continuous Speech Recognition Summer Research Workshop Technical Report Series
, vol.24
-
-
Greenberg, S.1
-
20
-
-
0008390269
-
Auditory function
-
Crocker, M. (Ed.), Wiley, New York
-
Greenberg, S., 1997c. Auditory function. In: Crocker, M. (Ed.), Encyclopedia of Acoustics. Wiley, New York, pp. 1301-1323.
-
(1997)
Encyclopedia of Acoustics
, pp. 1301-1323
-
-
Greenberg, S.1
-
22
-
-
0002076795
-
Insights into spoken language gleaned from phonetic transcription of the Switchboard corpus
-
Philadelphia
-
Greenberg, S., Hollenback, J., Ellis, D., 1996. Insights into spoken language gleaned from phonetic transcription of the Switchboard corpus. In: Proceedings of the International Conference on Spoken Language Processing, Philadelphia, pp. S32-35.
-
(1996)
Proceedings of the International Conference on Spoken Language Processing
-
-
Greenberg, S.1
Hollenback, J.2
Ellis, D.3
-
23
-
-
0343802832
-
Phonetic transcription of spontaneous American English (the Switchboard corpus)
-
submitted
-
Greenberg, S., Ellis, D.A., Hollenback, J., Fosler-Lussier, E., 1999. Phonetic transcription of spontaneous American English (the Switchboard corpus). Speech Communication (submitted).
-
(1999)
Speech Communication
-
-
Greenberg, S.1
Ellis, D.A.2
Hollenback, J.3
Fosler-Lussier, E.4
-
26
-
-
0040373542
-
-
Merriam, Springfield, MA
-
Kenyon, J.S., Knott, T.A., 1953. A Pronouncing Dictionary of American English. Merriam, Springfield, MA.
-
(1953)
A Pronouncing Dictionary of American English
-
-
Kenyon, J.S.1
Knott, T.A.2
-
27
-
-
0032136330
-
Robust speech recognition using the modulation spectrogram
-
Kingsbury, B.E.D., Morgan, N., Greenberg, S., 1998. Robust speech recognition using the modulation spectrogram. Speech Communication 25, 117-132.
-
(1998)
Speech Communication
, vol.25
, pp. 117-132
-
-
Kingsbury, B.E.D.1
Morgan, N.2
Greenberg, S.3
-
31
-
-
0003695433
-
-
University of Pennsylvania Press, Philadelphia
-
Labov, W., 1972. Sociolinguistic Patterns. University of Pennsylvania Press, Philadelphia.
-
(1972)
Sociolinguistic Patterns
-
-
Labov, W.1
-
32
-
-
0002695711
-
Suprasegmental features of speech
-
Lass, N. (Ed.), Mosby, St. Louis
-
Lehiste, I., 1996. Suprasegmental features of speech. In: Lass, N. (Ed.), Principles of Experimental Phonetics. Mosby, St. Louis, pp. 226-244.
-
(1996)
Principles of Experimental Phonetics
, pp. 226-244
-
-
Lehiste, I.1
-
33
-
-
0003409001
-
-
MIT Press, Cambridge, MA
-
Levelt, W., 1989. Speaking. MIT Press, Cambridge, MA.
-
(1989)
Speaking
-
-
Levelt, W.1
-
34
-
-
84942397864
-
A spectrographic study of vowel reduction
-
Lindblom, B., 1963. A spectrographic study of vowel reduction. J. Acoust. Soc. Amer. 35, 1773-1781.
-
(1963)
J. Acoust. Soc. Amer.
, vol.35
, pp. 1773-1781
-
-
Lindblom, B.1
-
35
-
-
0000665734
-
Explaining phonetic variation: A sketch of the H-H theory
-
Hardcastle, W., Marchal. A. (Eds.), Kluwer Academic Publishers, Dordrecht
-
Lindblom, B., 1990. Explaining phonetic variation: a sketch of the H-H theory. In: Hardcastle, W., Marchal. A. (Eds.), Speech Production and Speech Modeling. Kluwer Academic Publishers, Dordrecht, pp. 403-439.
-
(1990)
Speech Production and Speech Modeling
, pp. 403-439
-
-
Lindblom, B.1
-
37
-
-
84898105320
-
Explorations with fabricated data
-
Hub-5
-
McAllaster, D., Gillick, L., Scattone, F., Newman, M., 1998. Explorations with fabricated data. In: Proceedings of the DARPA Workshop on Conversational Speech Recognition, Hub-5.
-
(1998)
Proceedings of the DARPA Workshop on Conversational Speech Recognition
-
-
McAllaster, D.1
Gillick, L.2
Scattone, F.3
Newman, M.4
-
38
-
-
0030643784
-
Prosodic processing and its use in Verbmobil
-
Niemann, H., Noth, E., Kiessling, A., Kompe, R., Batliner, A., 1997. Prosodic processing and its use in Verbmobil. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 75-78.
-
(1997)
Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing
, pp. 75-78
-
-
Niemann, H.1
Noth, E.2
Kiessling, A.3
Kompe, R.4
Batliner, A.5
-
39
-
-
79959854240
-
Modeling systematic variations in pronunciation via a language-dependent hidden speaking mode
-
Center for Language and Speech Processing. Johns Hopkins University, Baltimore, MD.
-
Ostendorf, M., Byrne, B., Macchiani, M., Finke, M., Gunawardana, A., Ross, K., Roweis, S., Shriberg, E., Talkin, D., Waibel, A., Wheatley, B., Zeppenfeld, T., 1997. Modeling systematic variations in pronunciation via a language-dependent hidden speaking mode. Research Report #24, Large Vocabulary Continuous Speech Recognition Workshop Technical Report Series. Center for Language and Speech Processing. Johns Hopkins University, Baltimore, MD.
-
(1997)
Research Report #24, Large Vocabulary Continuous Speech Recognition Workshop Technical Report Series
, vol.24
-
-
Ostendorf, M.1
Byrne, B.2
Macchiani, M.3
Finke, M.4
Gunawardana, A.5
Ross, K.6
Roweis, S.7
Shriberg, E.8
Talkin, D.9
Waibel, A.10
Wheatley, B.11
Zeppenfeld, T.12
-
40
-
-
0004244302
-
-
Prentice-Hall, Englewood Cliffs, NJ.
-
Rabiner, L.R., Juang, B.H., 1993. Fundamentals of Speech Recognition. Prentice-Hall, Englewood Cliffs, NJ.
-
(1993)
Fundamentals of Speech Recognition
-
-
Rabiner, L.R.1
Juang, B.H.2
-
41
-
-
0003921935
-
Automatic generation of detailed pronunciation lexicons
-
Lee, C.H., Soong, F.K., Paliwal, K.K. (Eds.), Kluwer Academic Publishers, Boston
-
Riley, M., Ljolje, A., 1995. Automatic generation of detailed pronunciation lexicons. In: Lee, C.H., Soong, F.K., Paliwal, K.K. (Eds.), Automatic Speech and Speaker Recognition: Advanced Topics. Kluwer Academic Publishers, Boston.
-
(1995)
Automatic Speech and Speaker Recognition: Advanced Topics
-
-
Riley, M.1
Ljolje, A.2
-
42
-
-
0002802333
-
Stochastic pronunciation modelling and hand-labelled phonetic corpora
-
Kerkrade
-
Riley, M., Finke, M., Khudanpur, S., Llolje, A., McDonough, J., Nock, H., Saraclar, M., Wooters, C., Zavaliagkos, G., 1998. Stochastic pronunciation modelling and hand-labelled phonetic corpora. In: Proceedings of the ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kerkrade, pp. 109-116.
-
(1998)
Proceedings of the ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition
, pp. 109-116
-
-
Riley, M.1
Finke, M.2
Khudanpur, S.3
Llolje, A.4
McDonough, J.5
Nock, H.6
Saraclar, M.7
Wooters, C.8
Zavaliagkos, G.9
-
43
-
-
0037795516
-
Statistical modeling of pronunciation: It's not the model, it's the data
-
Kerkrade
-
Schiel, F.A., Tillmann, H., 1998. Statistical modeling of pronunciation: it's not the model, it's the data. In: Proceedings of the ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kerkrade, pp. 131-136.
-
(1998)
Proceedings of the ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition
, pp. 131-136
-
-
Schiel, F.A.1
Tillmann, H.2
-
45
-
-
84941763441
-
Efficiency as an organizing principle of natural speech
-
van Son, R.J.J.H., Koopmans-van Beinum, J., Pols, L.C.W., 1998. Efficiency as an organizing principle of natural speech. In: Proceedings of the International Conference on Spoken Language Processing, pp. 2375-2378.
-
(1998)
Proceedings of the International Conference on Spoken Language Processing
, pp. 2375-2378
-
-
Van Son, R.J.J.H.1
Koopmans-van Beinum, J.2
Pols, L.C.W.3
-
46
-
-
0033096914
-
Acoustic correlates of lexical stress in continuous telephone speech
-
van Kuik, D., Boves, L., 1999. Acoustic correlates of lexical stress in continuous telephone speech. Speech Communication 27, 95-111.
-
(1999)
Speech Communication
, vol.27
, pp. 95-111
-
-
Van Kuik, D.1
Boves, L.2
-
49
-
-
0043086491
-
Effect of speaking style on LVCSR performance
-
Philadelphia
-
Weintraub, M., Taussig, K., Smith, K.H., Snodgrass, A., 1996. Effect of speaking style on LVCSR performance. In: Proceedings of the International Conference on Spoken Language Processing, Philadelphia.
-
(1996)
Proceedings of the International Conference on Spoken Language Processing
-
-
Weintraub, M.1
Taussig, K.2
Smith, K.H.3
Snodgrass, A.4
-
50
-
-
85031578975
-
WS96 project report: Automatic learning of word pronunciation from data
-
Center for Language and Speech Processing. Johns Hopkins University, Baltimore, MD.
-
Weintraub, M., Fosler, E., Galles, C., Kao, Y.-H., Khudanpur, S., Saraclar, M., Wegmann, S., 1997. WS96 project report: Automatic learning of word pronunciation from data. Research Report #24, Large Vocabulary Continuous Speech Recognition Summer Research Workshop Technical Report Series. Center for Language and Speech Processing. Johns Hopkins University, Baltimore, MD.
-
(1997)
Research Report #24, Large Vocabulary Continuous Speech Recognition Summer Research Workshop Technical Report Series
, vol.24
-
-
Weintraub, M.1
Fosler, E.2
Galles, C.3
Kao, Y.-H.4
Khudanpur, S.5
Saraclar, M.6
Wegmann, S.7
-
51
-
-
84892186467
-
Incorporating information from syllable-length time scales into automatic speech recognition
-
Seattle
-
Wu, S.-L., Kingsbury, B., Morgan, N., Greenberg, S., 1998a. Incorporating information from syllable-length time scales into automatic speech recognition. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Seattle, pp. 721-724.
-
(1998)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
, pp. 721-724
-
-
Wu, S.-L.1
Kingsbury, B.2
Morgan, N.3
Greenberg, S.4
-
52
-
-
0343249600
-
Performance improvements through combining phone- and syllable-length information in automatic speech recognition
-
Sydney
-
Wu, S.-L., Kingsbury, B., Morgan, N., Greenberg, S., 1998b. Performance improvements through combining phone- and syllable-length information in automatic speech recognition. In: Proceedings of the International Conference on Spoken Language Processing, Sydney, pp. 854-857.
-
(1998)
Proceedings of the International Conference on Spoken Language Processing
, pp. 854-857
-
-
Wu, S.-L.1
Kingsbury, B.2
Morgan, N.3
Greenberg, S.4
-
53
-
-
84872452207
-
The meaning-frequency relationship of words
-
Zipf, O.K., 1945. The meaning-frequency relationship of words. J. Gen. Psych. 33, 251-256.
-
(1945)
J. Gen. Psych.
, vol.33
, pp. 251-256
-
-
Zipf, O.K.1
-
54
-
-
0342497635
-
Transcription and alignment of the TIMIT database
-
Fujisaki, H. (Ed.), Elsevier, Amsterdam
-
Zue, V.W., Seneff, S., 1996. Transcription and alignment of the TIMIT database. In: Fujisaki, H. (Ed.), Recent Research Towards Advanced Man-Machine Interface Through Spoken Language. Elsevier, Amsterdam, pp. 515-525.
-
(1996)
Recent Research Towards Advanced Man-Machine Interface Through Spoken Language
, pp. 515-525
-
-
Zue, V.W.1
Seneff, S.2
|