-
1
-
-
0000353178
-
A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
-
Baum, L.E., Petrie, T., Soules, G., Weiss, N., 1970. A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Annals of Mathematical Statistics 41 (1), 164-171.
-
(1970)
Annals of Mathematical Statistics
, vol.41
, Issue.1
, pp. 164-171
-
-
Baum, L.E.1
Petrie, T.2
Soules, G.3
Weiss, N.4
-
2
-
-
0037841255
-
Multi-stream speech recognition
-
Technical Report IDIAP-RR 96-07, IDIAP
-
Bourlard, H., Dupont, S., Ris, C., 1996. Multi-stream speech recognition. Technical Report IDIAP-RR 96-07, IDIAP.
-
(1996)
-
-
Bourlard, H.1
Dupont, S.2
Ris, C.3
-
3
-
-
0030685285
-
Coupled hidden Markov models for complex action recognition
-
Brand, M., Oliver, N., Pentland, A., 1997. Coupled hidden Markov models for complex action recognition. In: Proceedings of IEEE CVPR, pp. 994-999.
-
(1997)
Proceedings of IEEE CVPR
, pp. 994-999
-
-
Brand, M.1
Oliver, N.2
Pentland, A.3
-
4
-
-
0031624621
-
Pronunciation modelling using a hand-labelled corpus for conversational speech recognition
-
Byrne, W., Finke, M., Khudanpur, S., McDonough, J., Nock, H., Riley, M., Saraclar, M., Wooters, C., Zavaliagkos, G., 1998. Pronunciation modelling using a hand-labelled corpus for conversational speech recognition. In: Proceedings of ICASSP, pp. 313-316.
-
(1998)
Proceedings of ICASSP
, pp. 313-316
-
-
Byrne, W.1
Finke, M.2
Khudanpur, S.3
McDonough, J.4
Nock, H.5
Riley, M.6
Saraclar, M.7
Wooters, C.8
Zavaliagkos, G.9
-
5
-
-
0003640523
-
The ISOLET spoken letter database
-
Technical Report CSE 90-004, OGI
-
Cole, R., Muthusamy, Y., Fanty, M., 1990. The ISOLET spoken letter database. Technical Report CSE 90-004, OGI.
-
(1990)
-
-
Cole, R.1
Muthusamy, Y.2
Fanty, M.3
-
6
-
-
84875738293
-
A new approach for multi-band speech recognition based on probabilistic graphical models
-
Daoudi, K., Fohr, D., Antoine, C., 2000. A new approach for multi-band speech recognition based on probabilistic graphical models. In: Proceedings of ICSLP, pp. I:329-332.
-
(2000)
Proceedings of ICSLP
, pp. 329-332
-
-
Daoudi, K.1
Fohr, D.2
Antoine, C.3
-
7
-
-
85002022542
-
Maximum likelihood from incomplete data via the EM algorithm
-
Dempster, A.P., Laird, N.M., Rubin, D.B., 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society 39 (1), 1-38.
-
(1977)
Journal of the Royal Statistical Society
, vol.39
, Issue.1
, pp. 1-38
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
8
-
-
0026458724
-
Structural design of a hidden Markov model based speech recognizer using multi-valued phonetic features: Comparison with segmental speech units
-
Deng, L., Erler, K., 1992. Structural design of a hidden Markov model based speech recognizer using multi-valued phonetic features: comparison with segmental speech units. Journal of the Acoustical Society of America 92 (92), 3058-3067.
-
(1992)
Journal of the Acoustical Society of America
, vol.92
, Issue.92
, pp. 3058-3067
-
-
Deng, L.1
Erler, K.2
-
9
-
-
80053229524
-
Modeling and efficient decoding of large vocabulary conversational speech
-
Finke, M., Fritsch, J., Koll, D., Waibel, A., 1999. Modeling and efficient decoding of large vocabulary conversational speech. In: Proceedings of Eurospeech, pp. 467-470.
-
(1999)
Proceedings of Eurospeech
, pp. 467-470
-
-
Finke, M.1
Fritsch, J.2
Koll, D.3
Waibel, A.4
-
10
-
-
85027454087
-
Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition
-
Finke, M., Waibel, A., 1997. Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition. In: Proceedings of Eurospeech, pp. 2379-2382.
-
(1997)
Proceedings of Eurospeech
, pp. 2379-2382
-
-
Finke, M.1
Waibel, A.2
-
11
-
-
0003671941
-
Model-based techniques for noise robust speech recognition
-
PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK
-
Gales, M.J.F., 1995. Model-based techniques for noise robust speech recognition. PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK.
-
(1995)
-
-
Gales, M.J.F.1
-
13
-
-
0024909979
-
Some statistical issues in the comparison of speech recognition algorithms
-
Gillick, L., Cox, S.J., 1989. Some statistical issues in the comparison of speech recognition algorithms. In: Proceedings of ICASSP, pp. 532-535.
-
(1989)
Proceedings of ICASSP
, pp. 532-535
-
-
Gillick, L.1
Cox, S.J.2
-
14
-
-
0142005771
-
Dynamic HMM selection for continuous speech recognition
-
Hain, T., Woodland, P.C., 1999. Dynamic HMM selection for continuous speech recognition. In: Proceedings of Eurospeech, pp. 532-535.
-
(1999)
Proceedings of Eurospeech
, pp. 532-535
-
-
Hain, T.1
Woodland, P.C.2
-
15
-
-
85009113591
-
Modelling sub-phone insertions and deletions in continuous speech recognition
-
Hain, T., Woodland, P.C., 2000. Modelling sub-phone insertions and deletions in continuous speech recognition. In: Proceedings of ICSLP, pp. IV:172-175.
-
(2000)
Proceedings of ICSLP
, pp. 172-175
-
-
Hain, T.1
Woodland, P.C.2
-
16
-
-
0012236195
-
The CU-HTK march 2000 Hub5e transcription system
-
Hain, T., Woodland, P.C., Evermann, G., Povey, D., 2000. The CU-HTK march 2000 Hub5e transcription system. In: Proceedings of Speech Transcription Workshop.
-
(2000)
Proceedings of Speech Transcription Workshop
-
-
Hain, T.1
Woodland, P.C.2
Evermann, G.3
Povey, D.4
-
17
-
-
0030365517
-
Towards ASR on partially corrupted speech
-
Hermansky, H., Tibrewala, S., Pavel, M., 1996. Towards ASR on partially corrupted speech. In: Proceedings of ICSLP. pp. 462-465.
-
(1996)
Proceedings of ICSLP
, pp. 462-465
-
-
Hermansky, H.1
Tibrewala, S.2
Pavel, M.3
-
19
-
-
85032275702
-
Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition
-
Humphries, J.J., Woodland, P.C., 1997. Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition. In: Proceedings of Eurospeech, pp. 2367-2370.
-
(1997)
Proceedings of Eurospeech
, pp. 2367-2370
-
-
Humphries, J.J.1
Woodland, P.C.2
-
20
-
-
0003857779
-
Discriminative training of hidden Markov models
-
PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK
-
Kapadia, S., 1998. Discriminative training of hidden Markov models. PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK.
-
(1998)
-
-
Kapadia, S.1
-
21
-
-
4243728511
-
Word-level phonetic variation in large speech corpora
-
In: Pompino-Marschal, B. (Ed.), ZAS Working Papers in Linguistics
-
Keating, P., 1997. Word-level phonetic variation in large speech corpora. In: Pompino-Marschal, B. (Ed.), ZAS Working Papers in Linguistics. Available from
-
(1997)
-
-
Keating, P.1
-
22
-
-
79952968027
-
Speech recognition via phonetically featured syllables
-
King, S., Stephenson, T., Isard, S., Taylor, P., Strachan, A., 1998. Speech recognition via phonetically featured syllables. In: Proceedings of ICSLP, vol. 3, pp. 1031-1034.
-
(1998)
Proceedings of ICSLP
, vol.3
, pp. 1031-1034
-
-
King, S.1
Stephenson, T.2
Isard, S.3
Taylor, P.4
Strachan, A.5
-
23
-
-
0034297586
-
Detection of phonological features in continuous speech using neural networks
-
King, S., Taylor, P., 2000. Detection of phonological features in continuous speech using neural networks. Computer Speech and Language 14 (4), 333-353.
-
(2000)
Computer Speech and Language
, vol.14
, Issue.4
, pp. 333-353
-
-
King, S.1
Taylor, P.2
-
24
-
-
19544365323
-
COMLEX pronouncing lexicon (renamed in 1997 release as CALLHOME American English lexicon)
-
Available from Linguistic Data Consortium
-
Kingsbury, P., Strassel, S., McLemore, C., 1997. COMLEX pronouncing lexicon (renamed in 1997 release as CALLHOME American English lexicon). Available from Linguistic Data Consortium
-
(1997)
-
-
Kingsbury, P.1
Strassel, S.2
McLemore, C.3
-
25
-
-
0003424928
-
Robust speech recognition using articulatory information
-
PhD Thesis, University of Bielefeld, Germany
-
Kirchhoff, K., 1999. Robust speech recognition using articulatory information. PhD Thesis, University of Bielefeld, Germany.
-
(1999)
-
-
Kirchhoff, K.1
-
26
-
-
0030351374
-
On designing pronunciation lexicons for large vocabulary, continuous speech recognition
-
Lamel, L., Adda, G., 1996. On designing pronunciation lexicons for large vocabulary, continuous speech recognition. In: Proceedings of ICSLP, pp. 6-9.
-
(1996)
Proceedings of ICSLP
, pp. 6-9
-
-
Lamel, L.1
Adda, G.2
-
28
-
-
0021226391
-
A database for speaker-independent digit recognition
-
Leonard, R. G., 1984. A database for speaker-independent digit recognition. In: Proceedings of ICASSP, pp. 42.11-14.
-
(1984)
Proceedings of ICASSP
, pp. 11-14
-
-
Leonard, R.G.1
-
29
-
-
0018918171
-
An algorithm for vector quantizer design
-
Linde, Y., Buzo, A., Gray, R.M., 1980. An algorithm for vector quantizer design. IEEE Transactions Communications 28, 84-95.
-
(1980)
IEEE Transactions Communications
, vol.28
, pp. 84-95
-
-
Linde, Y.1
Buzo, A.2
Gray, R.M.3
-
30
-
-
0038178805
-
Factorial hidden Markov models for speech recognition: Preliminary experiments
-
Technical Report 97/7, Cambridge Research Laboratory
-
Logan, B., Moreno, P.J., 1997. Factorial hidden Markov models for speech recognition: preliminary experiments. Technical Report 97/7, Cambridge Research Laboratory.
-
(1997)
-
-
Logan, B.1
Moreno, P.J.2
-
31
-
-
85009078254
-
Asynchrony with trained transition probabilities improves performance in multi-band speech recognition
-
Mak, B., Tam, Y.-C., 2000. Asynchrony with trained transition probabilities improves performance in multi-band speech recognition. In: Proceedings of ICSLP, pp. IV:149-152.
-
(2000)
Proceedings of ICSLP
, pp. 149-152
-
-
Mak, B.1
Tam, Y.-C.2
-
32
-
-
0006133268
-
Discriminative weighting of multi-resolution sub-band cepstral features for speech recognition
-
McMahon, P., McCourt, P., Vaseghi, S., 1998. Discriminative weighting of multi-resolution sub-band cepstral features for speech recognition. In: Proceedings of ICSLP, pp. 1055-1058.
-
(1998)
Proceedings of ICSLP
, pp. 1055-1058
-
-
McMahon, P.1
McCourt, P.2
Vaseghi, S.3
-
33
-
-
0004119130
-
A multi-band approach to automatic speech recognition
-
PhD Thesis, ICSI, UC Berkeley, CA, USA
-
Mirghafori, N. 1999. A Multi-band approach to automatic speech recognition. PhD Thesis, ICSI, UC Berkeley, CA, USA.
-
(1999)
-
-
Mirghafori, N.1
-
34
-
-
0004052871
-
Audio-visual speech recognition
-
Technical Report, The Johns Hopkins University (Center for Language and Speech Processing) Summer Research Workshop
-
Neti, C., Potamianos, G., Luettin, J., Matthews, I., Herve Gtin, Vergyri, D., Sison, J., Mashari, J., Zhou, J., 2000. Audio-visual speech recognition. Technical Report, The Johns Hopkins University (Center for Language and Speech Processing) Summer Research Workshop.
-
(2000)
-
-
Neti, C.1
Potamianos, G.2
Luettin, J.3
Matthews, I.4
Herve, G.5
Vergyri, D.6
Sison, J.7
Mashari, J.8
Zhou, J.9
-
35
-
-
85002102457
-
-
NIST. Score package
-
NIST. Score package. Available from.
-
-
-
-
36
-
-
0003951389
-
Techniques for modelling phonological processes in automatic speech recognition
-
PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK, August
-
Nock, H.J., 2001. Techniques for modelling phonological processes in automatic speech recognition. PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK, August.
-
(2001)
-
-
Nock, H.J.1
-
38
-
-
0036081023
-
Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models
-
Nock, H.J., Young, S.J., 2002. Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models. Cognitive Science 26 (3), 283-301.
-
(2002)
Cognitive Science
, vol.26
, Issue.3
, pp. 283-301
-
-
Nock, H.J.1
Young, S.J.2
-
39
-
-
0003805597
-
The use of context in large vocabulary speech recognition
-
PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK
-
Odell, J., 1995. The use of context in large vocabulary speech recognition. PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK.
-
(1995)
-
-
Odell, J.1
-
40
-
-
0347307067
-
Incorporating linguistic theories of pronunciation variation into speech recognition models
-
London, UK
-
Ostendorf, M., 2000. Incorporating linguistic theories of pronunciation variation into speech recognition models. In: Philosophical Transactions of Royal Society, vol. 358, London, UK, pp. 1325-1338.
-
(2000)
Philosophical Transactions of Royal Society
, vol.358
, pp. 1325-1338
-
-
Ostendorf, M.1
-
41
-
-
0030715097
-
HMM topology design using maximum likelihood successive state splitting
-
Ostendorf, M., Singer, H., 1997. HMM topology design using maximum likelihood successive state splitting. Computer Speech and Language 11, 17-41.
-
(1997)
Computer Speech and Language
, vol.11
, pp. 17-41
-
-
Ostendorf, M.1
Singer, H.2
-
43
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
Rabiner, L.R., 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of IEEE 77(2), 257-285.
-
(1989)
Proceedings of IEEE
, vol.77
, Issue.2
, pp. 257-285
-
-
Rabiner, L.R.1
-
45
-
-
0026405248
-
A statistical model for generating pronunciation networks
-
Riley, M.D., 1991. A statistical model for generating pronunciation networks. In: Proceedings of ICASSP, pp. 737-740.
-
(1991)
Proceedings of ICASSP
, pp. 737-740
-
-
Riley, M.D.1
-
46
-
-
0005921215
-
Pronunciation modeling for conversational speech recognition
-
PhD Thesis, The Johns Hopkins University, MD, USA
-
Saraclar, M., 2000. Pronunciation modeling for conversational speech recognition. PhD Thesis, The Johns Hopkins University, MD, USA.
-
(2000)
-
-
Saraclar, M.1
-
47
-
-
0000114416
-
Pronunciation modeling by sharing Gaussian densities across phonetic models
-
Saraclar, M., Nock, H., Khudanpur, S., 2000. Pronunciation modeling by sharing Gaussian densities across phonetic models. Computer Speech and Language 14 (2), 137-160.
-
(2000)
Computer Speech and Language
, vol.14
, Issue.2
, pp. 137-160
-
-
Saraclar, M.1
Nock, H.2
Khudanpur, S.3
-
49
-
-
0029747053
-
Integrating audio and visual information to provide highly robust speech recognition
-
Tomlinson, M.J., Russell, M.J., Brooke, N.M., 1996. Integrating audio and visual information to provide highly robust speech recognition. In: Proceedings of ICASSP, vol. II, pp. 821-824.
-
(1996)
Proceedings of ICASSP
, vol.2
, pp. 821-824
-
-
Tomlinson, M.J.1
Russell, M.J.2
Brooke, N.M.3
-
50
-
-
0030643684
-
Modelling asynchrony in speech using elementary single-signal decomposition
-
Tomlinson, M.J., Russell, M.J., Moore, R.K., Buckland, A.P., Fawley, M.A. 1997. Modelling asynchrony in speech using elementary single-signal decomposition. In: Proceedings of ICASSP, pp. 1247-1250.
-
(1997)
Proceedings of ICASSP
, pp. 1247-1250
-
-
Tomlinson, M.J.1
Russell, M.J.2
Moore, R.K.3
Buckland, A.P.4
Fawley, M.A.5
-
51
-
-
0025681008
-
Hidden Markov model decomposition of speech and noise
-
Varga, A.P., Moore, R.K., 1990. Hidden Markov model decomposition of speech and noise. In: Proceedings of ICASSP, pp. 845-848.
-
(1990)
Proceedings of ICASSP
, pp. 845-848
-
-
Varga, A.P.1
Moore, R.K.2
-
52
-
-
85009113617
-
Gestural overlap, place of articulation and speech rate: An x-ray investigation
-
Vaxeclaire, B., Sock, R., Perrier, P., 2000. Gestural overlap, place of articulation and speech rate: an X-ray investigation. In: Proceedings of ICSLP, pp. II:166-II:169.
-
(2000)
Proceedings of ICSLP
, pp. II:166-II:169
-
-
Vaxelaire, B.1
Sock, R.2
Perrier, P.3
-
54
-
-
4243661293
-
Automatic learning of word pronunciation from data
-
Technical Report, The Johns Hopkins University (Center for Language and Speech Procesing) Summer Research Workshop
-
Weintraub, M., Wegmann, S., Kao, Y.-H., Khudanpur, S., Galles, C., Fosler, E., Saraclar, M., 1996. Automatic learning of word pronunciation from data. Technical Report, The Johns Hopkins University (Center for Language and Speech Procesing) Summer Research Workshop.
-
(1996)
-
-
Weintraub, M.1
Wegmann, S.2
Kao, Y.-H.3
Khudanpur, S.4
Galles, C.5
Fosler, E.6
Saraclar, M.7
-
55
-
-
0038517643
-
Progress towards improved speech modelling using asynchronous sub-bands and formant frequencies
-
Wilkinson, N., Russell, M.J., 2001. Progress towards improved speech modelling using asynchronous sub-bands and formant frequencies. In: Proceedings of Institute of Acoustics WISP, Stratford-upon-Avon, UK.
-
(2001)
Proceedings of Institute of Acoustics WISP, Stratford-Upon-Avon, UK
-
-
Wilkinson, N.1
Russell, M.J.2
-
56
-
-
0003822743
-
-
ECRL
-
Young, S., Jansen, J., Odell, J., Ollason, D., Woodland, P., 1995. The HTK Book (Version 2.0), ECRL.
-
(1995)
The HTK Book (Version 2.0)
-
-
Young, S.1
Jansen, J.2
Odell, J.3
Ollason, D.4
Woodland, P.5
|