-
1
-
-
0030718943
-
Multilingual large vocabulary speech recognition in the European SQALE project
-
S.J. Young, M. Adda-Decker, X. Aubert, C. Dugast, J.L. Gauvain, D.J. Kershaw, L. Lamel, and D.A. Leeuwen, "Multilingual large vocabulary speech recognition in the European SQALE project," Computer Speech & Language, vol.11, pp.73-89, 1997.
-
(1997)
Computer Speech & Language
, vol.11
, pp. 73-89
-
-
Young, S.J.1
Adda-Decker, M.2
Aubert, X.3
Dugast, C.4
Gauvain, J.L.5
Kershaw, D.J.6
Lamel, L.7
Leeuwen, D.A.8
-
2
-
-
0031187171
-
Speech recognition by machine and humans
-
R.P. Lippmann, "Speech recognition by machine and humans," Speech Communication. vol.22, pp.1-15, 1997.
-
(1997)
Speech Communication.
, vol.22
, pp. 1-15
-
-
Lippmann, R.P.1
-
4
-
-
0011510426
-
Capabilities and limitations of stochastic language models
-
March
-
S. Nakagawa, "Capabilities and limitations of stochastic language models," Conf. Record, Acoust. Soc. Japan, pp.23-26, March 1998.
-
(1998)
Conf. Record, Acoust. Soc. Japan
, pp. 23-26
-
-
Nakagawa, S.1
-
5
-
-
0011458455
-
Relationship among perplexity word accuracy and phoneme accuracy, and drawback and modification of perplexity
-
S. Nakagawa, "Relationship among perplexity word accuracy and phoneme accuracy, and drawback and modification of perplexity," Proc. First Int. Workshop East Asian Language Resources and Evaluation, pp.123-128, 1998.
-
(1998)
Proc. First Int. Workshop East Asian Language Resources and Evaluation
, pp. 123-128
-
-
Nakagawa, S.1
-
6
-
-
0011450087
-
Robust speech recognition using HMM's with Toplitz state covariance matrices
-
W.J.J. Roberts and Y. Ephraim, "Robust speech recognition using HMM's with Toplitz state covariance matrices," Proc. ICSLP, pp.369-372, 1998.
-
(1998)
Proc. ICSLP
, pp. 369-372
-
-
Roberts, W.J.J.1
Ephraim, Y.2
-
8
-
-
0003563803
-
-
IOS Press
-
S. Nakagawa, K. Shikano, and Y. Tohkura, Speech, Hearing and Neural Network Model, IOS Press, 1995.
-
(1995)
Speech, Hearing and Neural Network Model
-
-
Nakagawa, S.1
Shikano, K.2
Tohkura, Y.3
-
9
-
-
0011449585
-
-
Iwanami-shoten
-
N. Takubo, K. Maekawa, Y. Kubozono, K. Honda, K. Shirai, and S. Nakagawa, Speech, Iwanami-shoten, 1998.
-
(1998)
Speech
-
-
Takubo, N.1
Maekawa, K.2
Kubozono, Y.3
Honda, K.4
Shirai, K.5
Nakagawa, S.6
-
11
-
-
85009114626
-
Relationship among speaking style, inter-phoneme's distance and speech recognition performance
-
K. Yamamoto and S. Nakagawa, "Relationship among speaking style, inter-phoneme's distance and speech recognition performance," Proc. ICSLP, pp.859-862, 2000.
-
(2000)
Proc. ICSLP
, pp. 859-862
-
-
Yamamoto, K.1
Nakagawa, S.2
-
12
-
-
0000940883
-
Acoustic signal processing techniques for robust speech recognition
-
S. Nakagawa, "Acoustic signal processing techniques for robust speech recognition," J. Acoust. Soc. Japan, vol.53, no.11, pp.864-871, 1997.
-
(1997)
J. Acoust. Soc. Japan
, vol.53
, Issue.11
, pp. 864-871
-
-
Nakagawa, S.1
-
13
-
-
0030779363
-
Noise compensation methods for hidden Markov model speech recognition in adverse environments
-
S.V. Vaseghi and B.P. Molner, "Noise compensation methods for hidden Markov model speech recognition in adverse environments," IEEE Trans. Speech Audio Process., vol.5, no.1, pp.11-21, 1997.
-
(1997)
IEEE Trans. Speech Audio Process.
, vol.5
, Issue.1
, pp. 11-21
-
-
Vaseghi, S.V.1
Molner, B.P.2
-
14
-
-
0023263708
-
Multi-style training for robust isolated-word speech recognition
-
R.P. Lippmann, E.A. Martin, and D.B. Paul, "Multi-style training for robust isolated-word speech recognition," Proc. ICASSP, pp.705-708, 1987.
-
(1987)
Proc. ICASSP
, pp. 705-708
-
-
Lippmann, R.P.1
Martin, E.A.2
Paul, D.B.3
-
15
-
-
0022181749
-
Some acoustic-phonetic correlates of speech produced in noise
-
D. Pisoni, R. Bernacki, H. Nusbaum, and M. Yuchtman, "Some acoustic-phonetic correlates of speech produced in noise," Proc. ICASSP, pp.1581-1584, 1985.
-
(1985)
Proc. ICASSP
, pp. 1581-1584
-
-
Pisoni, D.1
Bernacki, R.2
Nusbaum, H.3
Yuchtman, M.4
-
16
-
-
0011496722
-
Normalizing lombard speech under different conditions
-
July
-
A. Wakao, K. Takeda, and F. Itakura, "Normalizing Lombard speech under different conditions," IEICE Trans., vol.J80-D-II, no.7, pp.1643-1650, July 1997.
-
(1997)
IEICE Trans.
, vol.J80-D-II
, Issue.7
, pp. 1643-1650
-
-
Wakao, A.1
Takeda, K.2
Itakura, F.3
-
17
-
-
0029345416
-
A comparison of signal processing front ends for automatic word recognition
-
C.R. Jankowski, H.-D.H. Vo, and R.P. Lippmann, "A comparison of signal processing front ends for automatic word recognition," IEEE Trans. Speech & Audio Process., vol.3, no.4, pp.286-292, 1995.
-
(1995)
IEEE Trans. Speech & Audio Process.
, vol.3
, Issue.4
, pp. 286-292
-
-
Jankowski, C.R.1
Vo, H.-D.H.2
Lippmann, R.P.3
-
20
-
-
0022667694
-
Speaker independent isolated word recognition using dynamic features of speech spectrum
-
S. Furui, "Speaker independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. Acoust. Speech & Signal Process., vol.34, no.1, pp.52-59, 1999.
-
(1999)
IEEE Trans. Acoust. Speech & Signal Process.
, vol.34
, Issue.1
, pp. 52-59
-
-
Furui, S.1
-
21
-
-
0032676337
-
On the relative importance of various components of the modulation spectrum for automatic speech recognition
-
N. Kanadera, T. Arai, H. Hermansky, and M. Pavel, "On the relative importance of various components of the modulation spectrum for automatic speech recognition," Speech Communication, vol.28, pp.43-55, 1999.
-
(1999)
Speech Communication
, vol.28
, pp. 43-55
-
-
Kanadera, N.1
Arai, T.2
Hermansky, H.3
Pavel, M.4
-
22
-
-
0031221099
-
Filtering the time sequences of spectral parameters for speech recognition
-
C. Nadeu, P.P. Leal, and B.-H. Juang, "Filtering the time sequences of spectral parameters for speech recognition," Speech Communication, vol.22, pp.315-332, 1997.
-
(1997)
Speech Communication
, vol.22
, pp. 315-332
-
-
Nadeu, C.1
Leal, P.P.2
Juang, B.-H.3
-
23
-
-
0011468569
-
An evaluation of mel-LPC cepstrum in noisy speech recognition
-
Y. Nakatoh and H. Matsumoto, "An evaluation of mel-LPC cepstrum in noisy speech recognition," Conf. Record, Acoust. Soc. Japan, pp.23-24, 1999.
-
(1999)
Conf. Record, Acoust. Soc. Japan
, pp. 23-24
-
-
Nakatoh, Y.1
Matsumoto, H.2
-
24
-
-
0032761999
-
Scale transform in speech analysis
-
S. Omesh, L. Cohen, N. Marinovic, and D.J. Nelson, "Scale transform in speech analysis," IEEE Trans. Speech & Audio Process., vol.7, no.1, pp.40-45, 1999.
-
(1999)
IEEE Trans. Speech & Audio Process.
, vol.7
, Issue.1
, pp. 40-45
-
-
Omesh, S.1
Cohen, L.2
Marinovic, N.3
Nelson, D.J.4
-
25
-
-
0011498037
-
A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition
-
J. Chen, B. Xu, and T. Huang, "A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition," Proc. ICASSP, pp.629-632, 1998.
-
(1998)
Proc. ICASSP
, pp. 629-632
-
-
Chen, J.1
Xu, B.2
Huang, T.3
-
26
-
-
0031176764
-
Hidden Markov model-based speech recognition with intermediate wavelet transform domains
-
R. Singh, K. Davis, and P.V.S. Rao, "Hidden Markov model-based speech recognition with intermediate wavelet transform domains," Computer Speech and Language, vol.11, pp.252-273, 1997.
-
(1997)
Computer Speech and Language
, vol.11
, pp. 252-273
-
-
Singh, R.1
Davis, K.2
Rao, P.V.S.3
-
27
-
-
0026189808
-
Speech recognition in adverse environments
-
B.H. Juang, "Speech recognition in adverse environments," Computer Speech Language, vol.5, pp.275-294, 1991.
-
(1991)
Computer Speech Language
, vol.5
, pp. 275-294
-
-
Juang, B.H.1
-
28
-
-
33947656987
-
Speech recognition in noise using a projection based likelihood measure for mixture density HMM's
-
B.A. Carlson and M.A. Clements, "Speech recognition in noise using a projection based likelihood measure for mixture density HMM's," Proc. ICASSP, vol.I, pp.237-240, 1992.
-
(1992)
Proc. ICASSP
, vol.1
, pp. 237-240
-
-
Carlson, B.A.1
Clements, M.A.2
-
29
-
-
0032116602
-
A novel projection-based likelihood measure for noisy speech recognition
-
J.-T. Chien, H.-C. Wang, and L.-M. Lee, "A novel projection-based likelihood measure for noisy speech recognition," Speech Communication, vol.24, pp.287-297, 1998.
-
(1998)
Speech Communication
, vol.24
, pp. 287-297
-
-
Chien, J.-T.1
Wang, H.-C.2
Lee, L.-M.3
-
30
-
-
0032203256
-
Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method
-
S. Katagiri, B.-H. Juang, and C.-H. Lee, "Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method," Proc. IEEE, vol.86, no.11, pp.2345-2372, 1998.
-
(1998)
Proc. IEEE
, vol.86
, Issue.11
, pp. 2345-2372
-
-
Katagiri, S.1
Juang, B.-H.2
Lee, C.-H.3
-
31
-
-
0029723602
-
Discriminative feature extraction to filter design
-
A. Biem, E. Mcdemott, and S. Katagiri, "Discriminative feature extraction to filter design," Proc. IEEE Workshop Neural Networks for Signal Processing, vol.IV, pp.273-282, 1996.
-
(1996)
Proc. IEEE Workshop Neural Networks for Signal Processing
, vol.4
, pp. 273-282
-
-
Biem, A.1
Mcdemott, E.2
Katagiri, S.3
-
32
-
-
0001286647
-
Minimum classification error training algorithm for feature extractor and pattern classification in speech recognition
-
K.K. Paliwal, M. Bacchiami, and Y. Sagisaka, "Minimum classification error training algorithm for feature extractor and pattern classification in speech recognition," Proc. EuroSpeech, pp.541-545, 1995.
-
(1995)
Proc. EuroSpeech
, pp. 541-545
-
-
Paliwal, K.K.1
Bacchiami, M.2
Sagisaka, Y.3
-
33
-
-
0032674196
-
Feature extraction for speech recognition based on orthogonal acoustic - Feature planes and LDA
-
T. Nitta, "Feature extraction for speech recognition based on orthogonal acoustic - Feature planes and LDA," Proc. ICASSP, pp.421-424, 1999.
-
(1999)
Proc. ICASSP
, pp. 421-424
-
-
Nitta, T.1
-
34
-
-
84893207073
-
Continuous speech recognition in noise using spectral subtraction and HMM adaptation
-
J.A.N. Flores and S.J. Young, "Continuous speech recognition in noise using spectral subtraction and HMM adaptation," Proc. ICASSP, vol.I, pp.409-412, 1994.
-
(1994)
Proc. ICASSP
, vol.1
, pp. 409-412
-
-
Flores, J.A.N.1
Young, S.J.2
-
35
-
-
11044237174
-
An evaluation of speech enhancement approach E-CMN/CSS for speech recognition
-
Jan.
-
M. Shozakai, S. Nakamura, and K. Shikano, "An evaluation of speech enhancement approach E-CMN/CSS for speech recognition," IEICE Trans., vol.J81-D, no.1, pp.1-9, Jan. 1998.
-
(1998)
IEICE Trans.
, vol.J81-D
, Issue.1
, pp. 1-9
-
-
Shozakai, M.1
Nakamura, S.2
Shikano, K.3
-
36
-
-
0026882842
-
Experiments with a nonlinear spectral subtractor (NSS), hidden Markov model and the projection, for robust speech recognition in cars
-
P. Lockwood and J. Boudy, "Experiments with a nonlinear spectral subtractor (NSS), hidden Markov model and the projection, for robust speech recognition in cars," Speech Communication, vol.11, pp.215-228, 1992.
-
(1992)
Speech Communication
, vol.11
, pp. 215-228
-
-
Lockwood, P.1
Boudy, J.2
-
37
-
-
0030711159
-
Spectral subtraction and RASTA-filtering in text-dependent HMM-based speaker verification
-
D. Hardt and K. Fellbaum, "Spectral subtraction and RASTA-filtering in text-dependent HMM-based speaker verification," Proc. ICASSP, pp.867-870, 1997.
-
(1997)
Proc. ICASSP
, pp. 867-870
-
-
Hardt, D.1
Fellbaum, K.2
-
38
-
-
0011498039
-
A smoothing method of time direction on speech recognition under noisy environments using spectral subtraction
-
N. Kitaoka, I. Akahori, and S. Nakagawa, "A smoothing method of time direction on speech recognition under noisy environments using spectral subtraction," Proc. Int. Conf. Speech Processing, pp.381-386, 1999.
-
(1999)
Proc. Int. Conf. Speech Processing
, pp. 381-386
-
-
Kitaoka, N.1
Akahori, I.2
Nakagawa, S.3
-
39
-
-
0011464161
-
Improved robust speech recognition considering signal correlation approximated by Tayler series
-
J.-L. Shen, J.-W. Hung, and L.-S. Lee, "Improved robust speech recognition considering signal correlation approximated by Tayler series," Proc. ICSLP, pp.1499-1502, 1998.
-
(1998)
Proc. ICSLP
, pp. 1499-1502
-
-
Shen, J.-L.1
Hung, J.-W.2
Lee, L.-S.3
-
40
-
-
0025681008
-
Hidden Markov model decomposition of speech and noise
-
A.P. Varga and R.K. Moore, "Hidden Markov model decomposition of speech and noise," Proc. ICASSP, pp.845-848, 1990.
-
(1990)
Proc. ICASSP
, pp. 845-848
-
-
Varga, A.P.1
Moore, R.K.2
-
41
-
-
0027622731
-
Cepstral parameter compensation for HMM recognition in noise
-
M.J.F. Gales and S.J. Young, "Cepstral parameter compensation for HMM recognition in noise," Speech Communication, vol.12, pp.231-239, 1993.
-
(1993)
Speech Communication
, vol.12
, pp. 231-239
-
-
Gales, M.J.F.1
Young, S.J.2
-
42
-
-
0030245128
-
Robust continuous speech recognition using parallel model combination
-
M.J.F. Gales and S.J. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech & Audio Process., vol.4, pp.352-359, 1996.
-
(1996)
IEEE Trans. Speech & Audio Process.
, vol.4
, pp. 352-359
-
-
Gales, M.J.F.1
Young, S.J.2
-
43
-
-
0003524869
-
Recognition of noisy speech by composition of hidden Markov models
-
IEICE Technical Report, SP92-96
-
F. Martin, K. Shikano, Y. Minami, and Y. Okabe, "Recognition of noisy speech by composition of hidden Markov models," IEICE Technical Report, SP92-96, 1992.
-
(1992)
-
-
Martin, F.1
Shikano, K.2
Minami, Y.3
Okabe, Y.4
-
44
-
-
0011400310
-
Robust HMM to variation of noisy environments based on variance extension of noisy models
-
H. Matsumoto and H. Ubukata, "Robust HMM to variation of noisy environments based on variance extension of noisy models," Proc. EuroSpeech, pp.2387-2390, 1999.
-
(1999)
Proc. EuroSpeech
, pp. 2387-2390
-
-
Matsumoto, H.1
Ubukata, H.2
-
45
-
-
0032623471
-
Robust features for noisy speech recognition based on temporal trajectory fitting of short-time autocorrelation sequences
-
K.H. You and H.-C. Wang, "Robust features for noisy speech recognition based on temporal trajectory fitting of short-time autocorrelation sequences," Speech Communication, vol.28, pp.13-24, 1999.
-
(1999)
Speech Communication
, vol.28
, pp. 13-24
-
-
You, K.H.1
Wang, H.-C.2
-
46
-
-
0011448901
-
HMM composition of segmental unit input HMM for noisy speech recognition
-
K. Yamamoto and S. Nakagawa, "HMM composition of segmental unit input HMM for noisy speech recognition," Proc. EuroSpeech, pp.2865-2868, 1999.
-
(1999)
Proc. EuroSpeech
, pp. 2865-2868
-
-
Yamamoto, K.1
Nakagawa, S.2
-
47
-
-
0011406317
-
Difference in speech recognition performance caused by difference in front-end devices and its compensations
-
K. Yamamoto and S. Nakagawa, "Difference in speech recognition performance caused by difference in front-end devices and its compensations," Proc. 7th Western Pacific Regional Acoust. Conf., pp.85-88, 2000.
-
(2000)
Proc. 7th Western Pacific Regional Acoust. Conf.
, pp. 85-88
-
-
Yamamoto, K.1
Nakagawa, S.2
-
48
-
-
0011501273
-
Real-time cepstrum mean subtraction using the most likely partial state sequence
-
March
-
S. Kuroiwa, T. Kato, and N. Higuchi, "Real-time cepstrum mean subtraction using the most likely partial state sequence," IEICE Trans., vol.J82-D-II, no.3, pp.332-339, March 1999.
-
(1999)
IEICE Trans.
, vol.J82-D-II
, Issue.3
, pp. 332-339
-
-
Kuroiwa, S.1
Kato, T.2
Higuchi, N.3
-
49
-
-
0030149866
-
A maximum-likelihood approach to stochastic matching for robust speech recognition
-
A. Sankar and C.H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech & Audio Process., vol.4, no.5, pp.190-202, 1996.
-
(1996)
IEEE Trans. Speech & Audio Process.
, vol.4
, Issue.5
, pp. 190-202
-
-
Sankar, A.1
Lee, C.H.2
-
50
-
-
0029369804
-
Rapid environment adaptation for speech recognition
-
K. Takagi, H. Hattori, and T. Watanabe, "Rapid environment adaptation for speech recognition," J. Acoust. Soc. Japan, (E), vol.16, no.5, pp.273-281, 1995.
-
(1995)
J. Acoust. Soc. Japan, (E)
, vol.16
, Issue.5
, pp. 273-281
-
-
Takagi, K.1
Hattori, H.2
Watanabe, T.3
-
51
-
-
0011510430
-
An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation
-
Y. Tsurumi and S. Nakagawa, "An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation," Proc. IC-SLP, pp.431-434, 1994.
-
(1994)
Proc. IC-SLP
, pp. 431-434
-
-
Tsurumi, Y.1
Nakagawa, S.2
-
52
-
-
0011410507
-
Acoustical and environmental robustness
-
Kluwer Academic Pub., Dordrecht
-
A. Acero, "Acoustical and Environmental Robustness," in Automatic Speech Recognition, Kluwer Academic Pub., Dordrecht, 1993.
-
(1993)
Automatic Speech Recognition
-
-
Acero, A.1
-
53
-
-
0032116601
-
Data-driven environmental compensation for speech recognition a unified approach
-
P.J. Moreno, B. Raj, and R.M. Stern, "Data-driven environmental compensation for speech recognition a unified approach," Speech Communication, vol.24, pp.267-285, 1998.
-
(1998)
Speech Communication
, vol.24
, pp. 267-285
-
-
Moreno, P.J.1
Raj, B.2
Stern, R.M.3
-
54
-
-
0029725301
-
A vector Taylor series approach for environment-independent speech recognition
-
P.J. Moreno, B. Raj, and R.M. Stern, "A vector Taylor series approach for environment-independent speech recognition," Proc. ICASSP, pp.733-736, 1996.
-
(1996)
Proc. ICASSP
, pp. 733-736
-
-
Moreno, P.J.1
Raj, B.2
Stern, R.M.3
-
55
-
-
0032048385
-
Speech recognition in noisy environments using first-order vector Taylor series
-
D.Y. Kim, C.K. Un, and N.S. Kim, "Speech recognition in noisy environments using first-order vector Taylor series," Speech Communication, vol.24, no.1, pp.39-49, 1998.
-
(1998)
Speech Communication
, vol.24
, Issue.1
, pp. 39-49
-
-
Kim, D.Y.1
Un, C.K.2
Kim, N.S.3
-
56
-
-
0011496725
-
HMM adaptation method for noise and distortion by maximizing likelihood
-
July
-
Y. Minami and S. Furui, "HMM adaptation method for noise and distortion by maximizing likelihood," IEICE Trans., vol.J80-A, no.7, pp.1179-1186, July 1997.
-
(1997)
IEICE Trans.
, vol.J80-A
, Issue.7
, pp. 1179-1186
-
-
Minami, Y.1
Furui, S.2
-
57
-
-
0032203405
-
A general joint additive and convolutive bias approach applied to noisy lombard speech recognition
-
M. Afify, Y. Gong, and J.P. Haton, "A general joint additive and convolutive bias approach applied to noisy lombard speech recognition," IEEE Trans. Speech & Audio Process., vol.6, no.6, pp.524-537, 1998.
-
(1998)
IEEE Trans. Speech & Audio Process.
, vol.6
, Issue.6
, pp. 524-537
-
-
Afify, M.1
Gong, Y.2
Haton, J.P.3
-
58
-
-
0035249243
-
HMM - Separation-based speech recognition for a distant moving speaker
-
T. Takiguchi, S. Nakamura, and K. Shikano, "HMM - Separation-based speech recognition for a distant moving speaker," IEEE Trans. Speech & Audio Process., vol.9, no.3, pp.127-140, 2001.
-
(2001)
IEEE Trans. Speech & Audio Process.
, vol.9
, Issue.3
, pp. 127-140
-
-
Takiguchi, T.1
Nakamura, S.2
Shikano, K.3
-
59
-
-
0032139769
-
Automatic segmentation of speech recorded in unknown noisy channel characteristics
-
B.L. Pallon and J.H.L. Hansen, "Automatic segmentation of speech recorded in unknown noisy channel characteristics," Speech Communication, vol.25, no.1-3, pp.97-116, 1998.
-
(1998)
Speech Communication
, vol.25
, Issue.1-3
, pp. 97-116
-
-
Pallon, B.L.1
Hansen, J.H.L.2
-
60
-
-
0011448902
-
Japanese phoneme recognition using continuous parameter hidden Markov models
-
June
-
S. Nakagawa, Y. Hirata, and Y. Hashimoto, "Japanese phoneme recognition using continuous parameter hidden Markov models," J. Acoust. Soc. Japan, vol.46, no.6, pp.486-496, June 1990.
-
(1990)
J. Acoust. Soc. Japan
, vol.46
, Issue.6
, pp. 486-496
-
-
Nakagawa, S.1
Hirata, Y.2
Hashimoto, Y.3
-
61
-
-
0023211284
-
Integration of acoustic information in a large vocabulary word recognizer
-
V.N. Gupta, M. Lennig, and P. Mermelstein, "Integration of acoustic information in a large vocabulary word recognizer," ICASSP, vol.II, pp.697-700, 1987.
-
(1987)
ICASSP
, vol.2
, pp. 697-700
-
-
Gupta, V.N.1
Lennig, M.2
Mermelstein, P.3
-
62
-
-
20344368952
-
Hidden Markov model embedded dynamic features of speech spectrum
-
Feb.
-
E. Tsuboka and J. Nakahashi, "Hidden Markov model embedded dynamic features of speech spectrum," IEICE Trans., vol.J77-A, no.2, pp.162-172, Feb. 1994.
-
(1994)
IEICE Trans.
, vol.J77-A
, Issue.2
, pp. 162-172
-
-
Tsuboka, E.1
Nakahashi, J.2
-
63
-
-
0029325484
-
Neural predictive hidden Markov model for speech recognition
-
June
-
E. Tsuboka and Y. Takada, "Neural predictive hidden Markov model for speech recognition," IEICE Trans., Inf. & Syst., vol.E78-D, no.6, pp.676-684, June 1995.
-
(1995)
IEICE Trans., Inf. & Syst.
, vol.E78-D
, Issue.6
, pp. 676-684
-
-
Tsuboka, E.1
Takada, Y.2
-
64
-
-
84911676598
-
Linear and nonlinear prediction for speech recognition with hidden Markov models
-
M. Saerens and H. Bourlard, "Linear and nonlinear prediction for speech recognition with hidden Markov models," Proc. EuroSpeech, pp.807-810, 1993.
-
(1993)
Proc. EuroSpeech
, pp. 807-810
-
-
Saerens, M.1
Bourlard, H.2
-
65
-
-
0030262262
-
An MLP/HMM hybrid model using linear predictors
-
Y.J. Chung and C.K. Un, "An MLP/HMM hybrid model using linear predictors," Speech Communication, vol.19, pp.307-316, 1996.
-
(1996)
Speech Communication
, vol.19
, pp. 307-316
-
-
Chung, Y.J.1
Un, C.K.2
-
66
-
-
0028516022
-
Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
-
L. Deng, M. Aksmanoric, X. Sun, and C.F.J. Wu, "Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states," IEEE Trans. Speech Audio & Process., vol.2, no.4, pp.507-520, 1994.
-
(1994)
IEEE Trans. Speech Audio & Process.
, vol.2
, Issue.4
, pp. 507-520
-
-
Deng, L.1
Aksmanoric, M.2
Sun, X.3
Wu, C.F.J.4
-
67
-
-
0011495820
-
Speech recognition by hidden Markov model using segmental statistics
-
IEICE Technical Report, SP90-69
-
Y. Hirata, I. Hayakawa, Y. Ono, and S. Nakagawa, "Speech recognition by hidden Markov model using segmental statistics," IEICE Technical Report, SP90-69, 1990.
-
(1990)
-
-
Hirata, Y.1
Hayakawa, I.2
Ono, Y.3
Nakagawa, S.4
-
68
-
-
0011400311
-
Syllable recognition by hidden Markov model using fixed-length segmental statistics
-
May
-
S. Nakagawa, Y. Hirata, and Y. Ono, "Syllable recognition by hidden Markov model using fixed-length segmental statistics," IEICE Trans., vol.J75-D-II, no.5, pp.843-851, May 1992.
-
(1992)
IEICE Trans.
, vol.J75-D-II
, Issue.5
, pp. 843-851
-
-
Nakagawa, S.1
Hirata, Y.2
Ono, Y.3
-
69
-
-
0000321310
-
Explicit correlation in hidden Markov model for speech recognition
-
C.J. Wellekens, "Explicit correlation in hidden Markov model for speech recognition," Proc. ICASSP, vol.I, pp.383-386, 1987.
-
(1987)
Proc. ICASSP
, vol.1
, pp. 383-386
-
-
Wellekens, C.J.1
-
70
-
-
0030261616
-
Modelling of the interframe dependence in an HMM using conditional Gaussian mixtures
-
J. Ming and F.J. Smith, "Modelling of the interframe dependence in an HMM using conditional Gaussian mixtures," Computer Speech and Language, vol.10, pp.229-242, 1996.
-
(1996)
Computer Speech and Language
, vol.10
, pp. 229-242
-
-
Ming, J.1
Smith, F.J.2
-
71
-
-
0027167185
-
A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition
-
K. Aikawa, H. Singer, H. Kawakara, and Y. Tohkura, "A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition," Proc. ICASSP, pp.668-671, 1993.
-
(1993)
Proc. ICASSP
, pp. 668-671
-
-
Aikawa, K.1
Singer, H.2
Kawakara, H.3
Tohkura, Y.4
-
72
-
-
0011453543
-
Comparative evaluation of segmental unit input HMM and conditional density HMM
-
K. Yamamoto and S. Nakagawa, "Comparative evaluation of segmental unit input HMM and conditional density HMM," Proc. EuroSpeech, pp.1615-1618, 1995.
-
(1995)
Proc. EuroSpeech
, pp. 1615-1618
-
-
Yamamoto, K.1
Nakagawa, S.2
-
73
-
-
85128367481
-
Continuous speech recognition using segmental unit input HMM with a mixture of probability density functions and context dependency
-
K. Hanai, K. Yamamoto, N. Minematsu, and S. Nakagawa, "Continuous speech recognition using segmental unit input HMM with a mixture of probability density functions and context dependency," Proc. ICSLP, pp.2935-2938, 1998.
-
(1998)
Proc. ICSLP
, pp. 2935-2938
-
-
Hanai, K.1
Yamamoto, K.2
Minematsu, N.3
Nakagawa, S.4
-
74
-
-
0011494012
-
Speaker-independent phoneme and word recognition by statistical classification methods for time-sequential patterns
-
Oct.
-
S. Nakagawa and Y. Enomoto "Speaker-independent phoneme and word recognition by statistical classification methods for time-sequential patterns," IEICE Trans., vol.J71-D, no.10, pp.1977-1983, Oct. 1988.
-
(1988)
IEICE Trans.
, vol.J71-D
, Issue.10
, pp. 1977-1983
-
-
Nakagawa, S.1
Enomoto, Y.2
-
75
-
-
84926271491
-
Recognition on unvoiced plosive using time spectrum pattern
-
May
-
K. Ide, S. Makino, and K. Kido, "Recognition on unvoiced plosive using time spectrum pattern," J. Acoust. Soc. Japan, vol.39, no.5, pp.321-329, May 1983.
-
(1983)
J. Acoust. Soc. Japan
, vol.39
, Issue.5
, pp. 321-329
-
-
Ide, K.1
Makino, S.2
Kido, K.3
-
76
-
-
0024900279
-
A stochastic segment model for phoneme-based continuous speech recognition
-
M. Ostendorf and S. Roukos, "A stochastic segment model for phoneme-based continuous speech recognition," IEEE Trans. Acoust., Speech & Signal Process., vol.37, no.12, pp.1857-1869, 1989.
-
(1989)
IEEE Trans. Acoust., Speech & Signal Process.
, vol.37
, Issue.12
, pp. 1857-1869
-
-
Ostendorf, M.1
Roukos, S.2
-
77
-
-
0025594074
-
Connectionist Viterbi training a new hybrid for continuous speech recognition
-
M. Franzini and K.-F. Lee, "Connectionist Viterbi training a new hybrid for continuous speech recognition," Proc. ICASSP, vol.I, pp.425-428, 1990.
-
(1990)
Proc. ICASSP
, vol.1
, pp. 425-428
-
-
Franzini, M.1
Lee, K.-F.2
-
78
-
-
0028194709
-
Connectionist probability estimators in HMM speech recognition
-
S. Renal, N. Morgan, H. Bourlard, M. Cohen, and H. Franco, "Connectionist probability estimators in HMM speech recognition," IEEE Trans. Speech & Audio Process., vol.2, no.1, pp.161-174, 1994.
-
(1994)
IEEE Trans. Speech & Audio Process.
, vol.2
, Issue.1
, pp. 161-174
-
-
Renal, S.1
Morgan, N.2
Bourlard, H.3
Cohen, M.4
Franco, H.5
-
79
-
-
77954383749
-
Data-driven extensions to HMM statistical dependencies
-
J.A. Bilmes, "Data-driven extensions to HMM statistical dependencies," Proc. ICSLP, pp.69-72, 1998.
-
(1998)
Proc. ICSLP
, pp. 69-72
-
-
Bilmes, J.A.1
-
80
-
-
0011498040
-
Inter-frame dependence arising from preceding and succeeding frames - Application to speech recognition
-
P. Hanna, J. Ming, and F.J. Smith, "Inter-frame dependence arising from preceding and succeeding frames - Application to speech recognition," Speech Communication, vol.31, no.4, pp.1301-1312, 1999.
-
(1999)
Speech Communication
, vol.31
, Issue.4
, pp. 1301-1312
-
-
Hanna, P.1
Ming, J.2
Smith, F.J.3
-
81
-
-
0009626005
-
The IBM large vocabulary continuous speech recognition system for the ARPA NAB news task
-
L.R. Bahl, P.F. Brown, P.V. Souza, and R.L. Mercer, "The IBM large vocabulary continuous speech recognition system for the ARPA NAB news task," Proc. Spoken Language Systems Technology Workshop, pp.121-126, 1995.
-
(1995)
Proc. Spoken Language Systems Technology Workshop
, pp. 121-126
-
-
Bahl, L.R.1
Brown, P.F.2
Souza, P.V.3
Mercer, R.L.4
-
82
-
-
0028996957
-
A unified way in incorporating segmental feature and segmental model into HMM
-
J. He and H. Leich, "A unified way in incorporating segmental feature and segmental model into HMM," Proc. ICASSP, vol.I, pp.532-535, 1995.
-
(1995)
Proc. ICASSP
, vol.1
, pp. 532-535
-
-
He, J.1
Leich, J.2
-
83
-
-
85027200620
-
The property of asymmetric segment
-
IEICE Technical Report, SP98-30
-
T. Ohtuki and T. Ohtomo, "The property of asymmetric segment," IEICE Technical Report, SP98-30, 1998.
-
(1998)
-
-
Ohtuki, T.1
Ohtomo, T.2
-
84
-
-
0030245363
-
From HMMs to segment models: A unified view of stochastic modeling for speech recognition
-
M. Ostendonf, V.V. Digalakis, and O.A. Kimball, "From HMMs to segment models: A unified view of stochastic modeling for speech recognition," IEEE Trans. Speech & Audio Process., vol.4, no.5, pp.360-378, 1996.
-
(1996)
IEEE Trans. Speech & Audio Process.
, vol.4
, Issue.5
, pp. 360-378
-
-
Ostendonf, M.1
Digalakis, V.V.2
Kimball, O.A.3
-
85
-
-
0032048095
-
Assessing the importance of the segmentation probability in segment-based speech recognition
-
J. Verhasselt, I. Illina, J.P. Martens, Y. Gong, and J.-P. Haton, "Assessing the importance of the segmentation probability in segment-based speech recognition," Speech Communication, vol.24, pp.51-72, 1998.
-
(1998)
Speech Communication
, vol.24
, pp. 51-72
-
-
Verhasselt, J.1
Illina, I.2
Martens, J.P.3
Gong, Y.4
Haton, J.-P.5
-
86
-
-
0023846644
-
Stochastic segment modeling using the estimate-maximize algorithm
-
S. Rocous, M. Ostendorf, H. Gish, and A. Derr, "Stochastic segment modeling using the estimate-maximize algorithm," Proc. ICASSP, pp.127-130, 1988.
-
(1988)
Proc. ICASSP
, pp. 127-130
-
-
Rocous, S.1
Ostendorf, M.2
Gish, H.3
Derr, A.4
-
87
-
-
0031185482
-
Speaker-independent phonetic classification using hidden Markov models with mixtures of trend functions
-
L. Deng and M. Aksmanovic, "Speaker-independent phonetic classification using hidden Markov models with mixtures of trend functions," IEEE Trans. Speech & Audio Process., vol.5, no.4, pp.319-324, 1997.
-
(1997)
IEEE Trans. Speech & Audio Process.
, vol.5
, Issue.4
, pp. 319-324
-
-
Deng, L.1
Aksmanovic, M.2
-
88
-
-
0032206267
-
Speech trajectory discrimination using the minimum classification error learning
-
R. Chengalvara and L. Deng, "Speech trajectory discrimination using the minimum classification error learning," IEEE Trans. Speech & Audio Process., vol.6, no.6, pp.505-515, 1998.
-
(1998)
IEEE Trans. Speech & Audio Process.
, vol.6
, Issue.6
, pp. 505-515
-
-
Chengalvara, R.1
Deng, L.2
-
90
-
-
0034478708
-
Improving phoneme classification performance using observation context-dependent segment models
-
M. Szarras and S. Matsunaga, "Improving phoneme classification performance using observation context-dependent segment models," Int. J. Speech Technology, vol.3, pp.253-262, 2000.
-
(2000)
Int. J. Speech Technology
, vol.3
, pp. 253-262
-
-
Szarras, M.1
Matsunaga, S.2
-
91
-
-
0027681974
-
ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
-
V. Digalakis, J.R. Rohlicek, and M. Ostendorf, "ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition," IEEE Trans. Speech & Audio Process., vol.1, no.4, pp.431-442, 1993.
-
(1993)
IEEE Trans. Speech & Audio Process.
, vol.1
, Issue.4
, pp. 431-442
-
-
Digalakis, V.1
Rohlicek, J.R.2
Ostendorf, M.3
-
92
-
-
0011458458
-
Kalman-filter solved by personal computer
-
Maruzen
-
M. Nakano and K. Nishiyama, Kalman-filter solved by personal computer, Maruzen 1993.
-
(1993)
-
-
Nakano, M.1
Nishiyama, K.2
-
93
-
-
0011432608
-
Time series analysis programming
-
Iwanami shoten
-
G. Kitagawa, Time series analysis programming, Iwanami shoten, 1993.
-
(1993)
-
-
Kitagawa, G.1
-
94
-
-
0029755019
-
Estimation of mixtures of stochastic dynamic trajectories: Application to continuous speech recognition
-
M. Afify, Y. Gong, and J.-P. Haton, "Estimation of mixtures of stochastic dynamic trajectories: Application to continuous speech recognition," Computer Speech and Language, vol.10, pp.23-36, 1996.
-
(1996)
Computer Speech and Language
, vol.10
, pp. 23-36
-
-
Afify, M.1
Gong, Y.2
Haton, J.-P.3
-
95
-
-
0011450090
-
Constraining model duration variance in HMM-based connected speech recognition
-
M.M. Hochberg and H.F. Silverman, "Constraining model duration variance in HMM-based connected speech recognition," Proc. EuroSpeech, pp.323-326, 1993.
-
(1993)
Proc. EuroSpeech
, pp. 323-326
-
-
Hochberg, M.M.1
Silverman, H.F.2
-
96
-
-
0029368174
-
Nonstationary hidden Markov model
-
B. Sin and J.H. Kim, "Nonstationary hidden Markov model," Signal Processing, vol.46, pp.31-46, 1995.
-
(1995)
Signal Processing
, vol.46
, pp. 31-46
-
-
Sin, B.1
Kim, J.H.2
-
97
-
-
0030247529
-
Modeling acoustic transitions in speech by modified hidden Markov models with state duration and state duration-dependent observation probabilities
-
Y.K. Park, C.K. Un, and O.W. Kwon, "Modeling acoustic transitions in speech by modified hidden Markov models with state duration and state duration-dependent observation probabilities" IEEE Trans. Speech & Audio Process, vol.4, no.5, pp.389-392, 1996.
-
(1996)
IEEE Trans. Speech & Audio Process
, vol.4
, Issue.5
, pp. 389-392
-
-
Park, Y.K.1
Un, C.K.2
Kwon, O.W.3
-
98
-
-
0000698482
-
Japanese dictation toolkit - 1997 version
-
May
-
T. Kawahara, A. Lee, T. Kobayashi, K. Takeda, N. Minematsu, K. Itoh, A. Itoh, M. Yamamoto, A. Yamada, T. Utsuro, and K. Shikano, "Japanese dictation toolkit - 1997 version," J. Acoust. Soc. Japan, vol.E20, no.3, pp.223-239, May 1999.
-
(1999)
J. Acoust. Soc. Japan
, vol.E20
, Issue.3
, pp. 223-239
-
-
Kawahara, T.1
Lee, A.2
Kobayashi, T.3
Takeda, K.4
Minematsu, N.5
Itoh, K.6
Itoh, A.7
Yamamoto, M.8
Yamada, A.9
Utsuro, T.10
Shikano, K.11
-
99
-
-
0029352735
-
Continuous speech dictation - From theory to practice
-
V. Steinbiss, H. Ney, U. Essen, B.-H. Tran, X. Aubert, C. Dugast, R. Kneser, H.-G. Meier, M. Oerder, R. Haeb-Umbach, D. Geller, W. Höllerbauer, and H. Bartosik, "Continuous speech dictation - From theory to practice," Speech Communication, vol.17, pp.19-38, 1995.
-
(1995)
Speech Communication
, vol.17
, pp. 19-38
-
-
Steinbiss, V.1
Ney, H.2
Essen, U.3
Tran, B.-H.4
Aubert, X.5
Dugast, C.6
Kneser, R.7
Meier, H.-G.8
Oerder, M.9
Haeb-Umbach, R.10
Geller, D.11
Höllerbauer, W.12
Bartosik, H.13
-
100
-
-
0011453546
-
Recognition of spoken words based on VCV syllable unit
-
May
-
R. Nakatsu and M. Kohda, "Recognition of spoken words based on VCV syllable unit," IEICE Trans., vol.J61-A, no.5, pp.464-471, May 1978.
-
(1978)
IEICE Trans.
, vol.J61-A
, Issue.5
, pp. 464-471
-
-
Nakatsu, R.1
Kohda, M.2
-
101
-
-
0022185407
-
Context-dependent modeling for acoustic-phonetic recognition of continuous speech
-
R. Schawartz, Y. Chow, O. Kimball, S. Roucos, M. Krasner, and J. Makhoul, "Context-dependent modeling for acoustic-phonetic recognition of continuous speech," Proc., ICASSP, pp.1203-1208, 1985.
-
(1985)
Proc., ICASSP
, pp. 1203-1208
-
-
Schawartz, R.1
Chow, Y.2
Kimball, O.3
Roucos, S.4
Krasner, M.5
Makhoul, J.6
-
103
-
-
0028996852
-
The 1994 HTK large vocabulary speech recognition system
-
P.C. Woodland, C.J. Leggetter, J.J. Odell, V. Valtcher, and S.J. Young, "The 1994 HTK large vocabulary speech recognition system," Proc. ICASSP, pp.73-76, 1995.
-
(1995)
Proc. ICASSP
, pp. 73-76
-
-
Woodland, P.C.1
Leggetter, C.J.2
Odell, J.J.3
Valtcher, V.4
Young, S.J.5
-
104
-
-
0011453547
-
Comparison of syntax-oriented spoken Japanese understanding with semantic-oriented system
-
July
-
S. Nakagawa, Y. Hirata, I. Murase, and T. Tanoue, "Comparison of syntax-oriented spoken Japanese understanding with semantic-oriented system," IEICE Trans., vol.E74, no.7, pp.1854-1862, July 1991.
-
(1991)
IEICE Trans.
, vol.E74
, Issue.7
, pp. 1854-1862
-
-
Nakagawa, S.1
Hirata, Y.2
Murase, I.3
Tanoue, T.4
-
105
-
-
0024889251
-
Large vocabulary word recognition based on demisyllable hidden Markov model using small amount of training data
-
T. Watanabe, "Large vocabulary word recognition based on demisyllable hidden Markov model using small amount of training data," Proc. ICASSP, S1.1, 1985.
-
Proc. ICASSP, S1.1, 1985.
-
-
Watanabe, T.1
-
106
-
-
0011448906
-
Multivariate statistical analysis of VCV syllables
-
Jan.
-
T. Sakai and K. Tabata, "Multivariate statistical analysis of VCV syllables," IEICE Trans., vol.56-D, no.1, pp.63-70, Jan. 1973.
-
(1973)
IEICE Trans.
, vol.56 D
, Issue.1
, pp. 63-70
-
-
Sakai, T.1
Tabata, K.2
-
107
-
-
34248800020
-
Mora or syllable? Speech segmentation in Japanese
-
T. Otake, G. Hatano, G. Culter, and J. Mehler, "Mora or syllable? Speech segmentation in Japanese," J. Mem. Lang, vol.32, pp.358-378, 1993.
-
(1993)
J. Mem. Lang
, vol.32
, pp. 358-378
-
-
Otake, T.1
Hatano, G.2
Culter, G.3
Mehler, J.4
-
108
-
-
0031632630
-
Advances in alphadigit recognition using syllables
-
J. Hamaker, A. Ganapathiraju, J. Picone, and J.J. Godfrey, "Advances in alphadigit recognition using syllables," Proc. ICASSP, pp.421-424, 1998.
-
(1998)
Proc. ICASSP
, pp. 421-424
-
-
Hamaker, J.1
Ganapathiraju, A.2
Picone, J.3
Godfrey, J.J.4
-
109
-
-
0003462715
-
Hidden Markov model for speech recognition
-
Edinburgh University Press
-
X.D. Xuang, Y. Ariki, and M.A. Jack, Hidden Markov model for speech recognition, Edinburgh University Press, 1990.
-
(1990)
-
-
Xuang, X.D.1
Ariki, Y.2
Jack, M.A.3
-
110
-
-
85015539783
-
Subphonetic modeling with Markov states-SENONE
-
M.-Y. Hwang, and X. Huang, "Subphonetic modeling with Markov states-SENONE," Proc. ICASSP, pp.33-36, 1992.
-
(1992)
Proc. ICASSP
, pp. 33-36
-
-
Hwang, M.-Y.1
Huang, X.2
-
111
-
-
0030193422
-
Genones: Generalized mixture tying in continuous hidden Markov model-based speech recognizers
-
V.V. Digalakis, P. Monaco, and H. Murveit, "Genones: Generalized mixture tying in continuous hidden Markov model-based speech recognizers," IEEE Trans. Speech & Audio Process., vol.4, no.4, pp.281-288, 1996.
-
(1996)
IEEE Trans. Speech & Audio Process.
, vol.4
, Issue.4
, pp. 281-288
-
-
Digalakis, V.V.1
Monaco, P.2
Murveit, H.3
-
112
-
-
0028530231
-
State clustering in hidden Markov-based continuous speech recognition
-
S.J. Young and P.C. Woodland, "State clustering in hidden Markov-based continuous speech recognition," Computer Speech and Language, vol.8, pp.369-383, 1994.
-
(1994)
Computer Speech and Language
, vol.8
, pp. 369-383
-
-
Young, S.J.1
Woodland, P.C.2
-
113
-
-
85027105819
-
Prediction about unknown phonetic context by tree-based phone modeling
-
Technical Report, SP90-64, IEICE
-
S. Hayamizu and K. Tanaka, "Prediction about unknown phonetic contexts by tree-based phone modeling," Technical Report, SP90-64, IEICE 1990.
-
(1990)
-
-
Hayamizu, S.1
Tanaka, K.2
-
114
-
-
85013744934
-
A successive state splitting algorithm for efficient allophone modeling
-
J. Takami and S. Sagayama, "A successive state splitting algorithm for efficient allophone modeling," Proc. ICASSP, pp.574-577, 1992.
-
(1992)
Proc. ICASSP
, pp. 574-577
-
-
Takami, J.1
Sagayama, S.2
-
115
-
-
0011471866
-
A study on HM-nets using phonetic decision tree-based successive state splitting
-
Oct.
-
T. Hori, M. Katoh, A. Itoh, and M. Kohda, "A study on HM-nets using phonetic decision tree-based successive state splitting," IEICE Trans. Inf. & Syst., vol.J80-D-II, no.10, pp.2645-2654, Oct. 1997.
-
(1997)
IEICE Trans. Inf. & Syst.
, vol.J80-D-II
, Issue.10
, pp. 2645-2654
-
-
Hori, T.1
Katoh, M.2
Itoh, A.3
Kohda, M.4
-
117
-
-
85007758082
-
Minimum error classification training of HMMs implementation details and experimental results
-
D. Rainton, and S. Sagayama, "Minimum error classification training of HMMs implementation details and experimental results," J. Acoust. Soc. Japan, vol.13, no.6, pp.379-388, 1992.
-
(1992)
J. Acoust. Soc. Japan
, vol.13
, Issue.6
, pp. 379-388
-
-
Rainton, D.1
Sagayama, S.2
-
118
-
-
0011400313
-
Estimating hidden Markov model parameters so as to maximize speech recognition accuracy
-
L.R. Bahl, P.F. Broun, P.V. Souza, and R.L. Mercer, "Estimating hidden Markov model parameters so as to maximize speech recognition accuracy," IEEE Trans. Speech & Audio Procss., vol.1, no.1, pp.77-82, 1993.
-
(1993)
IEEE Trans. Speech & Audio Procss.
, vol.1
, Issue.1
, pp. 77-82
-
-
Bahl, L.R.1
Broun, P.F.2
Souza, P.V.3
Mercer, R.L.4
-
119
-
-
0028412908
-
High performance connected digit recognition using maximum mutual information estimation
-
Y. Normndin, R. Cardin, and R. de Mori, "High performance connected digit recognition using maximum mutual information estimation," IEEE Trans. Speech & Audio Process., vol.2, pp.299-311, 1994.
-
(1994)
IEEE Trans. Speech & Audio Process.
, vol.2
, pp. 299-311
-
-
Normndin, Y.1
Cardin, R.2
De Mori, R.3
-
120
-
-
0031222490
-
MMIE training of large vocabulary recognition systems
-
V. Valtchev, J. Odel, P. Woodland, and S. Young, "MMIE training of large vocabulary recognition systems," Speech Communication, vol.22, pp.303-314, 1993.
-
(1993)
Speech Communication
, vol.22
, pp. 303-314
-
-
Valtchev, V.1
Odel, J.2
Woodland, P.3
Young, S.4
-
121
-
-
85128400029
-
Discriminative training of GMM using a modified EM algorithm for speaker recognition
-
K. Markov and S. Nakagawa, "Discriminative training of GMM using a modified EM algorithm for speaker recognition," Proc. ICSLP, vol.2, pp.177-180, 1998.
-
(1998)
Proc. ICSLP
, vol.2
, pp. 177-180
-
-
Markov, K.1
Nakagawa, S.2
-
122
-
-
0030235132
-
Performance of HMM-based speech recognizers with discriminative state-weights
-
O.W. Kwon and C.K. Un, "Performance of HMM-based speech recognizers with discriminative state-weights," Speech Communication, vol.19, pp.197-205, 1996.
-
(1996)
Speech Communication
, vol.19
, pp. 197-205
-
-
Kwon, O.W.1
Un, C.K.2
-
123
-
-
0032762247
-
Selective training for hidden Markov models with applications to speech classification
-
L.M. Arslan and H.L. Hanson, "Selective training for hidden Markov models with applications to speech classification," IEEE Trans. Speech & Audio Process., vol.7, no.1, pp.46-64, 1999.
-
(1999)
IEEE Trans. Speech & Audio Process.
, vol.7
, Issue.1
, pp. 46-64
-
-
Arslan, L.M.1
Hanson, H.L.2
-
124
-
-
0002235014
-
Improved feature decorrelation for HMM-based speech recognition
-
K. Demuynck, J. Duchateau, D.V. Comernolle, and P. Wambacq, "Improved feature decorrelation for HMM-based speech recognition," Proc. ICSLP, pp.2907-2910, 1998.
-
(1998)
Proc. ICSLP
, pp. 2907-2910
-
-
Demuynck, K.1
Duchateau, J.2
Comernolle, D.V.3
Wambacq, P.4
-
125
-
-
0029725604
-
A parametric approach to vocal tract length normalization
-
E. Eide and H. Gish, "A parametric approach to vocal tract length normalization," Proc. ICASSP, pp.346-349, 1996.
-
(1996)
Proc. ICASSP
, pp. 346-349
-
-
Eide, E.1
Gish, H.2
-
126
-
-
0034847002
-
The 1998 HTK system for transcription of conversational telephone speech
-
T. Hain, P.C. Woodland, T.R. Niesler, and E.W.D. Whittaker, "The 1998 HTK system for transcription of conversational telephone speech," Proc. ICASSP, pp.57-60. 1999.
-
(1999)
Proc. ICASSP
, pp. 57-60
-
-
Hain, T.1
Woodland, P.C.2
Niesler, T.R.3
Whittaker, E.W.D.4
-
127
-
-
0028419019
-
Maximum aposteriori estimation for multivariate Gaussian mixture observations of Markov chains
-
J.-L. Gauvain, and C.H. Lee, "Maximum aposteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech & Audio Process., vol.2, pp.291-298, 1994.
-
(1994)
IEEE Trans. Speech & Audio Process.
, vol.2
, pp. 291-298
-
-
Gauvain, J.-L.1
Lee, C.H.2
-
128
-
-
0030263447
-
Mean and variance adaptation within the MLLR framework
-
M.J.F. Gales and P.C. Woodland, "Mean and variance adaptation within the MLLR framework," Computer Speech and Language, vol.10, pp.249-264, 1996.
-
(1996)
Computer Speech and Language
, vol.10
, pp. 249-264
-
-
Gales, M.J.F.1
Woodland, P.C.2
-
129
-
-
0033100038
-
Maximum-likelihood stochastic-transformation adaptation of hidden Markov models
-
V.D. Diakoloukas and V.V. Digalakis, "Maximum-likelihood stochastic-transformation adaptation of hidden Markov models," IEEE Trans. Speech & Audio Process., vol.7, no.2, pp.177-187, 1999.
-
(1999)
IEEE Trans. Speech & Audio Process.
, vol.7
, Issue.2
, pp. 177-187
-
-
Diakoloukas, V.D.1
Digalakis, V.V.2
-
130
-
-
0031704151
-
Speaker clustering and transformation for speaker adaptation in speech recognition systems
-
M. Padmanabham, L.R. Bahl, D. Nahamoo, and M.A. Picheny, "Speaker clustering and transformation for speaker adaptation in speech recognition systems," IEEE Trans. Speech & Audio Process., vol.6, no.1, pp.71-77, 1998.
-
(1998)
IEEE Trans. Speech & Audio Process.
, vol.6
, Issue.1
, pp. 71-77
-
-
Padmanabham, M.1
Bahl, L.R.2
Nahamoo, D.3
Picheny, M.A.4
-
131
-
-
85135109228
-
Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs
-
K. Ohkura, M. Sugiyama, and S. Sagayama, "Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs," Proc. ICSLP, pp.369-372, 1992.
-
(1992)
Proc. ICSLP
, pp. 369-372
-
-
Ohkura, K.1
Sugiyama, M.2
Sagayama, S.3
-
132
-
-
0011411817
-
Speaker adaptation of acoustic models using correlations of transfer vectors
-
March
-
S. Takahashi and S. Sagayama, "Speaker adaptation of acoustic models using correlations of transfer vectors," IEICE Trans., vol.J82-D-II, no.3, pp.324-331, March 1999.
-
(1999)
IEICE Trans.
, vol.J82-D-II
, Issue.3
, pp. 324-331
-
-
Takahashi, S.1
Sagayama, S.2
-
133
-
-
0002488301
-
Speaker adaptation with autonomous control using tree structure
-
K. Shinoda and T. Watanabe, "Speaker adaptation with autonomous control using tree structure," Proc. Euro-Speech, pp.1143-1146, 1995.
-
(1995)
Proc. Euro-Speech
, pp. 1143-1146
-
-
Shinoda, K.1
Watanabe, T.2
-
134
-
-
0030189744
-
Speaker adaptation using combined transformation and Bayesian methods
-
V.V. Digalakis and L.G. Neumeyer, "Speaker adaptation using combined transformation and Bayesian methods," IEEE Trans. Speech & Audio Process., vol.4, no.4, pp.249-300, 1996.
-
(1996)
IEEE Trans. Speech & Audio Process.
, vol.4
, Issue.4
, pp. 249-300
-
-
Digalakis, V.V.1
Neumeyer, L.G.2
-
135
-
-
0000521080
-
Speaker adaptation using maximum a posteriori probability estimation and data size dependent parameter smoothing
-
March
-
M. Tonomura, T. Kosaka, and S. Matsumura, "Speaker adaptation using maximum a posteriori probability estimation and data size dependent parameter smoothing," IEICE Trans., vol.J81-D-II, no.3, pp.465-471, March 1998.
-
(1998)
IEICE Trans.
, vol.J81-D-II
, Issue.3
, pp. 465-471
-
-
Tonomura, M.1
Kosaka, T.2
Matsumura, S.3
-
136
-
-
0035279111
-
A structural Bayes approach to speaker adaptation
-
K. Shinoda and C.H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech & Audio Process., vol.9, no.3, pp.276-287, 2001.
-
(2001)
IEEE Trans. Speech & Audio Process.
, vol.9
, Issue.3
, pp. 276-287
-
-
Shinoda, K.1
Lee, C.H.2
-
137
-
-
0011448907
-
Automatic speech recognition by stochastic approaches
-
Feb.
-
S. Nakagawa, "Automatic speech recognition by stochastic approaches," J. Acoust. Soc. Japan, vol.50, no.2, pp.126-132, Feb. 1994.
-
(1994)
J. Acoust. Soc. Japan
, vol.50
, Issue.2
, pp. 126-132
-
-
Nakagawa, S.1
-
138
-
-
0011458461
-
Automatic learning of stochastic context-free grammar for spontaneous speech by integration of bigram
-
March
-
S. Nakagawa and K. Ohtani, "Automatic learning of stochastic context-free grammar for spontaneous speech by integration of bigram," Trans. Inf. Process. Soc. Japan, vol.39. no.3, pp.575-584, March 1998.
-
(1998)
Trans. Inf. Process. Soc. Japan
, vol.39
, Issue.3
, pp. 575-584
-
-
Nakagawa, S.1
Ohtani, K.2
-
139
-
-
0011509488
-
A study of large-vocabulary continuous speech recognition using higher order n-gram language models
-
Spring
-
K. Ohtsuki, K. Yoshida, T. Matsuoka, and S. Furui, "A study of large-vocabulary continuous speech recognition using higher order n-gram language models," Conf. Record. Acoust. Soc. Japan. pp.47-48, Spring 1997.
-
(1997)
Conf. Record. Acoust. Soc. Japan.
, pp. 47-48
-
-
Ohtsuki, K.1
Yoshida, K.2
Matsuoka, T.3
Furui, S.4
-
140
-
-
0028996884
-
Phrase bigrams for continuous speech recognition
-
E.P. Giachin, "Phrase bigrams for continuous speech recognition," Proc. ICASSP, pp.225-227, 1995.
-
(1995)
Proc. ICASSP
, pp. 225-227
-
-
Giachin, E.P.1
-
141
-
-
0011496729
-
Effect of vocabulary extension using word sequence concatenation for large vocabulary continuous speech recognition
-
April
-
Y. Wada, N. Kobayashi, Y. Nakano and T. Kobayashi, "Effect of vocabulary extension using word sequence concatenation for large vocabulary continuous speech recognition," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1413-1420, April 1999.
-
(1999)
Trans. Inf. Process. Soc. Japan
, vol.40
, Issue.4
, pp. 1413-1420
-
-
Wada, Y.1
Kobayashi, N.2
Nakano, Y.3
Kobayashi, T.4
-
142
-
-
0011501276
-
A task adaptation method and use of idiomatic expression of stochastic language model for speech recognition
-
Jan.
-
S. Nakagawa, H. Akamatsu, and H. Nishizaki, "A task adaptation method and use of idiomatic expression of stochastic language model for speech recognition," Natural Language Processing, vol.6, no.2. pp.97-115, Jan. 1999.
-
(1999)
Natural Language Processing
, vol.6
, Issue.2
, pp. 97-115
-
-
Nakagawa, S.1
Akamatsu, H.2
Nishizaki, H.3
-
143
-
-
0028996879
-
Language modeling by variable length sequences, theoretical formulation and evaluation of multigrams
-
S. Deligned and F. Bimbot, "Language modeling by variable length sequences, theoretical formulation and evaluation of multigrams," Proc. ICASSP, pp.169-172, 1995.
-
(1995)
Proc. ICASSP
, pp. 169-172
-
-
Deligned, S.1
Bimbot, F.2
-
144
-
-
0029762785
-
Variable-order N-gram generation by word-class splitting and consecutive word grouping
-
H. Masataki and Y. Sagisaka, "Variable-order N-gram generation by word-class splitting and consecutive word grouping," Proc. ICASSP, pp. 188-191, 1996.
-
(1996)
Proc. ICASSP
, pp. 188-191
-
-
Masataki, H.1
Sagisaka, Y.2
-
145
-
-
0024700466
-
Tree-based statistical language model for natural language speech recognition
-
L.R. Bahl, P.F. Brown, P.V. Souza, and R.L. Mercer, "Tree-based statistical language model for natural language speech recognition," IEEE Trans. Acoust. Speech & Signal Process., vol.37, no.7, pp. 1001-1008. 1989.
-
(1989)
IEEE Trans. Acoust. Speech & Signal Process.
, vol.37
, Issue.7
, pp. 1001-1008
-
-
Bahl, L.R.1
Brown, P.F.2
Souza, P.V.3
Mercer, R.L.4
-
146
-
-
0011464163
-
Word clustering for class-based language models
-
S. Mori, M. Nishimura, and N. Itoh, "Word clustering for class-based language models," Trans. Inf. Process. Soc. Japan, vol.38, no.11, pp.2200-2207, 1997.
-
(1997)
Trans. Inf. Process. Soc. Japan
, vol.38
, Issue.11
, pp. 2200-2207
-
-
Mori, S.1
Nishimura, M.2
Itoh, N.3
-
147
-
-
0032650074
-
Variable-length category n-gram language models
-
T.R. Niesler and P.C. Woodland, "Variable-length category n-gram language models," Computer Speech and Language, vol.13, pp.99-124, 1999.
-
(1999)
Computer Speech and Language
, vol.13
, pp. 99-124
-
-
Niesler, T.R.1
Woodland, P.C.2
-
148
-
-
0000797420
-
An estimation of an upper bound for the entropy of Japanese
-
S. Mori, and O. Yamaji, "An estimation of an upper bound for the entropy of Japanese," Trans. Inf. Process. Soc. Japan, vol.38, no.11, pp.2191-2199, 1997.
-
(1997)
Trans. Inf. Process. Soc. Japan
, vol.38
, Issue.11
, pp. 2191-2199
-
-
Mori, S.1
Yamaji, O.2
-
149
-
-
0030181951
-
A maximum entropy approach to adaptive statistical language modeling
-
R. Rosenfeld, "A maximum entropy approach to adaptive statistical language modeling," Computer Speech and Language, vol.10, pp.187-228, 1996.
-
(1996)
Computer Speech and Language
, vol.10
, pp. 187-228
-
-
Rosenfeld, R.1
-
150
-
-
0033106616
-
Interpolation of n-gram and mutual-information based trigger pair language models for Mandarin speech recognition
-
Z.G. Dong, and L.K. Teng, "Interpolation of n-gram and mutual-information based trigger pair language models for Mandarin speech recognition," Computer Speech and Language, vol.13, pp.125-141, 1999.
-
(1999)
Computer Speech and Language
, vol.13
, pp. 125-141
-
-
Dong, Z.G.1
Teng, L.K.2
-
151
-
-
0032165145
-
A multispan language model modeling framework for large vocabulary speech recognition
-
J.R. Bellegard, "A multispan language model modeling framework for large vocabulary speech recognition," IEEE Trans. Acoust. Speech & Signal Process., vol.6, no.5, pp.456-467, 1998.
-
(1998)
IEEE Trans. Acoust. Speech & Signal Process.
, vol.6
, Issue.5
, pp. 456-467
-
-
Bellegard, J.R.1
-
152
-
-
0011471867
-
Multispan statistical language modeling for large vocabulary speech recognition
-
J.R. Bellegard, "Multispan statistical language modeling for large vocabulary speech recognition," Proc. ICSLP, pp.2395-2398, 1998.
-
(1998)
Proc. ICSLP
, pp. 2395-2398
-
-
Bellegard, J.R.1
-
153
-
-
0032785782
-
Modeling long distance dependence in language: Topic mixtures versus dynamic cache models
-
R.M. Iyer and M. Ostendorf, "Modeling long distance dependence in language: Topic mixtures versus dynamic cache models," IEEE Trans. Speech & Audio Process., vol.7, no.1, pp.31-39, 1997.
-
(1997)
IEEE Trans. Speech & Audio Process.
, vol.7
, Issue.1
, pp. 31-39
-
-
Iyer, R.M.1
Ostendorf, M.2
-
154
-
-
0002235611
-
Adaptive topic-dependent language modeling using word-based varigramss
-
S. Martin, J. Liermann, and H. Ney, "Adaptive topic-dependent language modeling using word-based varigramss," Proc. EuroSpeech, pp.1447-1450, 1997.
-
(1997)
Proc. EuroSpeech
, pp. 1447-1450
-
-
Martin, S.1
Liermann, J.2
Ney, H.3
-
155
-
-
0011408731
-
Dictation of broadcast news speech using word pronounciation probability
-
Spring
-
K. Takagi and S. Furui, "Dictation of broadcast news speech using word pronounciation probability," Conf. Record, Acoust. Soc. Japan, pp.9-10, Spring 1998.
-
(1998)
Conf. Record, Acoust. Soc. Japan
, pp. 9-10
-
-
Takagi, K.1
Furui, S.2
-
156
-
-
0011451282
-
An improvement of language modeling for automatic transcription of Japanese broadcast-news speech
-
Spring
-
N. Sakurai and S. Furui, "An improvement of language modeling for automatic transcription of Japanese broadcast-news speech," Conf. Record, Acoust. Soc. Japan, pp.57-58, Spring 1999.
-
(1999)
Conf. Record, Acoust. Soc. Japan
, pp. 57-58
-
-
Sakurai, N.1
Furui, S.2
-
157
-
-
0011408732
-
A language model for recognition of continuously uttered sentences
-
Spring
-
T. Imai, Y. Saito, A. Ando, and S. Furui, "A language model for recognition of continuously uttered sentences," Conf. Record, Acoust. Soc. Japan, pp.63-64, Spring 1999.
-
(1999)
Conf. Record, Acoust. Soc. Japan
, pp. 63-64
-
-
Imai, T.1
Saito, Y.2
Ando, A.3
Furui, S.4
-
158
-
-
0011404832
-
Time dependent language model for broadcast news transcription
-
April
-
A. Kobayashi, T. Imai, A. Ando, and K. Nakabayashi, "Time dependent language model for broadcast news transcription," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1421-1429, April 1999.
-
(1999)
Trans. Inf. Process. Soc. Japan
, vol.40
, Issue.4
, pp. 1421-1429
-
-
Kobayashi, A.1
Imai, T.2
Ando, A.3
Nakabayashi, K.4
-
159
-
-
0011402513
-
The influence of morpheme analysis systems on language model for continuous speech recognition
-
Autumn
-
N. Yodo, K. Itoh, S. Nakamura, and K. Shikano, "The influence of morpheme analysis systems on language model for continuous speech recognition," Conf. Record, Acoust. Soc. Japan, pp.53-54, Autumn 1997.
-
(1997)
Conf. Record, Acoust. Soc. Japan
, pp. 53-54
-
-
Yodo, N.1
Itoh, K.2
Nakamura, S.3
Shikano, K.4
-
160
-
-
85024115120
-
An empirical study of smoothing techniques for language modeling
-
S.F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Proc. ACL, pp.310-318, 1996.
-
(1996)
Proc. ACL
, pp. 310-318
-
-
Chen, S.F.1
Goodman, J.2
-
161
-
-
0030124373
-
Succeeding word prediction for speech recognition based on stochastic language model
-
April
-
M. Zhou and S. Nakagawa, "Succeeding word prediction for speech recognition based on stochastic language model," IEICE Trans. Inf. & Syst., vol.E79-D, no.4, pp.333-341, April 1996.
-
(1996)
IEICE Trans. Inf. & Syst.
, vol.E79-D
, Issue.4
, pp. 333-341
-
-
Zhou, M.1
Nakagawa, S.2
-
162
-
-
0010032271
-
Inside-outside reestimation from partially bracketed corpora
-
F. Pereira and Y. Schabes, "Inside-outside reestimation from partially bracketed corpora," Proc. ACL, pp.31-37, 1992.
-
(1992)
Proc. ACL
, pp. 31-37
-
-
Pereira, F.1
Schabes, Y.2
-
163
-
-
84894805373
-
An empirical evaluation of probabilistic lexicalized tree insertion grammars
-
R. Hwa, "An empirical evaluation of probabilistic lexicalized tree insertion grammars," Proc. ACL, pp.557-563, 1998.
-
(1998)
Proc. ACL
, pp. 557-563
-
-
Hwa, R.1
-
164
-
-
85027133681
-
Construction and evaluation of language models based on stochastic context free grammar for speech recognition
-
Technical Report, SP99-37, Inst. Elect. Inf. Comm. Engrs., June
-
C. Hori, M. Katoh, A. Itoh, and M. Kohda, "Construction and evaluation of language models based on stochastic context free grammar for speech recognition," Technical Report, SP99-37, Inst. Elect. Inf. Comm. Engrs., June 1999.
-
(1999)
-
-
Hori, C.1
Katoh, M.2
Itoh, A.3
Kohda, M.4
-
165
-
-
0032673481
-
An automatic acquisition method of statistical finite-state automation sentences
-
M. Zuzuki and S. Makino, "An automatic acquisition method of statistical finite-state automation sentences," Proc. ICASSP, pp.737-740, 1999.
-
(1999)
Proc. ICASSP
, pp. 737-740
-
-
Zuzuki, M.1
Makino, S.2
-
166
-
-
0011403721
-
Construction of language models using probabilistic GLR methods toward speech recognition
-
April
-
H. Imai, H. Tanaka, and T. Tokunaga, "Construction of language models using probabilistic GLR methods toward speech recognition," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1404-1411, April 1999.
-
(1999)
Trans. Inf. Process. Soc. Japan
, vol.40
, Issue.4
, pp. 1404-1411
-
-
Imai, H.1
Tanaka, H.2
Tokunaga, T.3
-
167
-
-
0011449593
-
Spontaneous speech understanding method based on LR parsing of keyword lattice
-
Feb.
-
H. Tsuboi, Y. Takebayashi, and H. Hashimoto, "Spontaneous speech understanding method based on LR parsing of keyword lattice," Trans. Inf. Process. Soc. Japan, vol.38, no.2, pp.260-268, Feb. 1997.
-
(1997)
Trans. Inf. Process. Soc. Japan
, vol.38
, Issue.2
, pp. 260-268
-
-
Tsuboi, H.1
Takebayashi, Y.2
Hashimoto, H.3
-
168
-
-
0025517070
-
Automatic recognition of keywords in unconstrained speech using hidden Markov models
-
J.G. Wilpon, L.R. Rabiner, C.-H. Lee, and E.R. Goldman, "Automatic recognition of keywords in unconstrained speech using hidden Markov models," IEEE Trans. Acoust. Speech & Signal Process., vol.38, no.11, pp.1870-1878, 1990.
-
(1990)
IEEE Trans. Acoust. Speech & Signal Process.
, vol.38
, Issue.11
, pp. 1870-1878
-
-
Wilpon, J.G.1
Rabiner, L.R.2
Lee, C.-H.3
Goldman, E.R.4
-
169
-
-
0011449594
-
Processing unknown words in continuous speech recognition
-
July
-
K. Kita, T. Ehara, and T. Morimoto, "Processing unknown words in continuous speech recognition," IEICE Trans, vol.E74, no.7, pp.1811-1816, July 1991.
-
(1991)
IEICE Trans
, vol.E74
, Issue.7
, pp. 1811-1816
-
-
Kita, K.1
Ehara, T.2
Morimoto, T.3
-
170
-
-
0011501278
-
Comparison of dictation and word spotting techniques in classification of news speech articles
-
IEICE Technical Report, SP98-32, June
-
J. Ogata and Y. Ariki, "Comparison of dictation and word spotting techniques in classification of news speech articles," IEICE Technical Report, SP98-32, June 1998.
-
(1998)
-
-
Ogata, J.1
Ariki, Y.2
-
171
-
-
0011498043
-
Voice-operated projector using utterance verification and its application to hyper-text generation of lectures
-
April
-
T. Kawahara, K. Ishizuka, and S. Doshita, "Voice-operated projector using utterance verification and its application to hyper-text generation of lectures," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1491-1498, April 1999.
-
(1999)
Trans. Inf. Process. Soc. Japan
, vol.40
, Issue.4
, pp. 1491-1498
-
-
Kawahara, T.1
Ishizuka, K.2
Doshita, S.3
-
172
-
-
0011408733
-
Dealing with out-of -vocabulary words and speech disfluencies in an N-gram based speech understanding system
-
Dec.
-
A. Kai, Y. Hirose, and S. Nakagawa, "Dealing with out-of -vocabulary words and speech disfluencies in an N-gram based speech understanding system," Proc. ICSLP, pp.2427-2430, Dec. 1999.
-
(1999)
Proc. ICSLP
, pp. 2427-2430
-
-
Kai, A.1
Hirose, Y.2
Nakagawa, S.3
-
173
-
-
0001079615
-
A*-admissible key-phrase spotting with sub-syllable level utterance verification
-
B. Chen, H. Wong, L. Chen, and L. Lee, "A*-admissible key-phrase spotting with sub-syllable level utterance verification," Proc. ICSLP, pp.783-786, 1998.
-
(1998)
Proc. ICSLP
, pp. 783-786
-
-
Chen, B.1
Wong, H.2
Chen, L.3
Lee, L.4
-
174
-
-
84902052756
-
A new confidence measure based on rank-ordering subphone scores
-
Q. Lin, S-Das, D. Lubensky, and M. Picheny, "A new confidence measure based on rank-ordering subphone scores," Proc. ICSLP, pp.3249-3252, 1998.
-
(1998)
Proc. ICSLP
, pp. 3249-3252
-
-
Lin, Q.1
S-Das2
Lubensky, D.3
Picheny, M.4
-
175
-
-
0032091375
-
Text-independent speaker recognition using non-linear frame likelihood transformation
-
K.P. Markov, and S. Nakagawa, "Text-independent speaker recognition using non-linear frame likelihood transformation," Speech Communication, vol.24, pp.193-209, 1998.
-
(1998)
Speech Communication
, vol.24
, pp. 193-209
-
-
Markov, K.P.1
Nakagawa, S.2
-
176
-
-
0011408734
-
Word-based approach to large-vocabulary continuous speech recognition for Japanese
-
April
-
M. Nishimura, N. Itoh, and K. Yamasaki, "Word-based approach to large-vocabulary continuous speech recognition for Japanese," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1395-1403, April 1999.
-
(1999)
Trans. Inf. Process. Soc. Japan
, vol.40
, Issue.4
, pp. 1395-1403
-
-
Nishimura, M.1
Itoh, N.2
Yamasaki, K.3
-
177
-
-
0011450876
-
Unknown utterance rejection using likelihood normalization based on syllable recognition
-
Dec.
-
T. Watanabe and S. Tsukada, "Unknown utterance rejection using likelihood normalization based on syllable recognition," IEICE Trans., vol.J75-D-II, no.12, pp.2002-2009, Dec. 1992.
-
(1992)
IEICE Trans.
, vol.J75-D-II
, Issue.12
, pp. 2002-2009
-
-
Watanabe, T.1
Tsukada, S.2
-
178
-
-
0029323659
-
Relationship among recognition rate, rejection rate and false alarm rate in a spoken word recognition system
-
June
-
A. Kai, and S. Nakagawa, "Relationship among recognition rate, rejection rate and false alarm rate in a spoken word recognition system," IEICE Trans. Inf. & Syst., vol.E78-D, no.6, pp.698-704, June 1995.
-
(1995)
IEICE Trans. Inf. & Syst.
, vol.E78-D
, Issue.6
, pp. 698-704
-
-
Kai, A.1
Nakagawa, S.2
-
179
-
-
0011501675
-
Large vocabulary continuous speech recognition: From laboratory systems towards real-world applications
-
Dec.
-
J.-L. Gauvain and L. Lamel, "Large vocabulary continuous speech recognition: From laboratory systems towards real-world applications," IEICE Trans., vol.J79-D-II, no.12, pp.2005-2021, Dec. 1996.
-
(1996)
IEICE Trans.
, vol.J79-D-II
, Issue.12
, pp. 2005-2021
-
-
Gauvain, J.-L.1
Lamel, L.2
-
180
-
-
4544364908
-
A decoder for broadcast news transcription
-
Autumn
-
T. Imai, K. Onoe, A. Kobayashi, and A. Ando, "A decoder for broadcast news transcription," Acoust. Soc. Japan, pp.105-106, Autumn 1998.
-
(1998)
Acoust. Soc. Japan
, pp. 105-106
-
-
Imai, T.1
Onoe, K.2
Kobayashi, A.3
Ando, A.4
-
181
-
-
0011495826
-
A new computation method of perplexity for text corpus including unknown words
-
Autumn
-
S. Nakagawa and H. Akamatsu, "A new computation method of perplexity for text corpus including unknown words," Conf. Record, Acoust. Soc. Japan, pp.63-64, Autumn 1998.
-
(1998)
Conf. Record, Acoust. Soc. Japan
, pp. 63-64
-
-
Nakagawa, S.1
Akamatsu, H.2
-
182
-
-
0030715922
-
Task adaptation using MAP estimation in N-gram language modeling
-
H. Masataki, Y. Sagisaka, K. Hisaki, and T. Kawahara, "Task adaptation using MAP estimation in N-gram language modeling," Proc. ICASSP, pp.783-786, 1997.
-
(1997)
Proc. ICASSP
, pp. 783-786
-
-
Masataki, H.1
Sagisaka, Y.2
Hisaki, K.3
Kawahara, T.4
-
183
-
-
85009128031
-
Relationship between phoneme recognition performance and word recognition rate
-
May
-
S. Nakagawa, "Relationship between phoneme recognition performance and word recognition rate," Trans. Inf. Process, Japan, vol.22, no.5, pp.488-496, May 1996.
-
(1996)
Trans. Inf. Process, Japan
, vol.22
, Issue.5
, pp. 488-496
-
-
Nakagawa, S.1
-
184
-
-
0011451284
-
Spontaneous speech understanding for a dialogue system
-
M. Hidano, T. Itoh, M. Yamamoto, and S. Nakagawa, "Spontaneous speech understanding for a dialogue system," Proc. ESCA Workshop on Spoken Dialogue Systems, pp.25-28, 1995.
-
(1995)
Proc. ESCA Workshop on Spoken Dialogue Systems
, pp. 25-28
-
-
Hidano, M.1
Itoh, T.2
Yamamoto, M.3
Nakagawa, S.4
-
185
-
-
84989448320
-
Evaluation of FFT cepstrum and LPC cepstrum for speech and speaker recognition
-
Feb.
-
S. Nakagawa and M. Sakamoto, "Evaluation of FFT cepstrum and LPC cepstrum for speech and speaker recognition," IEICE Trans., vol.J66-A, no.2, pp.1199-1206, Feb. 1983.
-
(1983)
IEICE Trans.
, vol.J66-A
, Issue.2
, pp. 1199-1206
-
-
Nakagawa, S.1
Sakamoto, M.2
-
186
-
-
84987195640
-
Perception of vowels and C-V syllables segmented from connected speech
-
May
-
H. Kuwabara and H. Sakai, "Perception of vowels and C-V syllables segmented from connected speech," J. Acoust. Soc. Japan, vol.28, no.5, pp.225-234, May 1972.
-
(1972)
J. Acoust. Soc. Japan
, vol.28
, Issue.5
, pp. 225-234
-
-
Kuwabara, H.1
Sakai, H.2
-
187
-
-
85027151219
-
A study on speech recognition unit based on speech perceptual experiments
-
IEICE Technical Report, SP99-43, July
-
K. Yamamoto and S. Nakagawa, "A study on speech recognition unit based on speech perceptual experiments," IEICE Technical Report, SP99-43, July 1999.
-
(1999)
-
-
Yamamoto, K.1
Nakagawa, S.2
-
188
-
-
0011501282
-
Toward spoken language understanding from speech recognition
-
Nov.
-
S. Nakagawa, "Toward spoken language understanding from speech recognition," J. Acoust. Soc. Japan, vol.52, no.11, pp.859-856, Nov. 1996.
-
(1996)
J. Acoust. Soc. Japan
, vol.52
, Issue.11
, pp. 859-856
-
-
Nakagawa, S.1
-
189
-
-
0011403914
-
Evaluation of auditory front-ends in DTW word recognition system
-
June
-
K. Obara and T. Hirahara, "Evaluation of auditory front-ends in DTW word recognition system," J. Acoust. Soc. Japan, vol.50, no.6, pp.452-464, June 1994.
-
(1994)
J. Acoust. Soc. Japan
, vol.50
, Issue.6
, pp. 452-464
-
-
Obara, K.1
Hirahara, T.2
-
190
-
-
0032677422
-
Recent experiments in large vocabulary conversational speech recognition
-
J. Billa, T. Colhurst, A. El-Jaroudi, R. Iyer, K. Ma, S. Matsuoukas, C. Quilen, F. Richardson, M. Siu, G. Zavaligkos, and H. Gish, "Recent experiments in large vocabulary conversational speech recognition," Proc. ICASSP, pp.41-44, 1999.
-
(1999)
Proc. ICASSP
, pp. 41-44
-
-
Billa, J.1
Colhurst, T.2
El-Jaroudi, A.3
Iyer, R.4
Ma, K.5
Matsuoukas, S.6
Quilen, C.7
Richardson, F.8
Siu, M.9
Zavaligkos, G.10
Gish, H.11
-
191
-
-
0031643048
-
Multiresolution cepstral features for phoneme recognition across speech sub-bands
-
P. McCourt, S. Vaseghi, and N. Harte, "Multiresolution cepstral features for phoneme recognition across speech sub-bands," Proc. ICASSP, pp.557-560, 1998.
-
(1998)
Proc. ICASSP
, pp. 557-560
-
-
McCourt, P.1
Vaseghi, S.2
Harte, N.3
-
192
-
-
0032654472
-
Channel and noise adaptation via HMM mixture mean transform and stochastic matching
-
S. Kong and B. Shi, "Channel and noise adaptation via HMM mixture mean transform and stochastic matching," Proc. ICASSP, pp. 301-304, 1999.
-
(1999)
Proc. ICASSP
, pp. 301-304
-
-
Kong, S.1
Shi, B.2
-
193
-
-
0025388113
-
A linear predictive HMM for vector valued observation with application to speech recognition
-
P. Kenny, M. Lenning, and P. Mermelstein, "A linear predictive HMM for vector valued observation with application to speech recognition," IEEE Trans. Acoust. Speech & Signal Process., vol.38, no.1, pp.220-225, 1990.
-
(1990)
IEEE Trans. Acoust. Speech & Signal Process.
, vol.38
, Issue.1
, pp. 220-225
-
-
Kenny, P.1
Lenning, M.2
Mermelstein, P.3
-
194
-
-
0011406323
-
Proposal of a stochastic context-free grammar for continuous observation vector sequences
-
Spring
-
S. Nakagawa, "Proposal of a stochastic context-free grammar for continuous observation vector sequences," Conf. Record, pp.73-74, Spring 1992.
-
(1992)
Conf. Record
, pp. 73-74
-
-
Nakagawa, S.1
-
195
-
-
0026171582
-
Application of the Gibbs distribution to hidden Markov modeling in speaker independent isolated word recognition
-
Y. Zhao, L.E. Atlas, and X. Zhuang, "Application of the Gibbs distribution to hidden Markov modeling in speaker independent isolated word recognition," IEEE Trans. Signal Process., vol.39, no.6, pp.1291-1298, 1991.
-
(1991)
IEEE Trans. Signal Process.
, vol.39
, Issue.6
, pp. 1291-1298
-
-
Zhao, Y.1
Atlas, L.E.2
Zhuang, X.3
-
196
-
-
0011411822
-
Probabilistic modeling with Bayesian networks for automatic speech recognition
-
G. Zweig and S. Russel, "Probabilistic modeling with Bayesian networks for automatic speech recognition," Proc. ICSLP, pp.3011-3014, 1998.
-
(1998)
Proc. ICSLP
, pp. 3011-3014
-
-
Zweig, G.1
Russel, S.2
-
197
-
-
0029325616
-
A comparative study of output probability functions in HMMs
-
June
-
S. Nakagawa, L. Zhao, and H. Suzuki, "A comparative study of output probability functions in HMMs," IEICE Trans. Inf. & Syst., vol.E78-D, no.6, pp.669-675, June 1995.
-
(1995)
IEICE Trans. Inf. & Syst.
, vol.E78-D
, Issue.6
, pp. 669-675
-
-
Nakagawa, S.1
Zhao, L.2
Suzuki, H.3
-
198
-
-
85009181766
-
Unified framework for acoustic topology modelling: ML-SSS and question-based decision trees
-
H. Singer and A. Nakamura, "Unified framework for acoustic topology modelling: ML-SSS and question-based decision trees," Proc. EuroSpeech, pp.1355-1358, 1999.
-
(1999)
Proc. EuroSpeech
, pp. 1355-1358
-
-
Singer, H.1
Nakamura, A.2
-
199
-
-
85027098626
-
Learning and normalizing of the talker differences in the recognition of spoken words
-
Technical Report, Acoust. Soc. Japan, SP75-25, Nov.
-
S. Furui, "Learning and normalizing of the talker differences in the recognition of spoken words," Technical Report, Acoust. Soc. Japan, SP75-25, Nov. 1975.
-
(1975)
-
-
Furui, S.1
-
200
-
-
0017961869
-
A real time spoken word recognition system with various learning capabilities of the speaker differences
-
Scripta Publishing Co.
-
S. Nakagawa and T. Sakai, "A real time spoken word recognition system with various learning capabilities of the speaker differences," Syst. Comp. Controls, vol.9, no.3, pp.63-71, Scripta Publishing Co., 1978.
-
(1978)
Syst. Comp. Controls
, vol.9
, Issue.3
, pp. 63-71
-
-
Nakagawa, S.1
Sakai, T.2
-
201
-
-
85009195509
-
A missing-word test comparison of human and statistical language model performance
-
M. Owens, A. Kruger, P. Donnelly, F.J. Smith, and J. Ming, "A missing-word test comparison of human and statistical language model performance," Proc. EuroSpeech, pp.145-148, 1999.
-
(1999)
Proc. EuroSpeech
, pp. 145-148
-
-
Owens, M.1
Kruger, A.2
Donnelly, P.3
Smith, F.J.4
Ming, J.5
-
202
-
-
0011400318
-
Robust language modeling for small corpus of target task using call combined word statistics and selective use of general corpus
-
Nov.
-
Y. Wada, N. Kobayashi, and T. Kobayashi, "Robust language modeling for small corpus of target task using call combined word statistics and selective use of general corpus," IEICE Trans., vol.J83-D-II, no.11, pp.2397-2406, Nov. 2000.
-
(2000)
IEICE Trans.
, vol.J83-D-II
, Issue.11
, pp. 2397-2406
-
-
Wada, Y.1
Kobayashi, N.2
Kobayashi, T.3
-
203
-
-
0011404834
-
Part-of-speech N-gram and word N-gram fused language model
-
H. Yamamoto, and Y. Sagisaka, "Part-of-speech N-gram and word N-gram fused language model," Proc. Euro-Speech, pp.1803-1806, 1999.
-
(1999)
Proc. Euro-Speech
, pp. 1803-1806
-
-
Yamamoto, H.1
Sagisaka, Y.2
-
204
-
-
0030719155
-
A word graph algorithm for large vocabulary continuous speech recognition
-
S. Ortmanns, H. Ney, and Z. Aubert, "A word graph algorithm for large vocabulary continuous speech recognition," Computer Speech and Language, vol.11, pp.43-72, 1997.
-
(1997)
Computer Speech and Language
, vol.11
, pp. 43-72
-
-
Ortmanns, S.1
Ney, H.2
Aubert, Z.3
-
205
-
-
0001100613
-
A study on a phoneme-graph-based hypothesis restriction for large vocabulary continuous speech recognition
-
April
-
T. Hori, N. Oka, M. Katoho, A. Itoh, and M. Kohda, "A study on a phoneme-graph-based hypothesis restriction for large vocabulary continuous speech recognition," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1365-1373 April 1999.
-
(1999)
Trans. Inf. Process. Soc. Japan
, vol.40
, Issue.4
, pp. 1365-1373
-
-
Hori, T.1
Oka, N.2
Katoho, M.3
Itoh, A.4
Kohda, M.5
-
206
-
-
29144491321
-
Large vocabulary continuous speech recognition based on multi-pass search using word trellis index
-
Jan.
-
A. Lee, Kawahra, and S. Doshita, "Large vocabulary continuous speech recognition based on multi-pass search using word trellis index," IEICE Trans., vol.J82-D, no.1, pp.1-9, Jan. 1999.
-
(1999)
IEICE Trans.
, vol.J82-D
, Issue.1
, pp. 1-9
-
-
Lee, A.1
Kawahra2
Doshita, S.3
-
207
-
-
85027104898
-
Some problems on automatic speech recognition
-
IEICE Technical Report, SP99-93, Dec.
-
S. Nakagawa, "Some problems on automatic speech recognition," IEICE Technical Report, SP99-93, Dec. 1999.
-
(1999)
-
-
Nakagawa, S.1
-
208
-
-
0031619371
-
Balancing acoustic and linguistic probabilities
-
A. Ogawa, K. Takeda, and F. Itakura, "Balancing acoustic and linguistic probabilities," Proc. ICASSP, pp.181-184, 1998.
-
(1998)
Proc. ICASSP
, pp. 181-184
-
-
Ogawa, A.1
Takeda, K.2
Itakura, F.3
-
209
-
-
0032649321
-
Partly hidden Markov model and its application to speech recognition
-
T. Kobayashi, J. Furuyama, and K. Masumitsu, "Partly hidden Markov model and its application to speech recognition," Proc. ICASSP, pp.121-124, 1999.
-
(1999)
Proc. ICASSP
, pp. 121-124
-
-
Kobayashi, T.1
Furuyama, J.2
Masumitsu, K.3
-
210
-
-
0011451288
-
Comparison of SCFG and HMM based speaker independent spoken digit recognition
-
Dec.
-
M. Zhou and S. Nakagawa, "Comparison of SCFG and HMM based speaker independent spoken digit recognition," Proc. Int. Workshop on Automatic Speech Recognition, pp.30-31, Dec. 1993.
-
(1993)
Proc. Int. Workshop on Automatic Speech Recognition
, pp. 30-31
-
-
Zhou, M.1
Nakagawa, S.2
-
211
-
-
85032751521
-
Dynamic programming search for continuous speech recognition
-
Sept.
-
H. Ney and S. Ortmanns, "Dynamic programming search for continuous speech recognition," IEEE Signal Process. Mag., pp.64-82, Sept. 1999.
-
(1999)
IEEE Signal Process. Mag.
, pp. 64-82
-
-
Ney, H.1
Ortmanns, S.2
-
212
-
-
85032751683
-
Hierarchical search for large-vocabulary conversational speech recognition
-
Sept.
-
N. Deshmukh, A. Ganapathiraju, and J. Picone, "Hierarchical search for large-vocabulary conversational speech recognition," IEEE Signal Process. Mag., pp.84-107, Sept. 1999.
-
(1999)
IEEE Signal Process. Mag.
, pp. 84-107
-
-
Deshmukh, N.1
Ganapathiraju, A.2
Picone, J.3
-
216
-
-
85007838242
-
Pitch dependent phone modeling for HMM-based speech recognition
-
H. Singer and S. Sagayama, "Pitch dependent phone modeling for HMM-based speech recognition," J. Acoust. Soc. Japan, (E), vol.15, no.2, pp.77-86, 1994.
-
(1994)
J. Acoust. Soc. Japan, (E)
, vol.15
, Issue.2
, pp. 77-86
-
-
Singer, H.1
Sagayama, S.2
-
217
-
-
85067723733
-
Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processing
-
Dec.
-
N. Minematsu and S. Nakagawa, "Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processing," Proc. ICSLP, pp.2427-2430, Dec. 1998.
-
(1998)
Proc. ICSLP
, pp. 2427-2430
-
-
Minematsu, N.1
Nakagawa, S.2
-
218
-
-
0028392167
-
An application of recurrent nets to phone probability estimation
-
A.J. Robinson, "An application of recurrent nets to phone probability estimation," IEEE Trans. Neural Networks, vol.5, no.2, pp.298-304, 1994.
-
(1994)
IEEE Trans. Neural Networks
, vol.5
, Issue.2
, pp. 298-304
-
-
Robinson, A.J.1
-
219
-
-
0011403725
-
Speech understanding and language model
-
Nov.
-
S. Nakagawa, "Speech understanding and language model," J. Signal Process., vol.2, no.6, pp.434-442, Nov. 1998.
-
(1998)
J. Signal Process.
, vol.2
, Issue.6
, pp. 434-442
-
-
Nakagawa, S.1
-
220
-
-
0011450878
-
Introduction to the special issue-some research problems on spoken dialogue systems
-
Nov.
-
S. Nakagawa, "Introduction to the special issue-some research problems on spoken dialogue systems," J. Acoust. Soc. Japan, vol.54, no.11, pp.783-790, Nov. 1998.
-
(1998)
J. Acoust. Soc. Japan
, vol.54
, Issue.11
, pp. 783-790
-
-
Nakagawa, S.1
-
222
-
-
85027158035
-
HMM-based speaker recognition
-
IEICE Technical Report, SP95-111, Jan.
-
T. Matsui, "HMM-based speaker recognition," IEICE Technical Report, SP95-111, Jan. 1996.
-
(1996)
-
-
Matsui, T.1
-
223
-
-
0031233424
-
Speaker recognition: A tutorial
-
J.P. Campbell, "Speaker recognition: A tutorial," Proc. IEEE, vol.85, no.9, 1437-1462, 1997.
-
(1997)
Proc. IEEE
, vol.85
, Issue.9
, pp. 1437-1462
-
-
Campbell, J.P.1
-
236
-
-
84944486544
-
Prediction and entropy of printed English
-
L.E. Shannon, "Prediction and entropy of printed English," Bell System Tech. J., vol.30, pp.50-64, 1951.
-
(1951)
Bell System Tech. J.
, vol.30
, pp. 50-64
-
-
Shannon, L.E.1
-
237
-
-
0017994420
-
A covergent gambling estimate of the entropy of English
-
T.M. Cover and R.C. King, "A covergent gambling estimate of the entropy of English," IEEE Trans. Inf. Theory, vol.24, no.4, pp.413-421, 1978.
-
(1978)
IEEE Trans. Inf. Theory
, vol.24
, Issue.4
, pp. 413-421
-
-
Cover, T.M.1
King, R.C.2
|