SCOPUS 정보 검색 플랫폼

IEICE Transactions on Information and Systems

Volumn E85-D, Issue 3, 2002, Pages 465-486

A survey on automatic speech recognition

a TOYOHASHI UNIVERSITY OF TECHNOLOGY (Japan)

Author keywords

Acoustic model; HMM; Language model; Ngram; Speech recognition

Indexed keywords

ACOUSTIC NOISE; ARTIFICIAL INTELLIGENCE; INFORMATION THEORY; LINGUISTICS; MARKOV PROCESSES; MATHEMATICAL MODELS; SPEECH ANALYSIS; SPEECH CODING; SPEECH SYNTHESIS;

ACOUSTIC MODELS; AUTOMATIC SPEECH RECOGNITION; HIDDEN MARKOV MODEL;

SPEECH RECOGNITION;

EID: 0036522866 PISSN: 09168532 EISSN: None Source Type: Journal
DOI: None Document Type: Article

Times cited : (16)

References (237)

1
- 0030718943
- Multilingual large vocabulary speech recognition in the European SQALE project
- S.J. Young, M. Adda-Decker, X. Aubert, C. Dugast, J.L. Gauvain, D.J. Kershaw, L. Lamel, and D.A. Leeuwen, "Multilingual large vocabulary speech recognition in the European SQALE project," Computer Speech & Language, vol.11, pp.73-89, 1997.
- (1997) Computer Speech & Language , vol.11 , pp. 73-89
- Young, S.J.¹ Adda-Decker, M.² Aubert, X.³ Dugast, C.⁴ Gauvain, J.L.⁵ Kershaw, D.J.⁶ Lamel, L.⁷ Leeuwen, D.A.⁸

2
- 0031187171
- Speech recognition by machine and humans
- R.P. Lippmann, "Speech recognition by machine and humans," Speech Communication. vol.22, pp.1-15, 1997.
- (1997) Speech Communication. , vol.22 , pp. 1-15
- Lippmann, R.P.¹

3
- 0011495813
- Kindai Kagakusha
- S. Nakagawa, Information Theory: Fundamental and application, Kindai Kagakusha, 1992.
- (1992) Information Theory: Fundamental and Application
- Nakagawa, S.¹

4
- 0011510426
- Capabilities and limitations of stochastic language models
- March
- S. Nakagawa, "Capabilities and limitations of stochastic language models," Conf. Record, Acoust. Soc. Japan, pp.23-26, March 1998.
- (1998) Conf. Record, Acoust. Soc. Japan , pp. 23-26
- Nakagawa, S.¹

5
- 0011458455
- Relationship among perplexity word accuracy and phoneme accuracy, and drawback and modification of perplexity
- S. Nakagawa, "Relationship among perplexity word accuracy and phoneme accuracy, and drawback and modification of perplexity," Proc. First Int. Workshop East Asian Language Resources and Evaluation, pp.123-128, 1998.
- (1998) Proc. First Int. Workshop East Asian Language Resources and Evaluation , pp. 123-128
- Nakagawa, S.¹

6
- 0011450087
- Robust speech recognition using HMM's with Toplitz state covariance matrices
- W.J.J. Roberts and Y. Ephraim, "Robust speech recognition using HMM's with Toplitz state covariance matrices," Proc. ICSLP, pp.369-372, 1998.
- (1998) Proc. ICSLP , pp. 369-372
- Roberts, W.J.J.¹ Ephraim, Y.²

7
- 0003303280
- Speech recognition based on stochastic models
- S. Nakagawa, "Speech recognition based on stochastic models," Inst. Elect. Inf. Comm. Engrs., 1988.
- (1988) Inst. Elect. Inf. Comm. Engrs.
- Nakagawa, S.¹

8
- 0003563803
- IOS Press
- S. Nakagawa, K. Shikano, and Y. Tohkura, Speech, Hearing and Neural Network Model, IOS Press, 1995.
- (1995) Speech, Hearing and Neural Network Model
- Nakagawa, S.¹ Shikano, K.² Tohkura, Y.³

9
- 0011449585
- Iwanami-shoten
- N. Takubo, K. Maekawa, Y. Kubozono, K. Honda, K. Shirai, and S. Nakagawa, Speech, Iwanami-shoten, 1998.
- (1998) Speech
- Takubo, N.¹ Maekawa, K.² Kubozono, Y.³ Honda, K.⁴ Shirai, K.⁵ Nakagawa, S.⁶

10
- 0011499173
- Maruzen
- S. Nakagawa, Pattern Information Processing, Maruzen, 1999.
- (1999) Pattern Information Processing
- Nakagawa, S.¹

11
- 85009114626
- Relationship among speaking style, inter-phoneme's distance and speech recognition performance
- K. Yamamoto and S. Nakagawa, "Relationship among speaking style, inter-phoneme's distance and speech recognition performance," Proc. ICSLP, pp.859-862, 2000.
- (2000) Proc. ICSLP , pp. 859-862
- Yamamoto, K.¹ Nakagawa, S.²

12
- 0000940883
- Acoustic signal processing techniques for robust speech recognition
- S. Nakagawa, "Acoustic signal processing techniques for robust speech recognition," J. Acoust. Soc. Japan, vol.53, no.11, pp.864-871, 1997.
- (1997) J. Acoust. Soc. Japan , vol.53 , Issue.11 , pp. 864-871
- Nakagawa, S.¹

13
- 0030779363
- Noise compensation methods for hidden Markov model speech recognition in adverse environments
- S.V. Vaseghi and B.P. Molner, "Noise compensation methods for hidden Markov model speech recognition in adverse environments," IEEE Trans. Speech Audio Process., vol.5, no.1, pp.11-21, 1997.
- (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.1 , pp. 11-21
- Vaseghi, S.V.¹ Molner, B.P.²

14
- 0023263708
- Multi-style training for robust isolated-word speech recognition
- R.P. Lippmann, E.A. Martin, and D.B. Paul, "Multi-style training for robust isolated-word speech recognition," Proc. ICASSP, pp.705-708, 1987.
- (1987) Proc. ICASSP , pp. 705-708
- Lippmann, R.P.¹ Martin, E.A.² Paul, D.B.³

15
- 0022181749
- Some acoustic-phonetic correlates of speech produced in noise
- D. Pisoni, R. Bernacki, H. Nusbaum, and M. Yuchtman, "Some acoustic-phonetic correlates of speech produced in noise," Proc. ICASSP, pp.1581-1584, 1985.
- (1985) Proc. ICASSP , pp. 1581-1584
- Pisoni, D.¹ Bernacki, R.² Nusbaum, H.³ Yuchtman, M.⁴

16
- 0011496722
- Normalizing lombard speech under different conditions
- July
- A. Wakao, K. Takeda, and F. Itakura, "Normalizing Lombard speech under different conditions," IEICE Trans., vol.J80-D-II, no.7, pp.1643-1650, July 1997.
- (1997) IEICE Trans. , vol.J80-D-II , Issue.7 , pp. 1643-1650
- Wakao, A.¹ Takeda, K.² Itakura, F.³

17
- 0029345416
- A comparison of signal processing front ends for automatic word recognition
- C.R. Jankowski, H.-D.H. Vo, and R.P. Lippmann, "A comparison of signal processing front ends for automatic word recognition," IEEE Trans. Speech & Audio Process., vol.3, no.4, pp.286-292, 1995.
- (1995) IEEE Trans. Speech & Audio Process. , vol.3 , Issue.4 , pp. 286-292
- Jankowski, C.R.¹ Vo, H.-D.H.² Lippmann, R.P.³

18
- 0003770709
- Kluwer Academic Pub., Dordrecht
- J.-C. Junqua and J.P. Haton, Robustness in Automatic Speech Recognition, Kluwer Academic Pub., Dordrecht, 1996.
- (1996) Robustness in Automatic Speech Recognition
- Junqua, J.-C.¹ Haton, J.P.²

19
- 0028517164
- RASTA processing of speech
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol.2, pp.578-589, 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

20
- 0022667694
- Speaker independent isolated word recognition using dynamic features of speech spectrum
- S. Furui, "Speaker independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. Acoust. Speech & Signal Process., vol.34, no.1, pp.52-59, 1999.
- (1999) IEEE Trans. Acoust. Speech & Signal Process. , vol.34 , Issue.1 , pp. 52-59
- Furui, S.¹

21
- 0032676337
- On the relative importance of various components of the modulation spectrum for automatic speech recognition
- N. Kanadera, T. Arai, H. Hermansky, and M. Pavel, "On the relative importance of various components of the modulation spectrum for automatic speech recognition," Speech Communication, vol.28, pp.43-55, 1999.
- (1999) Speech Communication , vol.28 , pp. 43-55
- Kanadera, N.¹ Arai, T.² Hermansky, H.³ Pavel, M.⁴

22
- 0031221099
- Filtering the time sequences of spectral parameters for speech recognition
- C. Nadeu, P.P. Leal, and B.-H. Juang, "Filtering the time sequences of spectral parameters for speech recognition," Speech Communication, vol.22, pp.315-332, 1997.
- (1997) Speech Communication , vol.22 , pp. 315-332
- Nadeu, C.¹ Leal, P.P.² Juang, B.-H.³

23
- 0011468569
- An evaluation of mel-LPC cepstrum in noisy speech recognition
- Y. Nakatoh and H. Matsumoto, "An evaluation of mel-LPC cepstrum in noisy speech recognition," Conf. Record, Acoust. Soc. Japan, pp.23-24, 1999.
- (1999) Conf. Record, Acoust. Soc. Japan , pp. 23-24
- Nakatoh, Y.¹ Matsumoto, H.²

24
- 0032761999
- Scale transform in speech analysis
- S. Omesh, L. Cohen, N. Marinovic, and D.J. Nelson, "Scale transform in speech analysis," IEEE Trans. Speech & Audio Process., vol.7, no.1, pp.40-45, 1999.
- (1999) IEEE Trans. Speech & Audio Process. , vol.7 , Issue.1 , pp. 40-45
- Omesh, S.¹ Cohen, L.² Marinovic, N.³ Nelson, D.J.⁴

25
- 0011498037
- A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition
- J. Chen, B. Xu, and T. Huang, "A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition," Proc. ICASSP, pp.629-632, 1998.
- (1998) Proc. ICASSP , pp. 629-632
- Chen, J.¹ Xu, B.² Huang, T.³

26
- 0031176764
- Hidden Markov model-based speech recognition with intermediate wavelet transform domains
- R. Singh, K. Davis, and P.V.S. Rao, "Hidden Markov model-based speech recognition with intermediate wavelet transform domains," Computer Speech and Language, vol.11, pp.252-273, 1997.
- (1997) Computer Speech and Language , vol.11 , pp. 252-273
- Singh, R.¹ Davis, K.² Rao, P.V.S.³

27
- 0026189808
- Speech recognition in adverse environments
- B.H. Juang, "Speech recognition in adverse environments," Computer Speech Language, vol.5, pp.275-294, 1991.
- (1991) Computer Speech Language , vol.5 , pp. 275-294
- Juang, B.H.¹

28
- 33947656987
- Speech recognition in noise using a projection based likelihood measure for mixture density HMM's
- B.A. Carlson and M.A. Clements, "Speech recognition in noise using a projection based likelihood measure for mixture density HMM's," Proc. ICASSP, vol.I, pp.237-240, 1992.
- (1992) Proc. ICASSP , vol.1 , pp. 237-240
- Carlson, B.A.¹ Clements, M.A.²

29
- 0032116602
- A novel projection-based likelihood measure for noisy speech recognition
- J.-T. Chien, H.-C. Wang, and L.-M. Lee, "A novel projection-based likelihood measure for noisy speech recognition," Speech Communication, vol.24, pp.287-297, 1998.
- (1998) Speech Communication , vol.24 , pp. 287-297
- Chien, J.-T.¹ Wang, H.-C.² Lee, L.-M.³

30
- 0032203256
- Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method
- S. Katagiri, B.-H. Juang, and C.-H. Lee, "Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method," Proc. IEEE, vol.86, no.11, pp.2345-2372, 1998.
- (1998) Proc. IEEE , vol.86 , Issue.11 , pp. 2345-2372
- Katagiri, S.¹ Juang, B.-H.² Lee, C.-H.³

31
- 0029723602
- Discriminative feature extraction to filter design
- A. Biem, E. Mcdemott, and S. Katagiri, "Discriminative feature extraction to filter design," Proc. IEEE Workshop Neural Networks for Signal Processing, vol.IV, pp.273-282, 1996.
- (1996) Proc. IEEE Workshop Neural Networks for Signal Processing , vol.4 , pp. 273-282
- Biem, A.¹ Mcdemott, E.² Katagiri, S.³

32
- 0001286647
- Minimum classification error training algorithm for feature extractor and pattern classification in speech recognition
- K.K. Paliwal, M. Bacchiami, and Y. Sagisaka, "Minimum classification error training algorithm for feature extractor and pattern classification in speech recognition," Proc. EuroSpeech, pp.541-545, 1995.
- (1995) Proc. EuroSpeech , pp. 541-545
- Paliwal, K.K.¹ Bacchiami, M.² Sagisaka, Y.³

33
- 0032674196
- Feature extraction for speech recognition based on orthogonal acoustic - Feature planes and LDA
- T. Nitta, "Feature extraction for speech recognition based on orthogonal acoustic - Feature planes and LDA," Proc. ICASSP, pp.421-424, 1999.
- (1999) Proc. ICASSP , pp. 421-424
- Nitta, T.¹

34
- 84893207073
- Continuous speech recognition in noise using spectral subtraction and HMM adaptation
- J.A.N. Flores and S.J. Young, "Continuous speech recognition in noise using spectral subtraction and HMM adaptation," Proc. ICASSP, vol.I, pp.409-412, 1994.
- (1994) Proc. ICASSP , vol.1 , pp. 409-412
- Flores, J.A.N.¹ Young, S.J.²

35
- 11044237174
- An evaluation of speech enhancement approach E-CMN/CSS for speech recognition
- Jan.
- M. Shozakai, S. Nakamura, and K. Shikano, "An evaluation of speech enhancement approach E-CMN/CSS for speech recognition," IEICE Trans., vol.J81-D, no.1, pp.1-9, Jan. 1998.
- (1998) IEICE Trans. , vol.J81-D , Issue.1 , pp. 1-9
- Shozakai, M.¹ Nakamura, S.² Shikano, K.³

36
- 0026882842
- Experiments with a nonlinear spectral subtractor (NSS), hidden Markov model and the projection, for robust speech recognition in cars
- P. Lockwood and J. Boudy, "Experiments with a nonlinear spectral subtractor (NSS), hidden Markov model and the projection, for robust speech recognition in cars," Speech Communication, vol.11, pp.215-228, 1992.
- (1992) Speech Communication , vol.11 , pp. 215-228
- Lockwood, P.¹ Boudy, J.²

37
- 0030711159
- Spectral subtraction and RASTA-filtering in text-dependent HMM-based speaker verification
- D. Hardt and K. Fellbaum, "Spectral subtraction and RASTA-filtering in text-dependent HMM-based speaker verification," Proc. ICASSP, pp.867-870, 1997.
- (1997) Proc. ICASSP , pp. 867-870
- Hardt, D.¹ Fellbaum, K.²

38
- 0011498039
- A smoothing method of time direction on speech recognition under noisy environments using spectral subtraction
- N. Kitaoka, I. Akahori, and S. Nakagawa, "A smoothing method of time direction on speech recognition under noisy environments using spectral subtraction," Proc. Int. Conf. Speech Processing, pp.381-386, 1999.
- (1999) Proc. Int. Conf. Speech Processing , pp. 381-386
- Kitaoka, N.¹ Akahori, I.² Nakagawa, S.³

39
- 0011464161
- Improved robust speech recognition considering signal correlation approximated by Tayler series
- J.-L. Shen, J.-W. Hung, and L.-S. Lee, "Improved robust speech recognition considering signal correlation approximated by Tayler series," Proc. ICSLP, pp.1499-1502, 1998.
- (1998) Proc. ICSLP , pp. 1499-1502
- Shen, J.-L.¹ Hung, J.-W.² Lee, L.-S.³

40
- 0025681008
- Hidden Markov model decomposition of speech and noise
- A.P. Varga and R.K. Moore, "Hidden Markov model decomposition of speech and noise," Proc. ICASSP, pp.845-848, 1990.
- (1990) Proc. ICASSP , pp. 845-848
- Varga, A.P.¹ Moore, R.K.²

41
- 0027622731
- Cepstral parameter compensation for HMM recognition in noise
- M.J.F. Gales and S.J. Young, "Cepstral parameter compensation for HMM recognition in noise," Speech Communication, vol.12, pp.231-239, 1993.
- (1993) Speech Communication , vol.12 , pp. 231-239
- Gales, M.J.F.¹ Young, S.J.²

42
- 0030245128
- Robust continuous speech recognition using parallel model combination
- M.J.F. Gales and S.J. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech & Audio Process., vol.4, pp.352-359, 1996.
- (1996) IEEE Trans. Speech & Audio Process. , vol.4 , pp. 352-359
- Gales, M.J.F.¹ Young, S.J.²

43
- 0003524869
- Recognition of noisy speech by composition of hidden Markov models
- IEICE Technical Report, SP92-96
- F. Martin, K. Shikano, Y. Minami, and Y. Okabe, "Recognition of noisy speech by composition of hidden Markov models," IEICE Technical Report, SP92-96, 1992.
- (1992)
- Martin, F.¹ Shikano, K.² Minami, Y.³ Okabe, Y.⁴

44
- 0011400310
- Robust HMM to variation of noisy environments based on variance extension of noisy models
- H. Matsumoto and H. Ubukata, "Robust HMM to variation of noisy environments based on variance extension of noisy models," Proc. EuroSpeech, pp.2387-2390, 1999.
- (1999) Proc. EuroSpeech , pp. 2387-2390
- Matsumoto, H.¹ Ubukata, H.²

45
- 0032623471
- Robust features for noisy speech recognition based on temporal trajectory fitting of short-time autocorrelation sequences
- K.H. You and H.-C. Wang, "Robust features for noisy speech recognition based on temporal trajectory fitting of short-time autocorrelation sequences," Speech Communication, vol.28, pp.13-24, 1999.
- (1999) Speech Communication , vol.28 , pp. 13-24
- You, K.H.¹ Wang, H.-C.²

46
- 0011448901
- HMM composition of segmental unit input HMM for noisy speech recognition
- K. Yamamoto and S. Nakagawa, "HMM composition of segmental unit input HMM for noisy speech recognition," Proc. EuroSpeech, pp.2865-2868, 1999.
- (1999) Proc. EuroSpeech , pp. 2865-2868
- Yamamoto, K.¹ Nakagawa, S.²

47
- 0011406317
- Difference in speech recognition performance caused by difference in front-end devices and its compensations
- K. Yamamoto and S. Nakagawa, "Difference in speech recognition performance caused by difference in front-end devices and its compensations," Proc. 7th Western Pacific Regional Acoust. Conf., pp.85-88, 2000.
- (2000) Proc. 7th Western Pacific Regional Acoust. Conf. , pp. 85-88
- Yamamoto, K.¹ Nakagawa, S.²

48
- 0011501273
- Real-time cepstrum mean subtraction using the most likely partial state sequence
- March
- S. Kuroiwa, T. Kato, and N. Higuchi, "Real-time cepstrum mean subtraction using the most likely partial state sequence," IEICE Trans., vol.J82-D-II, no.3, pp.332-339, March 1999.
- (1999) IEICE Trans. , vol.J82-D-II , Issue.3 , pp. 332-339
- Kuroiwa, S.¹ Kato, T.² Higuchi, N.³

49
- 0030149866
- A maximum-likelihood approach to stochastic matching for robust speech recognition
- A. Sankar and C.H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech & Audio Process., vol.4, no.5, pp.190-202, 1996.
- (1996) IEEE Trans. Speech & Audio Process. , vol.4 , Issue.5 , pp. 190-202
- Sankar, A.¹ Lee, C.H.²

50
- 0029369804
- Rapid environment adaptation for speech recognition
- K. Takagi, H. Hattori, and T. Watanabe, "Rapid environment adaptation for speech recognition," J. Acoust. Soc. Japan, (E), vol.16, no.5, pp.273-281, 1995.
- (1995) J. Acoust. Soc. Japan, (E) , vol.16 , Issue.5 , pp. 273-281
- Takagi, K.¹ Hattori, H.² Watanabe, T.³

51
- 0011510430
- An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation
- Y. Tsurumi and S. Nakagawa, "An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation," Proc. IC-SLP, pp.431-434, 1994.
- (1994) Proc. IC-SLP , pp. 431-434
- Tsurumi, Y.¹ Nakagawa, S.²

52
- 0011410507
- Acoustical and environmental robustness
- Kluwer Academic Pub., Dordrecht
- A. Acero, "Acoustical and Environmental Robustness," in Automatic Speech Recognition, Kluwer Academic Pub., Dordrecht, 1993.
- (1993) Automatic Speech Recognition
- Acero, A.¹

53
- 0032116601
- Data-driven environmental compensation for speech recognition a unified approach
- P.J. Moreno, B. Raj, and R.M. Stern, "Data-driven environmental compensation for speech recognition a unified approach," Speech Communication, vol.24, pp.267-285, 1998.
- (1998) Speech Communication , vol.24 , pp. 267-285
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

54
- 0029725301
- A vector Taylor series approach for environment-independent speech recognition
- P.J. Moreno, B. Raj, and R.M. Stern, "A vector Taylor series approach for environment-independent speech recognition," Proc. ICASSP, pp.733-736, 1996.
- (1996) Proc. ICASSP , pp. 733-736
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

55
- 0032048385
- Speech recognition in noisy environments using first-order vector Taylor series
- D.Y. Kim, C.K. Un, and N.S. Kim, "Speech recognition in noisy environments using first-order vector Taylor series," Speech Communication, vol.24, no.1, pp.39-49, 1998.
- (1998) Speech Communication , vol.24 , Issue.1 , pp. 39-49
- Kim, D.Y.¹ Un, C.K.² Kim, N.S.³

56
- 0011496725
- HMM adaptation method for noise and distortion by maximizing likelihood
- July
- Y. Minami and S. Furui, "HMM adaptation method for noise and distortion by maximizing likelihood," IEICE Trans., vol.J80-A, no.7, pp.1179-1186, July 1997.
- (1997) IEICE Trans. , vol.J80-A , Issue.7 , pp. 1179-1186
- Minami, Y.¹ Furui, S.²

57
- 0032203405
- A general joint additive and convolutive bias approach applied to noisy lombard speech recognition
- M. Afify, Y. Gong, and J.P. Haton, "A general joint additive and convolutive bias approach applied to noisy lombard speech recognition," IEEE Trans. Speech & Audio Process., vol.6, no.6, pp.524-537, 1998.
- (1998) IEEE Trans. Speech & Audio Process. , vol.6 , Issue.6 , pp. 524-537
- Afify, M.¹ Gong, Y.² Haton, J.P.³

58
- 0035249243
- HMM - Separation-based speech recognition for a distant moving speaker
- T. Takiguchi, S. Nakamura, and K. Shikano, "HMM - Separation-based speech recognition for a distant moving speaker," IEEE Trans. Speech & Audio Process., vol.9, no.3, pp.127-140, 2001.
- (2001) IEEE Trans. Speech & Audio Process. , vol.9 , Issue.3 , pp. 127-140
- Takiguchi, T.¹ Nakamura, S.² Shikano, K.³

59
- 0032139769
- Automatic segmentation of speech recorded in unknown noisy channel characteristics
- B.L. Pallon and J.H.L. Hansen, "Automatic segmentation of speech recorded in unknown noisy channel characteristics," Speech Communication, vol.25, no.1-3, pp.97-116, 1998.
- (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 97-116
- Pallon, B.L.¹ Hansen, J.H.L.²

60
- 0011448902
- Japanese phoneme recognition using continuous parameter hidden Markov models
- June
- S. Nakagawa, Y. Hirata, and Y. Hashimoto, "Japanese phoneme recognition using continuous parameter hidden Markov models," J. Acoust. Soc. Japan, vol.46, no.6, pp.486-496, June 1990.
- (1990) J. Acoust. Soc. Japan , vol.46 , Issue.6 , pp. 486-496
- Nakagawa, S.¹ Hirata, Y.² Hashimoto, Y.³

61
- 0023211284
- Integration of acoustic information in a large vocabulary word recognizer
- V.N. Gupta, M. Lennig, and P. Mermelstein, "Integration of acoustic information in a large vocabulary word recognizer," ICASSP, vol.II, pp.697-700, 1987.
- (1987) ICASSP , vol.2 , pp. 697-700
- Gupta, V.N.¹ Lennig, M.² Mermelstein, P.³

62
- 20344368952
- Hidden Markov model embedded dynamic features of speech spectrum
- Feb.
- E. Tsuboka and J. Nakahashi, "Hidden Markov model embedded dynamic features of speech spectrum," IEICE Trans., vol.J77-A, no.2, pp.162-172, Feb. 1994.
- (1994) IEICE Trans. , vol.J77-A , Issue.2 , pp. 162-172
- Tsuboka, E.¹ Nakahashi, J.²

63
- 0029325484
- Neural predictive hidden Markov model for speech recognition
- June
- E. Tsuboka and Y. Takada, "Neural predictive hidden Markov model for speech recognition," IEICE Trans., Inf. & Syst., vol.E78-D, no.6, pp.676-684, June 1995.
- (1995) IEICE Trans., Inf. & Syst. , vol.E78-D , Issue.6 , pp. 676-684
- Tsuboka, E.¹ Takada, Y.²

64
- 84911676598
- Linear and nonlinear prediction for speech recognition with hidden Markov models
- M. Saerens and H. Bourlard, "Linear and nonlinear prediction for speech recognition with hidden Markov models," Proc. EuroSpeech, pp.807-810, 1993.
- (1993) Proc. EuroSpeech , pp. 807-810
- Saerens, M.¹ Bourlard, H.²

65
- 0030262262
- An MLP/HMM hybrid model using linear predictors
- Y.J. Chung and C.K. Un, "An MLP/HMM hybrid model using linear predictors," Speech Communication, vol.19, pp.307-316, 1996.
- (1996) Speech Communication , vol.19 , pp. 307-316
- Chung, Y.J.¹ Un, C.K.²

66
- 0028516022
- Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
- L. Deng, M. Aksmanoric, X. Sun, and C.F.J. Wu, "Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states," IEEE Trans. Speech Audio & Process., vol.2, no.4, pp.507-520, 1994.
- (1994) IEEE Trans. Speech Audio & Process. , vol.2 , Issue.4 , pp. 507-520
- Deng, L.¹ Aksmanoric, M.² Sun, X.³ Wu, C.F.J.⁴

67
- 0011495820
- Speech recognition by hidden Markov model using segmental statistics
- IEICE Technical Report, SP90-69
- Y. Hirata, I. Hayakawa, Y. Ono, and S. Nakagawa, "Speech recognition by hidden Markov model using segmental statistics," IEICE Technical Report, SP90-69, 1990.
- (1990)
- Hirata, Y.¹ Hayakawa, I.² Ono, Y.³ Nakagawa, S.⁴

68
- 0011400311
- Syllable recognition by hidden Markov model using fixed-length segmental statistics
- May
- S. Nakagawa, Y. Hirata, and Y. Ono, "Syllable recognition by hidden Markov model using fixed-length segmental statistics," IEICE Trans., vol.J75-D-II, no.5, pp.843-851, May 1992.
- (1992) IEICE Trans. , vol.J75-D-II , Issue.5 , pp. 843-851
- Nakagawa, S.¹ Hirata, Y.² Ono, Y.³

69
- 0000321310
- Explicit correlation in hidden Markov model for speech recognition
- C.J. Wellekens, "Explicit correlation in hidden Markov model for speech recognition," Proc. ICASSP, vol.I, pp.383-386, 1987.
- (1987) Proc. ICASSP , vol.1 , pp. 383-386
- Wellekens, C.J.¹

70
- 0030261616
- Modelling of the interframe dependence in an HMM using conditional Gaussian mixtures
- J. Ming and F.J. Smith, "Modelling of the interframe dependence in an HMM using conditional Gaussian mixtures," Computer Speech and Language, vol.10, pp.229-242, 1996.
- (1996) Computer Speech and Language , vol.10 , pp. 229-242
- Ming, J.¹ Smith, F.J.²

71
- 0027167185
- A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition
- K. Aikawa, H. Singer, H. Kawakara, and Y. Tohkura, "A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition," Proc. ICASSP, pp.668-671, 1993.
- (1993) Proc. ICASSP , pp. 668-671
- Aikawa, K.¹ Singer, H.² Kawakara, H.³ Tohkura, Y.⁴

72
- 0011453543
- Comparative evaluation of segmental unit input HMM and conditional density HMM
- K. Yamamoto and S. Nakagawa, "Comparative evaluation of segmental unit input HMM and conditional density HMM," Proc. EuroSpeech, pp.1615-1618, 1995.
- (1995) Proc. EuroSpeech , pp. 1615-1618
- Yamamoto, K.¹ Nakagawa, S.²

73
- 85128367481
- Continuous speech recognition using segmental unit input HMM with a mixture of probability density functions and context dependency
- K. Hanai, K. Yamamoto, N. Minematsu, and S. Nakagawa, "Continuous speech recognition using segmental unit input HMM with a mixture of probability density functions and context dependency," Proc. ICSLP, pp.2935-2938, 1998.
- (1998) Proc. ICSLP , pp. 2935-2938
- Hanai, K.¹ Yamamoto, K.² Minematsu, N.³ Nakagawa, S.⁴

74
- 0011494012
- Speaker-independent phoneme and word recognition by statistical classification methods for time-sequential patterns
- Oct.
- S. Nakagawa and Y. Enomoto "Speaker-independent phoneme and word recognition by statistical classification methods for time-sequential patterns," IEICE Trans., vol.J71-D, no.10, pp.1977-1983, Oct. 1988.
- (1988) IEICE Trans. , vol.J71-D , Issue.10 , pp. 1977-1983
- Nakagawa, S.¹ Enomoto, Y.²

75
- 84926271491
- Recognition on unvoiced plosive using time spectrum pattern
- May
- K. Ide, S. Makino, and K. Kido, "Recognition on unvoiced plosive using time spectrum pattern," J. Acoust. Soc. Japan, vol.39, no.5, pp.321-329, May 1983.
- (1983) J. Acoust. Soc. Japan , vol.39 , Issue.5 , pp. 321-329
- Ide, K.¹ Makino, S.² Kido, K.³

76
- 0024900279
- A stochastic segment model for phoneme-based continuous speech recognition
- M. Ostendorf and S. Roukos, "A stochastic segment model for phoneme-based continuous speech recognition," IEEE Trans. Acoust., Speech & Signal Process., vol.37, no.12, pp.1857-1869, 1989.
- (1989) IEEE Trans. Acoust., Speech & Signal Process. , vol.37 , Issue.12 , pp. 1857-1869
- Ostendorf, M.¹ Roukos, S.²

77
- 0025594074
- Connectionist Viterbi training a new hybrid for continuous speech recognition
- M. Franzini and K.-F. Lee, "Connectionist Viterbi training a new hybrid for continuous speech recognition," Proc. ICASSP, vol.I, pp.425-428, 1990.
- (1990) Proc. ICASSP , vol.1 , pp. 425-428
- Franzini, M.¹ Lee, K.-F.²

78
- 0028194709
- Connectionist probability estimators in HMM speech recognition
- S. Renal, N. Morgan, H. Bourlard, M. Cohen, and H. Franco, "Connectionist probability estimators in HMM speech recognition," IEEE Trans. Speech & Audio Process., vol.2, no.1, pp.161-174, 1994.
- (1994) IEEE Trans. Speech & Audio Process. , vol.2 , Issue.1 , pp. 161-174
- Renal, S.¹ Morgan, N.² Bourlard, H.³ Cohen, M.⁴ Franco, H.⁵

79
- 77954383749
- Data-driven extensions to HMM statistical dependencies
- J.A. Bilmes, "Data-driven extensions to HMM statistical dependencies," Proc. ICSLP, pp.69-72, 1998.
- (1998) Proc. ICSLP , pp. 69-72
- Bilmes, J.A.¹

80
- 0011498040
- Inter-frame dependence arising from preceding and succeeding frames - Application to speech recognition
- P. Hanna, J. Ming, and F.J. Smith, "Inter-frame dependence arising from preceding and succeeding frames - Application to speech recognition," Speech Communication, vol.31, no.4, pp.1301-1312, 1999.
- (1999) Speech Communication , vol.31 , Issue.4 , pp. 1301-1312
- Hanna, P.¹ Ming, J.² Smith, F.J.³

81
- 0009626005
- The IBM large vocabulary continuous speech recognition system for the ARPA NAB news task
- L.R. Bahl, P.F. Brown, P.V. Souza, and R.L. Mercer, "The IBM large vocabulary continuous speech recognition system for the ARPA NAB news task," Proc. Spoken Language Systems Technology Workshop, pp.121-126, 1995.
- (1995) Proc. Spoken Language Systems Technology Workshop , pp. 121-126
- Bahl, L.R.¹ Brown, P.F.² Souza, P.V.³ Mercer, R.L.⁴

82
- 0028996957
- A unified way in incorporating segmental feature and segmental model into HMM
- J. He and H. Leich, "A unified way in incorporating segmental feature and segmental model into HMM," Proc. ICASSP, vol.I, pp.532-535, 1995.
- (1995) Proc. ICASSP , vol.1 , pp. 532-535
- He, J.¹ Leich, J.²

83
- 85027200620
- The property of asymmetric segment
- IEICE Technical Report, SP98-30
- T. Ohtuki and T. Ohtomo, "The property of asymmetric segment," IEICE Technical Report, SP98-30, 1998.
- (1998)
- Ohtuki, T.¹ Ohtomo, T.²

84
- 0030245363
- From HMMs to segment models: A unified view of stochastic modeling for speech recognition
- M. Ostendonf, V.V. Digalakis, and O.A. Kimball, "From HMMs to segment models: A unified view of stochastic modeling for speech recognition," IEEE Trans. Speech & Audio Process., vol.4, no.5, pp.360-378, 1996.
- (1996) IEEE Trans. Speech & Audio Process. , vol.4 , Issue.5 , pp. 360-378
- Ostendonf, M.¹ Digalakis, V.V.² Kimball, O.A.³

85
- 0032048095
- Assessing the importance of the segmentation probability in segment-based speech recognition
- J. Verhasselt, I. Illina, J.P. Martens, Y. Gong, and J.-P. Haton, "Assessing the importance of the segmentation probability in segment-based speech recognition," Speech Communication, vol.24, pp.51-72, 1998.
- (1998) Speech Communication , vol.24 , pp. 51-72
- Verhasselt, J.¹ Illina, I.² Martens, J.P.³ Gong, Y.⁴ Haton, J.-P.⁵

86
- 0023846644
- Stochastic segment modeling using the estimate-maximize algorithm
- S. Rocous, M. Ostendorf, H. Gish, and A. Derr, "Stochastic segment modeling using the estimate-maximize algorithm," Proc. ICASSP, pp.127-130, 1988.
- (1988) Proc. ICASSP , pp. 127-130
- Rocous, S.¹ Ostendorf, M.² Gish, H.³ Derr, A.⁴

87
- 0031185482
- Speaker-independent phonetic classification using hidden Markov models with mixtures of trend functions
- L. Deng and M. Aksmanovic, "Speaker-independent phonetic classification using hidden Markov models with mixtures of trend functions," IEEE Trans. Speech & Audio Process., vol.5, no.4, pp.319-324, 1997.
- (1997) IEEE Trans. Speech & Audio Process. , vol.5 , Issue.4 , pp. 319-324
- Deng, L.¹ Aksmanovic, M.²

88
- 0032206267
- Speech trajectory discrimination using the minimum classification error learning
- R. Chengalvara and L. Deng, "Speech trajectory discrimination using the minimum classification error learning," IEEE Trans. Speech & Audio Process., vol.6, no.6, pp.505-515, 1998.
- (1998) IEEE Trans. Speech & Audio Process. , vol.6 , Issue.6 , pp. 505-515
- Chengalvara, R.¹ Deng, L.²

89
- 0032673963
- Probabilistic-trajectory segmental HMMs
- W.J. Holmes and M.J. Russel, "Probabilistic-trajectory segmental HMMs," Computer Speech and Language. vol.13, pp.3-37, 1999.
- (1999) Computer Speech and Language. , vol.13 , pp. 3-37
- Holmes, W.J.¹ Russel, M.J.²

90
- 0034478708
- Improving phoneme classification performance using observation context-dependent segment models
- M. Szarras and S. Matsunaga, "Improving phoneme classification performance using observation context-dependent segment models," Int. J. Speech Technology, vol.3, pp.253-262, 2000.
- (2000) Int. J. Speech Technology , vol.3 , pp. 253-262
- Szarras, M.¹ Matsunaga, S.²

91
- 0027681974
- ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
- V. Digalakis, J.R. Rohlicek, and M. Ostendorf, "ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition," IEEE Trans. Speech & Audio Process., vol.1, no.4, pp.431-442, 1993.
- (1993) IEEE Trans. Speech & Audio Process. , vol.1 , Issue.4 , pp. 431-442
- Digalakis, V.¹ Rohlicek, J.R.² Ostendorf, M.³

92
- 0011458458
- Kalman-filter solved by personal computer
- Maruzen
- M. Nakano and K. Nishiyama, Kalman-filter solved by personal computer, Maruzen 1993.
- (1993)
- Nakano, M.¹ Nishiyama, K.²

93
- 0011432608
- Time series analysis programming
- Iwanami shoten
- G. Kitagawa, Time series analysis programming, Iwanami shoten, 1993.
- (1993)
- Kitagawa, G.¹

94
- 0029755019
- Estimation of mixtures of stochastic dynamic trajectories: Application to continuous speech recognition
- M. Afify, Y. Gong, and J.-P. Haton, "Estimation of mixtures of stochastic dynamic trajectories: Application to continuous speech recognition," Computer Speech and Language, vol.10, pp.23-36, 1996.
- (1996) Computer Speech and Language , vol.10 , pp. 23-36
- Afify, M.¹ Gong, Y.² Haton, J.-P.³

95
- 0011450090
- Constraining model duration variance in HMM-based connected speech recognition
- M.M. Hochberg and H.F. Silverman, "Constraining model duration variance in HMM-based connected speech recognition," Proc. EuroSpeech, pp.323-326, 1993.
- (1993) Proc. EuroSpeech , pp. 323-326
- Hochberg, M.M.¹ Silverman, H.F.²

96
- 0029368174
- Nonstationary hidden Markov model
- B. Sin and J.H. Kim, "Nonstationary hidden Markov model," Signal Processing, vol.46, pp.31-46, 1995.
- (1995) Signal Processing , vol.46 , pp. 31-46
- Sin, B.¹ Kim, J.H.²

97
- 0030247529
- Modeling acoustic transitions in speech by modified hidden Markov models with state duration and state duration-dependent observation probabilities
- Y.K. Park, C.K. Un, and O.W. Kwon, "Modeling acoustic transitions in speech by modified hidden Markov models with state duration and state duration-dependent observation probabilities" IEEE Trans. Speech & Audio Process, vol.4, no.5, pp.389-392, 1996.
- (1996) IEEE Trans. Speech & Audio Process , vol.4 , Issue.5 , pp. 389-392
- Park, Y.K.¹ Un, C.K.² Kwon, O.W.³

98
- 0000698482
- Japanese dictation toolkit - 1997 version
- May
- T. Kawahara, A. Lee, T. Kobayashi, K. Takeda, N. Minematsu, K. Itoh, A. Itoh, M. Yamamoto, A. Yamada, T. Utsuro, and K. Shikano, "Japanese dictation toolkit - 1997 version," J. Acoust. Soc. Japan, vol.E20, no.3, pp.223-239, May 1999.
- (1999) J. Acoust. Soc. Japan , vol.E20 , Issue.3 , pp. 223-239
- Kawahara, T.¹ Lee, A.² Kobayashi, T.³ Takeda, K.⁴ Minematsu, N.⁵ Itoh, K.⁶ Itoh, A.⁷ Yamamoto, M.⁸ Yamada, A.⁹ Utsuro, T.¹⁰ Shikano, K.¹¹

99
- 0029352735
- Continuous speech dictation - From theory to practice
- V. Steinbiss, H. Ney, U. Essen, B.-H. Tran, X. Aubert, C. Dugast, R. Kneser, H.-G. Meier, M. Oerder, R. Haeb-Umbach, D. Geller, W. Höllerbauer, and H. Bartosik, "Continuous speech dictation - From theory to practice," Speech Communication, vol.17, pp.19-38, 1995.
- (1995) Speech Communication , vol.17 , pp. 19-38
- Steinbiss, V.¹ Ney, H.² Essen, U.³ Tran, B.-H.⁴ Aubert, X.⁵ Dugast, C.⁶ Kneser, R.⁷ Meier, H.-G.⁸ Oerder, M.⁹ Haeb-Umbach, R.¹⁰ Geller, D.¹¹ Höllerbauer, W.¹² Bartosik, H.¹³

100
- 0011453546
- Recognition of spoken words based on VCV syllable unit
- May
- R. Nakatsu and M. Kohda, "Recognition of spoken words based on VCV syllable unit," IEICE Trans., vol.J61-A, no.5, pp.464-471, May 1978.
- (1978) IEICE Trans. , vol.J61-A , Issue.5 , pp. 464-471
- Nakatsu, R.¹ Kohda, M.²

101
- 0022185407
- Context-dependent modeling for acoustic-phonetic recognition of continuous speech
- R. Schawartz, Y. Chow, O. Kimball, S. Roucos, M. Krasner, and J. Makhoul, "Context-dependent modeling for acoustic-phonetic recognition of continuous speech," Proc., ICASSP, pp.1203-1208, 1985.
- (1985) Proc., ICASSP , pp. 1203-1208
- Schawartz, R.¹ Chow, Y.² Kimball, O.³ Roucos, S.⁴ Krasner, M.⁵ Makhoul, J.⁶

102
- 0003770715
- Kluwer Academic Publishers
- K.F. Lee, Automatic Speech Recognition, the Development of the SPHINX System, Kluwer Academic Publishers, 1989.
- (1989) Automatic Speech Recognition, the Development of the SPHINX System
- Lee, K.F.¹

103
- 0028996852
- The 1994 HTK large vocabulary speech recognition system
- P.C. Woodland, C.J. Leggetter, J.J. Odell, V. Valtcher, and S.J. Young, "The 1994 HTK large vocabulary speech recognition system," Proc. ICASSP, pp.73-76, 1995.
- (1995) Proc. ICASSP , pp. 73-76
- Woodland, P.C.¹ Leggetter, C.J.² Odell, J.J.³ Valtcher, V.⁴ Young, S.J.⁵

104
- 0011453547
- Comparison of syntax-oriented spoken Japanese understanding with semantic-oriented system
- July
- S. Nakagawa, Y. Hirata, I. Murase, and T. Tanoue, "Comparison of syntax-oriented spoken Japanese understanding with semantic-oriented system," IEICE Trans., vol.E74, no.7, pp.1854-1862, July 1991.
- (1991) IEICE Trans. , vol.E74 , Issue.7 , pp. 1854-1862
- Nakagawa, S.¹ Hirata, Y.² Murase, I.³ Tanoue, T.⁴

105
- 0024889251
- Large vocabulary word recognition based on demisyllable hidden Markov model using small amount of training data
- T. Watanabe, "Large vocabulary word recognition based on demisyllable hidden Markov model using small amount of training data," Proc. ICASSP, S1.1, 1985.
- Proc. ICASSP, S1.1, 1985.
- Watanabe, T.¹

106
- 0011448906
- Multivariate statistical analysis of VCV syllables
- Jan.
- T. Sakai and K. Tabata, "Multivariate statistical analysis of VCV syllables," IEICE Trans., vol.56-D, no.1, pp.63-70, Jan. 1973.
- (1973) IEICE Trans. , vol.56 D , Issue.1 , pp. 63-70
- Sakai, T.¹ Tabata, K.²

107
- 34248800020
- Mora or syllable? Speech segmentation in Japanese
- T. Otake, G. Hatano, G. Culter, and J. Mehler, "Mora or syllable? Speech segmentation in Japanese," J. Mem. Lang, vol.32, pp.358-378, 1993.
- (1993) J. Mem. Lang , vol.32 , pp. 358-378
- Otake, T.¹ Hatano, G.² Culter, G.³ Mehler, J.⁴

108
- 0031632630
- Advances in alphadigit recognition using syllables
- J. Hamaker, A. Ganapathiraju, J. Picone, and J.J. Godfrey, "Advances in alphadigit recognition using syllables," Proc. ICASSP, pp.421-424, 1998.
- (1998) Proc. ICASSP , pp. 421-424
- Hamaker, J.¹ Ganapathiraju, A.² Picone, J.³ Godfrey, J.J.⁴

109
- 0003462715
- Hidden Markov model for speech recognition
- Edinburgh University Press
- X.D. Xuang, Y. Ariki, and M.A. Jack, Hidden Markov model for speech recognition, Edinburgh University Press, 1990.
- (1990)
- Xuang, X.D.¹ Ariki, Y.² Jack, M.A.³

110
- 85015539783
- Subphonetic modeling with Markov states-SENONE
- M.-Y. Hwang, and X. Huang, "Subphonetic modeling with Markov states-SENONE," Proc. ICASSP, pp.33-36, 1992.
- (1992) Proc. ICASSP , pp. 33-36
- Hwang, M.-Y.¹ Huang, X.²

111
- 0030193422
- Genones: Generalized mixture tying in continuous hidden Markov model-based speech recognizers
- V.V. Digalakis, P. Monaco, and H. Murveit, "Genones: Generalized mixture tying in continuous hidden Markov model-based speech recognizers," IEEE Trans. Speech & Audio Process., vol.4, no.4, pp.281-288, 1996.
- (1996) IEEE Trans. Speech & Audio Process. , vol.4 , Issue.4 , pp. 281-288
- Digalakis, V.V.¹ Monaco, P.² Murveit, H.³

112
- 0028530231
- State clustering in hidden Markov-based continuous speech recognition
- S.J. Young and P.C. Woodland, "State clustering in hidden Markov-based continuous speech recognition," Computer Speech and Language, vol.8, pp.369-383, 1994.
- (1994) Computer Speech and Language , vol.8 , pp. 369-383
- Young, S.J.¹ Woodland, P.C.²

113
- 85027105819
- Prediction about unknown phonetic context by tree-based phone modeling
- Technical Report, SP90-64, IEICE
- S. Hayamizu and K. Tanaka, "Prediction about unknown phonetic contexts by tree-based phone modeling," Technical Report, SP90-64, IEICE 1990.
- (1990)
- Hayamizu, S.¹ Tanaka, K.²

114
- 85013744934
- A successive state splitting algorithm for efficient allophone modeling
- J. Takami and S. Sagayama, "A successive state splitting algorithm for efficient allophone modeling," Proc. ICASSP, pp.574-577, 1992.
- (1992) Proc. ICASSP , pp. 574-577
- Takami, J.¹ Sagayama, S.²

115
- 0011471866
- A study on HM-nets using phonetic decision tree-based successive state splitting
- Oct.
- T. Hori, M. Katoh, A. Itoh, and M. Kohda, "A study on HM-nets using phonetic decision tree-based successive state splitting," IEICE Trans. Inf. & Syst., vol.J80-D-II, no.10, pp.2645-2654, Oct. 1997.
- (1997) IEICE Trans. Inf. & Syst. , vol.J80-D-II , Issue.10 , pp. 2645-2654
- Hori, T.¹ Katoh, M.² Itoh, A.³ Kohda, M.⁴

116
- 0033106612
- A Bayesian triphone model
- J. Ming and J.F. Smith, "A Bayesian triphone model," Computer Speech and Language, vol.13, pp.195-206, 1999.
- (1999) Computer Speech and Language , vol.13 , pp. 195-206
- Ming, J.¹ Smith, J.F.²

117
- 85007758082
- Minimum error classification training of HMMs implementation details and experimental results
- D. Rainton, and S. Sagayama, "Minimum error classification training of HMMs implementation details and experimental results," J. Acoust. Soc. Japan, vol.13, no.6, pp.379-388, 1992.
- (1992) J. Acoust. Soc. Japan , vol.13 , Issue.6 , pp. 379-388
- Rainton, D.¹ Sagayama, S.²

118
- 0011400313
- Estimating hidden Markov model parameters so as to maximize speech recognition accuracy
- L.R. Bahl, P.F. Broun, P.V. Souza, and R.L. Mercer, "Estimating hidden Markov model parameters so as to maximize speech recognition accuracy," IEEE Trans. Speech & Audio Procss., vol.1, no.1, pp.77-82, 1993.
- (1993) IEEE Trans. Speech & Audio Procss. , vol.1 , Issue.1 , pp. 77-82
- Bahl, L.R.¹ Broun, P.F.² Souza, P.V.³ Mercer, R.L.⁴

119
- 0028412908
- High performance connected digit recognition using maximum mutual information estimation
- Y. Normndin, R. Cardin, and R. de Mori, "High performance connected digit recognition using maximum mutual information estimation," IEEE Trans. Speech & Audio Process., vol.2, pp.299-311, 1994.
- (1994) IEEE Trans. Speech & Audio Process. , vol.2 , pp. 299-311
- Normndin, Y.¹ Cardin, R.² De Mori, R.³

120
- 0031222490
- MMIE training of large vocabulary recognition systems
- V. Valtchev, J. Odel, P. Woodland, and S. Young, "MMIE training of large vocabulary recognition systems," Speech Communication, vol.22, pp.303-314, 1993.
- (1993) Speech Communication , vol.22 , pp. 303-314
- Valtchev, V.¹ Odel, J.² Woodland, P.³ Young, S.⁴

121
- 85128400029
- Discriminative training of GMM using a modified EM algorithm for speaker recognition
- K. Markov and S. Nakagawa, "Discriminative training of GMM using a modified EM algorithm for speaker recognition," Proc. ICSLP, vol.2, pp.177-180, 1998.
- (1998) Proc. ICSLP , vol.2 , pp. 177-180
- Markov, K.¹ Nakagawa, S.²

122
- 0030235132
- Performance of HMM-based speech recognizers with discriminative state-weights
- O.W. Kwon and C.K. Un, "Performance of HMM-based speech recognizers with discriminative state-weights," Speech Communication, vol.19, pp.197-205, 1996.
- (1996) Speech Communication , vol.19 , pp. 197-205
- Kwon, O.W.¹ Un, C.K.²

123
- 0032762247
- Selective training for hidden Markov models with applications to speech classification
- L.M. Arslan and H.L. Hanson, "Selective training for hidden Markov models with applications to speech classification," IEEE Trans. Speech & Audio Process., vol.7, no.1, pp.46-64, 1999.
- (1999) IEEE Trans. Speech & Audio Process. , vol.7 , Issue.1 , pp. 46-64
- Arslan, L.M.¹ Hanson, H.L.²

124
- 0002235014
- Improved feature decorrelation for HMM-based speech recognition
- K. Demuynck, J. Duchateau, D.V. Comernolle, and P. Wambacq, "Improved feature decorrelation for HMM-based speech recognition," Proc. ICSLP, pp.2907-2910, 1998.
- (1998) Proc. ICSLP , pp. 2907-2910
- Demuynck, K.¹ Duchateau, J.² Comernolle, D.V.³ Wambacq, P.⁴

125
- 0029725604
- A parametric approach to vocal tract length normalization
- E. Eide and H. Gish, "A parametric approach to vocal tract length normalization," Proc. ICASSP, pp.346-349, 1996.
- (1996) Proc. ICASSP , pp. 346-349
- Eide, E.¹ Gish, H.²

126
- 0034847002
- The 1998 HTK system for transcription of conversational telephone speech
- T. Hain, P.C. Woodland, T.R. Niesler, and E.W.D. Whittaker, "The 1998 HTK system for transcription of conversational telephone speech," Proc. ICASSP, pp.57-60. 1999.
- (1999) Proc. ICASSP , pp. 57-60
- Hain, T.¹ Woodland, P.C.² Niesler, T.R.³ Whittaker, E.W.D.⁴

127
- 0028419019
- Maximum aposteriori estimation for multivariate Gaussian mixture observations of Markov chains
- J.-L. Gauvain, and C.H. Lee, "Maximum aposteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech & Audio Process., vol.2, pp.291-298, 1994.
- (1994) IEEE Trans. Speech & Audio Process. , vol.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.H.²

128
- 0030263447
- Mean and variance adaptation within the MLLR framework
- M.J.F. Gales and P.C. Woodland, "Mean and variance adaptation within the MLLR framework," Computer Speech and Language, vol.10, pp.249-264, 1996.
- (1996) Computer Speech and Language , vol.10 , pp. 249-264
- Gales, M.J.F.¹ Woodland, P.C.²

129
- 0033100038
- Maximum-likelihood stochastic-transformation adaptation of hidden Markov models
- V.D. Diakoloukas and V.V. Digalakis, "Maximum-likelihood stochastic-transformation adaptation of hidden Markov models," IEEE Trans. Speech & Audio Process., vol.7, no.2, pp.177-187, 1999.
- (1999) IEEE Trans. Speech & Audio Process. , vol.7 , Issue.2 , pp. 177-187
- Diakoloukas, V.D.¹ Digalakis, V.V.²

130
- 0031704151
- Speaker clustering and transformation for speaker adaptation in speech recognition systems
- M. Padmanabham, L.R. Bahl, D. Nahamoo, and M.A. Picheny, "Speaker clustering and transformation for speaker adaptation in speech recognition systems," IEEE Trans. Speech & Audio Process., vol.6, no.1, pp.71-77, 1998.
- (1998) IEEE Trans. Speech & Audio Process. , vol.6 , Issue.1 , pp. 71-77
- Padmanabham, M.¹ Bahl, L.R.² Nahamoo, D.³ Picheny, M.A.⁴

131
- 85135109228
- Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs
- K. Ohkura, M. Sugiyama, and S. Sagayama, "Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs," Proc. ICSLP, pp.369-372, 1992.
- (1992) Proc. ICSLP , pp. 369-372
- Ohkura, K.¹ Sugiyama, M.² Sagayama, S.³

132
- 0011411817
- Speaker adaptation of acoustic models using correlations of transfer vectors
- March
- S. Takahashi and S. Sagayama, "Speaker adaptation of acoustic models using correlations of transfer vectors," IEICE Trans., vol.J82-D-II, no.3, pp.324-331, March 1999.
- (1999) IEICE Trans. , vol.J82-D-II , Issue.3 , pp. 324-331
- Takahashi, S.¹ Sagayama, S.²

133
- 0002488301
- Speaker adaptation with autonomous control using tree structure
- K. Shinoda and T. Watanabe, "Speaker adaptation with autonomous control using tree structure," Proc. Euro-Speech, pp.1143-1146, 1995.
- (1995) Proc. Euro-Speech , pp. 1143-1146
- Shinoda, K.¹ Watanabe, T.²

134
- 0030189744
- Speaker adaptation using combined transformation and Bayesian methods
- V.V. Digalakis and L.G. Neumeyer, "Speaker adaptation using combined transformation and Bayesian methods," IEEE Trans. Speech & Audio Process., vol.4, no.4, pp.249-300, 1996.
- (1996) IEEE Trans. Speech & Audio Process. , vol.4 , Issue.4 , pp. 249-300
- Digalakis, V.V.¹ Neumeyer, L.G.²

135
- 0000521080
- Speaker adaptation using maximum a posteriori probability estimation and data size dependent parameter smoothing
- March
- M. Tonomura, T. Kosaka, and S. Matsumura, "Speaker adaptation using maximum a posteriori probability estimation and data size dependent parameter smoothing," IEICE Trans., vol.J81-D-II, no.3, pp.465-471, March 1998.
- (1998) IEICE Trans. , vol.J81-D-II , Issue.3 , pp. 465-471
- Tonomura, M.¹ Kosaka, T.² Matsumura, S.³

136
- 0035279111
- A structural Bayes approach to speaker adaptation
- K. Shinoda and C.H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech & Audio Process., vol.9, no.3, pp.276-287, 2001.
- (2001) IEEE Trans. Speech & Audio Process. , vol.9 , Issue.3 , pp. 276-287
- Shinoda, K.¹ Lee, C.H.²

137
- 0011448907
- Automatic speech recognition by stochastic approaches
- Feb.
- S. Nakagawa, "Automatic speech recognition by stochastic approaches," J. Acoust. Soc. Japan, vol.50, no.2, pp.126-132, Feb. 1994.
- (1994) J. Acoust. Soc. Japan , vol.50 , Issue.2 , pp. 126-132
- Nakagawa, S.¹

138
- 0011458461
- Automatic learning of stochastic context-free grammar for spontaneous speech by integration of bigram
- March
- S. Nakagawa and K. Ohtani, "Automatic learning of stochastic context-free grammar for spontaneous speech by integration of bigram," Trans. Inf. Process. Soc. Japan, vol.39. no.3, pp.575-584, March 1998.
- (1998) Trans. Inf. Process. Soc. Japan , vol.39 , Issue.3 , pp. 575-584
- Nakagawa, S.¹ Ohtani, K.²

139
- 0011509488
- A study of large-vocabulary continuous speech recognition using higher order n-gram language models
- Spring
- K. Ohtsuki, K. Yoshida, T. Matsuoka, and S. Furui, "A study of large-vocabulary continuous speech recognition using higher order n-gram language models," Conf. Record. Acoust. Soc. Japan. pp.47-48, Spring 1997.
- (1997) Conf. Record. Acoust. Soc. Japan. , pp. 47-48
- Ohtsuki, K.¹ Yoshida, K.² Matsuoka, T.³ Furui, S.⁴

140
- 0028996884
- Phrase bigrams for continuous speech recognition
- E.P. Giachin, "Phrase bigrams for continuous speech recognition," Proc. ICASSP, pp.225-227, 1995.
- (1995) Proc. ICASSP , pp. 225-227
- Giachin, E.P.¹

141
- 0011496729
- Effect of vocabulary extension using word sequence concatenation for large vocabulary continuous speech recognition
- April
- Y. Wada, N. Kobayashi, Y. Nakano and T. Kobayashi, "Effect of vocabulary extension using word sequence concatenation for large vocabulary continuous speech recognition," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1413-1420, April 1999.
- (1999) Trans. Inf. Process. Soc. Japan , vol.40 , Issue.4 , pp. 1413-1420
- Wada, Y.¹ Kobayashi, N.² Nakano, Y.³ Kobayashi, T.⁴

142
- 0011501276
- A task adaptation method and use of idiomatic expression of stochastic language model for speech recognition
- Jan.
- S. Nakagawa, H. Akamatsu, and H. Nishizaki, "A task adaptation method and use of idiomatic expression of stochastic language model for speech recognition," Natural Language Processing, vol.6, no.2. pp.97-115, Jan. 1999.
- (1999) Natural Language Processing , vol.6 , Issue.2 , pp. 97-115
- Nakagawa, S.¹ Akamatsu, H.² Nishizaki, H.³

143
- 0028996879
- Language modeling by variable length sequences, theoretical formulation and evaluation of multigrams
- S. Deligned and F. Bimbot, "Language modeling by variable length sequences, theoretical formulation and evaluation of multigrams," Proc. ICASSP, pp.169-172, 1995.
- (1995) Proc. ICASSP , pp. 169-172
- Deligned, S.¹ Bimbot, F.²

144
- 0029762785
- Variable-order N-gram generation by word-class splitting and consecutive word grouping
- H. Masataki and Y. Sagisaka, "Variable-order N-gram generation by word-class splitting and consecutive word grouping," Proc. ICASSP, pp. 188-191, 1996.
- (1996) Proc. ICASSP , pp. 188-191
- Masataki, H.¹ Sagisaka, Y.²

145
- 0024700466
- Tree-based statistical language model for natural language speech recognition
- L.R. Bahl, P.F. Brown, P.V. Souza, and R.L. Mercer, "Tree-based statistical language model for natural language speech recognition," IEEE Trans. Acoust. Speech & Signal Process., vol.37, no.7, pp. 1001-1008. 1989.
- (1989) IEEE Trans. Acoust. Speech & Signal Process. , vol.37 , Issue.7 , pp. 1001-1008
- Bahl, L.R.¹ Brown, P.F.² Souza, P.V.³ Mercer, R.L.⁴

146
- 0011464163
- Word clustering for class-based language models
- S. Mori, M. Nishimura, and N. Itoh, "Word clustering for class-based language models," Trans. Inf. Process. Soc. Japan, vol.38, no.11, pp.2200-2207, 1997.
- (1997) Trans. Inf. Process. Soc. Japan , vol.38 , Issue.11 , pp. 2200-2207
- Mori, S.¹ Nishimura, M.² Itoh, N.³

147
- 0032650074
- Variable-length category n-gram language models
- T.R. Niesler and P.C. Woodland, "Variable-length category n-gram language models," Computer Speech and Language, vol.13, pp.99-124, 1999.
- (1999) Computer Speech and Language , vol.13 , pp. 99-124
- Niesler, T.R.¹ Woodland, P.C.²

148
- 0000797420
- An estimation of an upper bound for the entropy of Japanese
- S. Mori, and O. Yamaji, "An estimation of an upper bound for the entropy of Japanese," Trans. Inf. Process. Soc. Japan, vol.38, no.11, pp.2191-2199, 1997.
- (1997) Trans. Inf. Process. Soc. Japan , vol.38 , Issue.11 , pp. 2191-2199
- Mori, S.¹ Yamaji, O.²

149
- 0030181951
- A maximum entropy approach to adaptive statistical language modeling
- R. Rosenfeld, "A maximum entropy approach to adaptive statistical language modeling," Computer Speech and Language, vol.10, pp.187-228, 1996.
- (1996) Computer Speech and Language , vol.10 , pp. 187-228
- Rosenfeld, R.¹

150
- 0033106616
- Interpolation of n-gram and mutual-information based trigger pair language models for Mandarin speech recognition
- Z.G. Dong, and L.K. Teng, "Interpolation of n-gram and mutual-information based trigger pair language models for Mandarin speech recognition," Computer Speech and Language, vol.13, pp.125-141, 1999.
- (1999) Computer Speech and Language , vol.13 , pp. 125-141
- Dong, Z.G.¹ Teng, L.K.²

151
- 0032165145
- A multispan language model modeling framework for large vocabulary speech recognition
- J.R. Bellegard, "A multispan language model modeling framework for large vocabulary speech recognition," IEEE Trans. Acoust. Speech & Signal Process., vol.6, no.5, pp.456-467, 1998.
- (1998) IEEE Trans. Acoust. Speech & Signal Process. , vol.6 , Issue.5 , pp. 456-467
- Bellegard, J.R.¹

152
- 0011471867
- Multispan statistical language modeling for large vocabulary speech recognition
- J.R. Bellegard, "Multispan statistical language modeling for large vocabulary speech recognition," Proc. ICSLP, pp.2395-2398, 1998.
- (1998) Proc. ICSLP , pp. 2395-2398
- Bellegard, J.R.¹

153
- 0032785782
- Modeling long distance dependence in language: Topic mixtures versus dynamic cache models
- R.M. Iyer and M. Ostendorf, "Modeling long distance dependence in language: Topic mixtures versus dynamic cache models," IEEE Trans. Speech & Audio Process., vol.7, no.1, pp.31-39, 1997.
- (1997) IEEE Trans. Speech & Audio Process. , vol.7 , Issue.1 , pp. 31-39
- Iyer, R.M.¹ Ostendorf, M.²

154
- 0002235611
- Adaptive topic-dependent language modeling using word-based varigramss
- S. Martin, J. Liermann, and H. Ney, "Adaptive topic-dependent language modeling using word-based varigramss," Proc. EuroSpeech, pp.1447-1450, 1997.
- (1997) Proc. EuroSpeech , pp. 1447-1450
- Martin, S.¹ Liermann, J.² Ney, H.³

155
- 0011408731
- Dictation of broadcast news speech using word pronounciation probability
- Spring
- K. Takagi and S. Furui, "Dictation of broadcast news speech using word pronounciation probability," Conf. Record, Acoust. Soc. Japan, pp.9-10, Spring 1998.
- (1998) Conf. Record, Acoust. Soc. Japan , pp. 9-10
- Takagi, K.¹ Furui, S.²

156
- 0011451282
- An improvement of language modeling for automatic transcription of Japanese broadcast-news speech
- Spring
- N. Sakurai and S. Furui, "An improvement of language modeling for automatic transcription of Japanese broadcast-news speech," Conf. Record, Acoust. Soc. Japan, pp.57-58, Spring 1999.
- (1999) Conf. Record, Acoust. Soc. Japan , pp. 57-58
- Sakurai, N.¹ Furui, S.²

157
- 0011408732
- A language model for recognition of continuously uttered sentences
- Spring
- T. Imai, Y. Saito, A. Ando, and S. Furui, "A language model for recognition of continuously uttered sentences," Conf. Record, Acoust. Soc. Japan, pp.63-64, Spring 1999.
- (1999) Conf. Record, Acoust. Soc. Japan , pp. 63-64
- Imai, T.¹ Saito, Y.² Ando, A.³ Furui, S.⁴

158
- 0011404832
- Time dependent language model for broadcast news transcription
- April
- A. Kobayashi, T. Imai, A. Ando, and K. Nakabayashi, "Time dependent language model for broadcast news transcription," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1421-1429, April 1999.
- (1999) Trans. Inf. Process. Soc. Japan , vol.40 , Issue.4 , pp. 1421-1429
- Kobayashi, A.¹ Imai, T.² Ando, A.³ Nakabayashi, K.⁴

159
- 0011402513
- The influence of morpheme analysis systems on language model for continuous speech recognition
- Autumn
- N. Yodo, K. Itoh, S. Nakamura, and K. Shikano, "The influence of morpheme analysis systems on language model for continuous speech recognition," Conf. Record, Acoust. Soc. Japan, pp.53-54, Autumn 1997.
- (1997) Conf. Record, Acoust. Soc. Japan , pp. 53-54
- Yodo, N.¹ Itoh, K.² Nakamura, S.³ Shikano, K.⁴

160
- 85024115120
- An empirical study of smoothing techniques for language modeling
- S.F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Proc. ACL, pp.310-318, 1996.
- (1996) Proc. ACL , pp. 310-318
- Chen, S.F.¹ Goodman, J.²

161
- 0030124373
- Succeeding word prediction for speech recognition based on stochastic language model
- April
- M. Zhou and S. Nakagawa, "Succeeding word prediction for speech recognition based on stochastic language model," IEICE Trans. Inf. & Syst., vol.E79-D, no.4, pp.333-341, April 1996.
- (1996) IEICE Trans. Inf. & Syst. , vol.E79-D , Issue.4 , pp. 333-341
- Zhou, M.¹ Nakagawa, S.²

162
- 0010032271
- Inside-outside reestimation from partially bracketed corpora
- F. Pereira and Y. Schabes, "Inside-outside reestimation from partially bracketed corpora," Proc. ACL, pp.31-37, 1992.
- (1992) Proc. ACL , pp. 31-37
- Pereira, F.¹ Schabes, Y.²

163
- 84894805373
- An empirical evaluation of probabilistic lexicalized tree insertion grammars
- R. Hwa, "An empirical evaluation of probabilistic lexicalized tree insertion grammars," Proc. ACL, pp.557-563, 1998.
- (1998) Proc. ACL , pp. 557-563
- Hwa, R.¹

164
- 85027133681
- Construction and evaluation of language models based on stochastic context free grammar for speech recognition
- Technical Report, SP99-37, Inst. Elect. Inf. Comm. Engrs., June
- C. Hori, M. Katoh, A. Itoh, and M. Kohda, "Construction and evaluation of language models based on stochastic context free grammar for speech recognition," Technical Report, SP99-37, Inst. Elect. Inf. Comm. Engrs., June 1999.
- (1999)
- Hori, C.¹ Katoh, M.² Itoh, A.³ Kohda, M.⁴

165
- 0032673481
- An automatic acquisition method of statistical finite-state automation sentences
- M. Zuzuki and S. Makino, "An automatic acquisition method of statistical finite-state automation sentences," Proc. ICASSP, pp.737-740, 1999.
- (1999) Proc. ICASSP , pp. 737-740
- Zuzuki, M.¹ Makino, S.²

166
- 0011403721
- Construction of language models using probabilistic GLR methods toward speech recognition
- April
- H. Imai, H. Tanaka, and T. Tokunaga, "Construction of language models using probabilistic GLR methods toward speech recognition," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1404-1411, April 1999.
- (1999) Trans. Inf. Process. Soc. Japan , vol.40 , Issue.4 , pp. 1404-1411
- Imai, H.¹ Tanaka, H.² Tokunaga, T.³

167
- 0011449593
- Spontaneous speech understanding method based on LR parsing of keyword lattice
- Feb.
- H. Tsuboi, Y. Takebayashi, and H. Hashimoto, "Spontaneous speech understanding method based on LR parsing of keyword lattice," Trans. Inf. Process. Soc. Japan, vol.38, no.2, pp.260-268, Feb. 1997.
- (1997) Trans. Inf. Process. Soc. Japan , vol.38 , Issue.2 , pp. 260-268
- Tsuboi, H.¹ Takebayashi, Y.² Hashimoto, H.³

168
- 0025517070
- Automatic recognition of keywords in unconstrained speech using hidden Markov models
- J.G. Wilpon, L.R. Rabiner, C.-H. Lee, and E.R. Goldman, "Automatic recognition of keywords in unconstrained speech using hidden Markov models," IEEE Trans. Acoust. Speech & Signal Process., vol.38, no.11, pp.1870-1878, 1990.
- (1990) IEEE Trans. Acoust. Speech & Signal Process. , vol.38 , Issue.11 , pp. 1870-1878
- Wilpon, J.G.¹ Rabiner, L.R.² Lee, C.-H.³ Goldman, E.R.⁴

169
- 0011449594
- Processing unknown words in continuous speech recognition
- July
- K. Kita, T. Ehara, and T. Morimoto, "Processing unknown words in continuous speech recognition," IEICE Trans, vol.E74, no.7, pp.1811-1816, July 1991.
- (1991) IEICE Trans , vol.E74 , Issue.7 , pp. 1811-1816
- Kita, K.¹ Ehara, T.² Morimoto, T.³

170
- 0011501278
- Comparison of dictation and word spotting techniques in classification of news speech articles
- IEICE Technical Report, SP98-32, June
- J. Ogata and Y. Ariki, "Comparison of dictation and word spotting techniques in classification of news speech articles," IEICE Technical Report, SP98-32, June 1998.
- (1998)
- Ogata, J.¹ Ariki, Y.²

171
- 0011498043
- Voice-operated projector using utterance verification and its application to hyper-text generation of lectures
- April
- T. Kawahara, K. Ishizuka, and S. Doshita, "Voice-operated projector using utterance verification and its application to hyper-text generation of lectures," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1491-1498, April 1999.
- (1999) Trans. Inf. Process. Soc. Japan , vol.40 , Issue.4 , pp. 1491-1498
- Kawahara, T.¹ Ishizuka, K.² Doshita, S.³

172
- 0011408733
- Dealing with out-of -vocabulary words and speech disfluencies in an N-gram based speech understanding system
- Dec.
- A. Kai, Y. Hirose, and S. Nakagawa, "Dealing with out-of -vocabulary words and speech disfluencies in an N-gram based speech understanding system," Proc. ICSLP, pp.2427-2430, Dec. 1999.
- (1999) Proc. ICSLP , pp. 2427-2430
- Kai, A.¹ Hirose, Y.² Nakagawa, S.³

173
- 0001079615
- A*-admissible key-phrase spotting with sub-syllable level utterance verification
- B. Chen, H. Wong, L. Chen, and L. Lee, "A*-admissible key-phrase spotting with sub-syllable level utterance verification," Proc. ICSLP, pp.783-786, 1998.
- (1998) Proc. ICSLP , pp. 783-786
- Chen, B.¹ Wong, H.² Chen, L.³ Lee, L.⁴

174
- 84902052756
- A new confidence measure based on rank-ordering subphone scores
- Q. Lin, S-Das, D. Lubensky, and M. Picheny, "A new confidence measure based on rank-ordering subphone scores," Proc. ICSLP, pp.3249-3252, 1998.
- (1998) Proc. ICSLP , pp. 3249-3252
- Lin, Q.¹ S-Das² Lubensky, D.³ Picheny, M.⁴

175
- 0032091375
- Text-independent speaker recognition using non-linear frame likelihood transformation
- K.P. Markov, and S. Nakagawa, "Text-independent speaker recognition using non-linear frame likelihood transformation," Speech Communication, vol.24, pp.193-209, 1998.
- (1998) Speech Communication , vol.24 , pp. 193-209
- Markov, K.P.¹ Nakagawa, S.²

176
- 0011408734
- Word-based approach to large-vocabulary continuous speech recognition for Japanese
- April
- M. Nishimura, N. Itoh, and K. Yamasaki, "Word-based approach to large-vocabulary continuous speech recognition for Japanese," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1395-1403, April 1999.
- (1999) Trans. Inf. Process. Soc. Japan , vol.40 , Issue.4 , pp. 1395-1403
- Nishimura, M.¹ Itoh, N.² Yamasaki, K.³

177
- 0011450876
- Unknown utterance rejection using likelihood normalization based on syllable recognition
- Dec.
- T. Watanabe and S. Tsukada, "Unknown utterance rejection using likelihood normalization based on syllable recognition," IEICE Trans., vol.J75-D-II, no.12, pp.2002-2009, Dec. 1992.
- (1992) IEICE Trans. , vol.J75-D-II , Issue.12 , pp. 2002-2009
- Watanabe, T.¹ Tsukada, S.²

178
- 0029323659
- Relationship among recognition rate, rejection rate and false alarm rate in a spoken word recognition system
- June
- A. Kai, and S. Nakagawa, "Relationship among recognition rate, rejection rate and false alarm rate in a spoken word recognition system," IEICE Trans. Inf. & Syst., vol.E78-D, no.6, pp.698-704, June 1995.
- (1995) IEICE Trans. Inf. & Syst. , vol.E78-D , Issue.6 , pp. 698-704
- Kai, A.¹ Nakagawa, S.²

179
- 0011501675
- Large vocabulary continuous speech recognition: From laboratory systems towards real-world applications
- Dec.
- J.-L. Gauvain and L. Lamel, "Large vocabulary continuous speech recognition: From laboratory systems towards real-world applications," IEICE Trans., vol.J79-D-II, no.12, pp.2005-2021, Dec. 1996.
- (1996) IEICE Trans. , vol.J79-D-II , Issue.12 , pp. 2005-2021
- Gauvain, J.-L.¹ Lamel, L.²

180
- 4544364908
- A decoder for broadcast news transcription
- Autumn
- T. Imai, K. Onoe, A. Kobayashi, and A. Ando, "A decoder for broadcast news transcription," Acoust. Soc. Japan, pp.105-106, Autumn 1998.
- (1998) Acoust. Soc. Japan , pp. 105-106
- Imai, T.¹ Onoe, K.² Kobayashi, A.³ Ando, A.⁴

181
- 0011495826
- A new computation method of perplexity for text corpus including unknown words
- Autumn
- S. Nakagawa and H. Akamatsu, "A new computation method of perplexity for text corpus including unknown words," Conf. Record, Acoust. Soc. Japan, pp.63-64, Autumn 1998.
- (1998) Conf. Record, Acoust. Soc. Japan , pp. 63-64
- Nakagawa, S.¹ Akamatsu, H.²

182
- 0030715922
- Task adaptation using MAP estimation in N-gram language modeling
- H. Masataki, Y. Sagisaka, K. Hisaki, and T. Kawahara, "Task adaptation using MAP estimation in N-gram language modeling," Proc. ICASSP, pp.783-786, 1997.
- (1997) Proc. ICASSP , pp. 783-786
- Masataki, H.¹ Sagisaka, Y.² Hisaki, K.³ Kawahara, T.⁴

183
- 85009128031
- Relationship between phoneme recognition performance and word recognition rate
- May
- S. Nakagawa, "Relationship between phoneme recognition performance and word recognition rate," Trans. Inf. Process, Japan, vol.22, no.5, pp.488-496, May 1996.
- (1996) Trans. Inf. Process, Japan , vol.22 , Issue.5 , pp. 488-496
- Nakagawa, S.¹

184
- 0011451284
- Spontaneous speech understanding for a dialogue system
- M. Hidano, T. Itoh, M. Yamamoto, and S. Nakagawa, "Spontaneous speech understanding for a dialogue system," Proc. ESCA Workshop on Spoken Dialogue Systems, pp.25-28, 1995.
- (1995) Proc. ESCA Workshop on Spoken Dialogue Systems , pp. 25-28
- Hidano, M.¹ Itoh, T.² Yamamoto, M.³ Nakagawa, S.⁴

185
- 84989448320
- Evaluation of FFT cepstrum and LPC cepstrum for speech and speaker recognition
- Feb.
- S. Nakagawa and M. Sakamoto, "Evaluation of FFT cepstrum and LPC cepstrum for speech and speaker recognition," IEICE Trans., vol.J66-A, no.2, pp.1199-1206, Feb. 1983.
- (1983) IEICE Trans. , vol.J66-A , Issue.2 , pp. 1199-1206
- Nakagawa, S.¹ Sakamoto, M.²

186
- 84987195640
- Perception of vowels and C-V syllables segmented from connected speech
- May
- H. Kuwabara and H. Sakai, "Perception of vowels and C-V syllables segmented from connected speech," J. Acoust. Soc. Japan, vol.28, no.5, pp.225-234, May 1972.
- (1972) J. Acoust. Soc. Japan , vol.28 , Issue.5 , pp. 225-234
- Kuwabara, H.¹ Sakai, H.²

187
- 85027151219
- A study on speech recognition unit based on speech perceptual experiments
- IEICE Technical Report, SP99-43, July
- K. Yamamoto and S. Nakagawa, "A study on speech recognition unit based on speech perceptual experiments," IEICE Technical Report, SP99-43, July 1999.
- (1999)
- Yamamoto, K.¹ Nakagawa, S.²

188
- 0011501282
- Toward spoken language understanding from speech recognition
- Nov.
- S. Nakagawa, "Toward spoken language understanding from speech recognition," J. Acoust. Soc. Japan, vol.52, no.11, pp.859-856, Nov. 1996.
- (1996) J. Acoust. Soc. Japan , vol.52 , Issue.11 , pp. 859-856
- Nakagawa, S.¹

189
- 0011403914
- Evaluation of auditory front-ends in DTW word recognition system
- June
- K. Obara and T. Hirahara, "Evaluation of auditory front-ends in DTW word recognition system," J. Acoust. Soc. Japan, vol.50, no.6, pp.452-464, June 1994.
- (1994) J. Acoust. Soc. Japan , vol.50 , Issue.6 , pp. 452-464
- Obara, K.¹ Hirahara, T.²

190
- 0032677422
- Recent experiments in large vocabulary conversational speech recognition
- J. Billa, T. Colhurst, A. El-Jaroudi, R. Iyer, K. Ma, S. Matsuoukas, C. Quilen, F. Richardson, M. Siu, G. Zavaligkos, and H. Gish, "Recent experiments in large vocabulary conversational speech recognition," Proc. ICASSP, pp.41-44, 1999.
- (1999) Proc. ICASSP , pp. 41-44
- Billa, J.¹ Colhurst, T.² El-Jaroudi, A.³ Iyer, R.⁴ Ma, K.⁵ Matsuoukas, S.⁶ Quilen, C.⁷ Richardson, F.⁸ Siu, M.⁹ Zavaligkos, G.¹⁰ Gish, H.¹¹

191
- 0031643048
- Multiresolution cepstral features for phoneme recognition across speech sub-bands
- P. McCourt, S. Vaseghi, and N. Harte, "Multiresolution cepstral features for phoneme recognition across speech sub-bands," Proc. ICASSP, pp.557-560, 1998.
- (1998) Proc. ICASSP , pp. 557-560
- McCourt, P.¹ Vaseghi, S.² Harte, N.³

192
- 0032654472
- Channel and noise adaptation via HMM mixture mean transform and stochastic matching
- S. Kong and B. Shi, "Channel and noise adaptation via HMM mixture mean transform and stochastic matching," Proc. ICASSP, pp. 301-304, 1999.
- (1999) Proc. ICASSP , pp. 301-304
- Kong, S.¹ Shi, B.²

193
- 0025388113
- A linear predictive HMM for vector valued observation with application to speech recognition
- P. Kenny, M. Lenning, and P. Mermelstein, "A linear predictive HMM for vector valued observation with application to speech recognition," IEEE Trans. Acoust. Speech & Signal Process., vol.38, no.1, pp.220-225, 1990.
- (1990) IEEE Trans. Acoust. Speech & Signal Process. , vol.38 , Issue.1 , pp. 220-225
- Kenny, P.¹ Lenning, M.² Mermelstein, P.³

194
- 0011406323
- Proposal of a stochastic context-free grammar for continuous observation vector sequences
- Spring
- S. Nakagawa, "Proposal of a stochastic context-free grammar for continuous observation vector sequences," Conf. Record, pp.73-74, Spring 1992.
- (1992) Conf. Record , pp. 73-74
- Nakagawa, S.¹

195
- 0026171582
- Application of the Gibbs distribution to hidden Markov modeling in speaker independent isolated word recognition
- Y. Zhao, L.E. Atlas, and X. Zhuang, "Application of the Gibbs distribution to hidden Markov modeling in speaker independent isolated word recognition," IEEE Trans. Signal Process., vol.39, no.6, pp.1291-1298, 1991.
- (1991) IEEE Trans. Signal Process. , vol.39 , Issue.6 , pp. 1291-1298
- Zhao, Y.¹ Atlas, L.E.² Zhuang, X.³

196
- 0011411822
- Probabilistic modeling with Bayesian networks for automatic speech recognition
- G. Zweig and S. Russel, "Probabilistic modeling with Bayesian networks for automatic speech recognition," Proc. ICSLP, pp.3011-3014, 1998.
- (1998) Proc. ICSLP , pp. 3011-3014
- Zweig, G.¹ Russel, S.²

197
- 0029325616
- A comparative study of output probability functions in HMMs
- June
- S. Nakagawa, L. Zhao, and H. Suzuki, "A comparative study of output probability functions in HMMs," IEICE Trans. Inf. & Syst., vol.E78-D, no.6, pp.669-675, June 1995.
- (1995) IEICE Trans. Inf. & Syst. , vol.E78-D , Issue.6 , pp. 669-675
- Nakagawa, S.¹ Zhao, L.² Suzuki, H.³

198
- 85009181766
- Unified framework for acoustic topology modelling: ML-SSS and question-based decision trees
- H. Singer and A. Nakamura, "Unified framework for acoustic topology modelling: ML-SSS and question-based decision trees," Proc. EuroSpeech, pp.1355-1358, 1999.
- (1999) Proc. EuroSpeech , pp. 1355-1358
- Singer, H.¹ Nakamura, A.²

199
- 85027098626
- Learning and normalizing of the talker differences in the recognition of spoken words
- Technical Report, Acoust. Soc. Japan, SP75-25, Nov.
- S. Furui, "Learning and normalizing of the talker differences in the recognition of spoken words," Technical Report, Acoust. Soc. Japan, SP75-25, Nov. 1975.
- (1975)
- Furui, S.¹

200
- 0017961869
- A real time spoken word recognition system with various learning capabilities of the speaker differences
- Scripta Publishing Co.
- S. Nakagawa and T. Sakai, "A real time spoken word recognition system with various learning capabilities of the speaker differences," Syst. Comp. Controls, vol.9, no.3, pp.63-71, Scripta Publishing Co., 1978.
- (1978) Syst. Comp. Controls , vol.9 , Issue.3 , pp. 63-71
- Nakagawa, S.¹ Sakai, T.²

201
- 85009195509
- A missing-word test comparison of human and statistical language model performance
- M. Owens, A. Kruger, P. Donnelly, F.J. Smith, and J. Ming, "A missing-word test comparison of human and statistical language model performance," Proc. EuroSpeech, pp.145-148, 1999.
- (1999) Proc. EuroSpeech , pp. 145-148
- Owens, M.¹ Kruger, A.² Donnelly, P.³ Smith, F.J.⁴ Ming, J.⁵

202
- 0011400318
- Robust language modeling for small corpus of target task using call combined word statistics and selective use of general corpus
- Nov.
- Y. Wada, N. Kobayashi, and T. Kobayashi, "Robust language modeling for small corpus of target task using call combined word statistics and selective use of general corpus," IEICE Trans., vol.J83-D-II, no.11, pp.2397-2406, Nov. 2000.
- (2000) IEICE Trans. , vol.J83-D-II , Issue.11 , pp. 2397-2406
- Wada, Y.¹ Kobayashi, N.² Kobayashi, T.³

203
- 0011404834
- Part-of-speech N-gram and word N-gram fused language model
- H. Yamamoto, and Y. Sagisaka, "Part-of-speech N-gram and word N-gram fused language model," Proc. Euro-Speech, pp.1803-1806, 1999.
- (1999) Proc. Euro-Speech , pp. 1803-1806
- Yamamoto, H.¹ Sagisaka, Y.²

204
- 0030719155
- A word graph algorithm for large vocabulary continuous speech recognition
- S. Ortmanns, H. Ney, and Z. Aubert, "A word graph algorithm for large vocabulary continuous speech recognition," Computer Speech and Language, vol.11, pp.43-72, 1997.
- (1997) Computer Speech and Language , vol.11 , pp. 43-72
- Ortmanns, S.¹ Ney, H.² Aubert, Z.³

205
- 0001100613
- A study on a phoneme-graph-based hypothesis restriction for large vocabulary continuous speech recognition
- April
- T. Hori, N. Oka, M. Katoho, A. Itoh, and M. Kohda, "A study on a phoneme-graph-based hypothesis restriction for large vocabulary continuous speech recognition," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1365-1373 April 1999.
- (1999) Trans. Inf. Process. Soc. Japan , vol.40 , Issue.4 , pp. 1365-1373
- Hori, T.¹ Oka, N.² Katoho, M.³ Itoh, A.⁴ Kohda, M.⁵

206
- 29144491321
- Large vocabulary continuous speech recognition based on multi-pass search using word trellis index
- Jan.
- A. Lee, Kawahra, and S. Doshita, "Large vocabulary continuous speech recognition based on multi-pass search using word trellis index," IEICE Trans., vol.J82-D, no.1, pp.1-9, Jan. 1999.
- (1999) IEICE Trans. , vol.J82-D , Issue.1 , pp. 1-9
- Lee, A.¹ Kawahra² Doshita, S.³

207
- 85027104898
- Some problems on automatic speech recognition
- IEICE Technical Report, SP99-93, Dec.
- S. Nakagawa, "Some problems on automatic speech recognition," IEICE Technical Report, SP99-93, Dec. 1999.
- (1999)
- Nakagawa, S.¹

208
- 0031619371
- Balancing acoustic and linguistic probabilities
- A. Ogawa, K. Takeda, and F. Itakura, "Balancing acoustic and linguistic probabilities," Proc. ICASSP, pp.181-184, 1998.
- (1998) Proc. ICASSP , pp. 181-184
- Ogawa, A.¹ Takeda, K.² Itakura, F.³

209
- 0032649321
- Partly hidden Markov model and its application to speech recognition
- T. Kobayashi, J. Furuyama, and K. Masumitsu, "Partly hidden Markov model and its application to speech recognition," Proc. ICASSP, pp.121-124, 1999.
- (1999) Proc. ICASSP , pp. 121-124
- Kobayashi, T.¹ Furuyama, J.² Masumitsu, K.³

210
- 0011451288
- Comparison of SCFG and HMM based speaker independent spoken digit recognition
- Dec.
- M. Zhou and S. Nakagawa, "Comparison of SCFG and HMM based speaker independent spoken digit recognition," Proc. Int. Workshop on Automatic Speech Recognition, pp.30-31, Dec. 1993.
- (1993) Proc. Int. Workshop on Automatic Speech Recognition , pp. 30-31
- Zhou, M.¹ Nakagawa, S.²

211
- 85032751521
- Dynamic programming search for continuous speech recognition
- Sept.
- H. Ney and S. Ortmanns, "Dynamic programming search for continuous speech recognition," IEEE Signal Process. Mag., pp.64-82, Sept. 1999.
- (1999) IEEE Signal Process. Mag. , pp. 64-82
- Ney, H.¹ Ortmanns, S.²

212
- 85032751683
- Hierarchical search for large-vocabulary conversational speech recognition
- Sept.
- N. Deshmukh, A. Ganapathiraju, and J. Picone, "Hierarchical search for large-vocabulary conversational speech recognition," IEEE Signal Process. Mag., pp.84-107, Sept. 1999.
- (1999) IEEE Signal Process. Mag. , pp. 84-107
- Deshmukh, N.¹ Ganapathiraju, A.² Picone, J.³

213
- 0004164078
- Springer
- R. Kompei, Prosody in Speech Understanding Systems, Springer, 1996.
- (1996) Prosody in Speech Understanding Systems
- Kompei, R.¹

214
- 0003720687
- Kluwer Academic Publishers
- D.P. Morgan and C.L. Scofield, Neural Networks and Speech Processing, Kluwer Academic Publishers, 1994.
- (1994) Neural Networks and Speech Processing
- Morgan, D.P.¹ Scofield, C.L.²

215
- 0003573244
- Kluwer Academic Publishers
- H.A. Boarlard and N. Morgan, Connectionist Speech Recognition, Kluwer Academic Publishers, 1994.
- (1994) Connectionist Speech Recognition
- Boarlard, H.A.¹ Morgan, N.²

216
- 85007838242
- Pitch dependent phone modeling for HMM-based speech recognition
- H. Singer and S. Sagayama, "Pitch dependent phone modeling for HMM-based speech recognition," J. Acoust. Soc. Japan, (E), vol.15, no.2, pp.77-86, 1994.
- (1994) J. Acoust. Soc. Japan, (E) , vol.15 , Issue.2 , pp. 77-86
- Singer, H.¹ Sagayama, S.²

217
- 85067723733
- Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processing
- Dec.
- N. Minematsu and S. Nakagawa, "Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processing," Proc. ICSLP, pp.2427-2430, Dec. 1998.
- (1998) Proc. ICSLP , pp. 2427-2430
- Minematsu, N.¹ Nakagawa, S.²

218
- 0028392167
- An application of recurrent nets to phone probability estimation
- A.J. Robinson, "An application of recurrent nets to phone probability estimation," IEEE Trans. Neural Networks, vol.5, no.2, pp.298-304, 1994.
- (1994) IEEE Trans. Neural Networks , vol.5 , Issue.2 , pp. 298-304
- Robinson, A.J.¹

219
- 0011403725
- Speech understanding and language model
- Nov.
- S. Nakagawa, "Speech understanding and language model," J. Signal Process., vol.2, no.6, pp.434-442, Nov. 1998.
- (1998) J. Signal Process. , vol.2 , Issue.6 , pp. 434-442
- Nakagawa, S.¹

220
- 0011450878
- Introduction to the special issue-some research problems on spoken dialogue systems
- Nov.
- S. Nakagawa, "Introduction to the special issue-some research problems on spoken dialogue systems," J. Acoust. Soc. Japan, vol.54, no.11, pp.783-790, Nov. 1998.
- (1998) J. Acoust. Soc. Japan , vol.54 , Issue.11 , pp. 783-790
- Nakagawa, S.¹

221
- 0003969334
- R.D. Mori, ed., Academic Press
- R.D. Mori, ed., Spoken Dialogue with Computers, Academic Press, 1998.
- (1998) Spoken Dialogue with Computers

222
- 85027158035
- HMM-based speaker recognition
- IEICE Technical Report, SP95-111, Jan.
- T. Matsui, "HMM-based speaker recognition," IEICE Technical Report, SP95-111, Jan. 1996.
- (1996)
- Matsui, T.¹

223
- 0031233424
- Speaker recognition: A tutorial
- J.P. Campbell, "Speaker recognition: A tutorial," Proc. IEEE, vol.85, no.9, 1437-1462, 1997.
- (1997) Proc. IEEE , vol.85 , Issue.9 , pp. 1437-1462
- Campbell, J.P.¹

224
- 0004072715
- Marcel Dekker
- S. Furui, Digital Speech Processing, Synthesis and Recognition, Marcel Dekker, 1989.
- (1989) Digital Speech Processing, Synthesis and Recognition
- Furui, S.¹

225
- 0004158985
- Morikita-shuppan
- S. Imai, Speech Signal Processing, Morikita-shuppan, 1996.
- (1996) Speech Signal Processing
- Imai, S.¹

226
- 0011495831
- Morikita-shuppan
- K. Kita, S. Nakamura, and M. Nagata, Spoken language processing, Morikita-shuppan, 1996.
- (1996) Spoken Language Processing
- Kita, K.¹ Nakamura, S.² Nagata, M.³

227
- 0011451289
- Shokodo
- K. Shikano, S. Nakamura, and S. Ise, Digital Signal Processing for Speech and Sound, Shokodo, 1997.
- (1997) Digital Signal Processing for Speech and Sound
- Shikano, K.¹ Nakamura, S.² Ise, S.³

228
- 0011501682
- H. Tanaka, ed.; IEICE
- H. Tanaka, ed., Natural Language Processing - Fundamental and application -, IEICE, 1999.
- (1999) Natural Language Processing - Fundamental and Application -

229
- 0003473607
- Academic Press
- K. Fukunaga, Statistical Pattern Recognition, Second Edition, Academic Press, 1990.
- (1990) Statistical Pattern Recognition, Second Edition
- Fukunaga, K.¹

230
- 0003515694
- Dekker
- S. Furui and M.M. Sondhi, Advances in Speech Signal Processing, Dekker, 1991.
- (1991) Advances in Speech Signal Processing
- Furui, S.¹ Sondhi, M.M.²

231
- 0004199188
- MIT Press
- E. Charniak, Statistical Language Learning, MIT Press, 1993.
- (1993) Statistical Language Learning
- Charniak, E.¹

232
- 0003424145
- Macmillan
- J.R. Deller, J.G. Proakis, and J.H.L. Hansen, Discrete-Time Processing of Speech Signals, Macmillan, 1993.
- (1993) Discrete-Time Processing of Speech Signals
- Deller, J.R.¹ Proakis, J.G.² Hansen, J.H.L.³

233
- 0004244302
- Prentice Hall
- L.R. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition, Prentice Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.R.¹ Juang, B.-H.²

234
- 0003770711
- Kluwer Academic Pub.
- C.H. Lee, F.K. Soong, and K.K. Paliwal, Automatic Speech and Speaker Recognition, Kluwer Academic Pub., 1996.
- (1996) Automatic Speech and Speaker Recognition
- Lee, C.H.¹ Soong, F.K.² Paliwal, K.K.³

235
- 0003786003
- MIT Press
- F. Jelinek, Statistical Methods for Speech Recognition, MIT Press, 1997.
- (1997) Statistical Methods for Speech Recognition
- Jelinek, F.¹

236
- 84944486544
- Prediction and entropy of printed English
- L.E. Shannon, "Prediction and entropy of printed English," Bell System Tech. J., vol.30, pp.50-64, 1951.
- (1951) Bell System Tech. J. , vol.30 , pp. 50-64
- Shannon, L.E.¹

237
- 0017994420
- A covergent gambling estimate of the entropy of English
- T.M. Cover and R.C. King, "A covergent gambling estimate of the entropy of English," IEEE Trans. Inf. Theory, vol.24, no.4, pp.413-421, 1978.
- (1978) IEEE Trans. Inf. Theory , vol.24 , Issue.4 , pp. 413-421
- Cover, T.M.¹ King, R.C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.