메뉴 건너뛰기




Volumn E85-D, Issue 3, 2002, Pages 465-486

A survey on automatic speech recognition

Author keywords

Acoustic model; HMM; Language model; Ngram; Speech recognition

Indexed keywords

ACOUSTIC NOISE; ARTIFICIAL INTELLIGENCE; INFORMATION THEORY; LINGUISTICS; MARKOV PROCESSES; MATHEMATICAL MODELS; SPEECH ANALYSIS; SPEECH CODING; SPEECH SYNTHESIS;

EID: 0036522866     PISSN: 09168532     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (16)

References (237)
  • 2
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machine and humans
    • R.P. Lippmann, "Speech recognition by machine and humans," Speech Communication. vol.22, pp.1-15, 1997.
    • (1997) Speech Communication. , vol.22 , pp. 1-15
    • Lippmann, R.P.1
  • 4
    • 0011510426 scopus 로고    scopus 로고
    • Capabilities and limitations of stochastic language models
    • March
    • S. Nakagawa, "Capabilities and limitations of stochastic language models," Conf. Record, Acoust. Soc. Japan, pp.23-26, March 1998.
    • (1998) Conf. Record, Acoust. Soc. Japan , pp. 23-26
    • Nakagawa, S.1
  • 5
    • 0011458455 scopus 로고    scopus 로고
    • Relationship among perplexity word accuracy and phoneme accuracy, and drawback and modification of perplexity
    • S. Nakagawa, "Relationship among perplexity word accuracy and phoneme accuracy, and drawback and modification of perplexity," Proc. First Int. Workshop East Asian Language Resources and Evaluation, pp.123-128, 1998.
    • (1998) Proc. First Int. Workshop East Asian Language Resources and Evaluation , pp. 123-128
    • Nakagawa, S.1
  • 6
    • 0011450087 scopus 로고    scopus 로고
    • Robust speech recognition using HMM's with Toplitz state covariance matrices
    • W.J.J. Roberts and Y. Ephraim, "Robust speech recognition using HMM's with Toplitz state covariance matrices," Proc. ICSLP, pp.369-372, 1998.
    • (1998) Proc. ICSLP , pp. 369-372
    • Roberts, W.J.J.1    Ephraim, Y.2
  • 11
    • 85009114626 scopus 로고    scopus 로고
    • Relationship among speaking style, inter-phoneme's distance and speech recognition performance
    • K. Yamamoto and S. Nakagawa, "Relationship among speaking style, inter-phoneme's distance and speech recognition performance," Proc. ICSLP, pp.859-862, 2000.
    • (2000) Proc. ICSLP , pp. 859-862
    • Yamamoto, K.1    Nakagawa, S.2
  • 12
    • 0000940883 scopus 로고    scopus 로고
    • Acoustic signal processing techniques for robust speech recognition
    • S. Nakagawa, "Acoustic signal processing techniques for robust speech recognition," J. Acoust. Soc. Japan, vol.53, no.11, pp.864-871, 1997.
    • (1997) J. Acoust. Soc. Japan , vol.53 , Issue.11 , pp. 864-871
    • Nakagawa, S.1
  • 13
    • 0030779363 scopus 로고    scopus 로고
    • Noise compensation methods for hidden Markov model speech recognition in adverse environments
    • S.V. Vaseghi and B.P. Molner, "Noise compensation methods for hidden Markov model speech recognition in adverse environments," IEEE Trans. Speech Audio Process., vol.5, no.1, pp.11-21, 1997.
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.1 , pp. 11-21
    • Vaseghi, S.V.1    Molner, B.P.2
  • 14
    • 0023263708 scopus 로고
    • Multi-style training for robust isolated-word speech recognition
    • R.P. Lippmann, E.A. Martin, and D.B. Paul, "Multi-style training for robust isolated-word speech recognition," Proc. ICASSP, pp.705-708, 1987.
    • (1987) Proc. ICASSP , pp. 705-708
    • Lippmann, R.P.1    Martin, E.A.2    Paul, D.B.3
  • 15
    • 0022181749 scopus 로고
    • Some acoustic-phonetic correlates of speech produced in noise
    • D. Pisoni, R. Bernacki, H. Nusbaum, and M. Yuchtman, "Some acoustic-phonetic correlates of speech produced in noise," Proc. ICASSP, pp.1581-1584, 1985.
    • (1985) Proc. ICASSP , pp. 1581-1584
    • Pisoni, D.1    Bernacki, R.2    Nusbaum, H.3    Yuchtman, M.4
  • 16
    • 0011496722 scopus 로고    scopus 로고
    • Normalizing lombard speech under different conditions
    • July
    • A. Wakao, K. Takeda, and F. Itakura, "Normalizing Lombard speech under different conditions," IEICE Trans., vol.J80-D-II, no.7, pp.1643-1650, July 1997.
    • (1997) IEICE Trans. , vol.J80-D-II , Issue.7 , pp. 1643-1650
    • Wakao, A.1    Takeda, K.2    Itakura, F.3
  • 17
    • 0029345416 scopus 로고
    • A comparison of signal processing front ends for automatic word recognition
    • C.R. Jankowski, H.-D.H. Vo, and R.P. Lippmann, "A comparison of signal processing front ends for automatic word recognition," IEEE Trans. Speech & Audio Process., vol.3, no.4, pp.286-292, 1995.
    • (1995) IEEE Trans. Speech & Audio Process. , vol.3 , Issue.4 , pp. 286-292
    • Jankowski, C.R.1    Vo, H.-D.H.2    Lippmann, R.P.3
  • 20
    • 0022667694 scopus 로고    scopus 로고
    • Speaker independent isolated word recognition using dynamic features of speech spectrum
    • S. Furui, "Speaker independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. Acoust. Speech & Signal Process., vol.34, no.1, pp.52-59, 1999.
    • (1999) IEEE Trans. Acoust. Speech & Signal Process. , vol.34 , Issue.1 , pp. 52-59
    • Furui, S.1
  • 21
    • 0032676337 scopus 로고    scopus 로고
    • On the relative importance of various components of the modulation spectrum for automatic speech recognition
    • N. Kanadera, T. Arai, H. Hermansky, and M. Pavel, "On the relative importance of various components of the modulation spectrum for automatic speech recognition," Speech Communication, vol.28, pp.43-55, 1999.
    • (1999) Speech Communication , vol.28 , pp. 43-55
    • Kanadera, N.1    Arai, T.2    Hermansky, H.3    Pavel, M.4
  • 22
    • 0031221099 scopus 로고    scopus 로고
    • Filtering the time sequences of spectral parameters for speech recognition
    • C. Nadeu, P.P. Leal, and B.-H. Juang, "Filtering the time sequences of spectral parameters for speech recognition," Speech Communication, vol.22, pp.315-332, 1997.
    • (1997) Speech Communication , vol.22 , pp. 315-332
    • Nadeu, C.1    Leal, P.P.2    Juang, B.-H.3
  • 23
    • 0011468569 scopus 로고    scopus 로고
    • An evaluation of mel-LPC cepstrum in noisy speech recognition
    • Y. Nakatoh and H. Matsumoto, "An evaluation of mel-LPC cepstrum in noisy speech recognition," Conf. Record, Acoust. Soc. Japan, pp.23-24, 1999.
    • (1999) Conf. Record, Acoust. Soc. Japan , pp. 23-24
    • Nakatoh, Y.1    Matsumoto, H.2
  • 25
    • 0011498037 scopus 로고    scopus 로고
    • A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition
    • J. Chen, B. Xu, and T. Huang, "A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition," Proc. ICASSP, pp.629-632, 1998.
    • (1998) Proc. ICASSP , pp. 629-632
    • Chen, J.1    Xu, B.2    Huang, T.3
  • 26
    • 0031176764 scopus 로고    scopus 로고
    • Hidden Markov model-based speech recognition with intermediate wavelet transform domains
    • R. Singh, K. Davis, and P.V.S. Rao, "Hidden Markov model-based speech recognition with intermediate wavelet transform domains," Computer Speech and Language, vol.11, pp.252-273, 1997.
    • (1997) Computer Speech and Language , vol.11 , pp. 252-273
    • Singh, R.1    Davis, K.2    Rao, P.V.S.3
  • 27
    • 0026189808 scopus 로고
    • Speech recognition in adverse environments
    • B.H. Juang, "Speech recognition in adverse environments," Computer Speech Language, vol.5, pp.275-294, 1991.
    • (1991) Computer Speech Language , vol.5 , pp. 275-294
    • Juang, B.H.1
  • 28
    • 33947656987 scopus 로고
    • Speech recognition in noise using a projection based likelihood measure for mixture density HMM's
    • B.A. Carlson and M.A. Clements, "Speech recognition in noise using a projection based likelihood measure for mixture density HMM's," Proc. ICASSP, vol.I, pp.237-240, 1992.
    • (1992) Proc. ICASSP , vol.1 , pp. 237-240
    • Carlson, B.A.1    Clements, M.A.2
  • 29
    • 0032116602 scopus 로고    scopus 로고
    • A novel projection-based likelihood measure for noisy speech recognition
    • J.-T. Chien, H.-C. Wang, and L.-M. Lee, "A novel projection-based likelihood measure for noisy speech recognition," Speech Communication, vol.24, pp.287-297, 1998.
    • (1998) Speech Communication , vol.24 , pp. 287-297
    • Chien, J.-T.1    Wang, H.-C.2    Lee, L.-M.3
  • 30
    • 0032203256 scopus 로고    scopus 로고
    • Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method
    • S. Katagiri, B.-H. Juang, and C.-H. Lee, "Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method," Proc. IEEE, vol.86, no.11, pp.2345-2372, 1998.
    • (1998) Proc. IEEE , vol.86 , Issue.11 , pp. 2345-2372
    • Katagiri, S.1    Juang, B.-H.2    Lee, C.-H.3
  • 32
    • 0001286647 scopus 로고
    • Minimum classification error training algorithm for feature extractor and pattern classification in speech recognition
    • K.K. Paliwal, M. Bacchiami, and Y. Sagisaka, "Minimum classification error training algorithm for feature extractor and pattern classification in speech recognition," Proc. EuroSpeech, pp.541-545, 1995.
    • (1995) Proc. EuroSpeech , pp. 541-545
    • Paliwal, K.K.1    Bacchiami, M.2    Sagisaka, Y.3
  • 33
    • 0032674196 scopus 로고    scopus 로고
    • Feature extraction for speech recognition based on orthogonal acoustic - Feature planes and LDA
    • T. Nitta, "Feature extraction for speech recognition based on orthogonal acoustic - Feature planes and LDA," Proc. ICASSP, pp.421-424, 1999.
    • (1999) Proc. ICASSP , pp. 421-424
    • Nitta, T.1
  • 34
    • 84893207073 scopus 로고
    • Continuous speech recognition in noise using spectral subtraction and HMM adaptation
    • J.A.N. Flores and S.J. Young, "Continuous speech recognition in noise using spectral subtraction and HMM adaptation," Proc. ICASSP, vol.I, pp.409-412, 1994.
    • (1994) Proc. ICASSP , vol.1 , pp. 409-412
    • Flores, J.A.N.1    Young, S.J.2
  • 35
    • 11044237174 scopus 로고    scopus 로고
    • An evaluation of speech enhancement approach E-CMN/CSS for speech recognition
    • Jan.
    • M. Shozakai, S. Nakamura, and K. Shikano, "An evaluation of speech enhancement approach E-CMN/CSS for speech recognition," IEICE Trans., vol.J81-D, no.1, pp.1-9, Jan. 1998.
    • (1998) IEICE Trans. , vol.J81-D , Issue.1 , pp. 1-9
    • Shozakai, M.1    Nakamura, S.2    Shikano, K.3
  • 36
    • 0026882842 scopus 로고
    • Experiments with a nonlinear spectral subtractor (NSS), hidden Markov model and the projection, for robust speech recognition in cars
    • P. Lockwood and J. Boudy, "Experiments with a nonlinear spectral subtractor (NSS), hidden Markov model and the projection, for robust speech recognition in cars," Speech Communication, vol.11, pp.215-228, 1992.
    • (1992) Speech Communication , vol.11 , pp. 215-228
    • Lockwood, P.1    Boudy, J.2
  • 37
    • 0030711159 scopus 로고    scopus 로고
    • Spectral subtraction and RASTA-filtering in text-dependent HMM-based speaker verification
    • D. Hardt and K. Fellbaum, "Spectral subtraction and RASTA-filtering in text-dependent HMM-based speaker verification," Proc. ICASSP, pp.867-870, 1997.
    • (1997) Proc. ICASSP , pp. 867-870
    • Hardt, D.1    Fellbaum, K.2
  • 38
    • 0011498039 scopus 로고    scopus 로고
    • A smoothing method of time direction on speech recognition under noisy environments using spectral subtraction
    • N. Kitaoka, I. Akahori, and S. Nakagawa, "A smoothing method of time direction on speech recognition under noisy environments using spectral subtraction," Proc. Int. Conf. Speech Processing, pp.381-386, 1999.
    • (1999) Proc. Int. Conf. Speech Processing , pp. 381-386
    • Kitaoka, N.1    Akahori, I.2    Nakagawa, S.3
  • 39
    • 0011464161 scopus 로고    scopus 로고
    • Improved robust speech recognition considering signal correlation approximated by Tayler series
    • J.-L. Shen, J.-W. Hung, and L.-S. Lee, "Improved robust speech recognition considering signal correlation approximated by Tayler series," Proc. ICSLP, pp.1499-1502, 1998.
    • (1998) Proc. ICSLP , pp. 1499-1502
    • Shen, J.-L.1    Hung, J.-W.2    Lee, L.-S.3
  • 40
    • 0025681008 scopus 로고
    • Hidden Markov model decomposition of speech and noise
    • A.P. Varga and R.K. Moore, "Hidden Markov model decomposition of speech and noise," Proc. ICASSP, pp.845-848, 1990.
    • (1990) Proc. ICASSP , pp. 845-848
    • Varga, A.P.1    Moore, R.K.2
  • 41
    • 0027622731 scopus 로고
    • Cepstral parameter compensation for HMM recognition in noise
    • M.J.F. Gales and S.J. Young, "Cepstral parameter compensation for HMM recognition in noise," Speech Communication, vol.12, pp.231-239, 1993.
    • (1993) Speech Communication , vol.12 , pp. 231-239
    • Gales, M.J.F.1    Young, S.J.2
  • 42
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • M.J.F. Gales and S.J. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech & Audio Process., vol.4, pp.352-359, 1996.
    • (1996) IEEE Trans. Speech & Audio Process. , vol.4 , pp. 352-359
    • Gales, M.J.F.1    Young, S.J.2
  • 43
    • 0003524869 scopus 로고
    • Recognition of noisy speech by composition of hidden Markov models
    • IEICE Technical Report, SP92-96
    • F. Martin, K. Shikano, Y. Minami, and Y. Okabe, "Recognition of noisy speech by composition of hidden Markov models," IEICE Technical Report, SP92-96, 1992.
    • (1992)
    • Martin, F.1    Shikano, K.2    Minami, Y.3    Okabe, Y.4
  • 44
    • 0011400310 scopus 로고    scopus 로고
    • Robust HMM to variation of noisy environments based on variance extension of noisy models
    • H. Matsumoto and H. Ubukata, "Robust HMM to variation of noisy environments based on variance extension of noisy models," Proc. EuroSpeech, pp.2387-2390, 1999.
    • (1999) Proc. EuroSpeech , pp. 2387-2390
    • Matsumoto, H.1    Ubukata, H.2
  • 45
    • 0032623471 scopus 로고    scopus 로고
    • Robust features for noisy speech recognition based on temporal trajectory fitting of short-time autocorrelation sequences
    • K.H. You and H.-C. Wang, "Robust features for noisy speech recognition based on temporal trajectory fitting of short-time autocorrelation sequences," Speech Communication, vol.28, pp.13-24, 1999.
    • (1999) Speech Communication , vol.28 , pp. 13-24
    • You, K.H.1    Wang, H.-C.2
  • 46
    • 0011448901 scopus 로고    scopus 로고
    • HMM composition of segmental unit input HMM for noisy speech recognition
    • K. Yamamoto and S. Nakagawa, "HMM composition of segmental unit input HMM for noisy speech recognition," Proc. EuroSpeech, pp.2865-2868, 1999.
    • (1999) Proc. EuroSpeech , pp. 2865-2868
    • Yamamoto, K.1    Nakagawa, S.2
  • 47
    • 0011406317 scopus 로고    scopus 로고
    • Difference in speech recognition performance caused by difference in front-end devices and its compensations
    • K. Yamamoto and S. Nakagawa, "Difference in speech recognition performance caused by difference in front-end devices and its compensations," Proc. 7th Western Pacific Regional Acoust. Conf., pp.85-88, 2000.
    • (2000) Proc. 7th Western Pacific Regional Acoust. Conf. , pp. 85-88
    • Yamamoto, K.1    Nakagawa, S.2
  • 48
    • 0011501273 scopus 로고    scopus 로고
    • Real-time cepstrum mean subtraction using the most likely partial state sequence
    • March
    • S. Kuroiwa, T. Kato, and N. Higuchi, "Real-time cepstrum mean subtraction using the most likely partial state sequence," IEICE Trans., vol.J82-D-II, no.3, pp.332-339, March 1999.
    • (1999) IEICE Trans. , vol.J82-D-II , Issue.3 , pp. 332-339
    • Kuroiwa, S.1    Kato, T.2    Higuchi, N.3
  • 49
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • A. Sankar and C.H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech & Audio Process., vol.4, no.5, pp.190-202, 1996.
    • (1996) IEEE Trans. Speech & Audio Process. , vol.4 , Issue.5 , pp. 190-202
    • Sankar, A.1    Lee, C.H.2
  • 50
    • 0029369804 scopus 로고
    • Rapid environment adaptation for speech recognition
    • K. Takagi, H. Hattori, and T. Watanabe, "Rapid environment adaptation for speech recognition," J. Acoust. Soc. Japan, (E), vol.16, no.5, pp.273-281, 1995.
    • (1995) J. Acoust. Soc. Japan, (E) , vol.16 , Issue.5 , pp. 273-281
    • Takagi, K.1    Hattori, H.2    Watanabe, T.3
  • 51
    • 0011510430 scopus 로고
    • An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation
    • Y. Tsurumi and S. Nakagawa, "An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation," Proc. IC-SLP, pp.431-434, 1994.
    • (1994) Proc. IC-SLP , pp. 431-434
    • Tsurumi, Y.1    Nakagawa, S.2
  • 52
    • 0011410507 scopus 로고
    • Acoustical and environmental robustness
    • Kluwer Academic Pub., Dordrecht
    • A. Acero, "Acoustical and Environmental Robustness," in Automatic Speech Recognition, Kluwer Academic Pub., Dordrecht, 1993.
    • (1993) Automatic Speech Recognition
    • Acero, A.1
  • 53
    • 0032116601 scopus 로고    scopus 로고
    • Data-driven environmental compensation for speech recognition a unified approach
    • P.J. Moreno, B. Raj, and R.M. Stern, "Data-driven environmental compensation for speech recognition a unified approach," Speech Communication, vol.24, pp.267-285, 1998.
    • (1998) Speech Communication , vol.24 , pp. 267-285
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 54
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environment-independent speech recognition
    • P.J. Moreno, B. Raj, and R.M. Stern, "A vector Taylor series approach for environment-independent speech recognition," Proc. ICASSP, pp.733-736, 1996.
    • (1996) Proc. ICASSP , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 55
    • 0032048385 scopus 로고    scopus 로고
    • Speech recognition in noisy environments using first-order vector Taylor series
    • D.Y. Kim, C.K. Un, and N.S. Kim, "Speech recognition in noisy environments using first-order vector Taylor series," Speech Communication, vol.24, no.1, pp.39-49, 1998.
    • (1998) Speech Communication , vol.24 , Issue.1 , pp. 39-49
    • Kim, D.Y.1    Un, C.K.2    Kim, N.S.3
  • 56
    • 0011496725 scopus 로고    scopus 로고
    • HMM adaptation method for noise and distortion by maximizing likelihood
    • July
    • Y. Minami and S. Furui, "HMM adaptation method for noise and distortion by maximizing likelihood," IEICE Trans., vol.J80-A, no.7, pp.1179-1186, July 1997.
    • (1997) IEICE Trans. , vol.J80-A , Issue.7 , pp. 1179-1186
    • Minami, Y.1    Furui, S.2
  • 57
    • 0032203405 scopus 로고    scopus 로고
    • A general joint additive and convolutive bias approach applied to noisy lombard speech recognition
    • M. Afify, Y. Gong, and J.P. Haton, "A general joint additive and convolutive bias approach applied to noisy lombard speech recognition," IEEE Trans. Speech & Audio Process., vol.6, no.6, pp.524-537, 1998.
    • (1998) IEEE Trans. Speech & Audio Process. , vol.6 , Issue.6 , pp. 524-537
    • Afify, M.1    Gong, Y.2    Haton, J.P.3
  • 58
    • 0035249243 scopus 로고    scopus 로고
    • HMM - Separation-based speech recognition for a distant moving speaker
    • T. Takiguchi, S. Nakamura, and K. Shikano, "HMM - Separation-based speech recognition for a distant moving speaker," IEEE Trans. Speech & Audio Process., vol.9, no.3, pp.127-140, 2001.
    • (2001) IEEE Trans. Speech & Audio Process. , vol.9 , Issue.3 , pp. 127-140
    • Takiguchi, T.1    Nakamura, S.2    Shikano, K.3
  • 59
    • 0032139769 scopus 로고    scopus 로고
    • Automatic segmentation of speech recorded in unknown noisy channel characteristics
    • B.L. Pallon and J.H.L. Hansen, "Automatic segmentation of speech recorded in unknown noisy channel characteristics," Speech Communication, vol.25, no.1-3, pp.97-116, 1998.
    • (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 97-116
    • Pallon, B.L.1    Hansen, J.H.L.2
  • 60
    • 0011448902 scopus 로고
    • Japanese phoneme recognition using continuous parameter hidden Markov models
    • June
    • S. Nakagawa, Y. Hirata, and Y. Hashimoto, "Japanese phoneme recognition using continuous parameter hidden Markov models," J. Acoust. Soc. Japan, vol.46, no.6, pp.486-496, June 1990.
    • (1990) J. Acoust. Soc. Japan , vol.46 , Issue.6 , pp. 486-496
    • Nakagawa, S.1    Hirata, Y.2    Hashimoto, Y.3
  • 61
    • 0023211284 scopus 로고
    • Integration of acoustic information in a large vocabulary word recognizer
    • V.N. Gupta, M. Lennig, and P. Mermelstein, "Integration of acoustic information in a large vocabulary word recognizer," ICASSP, vol.II, pp.697-700, 1987.
    • (1987) ICASSP , vol.2 , pp. 697-700
    • Gupta, V.N.1    Lennig, M.2    Mermelstein, P.3
  • 62
    • 20344368952 scopus 로고
    • Hidden Markov model embedded dynamic features of speech spectrum
    • Feb.
    • E. Tsuboka and J. Nakahashi, "Hidden Markov model embedded dynamic features of speech spectrum," IEICE Trans., vol.J77-A, no.2, pp.162-172, Feb. 1994.
    • (1994) IEICE Trans. , vol.J77-A , Issue.2 , pp. 162-172
    • Tsuboka, E.1    Nakahashi, J.2
  • 63
    • 0029325484 scopus 로고
    • Neural predictive hidden Markov model for speech recognition
    • June
    • E. Tsuboka and Y. Takada, "Neural predictive hidden Markov model for speech recognition," IEICE Trans., Inf. & Syst., vol.E78-D, no.6, pp.676-684, June 1995.
    • (1995) IEICE Trans., Inf. & Syst. , vol.E78-D , Issue.6 , pp. 676-684
    • Tsuboka, E.1    Takada, Y.2
  • 64
    • 84911676598 scopus 로고
    • Linear and nonlinear prediction for speech recognition with hidden Markov models
    • M. Saerens and H. Bourlard, "Linear and nonlinear prediction for speech recognition with hidden Markov models," Proc. EuroSpeech, pp.807-810, 1993.
    • (1993) Proc. EuroSpeech , pp. 807-810
    • Saerens, M.1    Bourlard, H.2
  • 65
    • 0030262262 scopus 로고    scopus 로고
    • An MLP/HMM hybrid model using linear predictors
    • Y.J. Chung and C.K. Un, "An MLP/HMM hybrid model using linear predictors," Speech Communication, vol.19, pp.307-316, 1996.
    • (1996) Speech Communication , vol.19 , pp. 307-316
    • Chung, Y.J.1    Un, C.K.2
  • 66
    • 0028516022 scopus 로고
    • Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
    • L. Deng, M. Aksmanoric, X. Sun, and C.F.J. Wu, "Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states," IEEE Trans. Speech Audio & Process., vol.2, no.4, pp.507-520, 1994.
    • (1994) IEEE Trans. Speech Audio & Process. , vol.2 , Issue.4 , pp. 507-520
    • Deng, L.1    Aksmanoric, M.2    Sun, X.3    Wu, C.F.J.4
  • 67
    • 0011495820 scopus 로고
    • Speech recognition by hidden Markov model using segmental statistics
    • IEICE Technical Report, SP90-69
    • Y. Hirata, I. Hayakawa, Y. Ono, and S. Nakagawa, "Speech recognition by hidden Markov model using segmental statistics," IEICE Technical Report, SP90-69, 1990.
    • (1990)
    • Hirata, Y.1    Hayakawa, I.2    Ono, Y.3    Nakagawa, S.4
  • 68
    • 0011400311 scopus 로고
    • Syllable recognition by hidden Markov model using fixed-length segmental statistics
    • May
    • S. Nakagawa, Y. Hirata, and Y. Ono, "Syllable recognition by hidden Markov model using fixed-length segmental statistics," IEICE Trans., vol.J75-D-II, no.5, pp.843-851, May 1992.
    • (1992) IEICE Trans. , vol.J75-D-II , Issue.5 , pp. 843-851
    • Nakagawa, S.1    Hirata, Y.2    Ono, Y.3
  • 69
    • 0000321310 scopus 로고
    • Explicit correlation in hidden Markov model for speech recognition
    • C.J. Wellekens, "Explicit correlation in hidden Markov model for speech recognition," Proc. ICASSP, vol.I, pp.383-386, 1987.
    • (1987) Proc. ICASSP , vol.1 , pp. 383-386
    • Wellekens, C.J.1
  • 70
    • 0030261616 scopus 로고    scopus 로고
    • Modelling of the interframe dependence in an HMM using conditional Gaussian mixtures
    • J. Ming and F.J. Smith, "Modelling of the interframe dependence in an HMM using conditional Gaussian mixtures," Computer Speech and Language, vol.10, pp.229-242, 1996.
    • (1996) Computer Speech and Language , vol.10 , pp. 229-242
    • Ming, J.1    Smith, F.J.2
  • 71
    • 0027167185 scopus 로고
    • A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition
    • K. Aikawa, H. Singer, H. Kawakara, and Y. Tohkura, "A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition," Proc. ICASSP, pp.668-671, 1993.
    • (1993) Proc. ICASSP , pp. 668-671
    • Aikawa, K.1    Singer, H.2    Kawakara, H.3    Tohkura, Y.4
  • 72
    • 0011453543 scopus 로고
    • Comparative evaluation of segmental unit input HMM and conditional density HMM
    • K. Yamamoto and S. Nakagawa, "Comparative evaluation of segmental unit input HMM and conditional density HMM," Proc. EuroSpeech, pp.1615-1618, 1995.
    • (1995) Proc. EuroSpeech , pp. 1615-1618
    • Yamamoto, K.1    Nakagawa, S.2
  • 73
    • 85128367481 scopus 로고    scopus 로고
    • Continuous speech recognition using segmental unit input HMM with a mixture of probability density functions and context dependency
    • K. Hanai, K. Yamamoto, N. Minematsu, and S. Nakagawa, "Continuous speech recognition using segmental unit input HMM with a mixture of probability density functions and context dependency," Proc. ICSLP, pp.2935-2938, 1998.
    • (1998) Proc. ICSLP , pp. 2935-2938
    • Hanai, K.1    Yamamoto, K.2    Minematsu, N.3    Nakagawa, S.4
  • 74
    • 0011494012 scopus 로고
    • Speaker-independent phoneme and word recognition by statistical classification methods for time-sequential patterns
    • Oct.
    • S. Nakagawa and Y. Enomoto "Speaker-independent phoneme and word recognition by statistical classification methods for time-sequential patterns," IEICE Trans., vol.J71-D, no.10, pp.1977-1983, Oct. 1988.
    • (1988) IEICE Trans. , vol.J71-D , Issue.10 , pp. 1977-1983
    • Nakagawa, S.1    Enomoto, Y.2
  • 75
    • 84926271491 scopus 로고
    • Recognition on unvoiced plosive using time spectrum pattern
    • May
    • K. Ide, S. Makino, and K. Kido, "Recognition on unvoiced plosive using time spectrum pattern," J. Acoust. Soc. Japan, vol.39, no.5, pp.321-329, May 1983.
    • (1983) J. Acoust. Soc. Japan , vol.39 , Issue.5 , pp. 321-329
    • Ide, K.1    Makino, S.2    Kido, K.3
  • 76
    • 0024900279 scopus 로고
    • A stochastic segment model for phoneme-based continuous speech recognition
    • M. Ostendorf and S. Roukos, "A stochastic segment model for phoneme-based continuous speech recognition," IEEE Trans. Acoust., Speech & Signal Process., vol.37, no.12, pp.1857-1869, 1989.
    • (1989) IEEE Trans. Acoust., Speech & Signal Process. , vol.37 , Issue.12 , pp. 1857-1869
    • Ostendorf, M.1    Roukos, S.2
  • 77
    • 0025594074 scopus 로고
    • Connectionist Viterbi training a new hybrid for continuous speech recognition
    • M. Franzini and K.-F. Lee, "Connectionist Viterbi training a new hybrid for continuous speech recognition," Proc. ICASSP, vol.I, pp.425-428, 1990.
    • (1990) Proc. ICASSP , vol.1 , pp. 425-428
    • Franzini, M.1    Lee, K.-F.2
  • 79
    • 77954383749 scopus 로고    scopus 로고
    • Data-driven extensions to HMM statistical dependencies
    • J.A. Bilmes, "Data-driven extensions to HMM statistical dependencies," Proc. ICSLP, pp.69-72, 1998.
    • (1998) Proc. ICSLP , pp. 69-72
    • Bilmes, J.A.1
  • 80
    • 0011498040 scopus 로고    scopus 로고
    • Inter-frame dependence arising from preceding and succeeding frames - Application to speech recognition
    • P. Hanna, J. Ming, and F.J. Smith, "Inter-frame dependence arising from preceding and succeeding frames - Application to speech recognition," Speech Communication, vol.31, no.4, pp.1301-1312, 1999.
    • (1999) Speech Communication , vol.31 , Issue.4 , pp. 1301-1312
    • Hanna, P.1    Ming, J.2    Smith, F.J.3
  • 82
    • 0028996957 scopus 로고
    • A unified way in incorporating segmental feature and segmental model into HMM
    • J. He and H. Leich, "A unified way in incorporating segmental feature and segmental model into HMM," Proc. ICASSP, vol.I, pp.532-535, 1995.
    • (1995) Proc. ICASSP , vol.1 , pp. 532-535
    • He, J.1    Leich, J.2
  • 83
    • 85027200620 scopus 로고    scopus 로고
    • The property of asymmetric segment
    • IEICE Technical Report, SP98-30
    • T. Ohtuki and T. Ohtomo, "The property of asymmetric segment," IEICE Technical Report, SP98-30, 1998.
    • (1998)
    • Ohtuki, T.1    Ohtomo, T.2
  • 84
    • 0030245363 scopus 로고    scopus 로고
    • From HMMs to segment models: A unified view of stochastic modeling for speech recognition
    • M. Ostendonf, V.V. Digalakis, and O.A. Kimball, "From HMMs to segment models: A unified view of stochastic modeling for speech recognition," IEEE Trans. Speech & Audio Process., vol.4, no.5, pp.360-378, 1996.
    • (1996) IEEE Trans. Speech & Audio Process. , vol.4 , Issue.5 , pp. 360-378
    • Ostendonf, M.1    Digalakis, V.V.2    Kimball, O.A.3
  • 85
    • 0032048095 scopus 로고    scopus 로고
    • Assessing the importance of the segmentation probability in segment-based speech recognition
    • J. Verhasselt, I. Illina, J.P. Martens, Y. Gong, and J.-P. Haton, "Assessing the importance of the segmentation probability in segment-based speech recognition," Speech Communication, vol.24, pp.51-72, 1998.
    • (1998) Speech Communication , vol.24 , pp. 51-72
    • Verhasselt, J.1    Illina, I.2    Martens, J.P.3    Gong, Y.4    Haton, J.-P.5
  • 86
    • 0023846644 scopus 로고
    • Stochastic segment modeling using the estimate-maximize algorithm
    • S. Rocous, M. Ostendorf, H. Gish, and A. Derr, "Stochastic segment modeling using the estimate-maximize algorithm," Proc. ICASSP, pp.127-130, 1988.
    • (1988) Proc. ICASSP , pp. 127-130
    • Rocous, S.1    Ostendorf, M.2    Gish, H.3    Derr, A.4
  • 87
    • 0031185482 scopus 로고    scopus 로고
    • Speaker-independent phonetic classification using hidden Markov models with mixtures of trend functions
    • L. Deng and M. Aksmanovic, "Speaker-independent phonetic classification using hidden Markov models with mixtures of trend functions," IEEE Trans. Speech & Audio Process., vol.5, no.4, pp.319-324, 1997.
    • (1997) IEEE Trans. Speech & Audio Process. , vol.5 , Issue.4 , pp. 319-324
    • Deng, L.1    Aksmanovic, M.2
  • 88
    • 0032206267 scopus 로고    scopus 로고
    • Speech trajectory discrimination using the minimum classification error learning
    • R. Chengalvara and L. Deng, "Speech trajectory discrimination using the minimum classification error learning," IEEE Trans. Speech & Audio Process., vol.6, no.6, pp.505-515, 1998.
    • (1998) IEEE Trans. Speech & Audio Process. , vol.6 , Issue.6 , pp. 505-515
    • Chengalvara, R.1    Deng, L.2
  • 90
    • 0034478708 scopus 로고    scopus 로고
    • Improving phoneme classification performance using observation context-dependent segment models
    • M. Szarras and S. Matsunaga, "Improving phoneme classification performance using observation context-dependent segment models," Int. J. Speech Technology, vol.3, pp.253-262, 2000.
    • (2000) Int. J. Speech Technology , vol.3 , pp. 253-262
    • Szarras, M.1    Matsunaga, S.2
  • 91
    • 0027681974 scopus 로고
    • ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
    • V. Digalakis, J.R. Rohlicek, and M. Ostendorf, "ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition," IEEE Trans. Speech & Audio Process., vol.1, no.4, pp.431-442, 1993.
    • (1993) IEEE Trans. Speech & Audio Process. , vol.1 , Issue.4 , pp. 431-442
    • Digalakis, V.1    Rohlicek, J.R.2    Ostendorf, M.3
  • 92
    • 0011458458 scopus 로고
    • Kalman-filter solved by personal computer
    • Maruzen
    • M. Nakano and K. Nishiyama, Kalman-filter solved by personal computer, Maruzen 1993.
    • (1993)
    • Nakano, M.1    Nishiyama, K.2
  • 93
    • 0011432608 scopus 로고
    • Time series analysis programming
    • Iwanami shoten
    • G. Kitagawa, Time series analysis programming, Iwanami shoten, 1993.
    • (1993)
    • Kitagawa, G.1
  • 94
    • 0029755019 scopus 로고    scopus 로고
    • Estimation of mixtures of stochastic dynamic trajectories: Application to continuous speech recognition
    • M. Afify, Y. Gong, and J.-P. Haton, "Estimation of mixtures of stochastic dynamic trajectories: Application to continuous speech recognition," Computer Speech and Language, vol.10, pp.23-36, 1996.
    • (1996) Computer Speech and Language , vol.10 , pp. 23-36
    • Afify, M.1    Gong, Y.2    Haton, J.-P.3
  • 95
    • 0011450090 scopus 로고
    • Constraining model duration variance in HMM-based connected speech recognition
    • M.M. Hochberg and H.F. Silverman, "Constraining model duration variance in HMM-based connected speech recognition," Proc. EuroSpeech, pp.323-326, 1993.
    • (1993) Proc. EuroSpeech , pp. 323-326
    • Hochberg, M.M.1    Silverman, H.F.2
  • 96
    • 0029368174 scopus 로고
    • Nonstationary hidden Markov model
    • B. Sin and J.H. Kim, "Nonstationary hidden Markov model," Signal Processing, vol.46, pp.31-46, 1995.
    • (1995) Signal Processing , vol.46 , pp. 31-46
    • Sin, B.1    Kim, J.H.2
  • 97
    • 0030247529 scopus 로고    scopus 로고
    • Modeling acoustic transitions in speech by modified hidden Markov models with state duration and state duration-dependent observation probabilities
    • Y.K. Park, C.K. Un, and O.W. Kwon, "Modeling acoustic transitions in speech by modified hidden Markov models with state duration and state duration-dependent observation probabilities" IEEE Trans. Speech & Audio Process, vol.4, no.5, pp.389-392, 1996.
    • (1996) IEEE Trans. Speech & Audio Process , vol.4 , Issue.5 , pp. 389-392
    • Park, Y.K.1    Un, C.K.2    Kwon, O.W.3
  • 100
    • 0011453546 scopus 로고
    • Recognition of spoken words based on VCV syllable unit
    • May
    • R. Nakatsu and M. Kohda, "Recognition of spoken words based on VCV syllable unit," IEICE Trans., vol.J61-A, no.5, pp.464-471, May 1978.
    • (1978) IEICE Trans. , vol.J61-A , Issue.5 , pp. 464-471
    • Nakatsu, R.1    Kohda, M.2
  • 101
    • 0022185407 scopus 로고
    • Context-dependent modeling for acoustic-phonetic recognition of continuous speech
    • R. Schawartz, Y. Chow, O. Kimball, S. Roucos, M. Krasner, and J. Makhoul, "Context-dependent modeling for acoustic-phonetic recognition of continuous speech," Proc., ICASSP, pp.1203-1208, 1985.
    • (1985) Proc., ICASSP , pp. 1203-1208
    • Schawartz, R.1    Chow, Y.2    Kimball, O.3    Roucos, S.4    Krasner, M.5    Makhoul, J.6
  • 104
    • 0011453547 scopus 로고
    • Comparison of syntax-oriented spoken Japanese understanding with semantic-oriented system
    • July
    • S. Nakagawa, Y. Hirata, I. Murase, and T. Tanoue, "Comparison of syntax-oriented spoken Japanese understanding with semantic-oriented system," IEICE Trans., vol.E74, no.7, pp.1854-1862, July 1991.
    • (1991) IEICE Trans. , vol.E74 , Issue.7 , pp. 1854-1862
    • Nakagawa, S.1    Hirata, Y.2    Murase, I.3    Tanoue, T.4
  • 105
    • 0024889251 scopus 로고    scopus 로고
    • Large vocabulary word recognition based on demisyllable hidden Markov model using small amount of training data
    • T. Watanabe, "Large vocabulary word recognition based on demisyllable hidden Markov model using small amount of training data," Proc. ICASSP, S1.1, 1985.
    • Proc. ICASSP, S1.1, 1985.
    • Watanabe, T.1
  • 106
    • 0011448906 scopus 로고
    • Multivariate statistical analysis of VCV syllables
    • Jan.
    • T. Sakai and K. Tabata, "Multivariate statistical analysis of VCV syllables," IEICE Trans., vol.56-D, no.1, pp.63-70, Jan. 1973.
    • (1973) IEICE Trans. , vol.56 D , Issue.1 , pp. 63-70
    • Sakai, T.1    Tabata, K.2
  • 107
    • 34248800020 scopus 로고
    • Mora or syllable? Speech segmentation in Japanese
    • T. Otake, G. Hatano, G. Culter, and J. Mehler, "Mora or syllable? Speech segmentation in Japanese," J. Mem. Lang, vol.32, pp.358-378, 1993.
    • (1993) J. Mem. Lang , vol.32 , pp. 358-378
    • Otake, T.1    Hatano, G.2    Culter, G.3    Mehler, J.4
  • 109
    • 0003462715 scopus 로고
    • Hidden Markov model for speech recognition
    • Edinburgh University Press
    • X.D. Xuang, Y. Ariki, and M.A. Jack, Hidden Markov model for speech recognition, Edinburgh University Press, 1990.
    • (1990)
    • Xuang, X.D.1    Ariki, Y.2    Jack, M.A.3
  • 110
    • 85015539783 scopus 로고
    • Subphonetic modeling with Markov states-SENONE
    • M.-Y. Hwang, and X. Huang, "Subphonetic modeling with Markov states-SENONE," Proc. ICASSP, pp.33-36, 1992.
    • (1992) Proc. ICASSP , pp. 33-36
    • Hwang, M.-Y.1    Huang, X.2
  • 111
    • 0030193422 scopus 로고    scopus 로고
    • Genones: Generalized mixture tying in continuous hidden Markov model-based speech recognizers
    • V.V. Digalakis, P. Monaco, and H. Murveit, "Genones: Generalized mixture tying in continuous hidden Markov model-based speech recognizers," IEEE Trans. Speech & Audio Process., vol.4, no.4, pp.281-288, 1996.
    • (1996) IEEE Trans. Speech & Audio Process. , vol.4 , Issue.4 , pp. 281-288
    • Digalakis, V.V.1    Monaco, P.2    Murveit, H.3
  • 112
    • 0028530231 scopus 로고
    • State clustering in hidden Markov-based continuous speech recognition
    • S.J. Young and P.C. Woodland, "State clustering in hidden Markov-based continuous speech recognition," Computer Speech and Language, vol.8, pp.369-383, 1994.
    • (1994) Computer Speech and Language , vol.8 , pp. 369-383
    • Young, S.J.1    Woodland, P.C.2
  • 113
    • 85027105819 scopus 로고
    • Prediction about unknown phonetic context by tree-based phone modeling
    • Technical Report, SP90-64, IEICE
    • S. Hayamizu and K. Tanaka, "Prediction about unknown phonetic contexts by tree-based phone modeling," Technical Report, SP90-64, IEICE 1990.
    • (1990)
    • Hayamizu, S.1    Tanaka, K.2
  • 114
    • 85013744934 scopus 로고
    • A successive state splitting algorithm for efficient allophone modeling
    • J. Takami and S. Sagayama, "A successive state splitting algorithm for efficient allophone modeling," Proc. ICASSP, pp.574-577, 1992.
    • (1992) Proc. ICASSP , pp. 574-577
    • Takami, J.1    Sagayama, S.2
  • 115
    • 0011471866 scopus 로고    scopus 로고
    • A study on HM-nets using phonetic decision tree-based successive state splitting
    • Oct.
    • T. Hori, M. Katoh, A. Itoh, and M. Kohda, "A study on HM-nets using phonetic decision tree-based successive state splitting," IEICE Trans. Inf. & Syst., vol.J80-D-II, no.10, pp.2645-2654, Oct. 1997.
    • (1997) IEICE Trans. Inf. & Syst. , vol.J80-D-II , Issue.10 , pp. 2645-2654
    • Hori, T.1    Katoh, M.2    Itoh, A.3    Kohda, M.4
  • 117
    • 85007758082 scopus 로고
    • Minimum error classification training of HMMs implementation details and experimental results
    • D. Rainton, and S. Sagayama, "Minimum error classification training of HMMs implementation details and experimental results," J. Acoust. Soc. Japan, vol.13, no.6, pp.379-388, 1992.
    • (1992) J. Acoust. Soc. Japan , vol.13 , Issue.6 , pp. 379-388
    • Rainton, D.1    Sagayama, S.2
  • 118
    • 0011400313 scopus 로고
    • Estimating hidden Markov model parameters so as to maximize speech recognition accuracy
    • L.R. Bahl, P.F. Broun, P.V. Souza, and R.L. Mercer, "Estimating hidden Markov model parameters so as to maximize speech recognition accuracy," IEEE Trans. Speech & Audio Procss., vol.1, no.1, pp.77-82, 1993.
    • (1993) IEEE Trans. Speech & Audio Procss. , vol.1 , Issue.1 , pp. 77-82
    • Bahl, L.R.1    Broun, P.F.2    Souza, P.V.3    Mercer, R.L.4
  • 119
    • 0028412908 scopus 로고
    • High performance connected digit recognition using maximum mutual information estimation
    • Y. Normndin, R. Cardin, and R. de Mori, "High performance connected digit recognition using maximum mutual information estimation," IEEE Trans. Speech & Audio Process., vol.2, pp.299-311, 1994.
    • (1994) IEEE Trans. Speech & Audio Process. , vol.2 , pp. 299-311
    • Normndin, Y.1    Cardin, R.2    De Mori, R.3
  • 120
    • 0031222490 scopus 로고
    • MMIE training of large vocabulary recognition systems
    • V. Valtchev, J. Odel, P. Woodland, and S. Young, "MMIE training of large vocabulary recognition systems," Speech Communication, vol.22, pp.303-314, 1993.
    • (1993) Speech Communication , vol.22 , pp. 303-314
    • Valtchev, V.1    Odel, J.2    Woodland, P.3    Young, S.4
  • 121
    • 85128400029 scopus 로고    scopus 로고
    • Discriminative training of GMM using a modified EM algorithm for speaker recognition
    • K. Markov and S. Nakagawa, "Discriminative training of GMM using a modified EM algorithm for speaker recognition," Proc. ICSLP, vol.2, pp.177-180, 1998.
    • (1998) Proc. ICSLP , vol.2 , pp. 177-180
    • Markov, K.1    Nakagawa, S.2
  • 122
    • 0030235132 scopus 로고    scopus 로고
    • Performance of HMM-based speech recognizers with discriminative state-weights
    • O.W. Kwon and C.K. Un, "Performance of HMM-based speech recognizers with discriminative state-weights," Speech Communication, vol.19, pp.197-205, 1996.
    • (1996) Speech Communication , vol.19 , pp. 197-205
    • Kwon, O.W.1    Un, C.K.2
  • 123
    • 0032762247 scopus 로고    scopus 로고
    • Selective training for hidden Markov models with applications to speech classification
    • L.M. Arslan and H.L. Hanson, "Selective training for hidden Markov models with applications to speech classification," IEEE Trans. Speech & Audio Process., vol.7, no.1, pp.46-64, 1999.
    • (1999) IEEE Trans. Speech & Audio Process. , vol.7 , Issue.1 , pp. 46-64
    • Arslan, L.M.1    Hanson, H.L.2
  • 124
    • 0002235014 scopus 로고    scopus 로고
    • Improved feature decorrelation for HMM-based speech recognition
    • K. Demuynck, J. Duchateau, D.V. Comernolle, and P. Wambacq, "Improved feature decorrelation for HMM-based speech recognition," Proc. ICSLP, pp.2907-2910, 1998.
    • (1998) Proc. ICSLP , pp. 2907-2910
    • Demuynck, K.1    Duchateau, J.2    Comernolle, D.V.3    Wambacq, P.4
  • 125
    • 0029725604 scopus 로고    scopus 로고
    • A parametric approach to vocal tract length normalization
    • E. Eide and H. Gish, "A parametric approach to vocal tract length normalization," Proc. ICASSP, pp.346-349, 1996.
    • (1996) Proc. ICASSP , pp. 346-349
    • Eide, E.1    Gish, H.2
  • 126
    • 0034847002 scopus 로고    scopus 로고
    • The 1998 HTK system for transcription of conversational telephone speech
    • T. Hain, P.C. Woodland, T.R. Niesler, and E.W.D. Whittaker, "The 1998 HTK system for transcription of conversational telephone speech," Proc. ICASSP, pp.57-60. 1999.
    • (1999) Proc. ICASSP , pp. 57-60
    • Hain, T.1    Woodland, P.C.2    Niesler, T.R.3    Whittaker, E.W.D.4
  • 127
    • 0028419019 scopus 로고
    • Maximum aposteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • J.-L. Gauvain, and C.H. Lee, "Maximum aposteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech & Audio Process., vol.2, pp.291-298, 1994.
    • (1994) IEEE Trans. Speech & Audio Process. , vol.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.H.2
  • 128
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • M.J.F. Gales and P.C. Woodland, "Mean and variance adaptation within the MLLR framework," Computer Speech and Language, vol.10, pp.249-264, 1996.
    • (1996) Computer Speech and Language , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 129
    • 0033100038 scopus 로고    scopus 로고
    • Maximum-likelihood stochastic-transformation adaptation of hidden Markov models
    • V.D. Diakoloukas and V.V. Digalakis, "Maximum-likelihood stochastic-transformation adaptation of hidden Markov models," IEEE Trans. Speech & Audio Process., vol.7, no.2, pp.177-187, 1999.
    • (1999) IEEE Trans. Speech & Audio Process. , vol.7 , Issue.2 , pp. 177-187
    • Diakoloukas, V.D.1    Digalakis, V.V.2
  • 130
    • 0031704151 scopus 로고    scopus 로고
    • Speaker clustering and transformation for speaker adaptation in speech recognition systems
    • M. Padmanabham, L.R. Bahl, D. Nahamoo, and M.A. Picheny, "Speaker clustering and transformation for speaker adaptation in speech recognition systems," IEEE Trans. Speech & Audio Process., vol.6, no.1, pp.71-77, 1998.
    • (1998) IEEE Trans. Speech & Audio Process. , vol.6 , Issue.1 , pp. 71-77
    • Padmanabham, M.1    Bahl, L.R.2    Nahamoo, D.3    Picheny, M.A.4
  • 131
    • 85135109228 scopus 로고
    • Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs
    • K. Ohkura, M. Sugiyama, and S. Sagayama, "Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs," Proc. ICSLP, pp.369-372, 1992.
    • (1992) Proc. ICSLP , pp. 369-372
    • Ohkura, K.1    Sugiyama, M.2    Sagayama, S.3
  • 132
    • 0011411817 scopus 로고    scopus 로고
    • Speaker adaptation of acoustic models using correlations of transfer vectors
    • March
    • S. Takahashi and S. Sagayama, "Speaker adaptation of acoustic models using correlations of transfer vectors," IEICE Trans., vol.J82-D-II, no.3, pp.324-331, March 1999.
    • (1999) IEICE Trans. , vol.J82-D-II , Issue.3 , pp. 324-331
    • Takahashi, S.1    Sagayama, S.2
  • 133
    • 0002488301 scopus 로고
    • Speaker adaptation with autonomous control using tree structure
    • K. Shinoda and T. Watanabe, "Speaker adaptation with autonomous control using tree structure," Proc. Euro-Speech, pp.1143-1146, 1995.
    • (1995) Proc. Euro-Speech , pp. 1143-1146
    • Shinoda, K.1    Watanabe, T.2
  • 134
    • 0030189744 scopus 로고    scopus 로고
    • Speaker adaptation using combined transformation and Bayesian methods
    • V.V. Digalakis and L.G. Neumeyer, "Speaker adaptation using combined transformation and Bayesian methods," IEEE Trans. Speech & Audio Process., vol.4, no.4, pp.249-300, 1996.
    • (1996) IEEE Trans. Speech & Audio Process. , vol.4 , Issue.4 , pp. 249-300
    • Digalakis, V.V.1    Neumeyer, L.G.2
  • 135
    • 0000521080 scopus 로고    scopus 로고
    • Speaker adaptation using maximum a posteriori probability estimation and data size dependent parameter smoothing
    • March
    • M. Tonomura, T. Kosaka, and S. Matsumura, "Speaker adaptation using maximum a posteriori probability estimation and data size dependent parameter smoothing," IEICE Trans., vol.J81-D-II, no.3, pp.465-471, March 1998.
    • (1998) IEICE Trans. , vol.J81-D-II , Issue.3 , pp. 465-471
    • Tonomura, M.1    Kosaka, T.2    Matsumura, S.3
  • 136
    • 0035279111 scopus 로고    scopus 로고
    • A structural Bayes approach to speaker adaptation
    • K. Shinoda and C.H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech & Audio Process., vol.9, no.3, pp.276-287, 2001.
    • (2001) IEEE Trans. Speech & Audio Process. , vol.9 , Issue.3 , pp. 276-287
    • Shinoda, K.1    Lee, C.H.2
  • 137
    • 0011448907 scopus 로고
    • Automatic speech recognition by stochastic approaches
    • Feb.
    • S. Nakagawa, "Automatic speech recognition by stochastic approaches," J. Acoust. Soc. Japan, vol.50, no.2, pp.126-132, Feb. 1994.
    • (1994) J. Acoust. Soc. Japan , vol.50 , Issue.2 , pp. 126-132
    • Nakagawa, S.1
  • 138
    • 0011458461 scopus 로고    scopus 로고
    • Automatic learning of stochastic context-free grammar for spontaneous speech by integration of bigram
    • March
    • S. Nakagawa and K. Ohtani, "Automatic learning of stochastic context-free grammar for spontaneous speech by integration of bigram," Trans. Inf. Process. Soc. Japan, vol.39. no.3, pp.575-584, March 1998.
    • (1998) Trans. Inf. Process. Soc. Japan , vol.39 , Issue.3 , pp. 575-584
    • Nakagawa, S.1    Ohtani, K.2
  • 139
    • 0011509488 scopus 로고    scopus 로고
    • A study of large-vocabulary continuous speech recognition using higher order n-gram language models
    • Spring
    • K. Ohtsuki, K. Yoshida, T. Matsuoka, and S. Furui, "A study of large-vocabulary continuous speech recognition using higher order n-gram language models," Conf. Record. Acoust. Soc. Japan. pp.47-48, Spring 1997.
    • (1997) Conf. Record. Acoust. Soc. Japan. , pp. 47-48
    • Ohtsuki, K.1    Yoshida, K.2    Matsuoka, T.3    Furui, S.4
  • 140
    • 0028996884 scopus 로고
    • Phrase bigrams for continuous speech recognition
    • E.P. Giachin, "Phrase bigrams for continuous speech recognition," Proc. ICASSP, pp.225-227, 1995.
    • (1995) Proc. ICASSP , pp. 225-227
    • Giachin, E.P.1
  • 141
    • 0011496729 scopus 로고    scopus 로고
    • Effect of vocabulary extension using word sequence concatenation for large vocabulary continuous speech recognition
    • April
    • Y. Wada, N. Kobayashi, Y. Nakano and T. Kobayashi, "Effect of vocabulary extension using word sequence concatenation for large vocabulary continuous speech recognition," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1413-1420, April 1999.
    • (1999) Trans. Inf. Process. Soc. Japan , vol.40 , Issue.4 , pp. 1413-1420
    • Wada, Y.1    Kobayashi, N.2    Nakano, Y.3    Kobayashi, T.4
  • 142
    • 0011501276 scopus 로고    scopus 로고
    • A task adaptation method and use of idiomatic expression of stochastic language model for speech recognition
    • Jan.
    • S. Nakagawa, H. Akamatsu, and H. Nishizaki, "A task adaptation method and use of idiomatic expression of stochastic language model for speech recognition," Natural Language Processing, vol.6, no.2. pp.97-115, Jan. 1999.
    • (1999) Natural Language Processing , vol.6 , Issue.2 , pp. 97-115
    • Nakagawa, S.1    Akamatsu, H.2    Nishizaki, H.3
  • 143
    • 0028996879 scopus 로고
    • Language modeling by variable length sequences, theoretical formulation and evaluation of multigrams
    • S. Deligned and F. Bimbot, "Language modeling by variable length sequences, theoretical formulation and evaluation of multigrams," Proc. ICASSP, pp.169-172, 1995.
    • (1995) Proc. ICASSP , pp. 169-172
    • Deligned, S.1    Bimbot, F.2
  • 144
    • 0029762785 scopus 로고    scopus 로고
    • Variable-order N-gram generation by word-class splitting and consecutive word grouping
    • H. Masataki and Y. Sagisaka, "Variable-order N-gram generation by word-class splitting and consecutive word grouping," Proc. ICASSP, pp. 188-191, 1996.
    • (1996) Proc. ICASSP , pp. 188-191
    • Masataki, H.1    Sagisaka, Y.2
  • 146
    • 0011464163 scopus 로고    scopus 로고
    • Word clustering for class-based language models
    • S. Mori, M. Nishimura, and N. Itoh, "Word clustering for class-based language models," Trans. Inf. Process. Soc. Japan, vol.38, no.11, pp.2200-2207, 1997.
    • (1997) Trans. Inf. Process. Soc. Japan , vol.38 , Issue.11 , pp. 2200-2207
    • Mori, S.1    Nishimura, M.2    Itoh, N.3
  • 147
    • 0032650074 scopus 로고    scopus 로고
    • Variable-length category n-gram language models
    • T.R. Niesler and P.C. Woodland, "Variable-length category n-gram language models," Computer Speech and Language, vol.13, pp.99-124, 1999.
    • (1999) Computer Speech and Language , vol.13 , pp. 99-124
    • Niesler, T.R.1    Woodland, P.C.2
  • 148
    • 0000797420 scopus 로고    scopus 로고
    • An estimation of an upper bound for the entropy of Japanese
    • S. Mori, and O. Yamaji, "An estimation of an upper bound for the entropy of Japanese," Trans. Inf. Process. Soc. Japan, vol.38, no.11, pp.2191-2199, 1997.
    • (1997) Trans. Inf. Process. Soc. Japan , vol.38 , Issue.11 , pp. 2191-2199
    • Mori, S.1    Yamaji, O.2
  • 149
    • 0030181951 scopus 로고    scopus 로고
    • A maximum entropy approach to adaptive statistical language modeling
    • R. Rosenfeld, "A maximum entropy approach to adaptive statistical language modeling," Computer Speech and Language, vol.10, pp.187-228, 1996.
    • (1996) Computer Speech and Language , vol.10 , pp. 187-228
    • Rosenfeld, R.1
  • 150
    • 0033106616 scopus 로고    scopus 로고
    • Interpolation of n-gram and mutual-information based trigger pair language models for Mandarin speech recognition
    • Z.G. Dong, and L.K. Teng, "Interpolation of n-gram and mutual-information based trigger pair language models for Mandarin speech recognition," Computer Speech and Language, vol.13, pp.125-141, 1999.
    • (1999) Computer Speech and Language , vol.13 , pp. 125-141
    • Dong, Z.G.1    Teng, L.K.2
  • 151
    • 0032165145 scopus 로고    scopus 로고
    • A multispan language model modeling framework for large vocabulary speech recognition
    • J.R. Bellegard, "A multispan language model modeling framework for large vocabulary speech recognition," IEEE Trans. Acoust. Speech & Signal Process., vol.6, no.5, pp.456-467, 1998.
    • (1998) IEEE Trans. Acoust. Speech & Signal Process. , vol.6 , Issue.5 , pp. 456-467
    • Bellegard, J.R.1
  • 152
    • 0011471867 scopus 로고    scopus 로고
    • Multispan statistical language modeling for large vocabulary speech recognition
    • J.R. Bellegard, "Multispan statistical language modeling for large vocabulary speech recognition," Proc. ICSLP, pp.2395-2398, 1998.
    • (1998) Proc. ICSLP , pp. 2395-2398
    • Bellegard, J.R.1
  • 153
    • 0032785782 scopus 로고    scopus 로고
    • Modeling long distance dependence in language: Topic mixtures versus dynamic cache models
    • R.M. Iyer and M. Ostendorf, "Modeling long distance dependence in language: Topic mixtures versus dynamic cache models," IEEE Trans. Speech & Audio Process., vol.7, no.1, pp.31-39, 1997.
    • (1997) IEEE Trans. Speech & Audio Process. , vol.7 , Issue.1 , pp. 31-39
    • Iyer, R.M.1    Ostendorf, M.2
  • 154
    • 0002235611 scopus 로고    scopus 로고
    • Adaptive topic-dependent language modeling using word-based varigramss
    • S. Martin, J. Liermann, and H. Ney, "Adaptive topic-dependent language modeling using word-based varigramss," Proc. EuroSpeech, pp.1447-1450, 1997.
    • (1997) Proc. EuroSpeech , pp. 1447-1450
    • Martin, S.1    Liermann, J.2    Ney, H.3
  • 155
    • 0011408731 scopus 로고    scopus 로고
    • Dictation of broadcast news speech using word pronounciation probability
    • Spring
    • K. Takagi and S. Furui, "Dictation of broadcast news speech using word pronounciation probability," Conf. Record, Acoust. Soc. Japan, pp.9-10, Spring 1998.
    • (1998) Conf. Record, Acoust. Soc. Japan , pp. 9-10
    • Takagi, K.1    Furui, S.2
  • 156
    • 0011451282 scopus 로고    scopus 로고
    • An improvement of language modeling for automatic transcription of Japanese broadcast-news speech
    • Spring
    • N. Sakurai and S. Furui, "An improvement of language modeling for automatic transcription of Japanese broadcast-news speech," Conf. Record, Acoust. Soc. Japan, pp.57-58, Spring 1999.
    • (1999) Conf. Record, Acoust. Soc. Japan , pp. 57-58
    • Sakurai, N.1    Furui, S.2
  • 157
    • 0011408732 scopus 로고    scopus 로고
    • A language model for recognition of continuously uttered sentences
    • Spring
    • T. Imai, Y. Saito, A. Ando, and S. Furui, "A language model for recognition of continuously uttered sentences," Conf. Record, Acoust. Soc. Japan, pp.63-64, Spring 1999.
    • (1999) Conf. Record, Acoust. Soc. Japan , pp. 63-64
    • Imai, T.1    Saito, Y.2    Ando, A.3    Furui, S.4
  • 158
    • 0011404832 scopus 로고    scopus 로고
    • Time dependent language model for broadcast news transcription
    • April
    • A. Kobayashi, T. Imai, A. Ando, and K. Nakabayashi, "Time dependent language model for broadcast news transcription," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1421-1429, April 1999.
    • (1999) Trans. Inf. Process. Soc. Japan , vol.40 , Issue.4 , pp. 1421-1429
    • Kobayashi, A.1    Imai, T.2    Ando, A.3    Nakabayashi, K.4
  • 159
    • 0011402513 scopus 로고    scopus 로고
    • The influence of morpheme analysis systems on language model for continuous speech recognition
    • Autumn
    • N. Yodo, K. Itoh, S. Nakamura, and K. Shikano, "The influence of morpheme analysis systems on language model for continuous speech recognition," Conf. Record, Acoust. Soc. Japan, pp.53-54, Autumn 1997.
    • (1997) Conf. Record, Acoust. Soc. Japan , pp. 53-54
    • Yodo, N.1    Itoh, K.2    Nakamura, S.3    Shikano, K.4
  • 160
    • 85024115120 scopus 로고    scopus 로고
    • An empirical study of smoothing techniques for language modeling
    • S.F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Proc. ACL, pp.310-318, 1996.
    • (1996) Proc. ACL , pp. 310-318
    • Chen, S.F.1    Goodman, J.2
  • 161
    • 0030124373 scopus 로고    scopus 로고
    • Succeeding word prediction for speech recognition based on stochastic language model
    • April
    • M. Zhou and S. Nakagawa, "Succeeding word prediction for speech recognition based on stochastic language model," IEICE Trans. Inf. & Syst., vol.E79-D, no.4, pp.333-341, April 1996.
    • (1996) IEICE Trans. Inf. & Syst. , vol.E79-D , Issue.4 , pp. 333-341
    • Zhou, M.1    Nakagawa, S.2
  • 162
    • 0010032271 scopus 로고
    • Inside-outside reestimation from partially bracketed corpora
    • F. Pereira and Y. Schabes, "Inside-outside reestimation from partially bracketed corpora," Proc. ACL, pp.31-37, 1992.
    • (1992) Proc. ACL , pp. 31-37
    • Pereira, F.1    Schabes, Y.2
  • 163
    • 84894805373 scopus 로고    scopus 로고
    • An empirical evaluation of probabilistic lexicalized tree insertion grammars
    • R. Hwa, "An empirical evaluation of probabilistic lexicalized tree insertion grammars," Proc. ACL, pp.557-563, 1998.
    • (1998) Proc. ACL , pp. 557-563
    • Hwa, R.1
  • 164
    • 85027133681 scopus 로고    scopus 로고
    • Construction and evaluation of language models based on stochastic context free grammar for speech recognition
    • Technical Report, SP99-37, Inst. Elect. Inf. Comm. Engrs., June
    • C. Hori, M. Katoh, A. Itoh, and M. Kohda, "Construction and evaluation of language models based on stochastic context free grammar for speech recognition," Technical Report, SP99-37, Inst. Elect. Inf. Comm. Engrs., June 1999.
    • (1999)
    • Hori, C.1    Katoh, M.2    Itoh, A.3    Kohda, M.4
  • 165
    • 0032673481 scopus 로고    scopus 로고
    • An automatic acquisition method of statistical finite-state automation sentences
    • M. Zuzuki and S. Makino, "An automatic acquisition method of statistical finite-state automation sentences," Proc. ICASSP, pp.737-740, 1999.
    • (1999) Proc. ICASSP , pp. 737-740
    • Zuzuki, M.1    Makino, S.2
  • 166
    • 0011403721 scopus 로고    scopus 로고
    • Construction of language models using probabilistic GLR methods toward speech recognition
    • April
    • H. Imai, H. Tanaka, and T. Tokunaga, "Construction of language models using probabilistic GLR methods toward speech recognition," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1404-1411, April 1999.
    • (1999) Trans. Inf. Process. Soc. Japan , vol.40 , Issue.4 , pp. 1404-1411
    • Imai, H.1    Tanaka, H.2    Tokunaga, T.3
  • 167
    • 0011449593 scopus 로고    scopus 로고
    • Spontaneous speech understanding method based on LR parsing of keyword lattice
    • Feb.
    • H. Tsuboi, Y. Takebayashi, and H. Hashimoto, "Spontaneous speech understanding method based on LR parsing of keyword lattice," Trans. Inf. Process. Soc. Japan, vol.38, no.2, pp.260-268, Feb. 1997.
    • (1997) Trans. Inf. Process. Soc. Japan , vol.38 , Issue.2 , pp. 260-268
    • Tsuboi, H.1    Takebayashi, Y.2    Hashimoto, H.3
  • 168
  • 169
    • 0011449594 scopus 로고
    • Processing unknown words in continuous speech recognition
    • July
    • K. Kita, T. Ehara, and T. Morimoto, "Processing unknown words in continuous speech recognition," IEICE Trans, vol.E74, no.7, pp.1811-1816, July 1991.
    • (1991) IEICE Trans , vol.E74 , Issue.7 , pp. 1811-1816
    • Kita, K.1    Ehara, T.2    Morimoto, T.3
  • 170
    • 0011501278 scopus 로고    scopus 로고
    • Comparison of dictation and word spotting techniques in classification of news speech articles
    • IEICE Technical Report, SP98-32, June
    • J. Ogata and Y. Ariki, "Comparison of dictation and word spotting techniques in classification of news speech articles," IEICE Technical Report, SP98-32, June 1998.
    • (1998)
    • Ogata, J.1    Ariki, Y.2
  • 171
    • 0011498043 scopus 로고    scopus 로고
    • Voice-operated projector using utterance verification and its application to hyper-text generation of lectures
    • April
    • T. Kawahara, K. Ishizuka, and S. Doshita, "Voice-operated projector using utterance verification and its application to hyper-text generation of lectures," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1491-1498, April 1999.
    • (1999) Trans. Inf. Process. Soc. Japan , vol.40 , Issue.4 , pp. 1491-1498
    • Kawahara, T.1    Ishizuka, K.2    Doshita, S.3
  • 172
    • 0011408733 scopus 로고    scopus 로고
    • Dealing with out-of -vocabulary words and speech disfluencies in an N-gram based speech understanding system
    • Dec.
    • A. Kai, Y. Hirose, and S. Nakagawa, "Dealing with out-of -vocabulary words and speech disfluencies in an N-gram based speech understanding system," Proc. ICSLP, pp.2427-2430, Dec. 1999.
    • (1999) Proc. ICSLP , pp. 2427-2430
    • Kai, A.1    Hirose, Y.2    Nakagawa, S.3
  • 173
    • 0001079615 scopus 로고    scopus 로고
    • A*-admissible key-phrase spotting with sub-syllable level utterance verification
    • B. Chen, H. Wong, L. Chen, and L. Lee, "A*-admissible key-phrase spotting with sub-syllable level utterance verification," Proc. ICSLP, pp.783-786, 1998.
    • (1998) Proc. ICSLP , pp. 783-786
    • Chen, B.1    Wong, H.2    Chen, L.3    Lee, L.4
  • 174
    • 84902052756 scopus 로고    scopus 로고
    • A new confidence measure based on rank-ordering subphone scores
    • Q. Lin, S-Das, D. Lubensky, and M. Picheny, "A new confidence measure based on rank-ordering subphone scores," Proc. ICSLP, pp.3249-3252, 1998.
    • (1998) Proc. ICSLP , pp. 3249-3252
    • Lin, Q.1    S-Das2    Lubensky, D.3    Picheny, M.4
  • 175
    • 0032091375 scopus 로고    scopus 로고
    • Text-independent speaker recognition using non-linear frame likelihood transformation
    • K.P. Markov, and S. Nakagawa, "Text-independent speaker recognition using non-linear frame likelihood transformation," Speech Communication, vol.24, pp.193-209, 1998.
    • (1998) Speech Communication , vol.24 , pp. 193-209
    • Markov, K.P.1    Nakagawa, S.2
  • 176
    • 0011408734 scopus 로고    scopus 로고
    • Word-based approach to large-vocabulary continuous speech recognition for Japanese
    • April
    • M. Nishimura, N. Itoh, and K. Yamasaki, "Word-based approach to large-vocabulary continuous speech recognition for Japanese," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1395-1403, April 1999.
    • (1999) Trans. Inf. Process. Soc. Japan , vol.40 , Issue.4 , pp. 1395-1403
    • Nishimura, M.1    Itoh, N.2    Yamasaki, K.3
  • 177
    • 0011450876 scopus 로고
    • Unknown utterance rejection using likelihood normalization based on syllable recognition
    • Dec.
    • T. Watanabe and S. Tsukada, "Unknown utterance rejection using likelihood normalization based on syllable recognition," IEICE Trans., vol.J75-D-II, no.12, pp.2002-2009, Dec. 1992.
    • (1992) IEICE Trans. , vol.J75-D-II , Issue.12 , pp. 2002-2009
    • Watanabe, T.1    Tsukada, S.2
  • 178
    • 0029323659 scopus 로고
    • Relationship among recognition rate, rejection rate and false alarm rate in a spoken word recognition system
    • June
    • A. Kai, and S. Nakagawa, "Relationship among recognition rate, rejection rate and false alarm rate in a spoken word recognition system," IEICE Trans. Inf. & Syst., vol.E78-D, no.6, pp.698-704, June 1995.
    • (1995) IEICE Trans. Inf. & Syst. , vol.E78-D , Issue.6 , pp. 698-704
    • Kai, A.1    Nakagawa, S.2
  • 179
    • 0011501675 scopus 로고    scopus 로고
    • Large vocabulary continuous speech recognition: From laboratory systems towards real-world applications
    • Dec.
    • J.-L. Gauvain and L. Lamel, "Large vocabulary continuous speech recognition: From laboratory systems towards real-world applications," IEICE Trans., vol.J79-D-II, no.12, pp.2005-2021, Dec. 1996.
    • (1996) IEICE Trans. , vol.J79-D-II , Issue.12 , pp. 2005-2021
    • Gauvain, J.-L.1    Lamel, L.2
  • 180
  • 181
    • 0011495826 scopus 로고    scopus 로고
    • A new computation method of perplexity for text corpus including unknown words
    • Autumn
    • S. Nakagawa and H. Akamatsu, "A new computation method of perplexity for text corpus including unknown words," Conf. Record, Acoust. Soc. Japan, pp.63-64, Autumn 1998.
    • (1998) Conf. Record, Acoust. Soc. Japan , pp. 63-64
    • Nakagawa, S.1    Akamatsu, H.2
  • 182
    • 0030715922 scopus 로고    scopus 로고
    • Task adaptation using MAP estimation in N-gram language modeling
    • H. Masataki, Y. Sagisaka, K. Hisaki, and T. Kawahara, "Task adaptation using MAP estimation in N-gram language modeling," Proc. ICASSP, pp.783-786, 1997.
    • (1997) Proc. ICASSP , pp. 783-786
    • Masataki, H.1    Sagisaka, Y.2    Hisaki, K.3    Kawahara, T.4
  • 183
    • 85009128031 scopus 로고    scopus 로고
    • Relationship between phoneme recognition performance and word recognition rate
    • May
    • S. Nakagawa, "Relationship between phoneme recognition performance and word recognition rate," Trans. Inf. Process, Japan, vol.22, no.5, pp.488-496, May 1996.
    • (1996) Trans. Inf. Process, Japan , vol.22 , Issue.5 , pp. 488-496
    • Nakagawa, S.1
  • 185
    • 84989448320 scopus 로고
    • Evaluation of FFT cepstrum and LPC cepstrum for speech and speaker recognition
    • Feb.
    • S. Nakagawa and M. Sakamoto, "Evaluation of FFT cepstrum and LPC cepstrum for speech and speaker recognition," IEICE Trans., vol.J66-A, no.2, pp.1199-1206, Feb. 1983.
    • (1983) IEICE Trans. , vol.J66-A , Issue.2 , pp. 1199-1206
    • Nakagawa, S.1    Sakamoto, M.2
  • 186
    • 84987195640 scopus 로고
    • Perception of vowels and C-V syllables segmented from connected speech
    • May
    • H. Kuwabara and H. Sakai, "Perception of vowels and C-V syllables segmented from connected speech," J. Acoust. Soc. Japan, vol.28, no.5, pp.225-234, May 1972.
    • (1972) J. Acoust. Soc. Japan , vol.28 , Issue.5 , pp. 225-234
    • Kuwabara, H.1    Sakai, H.2
  • 187
    • 85027151219 scopus 로고    scopus 로고
    • A study on speech recognition unit based on speech perceptual experiments
    • IEICE Technical Report, SP99-43, July
    • K. Yamamoto and S. Nakagawa, "A study on speech recognition unit based on speech perceptual experiments," IEICE Technical Report, SP99-43, July 1999.
    • (1999)
    • Yamamoto, K.1    Nakagawa, S.2
  • 188
    • 0011501282 scopus 로고    scopus 로고
    • Toward spoken language understanding from speech recognition
    • Nov.
    • S. Nakagawa, "Toward spoken language understanding from speech recognition," J. Acoust. Soc. Japan, vol.52, no.11, pp.859-856, Nov. 1996.
    • (1996) J. Acoust. Soc. Japan , vol.52 , Issue.11 , pp. 859-856
    • Nakagawa, S.1
  • 189
    • 0011403914 scopus 로고
    • Evaluation of auditory front-ends in DTW word recognition system
    • June
    • K. Obara and T. Hirahara, "Evaluation of auditory front-ends in DTW word recognition system," J. Acoust. Soc. Japan, vol.50, no.6, pp.452-464, June 1994.
    • (1994) J. Acoust. Soc. Japan , vol.50 , Issue.6 , pp. 452-464
    • Obara, K.1    Hirahara, T.2
  • 191
    • 0031643048 scopus 로고    scopus 로고
    • Multiresolution cepstral features for phoneme recognition across speech sub-bands
    • P. McCourt, S. Vaseghi, and N. Harte, "Multiresolution cepstral features for phoneme recognition across speech sub-bands," Proc. ICASSP, pp.557-560, 1998.
    • (1998) Proc. ICASSP , pp. 557-560
    • McCourt, P.1    Vaseghi, S.2    Harte, N.3
  • 192
    • 0032654472 scopus 로고    scopus 로고
    • Channel and noise adaptation via HMM mixture mean transform and stochastic matching
    • S. Kong and B. Shi, "Channel and noise adaptation via HMM mixture mean transform and stochastic matching," Proc. ICASSP, pp. 301-304, 1999.
    • (1999) Proc. ICASSP , pp. 301-304
    • Kong, S.1    Shi, B.2
  • 193
    • 0025388113 scopus 로고
    • A linear predictive HMM for vector valued observation with application to speech recognition
    • P. Kenny, M. Lenning, and P. Mermelstein, "A linear predictive HMM for vector valued observation with application to speech recognition," IEEE Trans. Acoust. Speech & Signal Process., vol.38, no.1, pp.220-225, 1990.
    • (1990) IEEE Trans. Acoust. Speech & Signal Process. , vol.38 , Issue.1 , pp. 220-225
    • Kenny, P.1    Lenning, M.2    Mermelstein, P.3
  • 194
    • 0011406323 scopus 로고
    • Proposal of a stochastic context-free grammar for continuous observation vector sequences
    • Spring
    • S. Nakagawa, "Proposal of a stochastic context-free grammar for continuous observation vector sequences," Conf. Record, pp.73-74, Spring 1992.
    • (1992) Conf. Record , pp. 73-74
    • Nakagawa, S.1
  • 195
    • 0026171582 scopus 로고
    • Application of the Gibbs distribution to hidden Markov modeling in speaker independent isolated word recognition
    • Y. Zhao, L.E. Atlas, and X. Zhuang, "Application of the Gibbs distribution to hidden Markov modeling in speaker independent isolated word recognition," IEEE Trans. Signal Process., vol.39, no.6, pp.1291-1298, 1991.
    • (1991) IEEE Trans. Signal Process. , vol.39 , Issue.6 , pp. 1291-1298
    • Zhao, Y.1    Atlas, L.E.2    Zhuang, X.3
  • 196
    • 0011411822 scopus 로고    scopus 로고
    • Probabilistic modeling with Bayesian networks for automatic speech recognition
    • G. Zweig and S. Russel, "Probabilistic modeling with Bayesian networks for automatic speech recognition," Proc. ICSLP, pp.3011-3014, 1998.
    • (1998) Proc. ICSLP , pp. 3011-3014
    • Zweig, G.1    Russel, S.2
  • 197
    • 0029325616 scopus 로고
    • A comparative study of output probability functions in HMMs
    • June
    • S. Nakagawa, L. Zhao, and H. Suzuki, "A comparative study of output probability functions in HMMs," IEICE Trans. Inf. & Syst., vol.E78-D, no.6, pp.669-675, June 1995.
    • (1995) IEICE Trans. Inf. & Syst. , vol.E78-D , Issue.6 , pp. 669-675
    • Nakagawa, S.1    Zhao, L.2    Suzuki, H.3
  • 198
    • 85009181766 scopus 로고    scopus 로고
    • Unified framework for acoustic topology modelling: ML-SSS and question-based decision trees
    • H. Singer and A. Nakamura, "Unified framework for acoustic topology modelling: ML-SSS and question-based decision trees," Proc. EuroSpeech, pp.1355-1358, 1999.
    • (1999) Proc. EuroSpeech , pp. 1355-1358
    • Singer, H.1    Nakamura, A.2
  • 199
    • 85027098626 scopus 로고
    • Learning and normalizing of the talker differences in the recognition of spoken words
    • Technical Report, Acoust. Soc. Japan, SP75-25, Nov.
    • S. Furui, "Learning and normalizing of the talker differences in the recognition of spoken words," Technical Report, Acoust. Soc. Japan, SP75-25, Nov. 1975.
    • (1975)
    • Furui, S.1
  • 200
    • 0017961869 scopus 로고
    • A real time spoken word recognition system with various learning capabilities of the speaker differences
    • Scripta Publishing Co.
    • S. Nakagawa and T. Sakai, "A real time spoken word recognition system with various learning capabilities of the speaker differences," Syst. Comp. Controls, vol.9, no.3, pp.63-71, Scripta Publishing Co., 1978.
    • (1978) Syst. Comp. Controls , vol.9 , Issue.3 , pp. 63-71
    • Nakagawa, S.1    Sakai, T.2
  • 201
    • 85009195509 scopus 로고    scopus 로고
    • A missing-word test comparison of human and statistical language model performance
    • M. Owens, A. Kruger, P. Donnelly, F.J. Smith, and J. Ming, "A missing-word test comparison of human and statistical language model performance," Proc. EuroSpeech, pp.145-148, 1999.
    • (1999) Proc. EuroSpeech , pp. 145-148
    • Owens, M.1    Kruger, A.2    Donnelly, P.3    Smith, F.J.4    Ming, J.5
  • 202
    • 0011400318 scopus 로고    scopus 로고
    • Robust language modeling for small corpus of target task using call combined word statistics and selective use of general corpus
    • Nov.
    • Y. Wada, N. Kobayashi, and T. Kobayashi, "Robust language modeling for small corpus of target task using call combined word statistics and selective use of general corpus," IEICE Trans., vol.J83-D-II, no.11, pp.2397-2406, Nov. 2000.
    • (2000) IEICE Trans. , vol.J83-D-II , Issue.11 , pp. 2397-2406
    • Wada, Y.1    Kobayashi, N.2    Kobayashi, T.3
  • 203
    • 0011404834 scopus 로고    scopus 로고
    • Part-of-speech N-gram and word N-gram fused language model
    • H. Yamamoto, and Y. Sagisaka, "Part-of-speech N-gram and word N-gram fused language model," Proc. Euro-Speech, pp.1803-1806, 1999.
    • (1999) Proc. Euro-Speech , pp. 1803-1806
    • Yamamoto, H.1    Sagisaka, Y.2
  • 204
    • 0030719155 scopus 로고    scopus 로고
    • A word graph algorithm for large vocabulary continuous speech recognition
    • S. Ortmanns, H. Ney, and Z. Aubert, "A word graph algorithm for large vocabulary continuous speech recognition," Computer Speech and Language, vol.11, pp.43-72, 1997.
    • (1997) Computer Speech and Language , vol.11 , pp. 43-72
    • Ortmanns, S.1    Ney, H.2    Aubert, Z.3
  • 205
    • 0001100613 scopus 로고    scopus 로고
    • A study on a phoneme-graph-based hypothesis restriction for large vocabulary continuous speech recognition
    • April
    • T. Hori, N. Oka, M. Katoho, A. Itoh, and M. Kohda, "A study on a phoneme-graph-based hypothesis restriction for large vocabulary continuous speech recognition," Trans. Inf. Process. Soc. Japan, vol.40, no.4, pp.1365-1373 April 1999.
    • (1999) Trans. Inf. Process. Soc. Japan , vol.40 , Issue.4 , pp. 1365-1373
    • Hori, T.1    Oka, N.2    Katoho, M.3    Itoh, A.4    Kohda, M.5
  • 206
    • 29144491321 scopus 로고    scopus 로고
    • Large vocabulary continuous speech recognition based on multi-pass search using word trellis index
    • Jan.
    • A. Lee, Kawahra, and S. Doshita, "Large vocabulary continuous speech recognition based on multi-pass search using word trellis index," IEICE Trans., vol.J82-D, no.1, pp.1-9, Jan. 1999.
    • (1999) IEICE Trans. , vol.J82-D , Issue.1 , pp. 1-9
    • Lee, A.1    Kawahra2    Doshita, S.3
  • 207
    • 85027104898 scopus 로고    scopus 로고
    • Some problems on automatic speech recognition
    • IEICE Technical Report, SP99-93, Dec.
    • S. Nakagawa, "Some problems on automatic speech recognition," IEICE Technical Report, SP99-93, Dec. 1999.
    • (1999)
    • Nakagawa, S.1
  • 208
    • 0031619371 scopus 로고    scopus 로고
    • Balancing acoustic and linguistic probabilities
    • A. Ogawa, K. Takeda, and F. Itakura, "Balancing acoustic and linguistic probabilities," Proc. ICASSP, pp.181-184, 1998.
    • (1998) Proc. ICASSP , pp. 181-184
    • Ogawa, A.1    Takeda, K.2    Itakura, F.3
  • 209
    • 0032649321 scopus 로고    scopus 로고
    • Partly hidden Markov model and its application to speech recognition
    • T. Kobayashi, J. Furuyama, and K. Masumitsu, "Partly hidden Markov model and its application to speech recognition," Proc. ICASSP, pp.121-124, 1999.
    • (1999) Proc. ICASSP , pp. 121-124
    • Kobayashi, T.1    Furuyama, J.2    Masumitsu, K.3
  • 210
    • 0011451288 scopus 로고
    • Comparison of SCFG and HMM based speaker independent spoken digit recognition
    • Dec.
    • M. Zhou and S. Nakagawa, "Comparison of SCFG and HMM based speaker independent spoken digit recognition," Proc. Int. Workshop on Automatic Speech Recognition, pp.30-31, Dec. 1993.
    • (1993) Proc. Int. Workshop on Automatic Speech Recognition , pp. 30-31
    • Zhou, M.1    Nakagawa, S.2
  • 211
    • 85032751521 scopus 로고    scopus 로고
    • Dynamic programming search for continuous speech recognition
    • Sept.
    • H. Ney and S. Ortmanns, "Dynamic programming search for continuous speech recognition," IEEE Signal Process. Mag., pp.64-82, Sept. 1999.
    • (1999) IEEE Signal Process. Mag. , pp. 64-82
    • Ney, H.1    Ortmanns, S.2
  • 212
    • 85032751683 scopus 로고    scopus 로고
    • Hierarchical search for large-vocabulary conversational speech recognition
    • Sept.
    • N. Deshmukh, A. Ganapathiraju, and J. Picone, "Hierarchical search for large-vocabulary conversational speech recognition," IEEE Signal Process. Mag., pp.84-107, Sept. 1999.
    • (1999) IEEE Signal Process. Mag. , pp. 84-107
    • Deshmukh, N.1    Ganapathiraju, A.2    Picone, J.3
  • 216
    • 85007838242 scopus 로고
    • Pitch dependent phone modeling for HMM-based speech recognition
    • H. Singer and S. Sagayama, "Pitch dependent phone modeling for HMM-based speech recognition," J. Acoust. Soc. Japan, (E), vol.15, no.2, pp.77-86, 1994.
    • (1994) J. Acoust. Soc. Japan, (E) , vol.15 , Issue.2 , pp. 77-86
    • Singer, H.1    Sagayama, S.2
  • 217
    • 85067723733 scopus 로고    scopus 로고
    • Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processing
    • Dec.
    • N. Minematsu and S. Nakagawa, "Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processing," Proc. ICSLP, pp.2427-2430, Dec. 1998.
    • (1998) Proc. ICSLP , pp. 2427-2430
    • Minematsu, N.1    Nakagawa, S.2
  • 218
    • 0028392167 scopus 로고
    • An application of recurrent nets to phone probability estimation
    • A.J. Robinson, "An application of recurrent nets to phone probability estimation," IEEE Trans. Neural Networks, vol.5, no.2, pp.298-304, 1994.
    • (1994) IEEE Trans. Neural Networks , vol.5 , Issue.2 , pp. 298-304
    • Robinson, A.J.1
  • 219
    • 0011403725 scopus 로고    scopus 로고
    • Speech understanding and language model
    • Nov.
    • S. Nakagawa, "Speech understanding and language model," J. Signal Process., vol.2, no.6, pp.434-442, Nov. 1998.
    • (1998) J. Signal Process. , vol.2 , Issue.6 , pp. 434-442
    • Nakagawa, S.1
  • 220
    • 0011450878 scopus 로고    scopus 로고
    • Introduction to the special issue-some research problems on spoken dialogue systems
    • Nov.
    • S. Nakagawa, "Introduction to the special issue-some research problems on spoken dialogue systems," J. Acoust. Soc. Japan, vol.54, no.11, pp.783-790, Nov. 1998.
    • (1998) J. Acoust. Soc. Japan , vol.54 , Issue.11 , pp. 783-790
    • Nakagawa, S.1
  • 221
  • 222
    • 85027158035 scopus 로고    scopus 로고
    • HMM-based speaker recognition
    • IEICE Technical Report, SP95-111, Jan.
    • T. Matsui, "HMM-based speaker recognition," IEICE Technical Report, SP95-111, Jan. 1996.
    • (1996)
    • Matsui, T.1
  • 223
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • J.P. Campbell, "Speaker recognition: A tutorial," Proc. IEEE, vol.85, no.9, 1437-1462, 1997.
    • (1997) Proc. IEEE , vol.85 , Issue.9 , pp. 1437-1462
    • Campbell, J.P.1
  • 236
    • 84944486544 scopus 로고
    • Prediction and entropy of printed English
    • L.E. Shannon, "Prediction and entropy of printed English," Bell System Tech. J., vol.30, pp.50-64, 1951.
    • (1951) Bell System Tech. J. , vol.30 , pp. 50-64
    • Shannon, L.E.1
  • 237
    • 0017994420 scopus 로고
    • A covergent gambling estimate of the entropy of English
    • T.M. Cover and R.C. King, "A covergent gambling estimate of the entropy of English," IEEE Trans. Inf. Theory, vol.24, no.4, pp.413-421, 1978.
    • (1978) IEEE Trans. Inf. Theory , vol.24 , Issue.4 , pp. 413-421
    • Cover, T.M.1    King, R.C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.