메뉴 건너뛰기




Volumn 81, Issue 9, 1993, Pages 1215-1247

Signal Modeling Techniques in Speech Recognition

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; CONSTRAINT THEORY; MATHEMATICAL MODELS; MATHEMATICAL TRANSFORMATIONS; PARAMETER ESTIMATION; PROBABILITY; ROBUSTNESS (CONTROL SYSTEMS); SIGNAL PROCESSING; SPECTRUM ANALYSIS; STATISTICAL METHODS;

EID: 0027659197     PISSN: 00189219     EISSN: 15582256     Source Type: Journal    
DOI: 10.1109/5.237532     Document Type: Article
Times cited : (524)

References (116)
  • 3
    • 0025479826 scopus 로고
    • Speech recognition: From the laboratory to the real world
    • Oct.
    • J. G. Wilpon, R. P. Mikkilineni, D. B. Roe, and S. Gokcen, “Speech recognition: From the laboratory to the real world,” AT&T Tech. J., vol. 69, no. 5, pp. 14–24, Oct. 1990.
    • (1990) AT&T Tech. J. , vol.69 , Issue.5 , pp. 14-24
    • Wilpon, J.G.1    Mikkilineni, R.P.2    Roe, D.B.3    Gokcen, S.4
  • 5
    • 33646950057 scopus 로고
    • Voice across America: Toward robust speaker independent speech recognition for telecommunications applications
    • Apr.
    • B. Wheatley and J. Picone, “Voice across America: Toward robust speaker independent speech recognition for telecommunications applications,” Digital Signal Processing: A Rev. J., vol. 1, no. 2, pp. 45–64, Apr. 1991.
    • (1991) Digital Signal Processing: A Rev. J. , vol.1 , Issue.2 , pp. 45-64
    • Wheatley, B.1    Picone, J.2
  • 7
    • 0026382580 scopus 로고
    • Improvements in connected digit recognition using higher order spectral and energy features
    • (Toronto, Ont.Canada, May)
    • J. G. Wilpon, C. H. Lee, and L. R. Rabiner, “Improvements in connected digit recognition using higher order spectral and energy features,” in Proc. IEEE Int. Conf on Acoustics, Speech, and Signal Processing (Toronto, Ont., Canada, May 1991), pp. 349–352.
    • (1991) Proc. IEEE Int. Conf on Acoustics, Speech, and Signal Processing , vol.19 , pp. 349-352
    • Wilpon, J.G.1    Lee, C.H.2    Rabiner, L.R.3
  • 9
    • 0026103147 scopus 로고
    • Information-theoretic distortion measures for speech recognition: Theoretical considerations and experimental results
    • Feb.
    • Y. T. Lee, “Information-theoretic distortion measures for speech recognition: Theoretical considerations and experimental results,” IEEE Trans. Signal Processing, vol. 39, no. 2, pp. 330–335, Feb. 1991.
    • (1991) IEEE Trans. Signal Processing , vol.39 , Issue.2 , pp. 330-335
    • Lee, Y.T.1
  • 10
    • 0025623686 scopus 로고
    • Information-theoretic distortion measures for speech recognition: Theoretical considerations and experimental results
    • (Albuquerque, NM, Apr.
    • Y. T. Lee and D. Kahn, “Information-theoretic distortion measures for speech recognition: Theoretical considerations and experimental results,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (Albuquerque, NM, Apr. 1990), pp. 785–788.
    • (1990) Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing , pp. 785-788
    • Lee, Y.T.1    Kahn, D.2
  • 11
    • 0025635250 scopus 로고
    • On the use of hierarchical spectral dynamics in speech recognition
    • (Albuquerque, NM, Apr.)
    • S. Furui, “On the use of hierarchical spectral dynamics in speech recognition,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (Albuquerque, NM, Apr. 1990), pp. 789–792.
    • (1990) Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing , pp. 789-792
    • Furui, S.1
  • 12
    • 0024766457 scopus 로고
    • A family of distortion measures based upon projection operation for robust speech recognition
    • Nov.
    • D. Mansour and B. H. Juang, “A family of distortion measures based upon projection operation for robust speech recognition,” IEEE Trans. Acoust., Speech, Signal Processing, vol. 37, no. 11, pp. 1659–1671, Nov. 1989.
    • (1989) IEEE Trans. Acoust., Speech, Signal Processing , vol.37 , Issue.11 , pp. 1659-1671
    • Mansour, D.1    Juang, B.H.2
  • 13
    • 0000090514 scopus 로고
    • A weighted cepstral distance measure for speech recognition
    • Oct.
    • Y. Tohkura, “A weighted cepstral distance measure for speech recognition,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-35, no. 10, pp. 1414–1422, Oct. 1987.
    • (1987) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-35 , Issue.10 , pp. 1414-1422
    • Tohkura, Y.1
  • 14
    • 0024905808 scopus 로고
    • Phonetically sensitive discriminants for improved speech recognition
    • (Glasgow, Scotland) May
    • G. R. Doddington, “Phonetically sensitive discriminants for improved speech recognition,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (Glasgow, Scotland, May 1989), pp. 556–559.
    • (1989) Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing , pp. 556-559
    • Doddington, G.R.1
  • 15
    • 0024905238 scopus 로고
    • A comparison of several acoustic representations for speech recognition with degraded and undegraded speech
    • (Glasgow, Scotland) May
    • M. J. Hunt and C. Lefebvre, “A comparison of several acoustic representations for speech recognition with degraded and undegraded speech,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (Glasgow, Scotland, May 1989), pp. 262–265.
    • (1989) Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing , pp. 262-265
    • Hunt, M.J.1    Lefebvre, C.2
  • 18
    • 0000768393 scopus 로고
    • Frame specific statistical features for speaker-independent speech recognition
    • Aug.
    • E. L. Bocchieri and G. R. Doddington, “Frame specific statistical features for speaker-independent speech recognition,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, no. 4, pp. 755–764, Aug. 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , Issue.4 , pp. 755-764
    • Bocchieri, E.L.1    Doddington, G.R.2
  • 21
    • 0022667694 scopus 로고
    • Speaker-independent isolated word recognition using dynamic features of the speech spectrum
    • Feb.
    • S. Furui, “Speaker-independent isolated word recognition using dynamic features of the speech spectrum,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, no. 1, pp. 52–59, Feb. 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , Issue.1 , pp. 52-59
    • Furui, S.1
  • 22
    • 0022148098 scopus 로고
    • Vector quantization in speech coding
    • Nov.
    • J. Makhoul, S. Raucos, and H. Gish, “Vector quantization in speech coding,” in Proc. IEEE, vol. 73, no. 11, pp. 1551–1588, Nov. 1985.
    • (1985) Proc. IEEE , vol.73 , Issue.11 , pp. 1551-1588
    • Makhoul, J.1    Raucos, S.2    Gish, H.3
  • 24
    • 0021497212 scopus 로고
    • On the performance of isolated word speech recognizers using vector quantization and temporal energy contours
    • Sept.
    • L. R. Rabiner, K. C. Pan, and F. K. Soong, “On the performance of isolated word speech recognizers using vector quantization and temporal energy contours,” AT&T Bell Lab. Tech. vol. J., 63, no. 7, pp. 1245–1260, Sept. 1984.
    • (1984) AT&T Bell Lab. Tech.J. , vol.63 , Issue.7 , pp. 1245-1260
    • Rabiner, L.R.1    Pan, K.C.2    Soong, F.K.3
  • 25
    • 0020735346 scopus 로고
    • On the application of vector quantization and hidden Markov models to speaker-independent, isolated word recognition
    • Apr.
    • L. R. Rabiner, S. E. Levinson, M. M. Sondhi, “On the application of vector quantization and hidden Markov models to speaker-independent, isolated word recognition,” Bell Syst. Tech. J., vol. 62, no. 4, pp. 1075–1105, Apr. 1983.
    • (1983) Bell Syst. Tech. J. , vol.62 , Issue.4 , pp. 1075-1105
    • Rabiner, L.R.1    Levinson, S.E.2    Sondhi, M.M.3
  • 26
    • 0019053271 scopus 로고
    • Comparison of parametric representations of monosyllabic word recognition in continuously spoken sentences
    • Aug.
    • S. B. Davis and P. Mermelstein, “Comparison of parametric representations of monosyllabic word recognition in continuously spoken sentences,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, no. 4, pp. 357–366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 27
    • 0003793552 scopus 로고
    • Digital Signal Processing.
    • Englewood Cliffs, NJ Prentice-Hall
    • A. V. Oppenheim and R. W. Schafer, Digital Signal Processing. Englewood Cliffs, NJ, Prentice-Hall, 1975.
    • (1975)
    • Oppenheim, A.V.1    Schafer, R.W.2
  • 28
    • 0003424145 scopus 로고
    • Discrete Time Processing of Speech Signals.
    • New York MacMillan
    • J. R. Deller, J. G. Proakis, and J. H. L. Hansen, Discrete Time Processing of Speech Signals. New York: MacMillan, 1993.
    • (1993)
    • Deller, J.R.1    Proakis, J.G.2    Hansen, J.H.L.3
  • 29
    • 0004244302 scopus 로고
    • Fundamentals of Speech Recognition.
    • Englewood Cliffs, NJ Prentice-Hall
    • L. Rabiner and B. H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
    • (1993)
    • Rabiner, L.1    Juang, B.H.2
  • 30
    • 0004082156 scopus 로고
    • Digital Communications
    • 2nd ed. New York McGraw-Hill
    • J. G. Proakis, Digital Communications, 2nd ed. New York: McGraw-Hill, 1989.
    • (1989)
    • Proakis, J.G.1
  • 31
    • 0003874959 scopus 로고
    • Linear Prediction of Speech.
    • New York Springer-Verlag
    • J. Market and A. H. Gray, Jr., Linear Prediction of Speech. New York: Springer-Verlag, 1980.
    • (1980)
    • Market, J.1    Gray, A.H.2
  • 32
    • 0003425258 scopus 로고
    • Digital Processing of Speech Signals.
    • Englewood Cliffs, NJ Prentice-Hall
    • L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals. Englewood Cliffs, NJ: Prentice-Hall, 1978.
    • (1978)
    • Rabiner, L.R.1    Schafer, R.W.2
  • 33
    • 80052564213 scopus 로고
    • Problems of speech recognition in mobile environments
    • Kobe, Japan Nov.
    • A. M. Noll, “Problems of speech recognition in mobile environments,” in Proc. Int. Conf on Spoken Language Processing(Kobe, Japan, Nov. 1990), pp. 1133–1136.
    • (1990) Proc. Int. Conf on Spoken Language Processing( , pp. 1133-1136
    • Noll, A.M.1
  • 34
    • 80052584228 scopus 로고
    • A speech recognition method for noise environments using dual inputs
    • (Kobe, Japan) Nov.
    • Y. Nakadai and N. Sugamura, “A speech recognition method for noise environments using dual inputs,” in Proc. Int. Conf. on Spoken Language Processing (Kobe, Japan, Nov. 1990), pp. 1141–1144.
    • (1990) Proc. Int. Conf. on Spoken Language Processing , pp. 1141-1144
    • Nakadai, Y.1    Sugamura, N.2
  • 37
    • 0026382107 scopus 로고
    • A text-independent speaker recognition method robust against utterance variations
    • (Toronto, Ont., Canada Apr.
    • T. Matsui and S. Furui, “A text-independent speaker recognition method robust against utterance variations,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (Toronto, Ont., Canada, Apr. 1991), pp. 377–380.
    • (1991) Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing , pp. 377-380
    • Matsui, T.1    Furui, S.2
  • 38
    • 0003391579 scopus 로고
    • Pitch Determination of Speech Signals.
    • New York Springer-Verlag
    • W. Hess, Pitch Determination of Speech Signals. New York: Springer-Verlag, 1983.
    • (1983)
    • Hess, W.1
  • 39
    • 0003880074 scopus 로고
    • Practical Approaches to Speech Coding. Englewood
    • Cliffs, NJ Prentice-Hall
    • P. Papamichalis, Practical Approaches to Speech Coding. Englewood Cliffs, NJ: Prentice-Hall, 1987.
    • (1987)
    • Papamichalis, P.1
  • 40
    • 0014557562 scopus 로고
    • Parallel processing techniques for estimating pitch periods of speech in the time domain
    • Aug.
    • B. Gold and L. R. Rabiner, “Parallel processing techniques for estimating pitch periods of speech in the time domain,” J. Acoust. Soc. America, vol. 46, pt. 2, no. 2, pp. 442–448, Aug. 1969.
    • (1969) J. Acoust. Soc. America , vol.46 , Issue.2 , pp. 442-448
    • Gold, B.1    Rabiner, L.R.2
  • 41
    • 0023965776 scopus 로고
    • Design and implementation of a parallel processing based pitch detector
    • Feb.
    • R. S. Sukkar, J. L. LoCicero, and J. Picone, “Design and implementation of a parallel processing based pitch detector,” IEEE J. Selected Areas Commun., vol. 6, no. 2, pp. 441–451, Feb. 1988.
    • (1988) IEEE J. Selected Areas Commun. , vol.6 , Issue.2 , pp. 441-451
    • Sukkar, R.S.1    LoCicero, J.L.2    Picone, J.3
  • 42
    • 0022879618 scopus 로고
    • Voiced/unvoiced classification of speech with applications to the U.S. Government LPC-10E algorithm
    • (Tokyo, Japan Apr.
    • J. Campbell and T. E. Tremain, “Voiced/unvoiced classification of speech with applications to the U.S. Government LPC-10E algorithm” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing(Tokyo, Japan, Apr. 1986), pp. 473–476.
    • (1986) Proc. IEEE Int.Conf. on Acoustics, Speech, and SignalProcessing , pp. 473-476
    • Campbell, J.1    Tremain, T.E.2
  • 45
    • 0014055288 scopus 로고
    • Cepstrum pitch determination
    • Feb.
    • A. M. Noll, “Cepstrum pitch determination,” J. Acoust. Soc. America, vol. 41, no. 2, pp. 293–309, Feb. 1967.
    • (1967) J. Acoust. Soc. America , vol.41 , Issue.2 , pp. 293-309
    • Noll, A.M.1
  • 46
    • 84862272073 scopus 로고
    • Experiments in Hearing.
    • New York McGraw-Hill
    • G. von Bkésy, Experiments in Hearing. New York: McGraw-Hill, 1960.
    • (1960)
    • von Bkésy, G.1
  • 47
    • 84943676274 scopus 로고
    • Illinois Institute of Technology
    • Ph.D. dissertation Chicago Dec.
    • J. Picone, “Analytic signal processing,” Ph.D. dissertation, Illinois Institute of Technology, Chicago, Dec. 1983.
    • (1983)
    • Picone, J.1
  • 48
    • 0003564548 scopus 로고
    • Modern Control Engineering.
    • Englewood Cliffs, NJ Prentice-Hall
    • K. Ogata, Modern Control Engineering. Englewood Cliffs, NJ: Prentice-Hall, 1970.
    • (1970)
    • Ogata, K.1
  • 49
    • 0004106903 scopus 로고
    • An Introduction to the Physiology of Hearing.
    • New York Academic Press
    • J. O. Pickles, An Introduction to the Physiology of Hearing. New York: Academic Press, 1988.
    • (1988)
    • Pickles, J.O.1
  • 50
    • 0040752329 scopus 로고
    • Auditory Physiology
    • New York Academic Press
    • A. R. M/o ller, Auditory Physiology, New York: Academic Press, 1983.
    • (1983)
    • M/oller, A.R.1
  • 51
    • 0003522449 scopus 로고
    • Speech Communication: Human and Machine.
    • New York Addison-Wesley
    • D. O’Shaughnessy, Speech Communication: Human and Machine. New York: Addison-Wesley, 1987.
    • (1987)
    • O’Shaughnessy, D.1
  • 52
    • 84912495580 scopus 로고
    • Analytical expressions for critical-band rate and critical bandwidth as a function of frequency
    • Dec.
    • E. Zwicker and E. Terhardt, “Analytical expressions for critical-band rate and critical bandwidth as a function of frequency,” J. Acoust. Soc. America, vol. 68, no. 5, pp. 1523–1525, Dec. 1980.
    • (1980) J. Acoust. Soc. America , vol.68 , Issue.5 , pp. 1523-1525
    • Zwicker, E.1    Terhardt, E.2
  • 53
    • 0021794508 scopus 로고
    • Cochlear modeling
    • Sept.
    • J. B. Allen, “Cochlear modeling,” IEEE ASSP Mag., vol. 3, no. 3, pp. 3–29, Sept. 1985.
    • (1985) IEEE ASSP Mag. , vol.3 , Issue.3 , pp. 3-29
    • Allen, J.B.1
  • 54
    • 84928837806 scopus 로고
    • A joint synchrony/mean-rate model of auditory speech processing
    • Jan.
    • S. Seneff, “A joint synchrony/mean-rate model of auditory speech processing,” J. Phonetics, vol. 16, no. 1, pp. 55–76, Jan. 1988.
    • (1988) J. Phonetics , vol.16 , Issue.1 , pp. 55-76
    • Seneff, S.1
  • 55
    • 0003858079 scopus 로고
    • The Fast Fourier Transform.
    • Englewood Cliffs, NJ Prentice-Hall
    • O. E. Brigham, The Fast Fourier Transform. Englewood Cliffs, NJ: Prentice-Hall, 1974.
    • (1974)
    • Brigham, O.E.1
  • 56
    • 0015112070 scopus 로고
    • Speech analysis and synthesis by linear prediction of the speech wave
    • Mar.
    • B. S. Atal and S. L. Hanauer, “Speech analysis and synthesis by linear prediction of the speech wave,” J. Acoust. Soc. America, vol. 50, no. 2, pp. 637–655, Mar. 1971.
    • (1971) J. Acoust. Soc. America , vol.50 , Issue.2 , pp. 637-655
    • Atal, B.S.1    Hanauer, S.L.2
  • 57
    • 33646943734 scopus 로고
    • Digital Spectral Analysis With Applications.
    • Englewood Cliffs, NJ Prentice-Hall
    • S. L. Marple, Jr., Digital Spectral Analysis With Applications. Englewood Cliffs, NJ: Prentice-Hall, 1987.
    • (1987)
    • Marple, S.L.1
  • 58
    • 0016521547 scopus 로고
    • Quantization properties of transmission parameters in linear predictive systems
    • June
    • V. R. Viswanathan and J. Makhoul, “Quantization properties of transmission parameters in linear predictive systems,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-23, no. 3, pp. 309–321, June 1975.
    • (1975) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-23 , Issue.3 , pp. 309-321
    • Viswanathan, V.R.1    Makhoul, J.2
  • 59
    • 0020114643 scopus 로고
    • Predictive coding of speech at low bit rates
    • Apr.
    • B. S. Atal, “Predictive coding of speech at low bit rates,” IEEE Trans. Commun., vol. COM-30, no. 4, pp. 600–614, Apr. 1982.
    • (1982) IEEE Trans. Commun. , vol.COM-30 , Issue.4 , pp. 600-614
    • Atal, B.S.1
  • 60
    • 84989426403 scopus 로고
    • A new model of LPC excitation for producing natural sounding speech at low bit rates
    • (Paris, France)
    • B. S. Atal and J. R. Remde, “A new model of LPC excitation for producing natural sounding speech at low bit rates,” in Proc. IEEE Int. Conf on Acoustics, Speech, and Signal Processing (Paris, France, 1982), pp. 614–617.
    • (1982) Proc. IEEE Int. Conf on Acoustics, Speech, and Signal Processing , pp. 614-617
    • Atal, B.S.1    Remde, J.R.2
  • 62
    • 0016113916 scopus 로고
    • A parametrically controlled spectral analysis system for speech
    • Oct. (not the original reference by any means, but a good reference on digital computation of the spectrogram—original references on analog techniques date back to the 1940’s).
    • H. F. Silverman and N. R. Dixon, “A parametrically controlled spectral analysis system for speech,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-22, no. 5, pp. 362–381, Oct. 1974 (not the original reference by any means, but a good reference on digital computation of the spectrogram—original references on analog techniques date back to the 1940’s).
    • (1974) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-22 , Issue.5 , pp. 362-381
    • Silverman, H.F.1    Dixon, N.R.2
  • 63
    • 0003751555 scopus 로고
    • Complex Variables and Applications.
    • New York McGraw-Hill
    • R. V. Churchill, J. W. Brown, and R. F. Verhey, Complex Variables and Applications. New York: McGraw-Hill, 1976.
    • (1976)
    • Churchill, R.V.1    Brown, J.W.2    Verhey, R.F.3
  • 64
    • 0016067897 scopus 로고
    • Linear prediction for speaker identification
    • June
    • B. S. Atal, “Linear prediction for speaker identification,” J. Acoust. Soc. America, vol. 55, no. 6, pp. 1304–1311, June 1974.
    • (1974) J. Acoust. Soc. America , vol.55 , Issue.6 , pp. 1304-1311
    • Atal, B.S.1
  • 65
    • 0001052406 scopus 로고
    • Discrete representation of signals
    • June
    • A. V. Oppenheim and D. H. Johnson, “Discrete representation of signals,” in Proc. IEEE, vol. 60, no. 6, pp. 681–691, June 1972.
    • (1972) Proc. IEEE , vol.60 , Issue.6 , pp. 681-691
    • Oppenheim, A.V.1    Johnson, D.H.2
  • 66
    • 0003770715 scopus 로고
    • Automatic Speech Recognition: The Development of the SPHINX System.
    • Boston, MA Kluwer
    • K. F. Lee, Automatic Speech Recognition: The Development of the SPHINX System. Boston, MA: Kluwer, 1989.
    • (1989)
    • Lee, K.F.1
  • 67
    • 0024916535 scopus 로고
    • Application of hidden Markov models for recognition of a limited set of words in unconstrained speech
    • (Glasgow, Scotland,)
    • J. G. Wilpon, C. H. Lee, and L. R. Rabiner, “Application of hidden Markov models for recognition of a limited set of words in unconstrained speech,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (Glasgow, Scotland, 1989), pp. 254–257.
    • (1989) Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing , pp. 254-257
    • Wilpon, J.G.1    Lee, C.H.2    Rabiner, L.R.3
  • 68
    • 0004181307 scopus 로고
    • Digital Filters
    • 2nd ed. Cliffs, NJ Prentice-Hall
    • R. W. Hamming, Digital Filters, 2nd ed. Englewood, Cliffs, NJ: Prentice-Hall, 1989.
    • (1989)
    • Hamming, R.W.1
  • 69
    • 0025642106 scopus 로고
    • Experiments on mixture-density phoneme modelling for the speaker-independent 1000-word speech recognition DARPA task
    • (Albuquerque, NM,) Apr.
    • H. Ney, “Experiments on mixture-density phoneme modelling for the speaker-independent 1000-word speech recognition DARPA task,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (Albuquerque, NM, Apr. 1990), pp. 713–716.
    • (1990) Proc. IEEE Int.Conf. on Acoustics, Speech, and Signal Processing , pp. 713-716
    • Ney, H.1
  • 72
    • 0032097263 scopus 로고
    • Introduction to Statistical Pattern Recognition.
    • New York Academic Press
    • K. Fukunaga, Introduction to Statistical Pattern Recognition. New York: Academic Press, 1972.
    • (1972)
    • Fukunaga, K.1
  • 73
    • 0025465111 scopus 로고
    • Continuous speech recognition using hidden Markov
    • July
    • J. Picone, “Continuous speech recognition using hidden Markov models IEEE ASSP Mag., vol. 7, no. 3, pp. 26–41, July 1990.
    • (1990) models IEEE ASSP Mag. , vol.7 , Issue.3 , pp. 26-41
    • Picone, J.1
  • 74
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, “A tutorial on hidden Markov models and selected applications in speech recognition,” Proc. IEEE, vol. 77, no. 2, pp. 257–285, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-285
    • Rabiner, L.R.1
  • 76
    • 33744639469 scopus 로고
    • Cluster Analysis for Applications.
    • New York Academic Press
    • M. R. Anderberg, Cluster Analysis for Applications. New York: Academic Press, 1973.
    • (1973)
    • Anderberg, M.R.1
  • 77
    • 0018918171 scopus 로고
    • An algorithm for vector quantizer design
    • Jan.
    • Y. Linde, A. Buzo, and R. M. Gray, “An algorithm for vector quantizer design,” IEEE Trans. Commun., vol. COM-28, no. 1, pp. 84–95, Jan. 1980.
    • (1980) IEEE Trans. Commun. , vol.COM-28 , Issue.1 , pp. 84-95
    • Linde, Y.1    Buzo, A.2    Gray, R.M.3
  • 78
    • 0018455339 scopus 로고
    • Interactive clustering techniques for selecting speaker-independent reference templates for isolated word recognition
    • Apr.
    • S. E. Levinson, L. R. Rabiner, A. E. Rosenberg, and J. G. Wilpon, “Interactive clustering techniques for selecting speaker-independent reference templates for isolated word recognition,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-27, no. 2, pp. 134–141, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-27 , Issue.2 , pp. 134-141
    • Levinson, S.E.1    Rabiner, L.R.2    Rosenberg, A.E.3    Wilpon, J.G.4
  • 80
    • 0020098639 scopus 로고
    • On the structure of vector quantizers
    • Mar.
    • A. Gersho, “On the structure of vector quantizers,” IEEE Trans. Informat., Theory, vol. IT-28, no. 2, pp. 157–166, Mar. 1982.
    • (1982) IEEE Trans. Informat., Theory , vol.IT-28 , Issue.2 , pp. 157-166
    • Gersho, A.1
  • 81
    • 0003527079 scopus 로고
    • Self-Organization and Associative Memory
    • 3rd ed. New York Springer-Verlag
    • T. Kohonen, Self-Organization and Associative Memory, 3rd ed. New York: Springer-Verlag, 1989.
    • (1989)
    • Kohonen, T.1
  • 82
    • 0003472470 scopus 로고
    • Pattern Classification and Scene Analysis.
    • New York Academic Press
    • R. O. Duda and P. E. Hart, Pattern Classification and Scene Analysis. New York: Academic Press, 1973.
    • (1973)
    • Duda, R.O.1    Hart, P.E.2
  • 83
    • 0016467604 scopus 로고
    • Minimum prediction residual principle applied to speech recognition
    • Feb.
    • F. Itakura, “Minimum prediction residual principle applied to speech recognition,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-23, no. 1, pp. 67–72, Feb. 1975.
    • (1975) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-23 , Issue.1 , pp. 67-72
    • Itakura, F.1
  • 84
    • 0020795461 scopus 로고
    • On the effects of varying filter bank parameters on isolated word recognition
    • Aug.
    • B. Dautrich, L. R. Rabiner, and T. B. Martin, “On the effects of varying filter bank parameters on isolated word recognition,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-31, no. 4, pp. 793–807, Aug. 1983.
    • (1983) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-31 , Issue.4 , pp. 793-807
    • Dautrich, B.1    Rabiner, L.R.2    Martin, T.B.3
  • 85
    • 0025418339 scopus 로고
    • Duration in context clustering for speech recognition
    • Apr.
    • J.Picone, “Duration in context clustering for speech recognition,” Speech Commun., vol. 9, no. 2, pp. 119—128, Apr. 1990.
    • (1990) Speech Commun. , vol.9 , Issue.2 , pp. 119-128
    • Picone, J.1
  • 89
    • 84943681417 scopus 로고
    • Speaker adaptation from limited training in the BBN BYBLOS speech recognition system
    • Palo Alto, CA Morgan Kaufmann Feb.
    • F. Kubala, M. Feng, J. Makhoul, and R. Schwartz, “Speaker adaptation from limited training in the BBN BYBLOS speech recognition system,” in Proc. DARPA Speech and Natural Language Workshop. Palo Alto, CA: Morgan Kaufmann Pub., Feb. 1989, pp. 100–105.
    • (1989) Proc. DARPA Speech and Natural Language Workshop , pp. 100-105
    • Kubala, F.1    Feng, M.2    Makhoul, J.3    Schwartz, R.4
  • 92
    • 0026385261 scopus 로고
    • Integrating time alignment and neural networks for high performance continuous speech recognition
    • (Toronto, Ont., Canada, Apr
    • P. Haffner, M. Franzini, and A. Waibel, “Integrating time alignment and neural networks for high performance continuous speech recognition,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (Toronto, Ont., Canada, Apr.1991), pp.105–109.
    • (1991) Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing , pp. 105-109
    • Haffner, P.1    Franzini, M.2    Waibel, A.3
  • 97
    • 0022150487 scopus 로고
    • The development of an experimental discrete dictation recognizer
    • F. Jelinek, “The development of an experimental discrete dictation recognizer,” Proc. IEEE, vol. 73, no. 11, pp. 1616–1624, Nov. 1985.
    • (1985) Proc. IEEE , vol.73 , Issue.11 , pp. 1616-1624
    • Jelinek, F.1
  • 99
    • 84943678218 scopus 로고
    • A recognition time reduction algorithm for large-vocabulary speech recognition
    • (Kobe, Japan, Nov.)
    • J. Koo, C. K. Un, H. S. Lee, H. R. Kim, and M. W. Koo, “A recognition time reduction algorithm for large-vocabulary speech recognition,” in Proc. Int. Conf. on Spoken Language Processing (Kobe, Japan, Nov. 1990), pp. 253–256.
    • (1990) Proc. Int. Conf. on Spoken Language Processing , pp. 253-256
    • Koo, J.1    Un, C.K.2    Lee, H.S.3    Kim, H.R.4    Koo, M.W.5
  • 101
    • 0346407544 scopus 로고
    • An optimal discriminative training method for continuous mixture density HMMs
    • (Kobe, Japan, Nov.)
    • S. Mizuta and K. Kakajima, “An optimal discriminative training method for continuous mixture density HMMs,” inProc. Int. Conf. on Spoken Language Processing (Kobe, Japan, Nov.1990), pp.245–248.
    • (1990) Proc. Int. Conf. on Spoken Language Processing , pp. 245-248
    • Mizuta, S.1    Kakajima, K.2
  • 102
    • 0024889251 scopus 로고
    • Large vocabulary word recognition based on demisyllable hidden Markov model using small amount of training data
    • (Glasgow, Scotland,)
    • K. Yoshida, T. Watanabe, and S. Koga, “Large vocabulary word recognition based on demisyllable hidden Markov model using small amount of training data,” in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (Glasgow, Scotland, 1989), pp. 1–4.
    • (1989) Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing , pp. 1-4
    • Yoshida, K.1    Watanabe, T.2    Koga, S.3
  • 103
    • 0026404742 scopus 로고
    • Word recognition using neural nets, multi-state Gaussian and K-nearest neighbor classifiers
    • (Toronto, Ont., Canada, Apr.)
    • D. Lubensky, “Word recognition using neural nets, multi-state Gaussian and K-nearest neighbor classifiers,” inProc. IEEE Int.Conf. on Acoustics, Speech, and Signal Processing (Toronto, Ont., Canada, Apr. 1991), pp. 141–144.
    • (1991) Proc. IEEE Int.Conf. on Acoustics, Speech, and Signal Processing , pp. 141-144
    • Lubensky, D.1
  • 106
    • 84942213880 scopus 로고
    • Experiments with a speakerindependent continuous speech recognition system on the TIMIT database
    • (Kobe, Japan, Nov.)
    • Y. Zhao and H. Wakita, “Experiments with a speakerindependent continuous speech recognition system onthe TIMIT database,” in Proc. Int. Conf. on Spoken Language Processing (Kobe, Japan, Nov. 1990), pp. 697–700.
    • (1990) Proc. Int. Conf. on Spoken Language Processing , pp. 697-700
    • Zhao, Y.1    Wakita, H.2
  • 110
    • 0023833734 scopus 로고
    • 1000-word speaker-independent continuous-speech recognition using hidden Markovmodels
    • (New York, Apr.)
    • H. Murveit and M. Weintraub, “1000-word speaker-independent continuous-speech recognition using hidden Markovmodels,” in Proc. IEEE Int. Conf. on Acoustics, Speech, andSignal Processing (New York, Apr. 1988), pp. 115–118.
    • (1988) Proc. IEEE Int. Conf. on Acoustics, Speech, andSignal Processing , pp. 115-118
    • Murveit, H.1    Weintraub, M.2
  • 112
    • 84943681507 scopus 로고
    • A Japanesetext dictation system based on phoneme recognition and a dependency grammar
    • (Toronto. Ont., Canada. Apr.)
    • S. Makino, A. Ito, M. Endo, and K. Kido, “A Japanese text dictation system based on phoneme recognition and a dependency grammar,” in Proc. IEEE Int. Conf. on Acoustics, Speech. and Simal Processing (Toronto. Ont., Canada. Apr.1989), pp. 699–702.
    • (1989) Proc. IEEE Int. Conf. on Acoustics, Speech. and Simal Processing , pp. 699-702
    • Makino, S.1    Ito, A.2    Endo, M.3    Kido, K.4
  • 113
    • 80052556850 scopus 로고
    • Speaker adaptable phoneme recognition selecting reliable acoustic features based on mutual information
    • (Kobe, Japan, Nov.)
    • K. Shirai, N. Hosaka, E. Kitagawa, and T. Endou, “Speaker adaptable phoneme recognition selecting reliable acoustic features based on mutual information,” in Proc. Int. Conf. on Spoken Language Processing (Kobe, Japan, Nov. 1990). pp.353–356.
    • (1990) Proc. Int. Conf. on Spoken Language Processing , pp. 353-356
    • Shirai, K.1    Hosaka, N.2    Kitagawa, E.3    Endou, T.4
  • 114
    • 30244460981 scopus 로고
    • A comparative study of acoustic representations of speech for vowel classificationusing multi-layer preceptrons
    • (Kobe, Japan, Nov.)
    • H. M. Meng and V. W. Zue, “A comparative study of acoustic representations of speech for vowel classificationusing multi-layer preceptrons,” in Proc. Int. Conf.SpokenLanguage Processing (Kobe, Japan, Nov. 1990), pp. 1053–1056.
    • (1990) Proc. Int. Conf. SpokenLanguage Processing , pp. 1053-1056
    • Meng, H.M.1    Zue, V.W.2
  • 116
    • 84889335871 scopus 로고    scopus 로고
    • Towards handling the acoustic environment in spoken language processing
    • (Banff, Alta., Canada, Oct.) 1992
    • H. Hermansky and N. Morgan, “Towards handling the acoustic environment in spoken language processing,” in Proc. Int. Conf.on Spoken Language Processing (Banff, Alta., Canada, Oct.1992), pp. 85–88.
    • Proc. Int. Conf.on Spoken Language Processing , pp. 85-88
    • Hermansky, H.1    Morgan, N.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.