SCOPUS 정보 검색 플랫폼

Volumn 26, Issue 4, 2009, Pages 78-85

Updated MINDS report on speech recognition and understanding, part 2

(7) Baker, Janet M a Deng, Li b Khudanpur, Sanjeev c Lee, Chin Hui d Glass, James R e Morgan, Nelson f O'Shaughnessy, Douglas g

a Saras Institute (United States)

b UNIVERSITY OF WASHINGTON (United States)

c Johns Hopkins University (United States)

d GEORGIA INSTITUTE OF TECHNOLOGY (United States)

e MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

f UNIVERSITY OF CALIFORNIA (United States)

g INRS EMT (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTICS; AUDIO SIGNAL PROCESSING; DATA MINING; DEEP NEURAL NETWORKS; SEARCH ENGINES; SPEECH;

AUTOMATIC SPEECH RECOGNITION; COMPUTATIONAL ARCHITECTURE; HETEROGENEOUS KNOWLEDGE; HUMAN LANGUAGE TECHNOLOGIES; HUMAN SPEECH PERCEPTION; ROBUST SPEECH RECOGNITION; SPEAKER CHARACTERISTICS; SPEECH RECOGNITION SYSTEMS;

SPEECH RECOGNITION;

EID: 85032759066 PISSN: 10535888 EISSN: None Source Type: Journal
DOI: 10.1109/MSP.2009.932707 Document Type: Article

Times cited : (49)

References (68)

1
- 0030677475
- Speaker adaptive training: A maximum likelihood approach to speaker normalization
- Apr
- T. Anastasakos, J. McDonough, and J. Makhoul, "Speaker adaptive training: A maximum likelihood approach to speaker normalization," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Apr. 1997, pp. 1043-1046.
- (1997) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , pp. 1043-1046
- Anastasakos, T.¹ McDonough, J.² Makhoul, J.³

2
- 33745203699
- Improving speech recognition using a data-driven approach
- Sept
- G. Aradilla, J. Vepa, and H. Bourlard, "Improving speech recognition using a data-driven approach," in Proc. Eurospeech, pp. 3333-3336, Sept. 2005.
- (2005) Proc. Eurospeech , pp. 3333-3336
- Aradilla, G.¹ Vepa, J.² Bourlard, H.³

3
- 85032768965
- K. Asanovic, R. Bodik, B. C. Catanzaro, J. Gebis, P. Husbands, K. Keutzer, D. Patterson, W. Plishker, J. Shalf, S. Williams, and K. Yelick, The landscape of parallel computing research: A view from Berkeley, EECS Dept., Univ.California at Berkeley, Tech. Rep. UCB/ EECS-2006-183, Dec. 2006.
- K. Asanovic, R. Bodik, B. C. Catanzaro, J. Gebis, P. Husbands, K. Keutzer, D. Patterson, W. Plishker, J. Shalf, S. Williams, and K. Yelick, "The landscape of parallel computing research: A view from Berkeley," EECS Dept., Univ.California at Berkeley, Tech. Rep. UCB/ EECS-2006-183, Dec. 2006.

4
- 4544323356
- Combination of hidden Markov models with dynamic time warping for speech recognition
- S. Axelrod and B. Maison, "Combination of hidden Markov models with dynamic time warping for speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, 2004, vol. 1, pp. 173-176.
- (2004) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , vol.1 , pp. 173-176
- Axelrod, S.¹ Maison, B.²

5
- 0022890536
- Maximum mutual information estimation of hidden Markov model parameters for speech recognition
- Apr
- L. Bahl, P. Brown, P. de Souza, and R. Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Apr. 1986, pp. 49-52.
- (1986) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , pp. 49-52
- Bahl, L.¹ Brown, P.² de Souza, P.³ Mercer, R.⁴

6
- 0004123567
- New York: Holt
- L. Bloomfield, Language. New York: Holt, 1933.
- (1933) Language
- Bloomfield, L.¹

7
- 0030355935
- A new ASR approach based on independent processing and recombination of partial frequency bands
- Oct
- H. Bourlard and S. Dupont, "A new ASR approach based on independent processing and recombination of partial frequency bands," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Oct. 1996, vol. 1, pp. 426-429.
- (1996) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , vol.1 , pp. 426-429
- Bourlard, H.¹ Dupont, S.²

8
- 0032677683
- An efficient probabilistically sound-algorithm for segmentation and word discovery
- Feb
- M. R. Brent, "An efficient probabilistically sound-algorithm for segmentation and word discovery," Mach. Learn., vol. 34, no. 1-3, pp. 71-105, Feb. 1999.
- (1999) Mach. Learn , vol.34 , Issue.1-3 , pp. 71-105
- Brent, M.R.¹

9
- 84860169561
- A corpus-based approach to language learning,
- Ph.D. dissertation, Univ. Pennsylvania, Philadelphia, PA
- E. Brill, "A corpus-based approach to language learning," Ph.D. dissertation, Univ. Pennsylvania, Philadelphia, PA, 1993.
- (1993)
- Brill, E.¹

10
- 85032761378
- Decoding semantic category from MEG and intracranial EEG in humans
- Washington, DC
- A. Chan, S. Cash, E. Eskandar, J. M. Baker, C. Carlson, O. Devinsky, W. Doyle, R. Kuzniecky, T. Thesen, C. Wang, K. Marinkovic, and E. Halgren, "Decoding semantic category from MEG and intracranial EEG in humans," in Proc. Neuroscience 2008 Conf., Washington, DC.
- Proc. Neuroscience 2008 Conf
- Chan, A.¹ Cash, S.² Eskandar, E.³ Baker, J.M.⁴ Carlson, C.⁵ Devinsky, O.⁶ Doyle, W.⁷ Kuzniecky, R.⁸ Thesen, T.⁹ Wang, C.¹⁰ Marinkovic, K.¹¹ Halgren, E.¹²

11
- 0003793394
- New York: Praeger
- N. A. Chomsky, Knowledge of Language: Is Nature, Origin, and Use. New York: Praeger, 1986.
- (1986) Knowledge of Language: Is Nature, Origin, and Use
- Chomsky, N.A.¹

12
- 85032756635
- A. Clark, Unsupervised language acquisition: Theory and practice, Ph.D. dissertation, Univ. Sussex, Brighton, U.K., 2001.
- A. Clark, "Unsupervised language acquisition: Theory and practice," Ph.D. dissertation, Univ. Sussex, Brighton, U.K., 2001.

13
- 33745224873
- Vocal tract normalization in speech recognition: Compensating for systematic speaker variability
- May
- J. Cohen, T. Kamm, and A. G. Andreou, "Vocal tract normalization in speech recognition: Compensating for systematic speaker variability," J. Acoust. Soc. Amer., vol. 97, no. 5, pp. 3246-3247, May 1995.
- (1995) J. Acoust. Soc. Amer , vol.97 , Issue.5 , pp. 3246-3247
- Cohen, J.¹ Kamm, T.² Andreou, A.G.³

14
- 0346594072
- Language acquisition in the absence of experience
- Dec
- S. Crain, "Language acquisition in the absence of experience," Behav. Brain Sci., vol. 14, no. 4, pp. 601-699, Dec. 1991.
- (1991) Behav. Brain Sci , vol.14 , Issue.4 , pp. 601-699
- Crain, S.¹

15
- 0035312570
- Spatiotemporal mapping of brain activity by integration of multiple imaging modalities
- A. M. Dale and E. Halgren, "Spatiotemporal mapping of brain activity by integration of multiple imaging modalities," Curr. Opin. Neurobiol. vol. 11, no. 2, pp. 202-208, 2001.
- (2001) Curr. Opin. Neurobiol , vol.11 , Issue.2 , pp. 202-208
- Dale, A.M.¹ Halgren, E.²

16
- 0004241790
- Unsupervised language acquisition,
- Ph.D. dissertation, MIT, Cambridge, MA
- C. G. de Marcken, "Unsupervised language acquisition," Ph.D. dissertation, MIT, Cambridge, MA, 1996.
- (1996)
- de Marcken, C.G.¹

17
- 0028516022
- Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
- L. Deng, M. Aksmanovic, D. Sun, and J. Wu, "Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 507-520, 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 507-520
- Deng, L.¹ Aksmanovic, M.² Sun, D.³ Wu, J.⁴

18
- 34047266395
- L. Deng, D. Yu, and A. Acero, Structured speech modeling, IEEE Trans. Audio, Speech Lang. Process. (Special Issue on Rich Transcription), 14, no. 5, pp. 1492-1504, Sept. 2006.
- L. Deng, D. Yu, and A. Acero, "Structured speech modeling," IEEE Trans. Audio, Speech Lang. Process. (Special Issue on Rich Transcription), vol. 14, no. 5, pp. 1492-1504, Sept. 2006.

19
- 4243117872
- New York: Marcel Dekker
- L. Deng and D. O'Shaughnessy, Speech Processing - A Dynamic and Optimization-oriented Approach. New York: Marcel Dekker, 2003.
- (2003) Speech Processing - A Dynamic and Optimization-oriented Approach
- Deng, L.¹ O'Shaughnessy, D.²

20
- 85008543637
- Beamforming microphone arrays for speech enhancement
- K. Farrell, R. Mammone, and J. Flanagan, "Beamforming microphone arrays for speech enhancement," in Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, 1992, pp. 285-288.
- (1992) Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing , pp. 285-288
- Farrell, K.¹ Mammone, R.² Flanagan, J.³

21
- 84949826552
- The frame net database and software tools
- Las Palmas
- C. J. Fillmore, C. F. Baker, and H. Sato, "The frame net database and software tools," in Proc. 3rd Int. Conf. Language Resources and Evaluation (LREC), Las Palmas, 2002, pp. 1157-1160.
- (2002) Proc. 3rd Int. Conf. Language Resources and Evaluation (LREC) , pp. 1157-1160
- Fillmore, C.J.¹ Baker, C.F.² Sato, H.³

22
- 34547549792
- Speech recognition using linear dynamic models
- J. Frankel and S. King, "Speech recognition using linear dynamic models," IEEE Trans. Audio, Speech Lang. Process., vol. 15, no. 1, pp. 246-256, 2007.
- (2007) IEEE Trans. Audio, Speech Lang. Process , vol.15 , Issue.1 , pp. 246-256
- Frankel, J.¹ King, S.²

23
- 58849145971
- ASR - Articulatory Speech Recognition
- Aalborg, Denmark
- J. Frankel and S. King, "ASR - Articulatory Speech Recognition," in Proc. Eurospeech, Aalborg, Denmark, 2001, pp. 599-602.
- (2001) Proc. Eurospeech , pp. 599-602
- Frankel, J.¹ King, S.²

24
- 85032775863
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains
- J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains," IEEE Trans. Speech Audio Process., no. 7, pp. 711-720, 1997.
- (1997) IEEE Trans. Speech Audio Process , Issue.7 , pp. 711-720
- Gauvain, J.-L.¹ Lee, C.-H.²

25
- 0029114364
- Mapping function in the brain with magnetoencephalography, anatomical magnetic resonance imaging, and functional magnetic resonance imaging
- J. S. George, C. J. Aine, J. C. Mosher, D. M. Schmidt, D. M. Ranken, and H. A. Schlitt, "Mapping function in the brain with magnetoencephalography, anatomical magnetic resonance imaging, and functional magnetic resonance imaging," J. Clin. Neurophysiol., vol. 12, no. 5, pp. 406-431, 1995.
- (1995) J. Clin. Neurophysiol , vol.12 , Issue.5 , pp. 406-431
- George, J.S.¹ Aine, C.J.² Mosher, J.C.³ Schmidt, D.M.⁴ Ranken, D.M.⁵ Schlitt, H.A.⁶

26
- 0038359548
- J. R. Glass, A probabilistic framework for segment-based speech recognition, Comput., Speech Lang., 17, no. 2-3, pp. 137-152, 2003 (Eds.: M. Russell and J. Bilmes, Special Issue).
- J. R. Glass, "A probabilistic framework for segment-based speech recognition," Comput., Speech Lang., vol. 17, no. 2-3, pp. 137-152, 2003 (Eds.: M. Russell and J. Bilmes, Special Issue).

27
- 0002068513
- Cambridge, MA: Blackwell Publishers
- H. Goodluck, Language Acquisition. Cambridge, MA: Blackwell Publishers, 1991.
- (1991) Language Acquisition
- Goodluck, H.¹

28
- 0344147463
- Contribution of fine phonetic detail to speech understanding
- Barcelona, Spain
- S. Hawkins, "Contribution of fine phonetic detail to speech understanding," in Proc. 15th Int. Congress of Phonetic Sciences (ICPhS-03), Barcelona, Spain, 2003, pp. 293-296.
- (2003) Proc. 15th Int. Congress of Phonetic Sciences (ICPhS-03) , pp. 293-296
- Hawkins, S.¹

29
- 0028517164
- RASTA processing of speech
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 578-589, 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

30
- 0030355778
- Using accent-specific pronunciation modeling for robust speech recognition
- J. J. Humphries, P. C. Woodland, and D. Pearce, "Using accent-specific pronunciation modeling for robust speech recognition," in Proc. Int. Conf. Spoken Language Processing, 1996, pp. 2324-2327.
- (1996) Proc. Int. Conf. Spoken Language Processing , pp. 2324-2327
- Humphries, J.J.¹ Woodland, P.C.² Pearce, D.³

31
- 68349094559
- Speech recognition on vector architectures,
- Ph.D. dissertation, Univ. California, Berkeley
- A. Janin, "Speech recognition on vector architectures," Ph.D. dissertation, Univ. California, Berkeley, 2004.
- (2004)
- Janin, A.¹

32
- 0003786003
- Cambridge, MA: MIT Press
- F. Jelinek, Statistical Methods for Speech Recognition. Cambridge, MA: MIT Press, 1997.
- (1997) Statistical Methods for Speech Recognition
- Jelinek, F.¹

33
- 0016939124
- Continuous speech recognition by statistical methods
- F. Jelinek, "Continuous speech recognition by statistical methods," Proc. IEEE, vol. 64, no. 4, pp. 532-557, 1976.
- (1976) Proc. IEEE , vol.64 , Issue.4 , pp. 532-557
- Jelinek, F.¹

34
- 15844399848
- Vocabulary independent word confidence measure using subword features
- Sydney, Australia
- L. Jiang and X. D. Huang, "Vocabulary independent word confidence measure using subword features," in Proc. Int. Conf. Spoken Language Processing, Sydney, Australia, 1998, pp. 401-404.
- (1998) Proc. Int. Conf. Spoken Language Processing , pp. 401-404
- Jiang, L.¹ Huang, X.D.²

35
- 0003914808
- Cambridge MA: MIT Press/Bradford Books
- P. W. Jusczyk, The Discovery of Spoken Language. Cambridge MA: MIT Press/Bradford Books, 1997.
- (1997) The Discovery of Spoken Language
- Jusczyk, P.W.¹

36
- 0029351511
- Infants' detection of sound patterns of words in fluent speech
- Aug
- P. W. Jusczyk and R. N. Aslin, "Infants' detection of sound patterns of words in fluent speech," Cogn. Psychol., vol. 29, no. 1, pp. 1-23, Aug. 1995.
- (1995) Cogn. Psychol , vol.29 , Issue.1 , pp. 1-23
- Jusczyk, P.W.¹ Aslin, R.N.²

37
- 85032779194
- Identifying unexpected words using in-context and out-of-context phoneme posteriors,
- Tech. Rep, IDIAPRR 06-68
- H. Ketabdar and H. Hermansky, "Identifying unexpected words using in-context and out-of-context phoneme posteriors," Tech. Rep., IDIAPRR 06-68, 2006.
- (2006)
- Ketabdar, H.¹ Hermansky, H.²

38
- 84964379003
- From tree bank to prop bank
- Canary Islands, Spain
- P. Kingsbury and M. Palmer, "From tree bank to prop bank," in Proc. LREC, Las Palmas, Canary Islands, Spain, 2002.
- (2002) Proc. LREC, Las Palmas
- Kingsbury, P.¹ Palmer, M.²

39
- 0141703242
- K. Kirchhoff, J. Bilmes, S. Das, N. Duta, M. Egan, J. Gang, H. Feng, J. Henderson, L. Daben, M. Noamany, P. Schone, R. Schwartz, and D. Vergyri, Novel approaches to Arabic speech recognition: Report from the 2002 Johns-Hopkins summer workshop, in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Apr. 2003, pp. 344-347.
- K. Kirchhoff, J. Bilmes, S. Das, N. Duta, M. Egan, J. Gang, H. Feng, J. Henderson, L. Daben, M. Noamany, P. Schone, R. Schwartz, and D. Vergyri, "Novel approaches to Arabic speech recognition: Report from the 2002 Johns-Hopkins summer workshop," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Apr. 2003, pp. 344-347.

40
- 33746094611
- The unsupervised learning of natural language structure,
- Ph.D. dissertation, Stanford Univ, Palo Alto, CA
- D. Klein, "The unsupervised learning of natural language structure," Ph.D. dissertation, Stanford Univ., Palo Alto, CA, 2005.
- (2005)
- Klein, D.¹

41
- 84864010278
- Speaker adaptation of continuous density HMMs using multivariate linear regression
- C. Leggetter and P. Woodland, "Speaker adaptation of continuous density HMMs using multivariate linear regression," in Proc. Int. Conf. Spoken Language Processing, 1994, pp. 451-454.
- (1994) Proc. Int. Conf. Spoken Language Processing , pp. 451-454
- Leggetter, C.¹ Woodland, P.²

42
- 0002502431
- Languages and language
- K. Gunderson, Ed. Minneapolis, MN: Univ. Minnesota Press
- D. Lewis, "Languages and language," in Language, Mind, and Knowledge, K. Gunderson, Ed. Minneapolis, MN: Univ. Minnesota Press, 1975, pp. 3-35.
- (1975) Language, Mind, and Knowledge , pp. 3-35
- Lewis, D.¹

43
- 33745220761
- An investigation into a simulation of episodic memory for automatic speech recognition
- Lisbon, Portugal, 5-9 Sept
- V. Maier and R. K. Moore, "An investigation into a simulation of episodic memory for automatic speech recognition," in Proc. Interspeech 2005, Lisbon, Portugal, 5-9 Sept. 2005, pp. 1245-1248.
- (2005) Proc. Interspeech 2005 , pp. 1245-1248
- Maier, V.¹ Moore, R.K.²

44
- 1642276395
- Spatiotemporal dynamics of word processing in the human cortex
- K. Marinkovic, "Spatiotemporal dynamics of word processing in the human cortex," Neuroscientist, vol. 10, no. 2, pp. 142-152, 2004.
- (2004) Neuroscientist , vol.10 , Issue.2 , pp. 142-152
- Marinkovic, K.¹

45
- 45749091592
- Predicting human brain activity associated with the meanings of nouns
- T. M. Mitchell, S. V. Shinkareva, A. Carlson, K.-M. Chang, V. L. Malave, R. A. Mason, and M. A. Just, "Predicting human brain activity associated with the meanings of nouns," Science, vol. 320, no. 5880, pp. 1191-1195, 2008.
- (2008) Science , vol.320 , Issue.5880 , pp. 1191-1195
- Mitchell, T.M.¹ Shinkareva, S.V.² Carlson, A.³ Chang, K.-M.⁴ Malave, V.L.⁵ Mason, R.A.⁶ Just, M.A.⁷

46
- 85032751546
- Pushing the envelope-aside
- Sept
- N. Morgan, Q. Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cetin, H. Bourlard, and M. Athineos, "Pushing the envelope-aside," IEEE Signal Processing Mag., vol. 22, no. 5, pp. 81-88, Sept. 2005.
- (2005) IEEE Signal Processing Mag , vol.22 , Issue.5 , pp. 81-88
- Morgan, N.¹ Zhu, Q.² Stolcke, A.³ Sonmez, K.⁴ Sivadas, S.⁵ Shinozaki, T.⁶ Ostendorf, M.⁷ Jain, P.⁸ Hermansky, H.⁹ Ellis, D.¹⁰ Doddington, G.¹¹ Chen, B.¹² Cetin, O.¹³ Bourlard, H.¹⁴ Athineos, M.¹⁵

47
- 84892163293
- Combining multiple estimators of speaking rate
- N. Morgan and E. Fosler-Lussier, "Combining multiple estimators of speaking rate," in Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, 1998, pp. 729-732.
- (1998) Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing , pp. 729-732
- Morgan, N.¹ Fosler-Lussier, E.²

48
- 0033676801
- Denoising of human speech using combined acoustic and EM sensor signal processing
- 5-9 June, Istanbul, Turkey, pp
- L. C. Ng, G. C. Burnett, J. F. Holzrichter, and T. J. Gable, "Denoising of human speech using combined acoustic and EM sensor signal processing," in Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, 5-9 June 2000, Istanbul, Turkey, pp. 229-232.
- (2000) Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing , pp. 229-232
- Ng, L.C.¹ Burnett, G.C.² Holzrichter, J.F.³ Gable, T.J.⁴

49
- 34250826265
- A conversation with John Hennessy and David Patterson
- Dec./Jan
- K. Olukotun, "A conversation with John Hennessy and David Patterson," ACM Queue Mag., vol. 4, no. 10, pp. 14-22, Dec./Jan. 2006-2007.
- (2006) ACM Queue Mag , vol.4 , Issue.10 , pp. 14-22
- Olukotun, K.¹

50
- 0022794148
- Speaker recognition
- D. O'Shaughnessy, "Speaker recognition," IEEE Acoust. Speech Signal Process. Mag., vol. 3, no. 4, pp. 4-17, 1986.
- (1986) IEEE Acoust. Speech Signal Process. Mag , vol.3 , Issue.4 , pp. 4-17
- O'Shaughnessy, D.¹

51
- 0030245363
- From HMMs to segment models: A unified view of stochastic modeling for speech recognition
- M. Ostendorf, V. Digalakis, and J. Rohlicek, "From HMMs to segment models: A unified view of stochastic modeling for speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 360-378, 1996.
- (1996) IEEE Trans. Speech Audio Process , vol.4 , Issue.5 , pp. 360-378
- Ostendorf, M.¹ Digalakis, V.² Rohlicek, J.³

52
- 64849086376
- Unsupervised pattern discovery in speech: Applications to word acquisition and speaker segmentation,
- Ph.D. dissertation, MIT, Cambridge, MA
- A. Park, "Unsupervised pattern discovery in speech: Applications to word acquisition and speaker segmentation," Ph.D. dissertation, MIT, Cambridge, MA, 2006.
- (2006)
- Park, A.¹

53
- 85059598488
- Inside-outside re-estimation from partially bracketed corpora
- Newark, DE
- F. Pereira and Y. Schabes, "Inside-outside re-estimation from partially bracketed corpora," in 30th Annu. Meeting of the Association for Computational Linguistics, Newark, DE, 1992, pp. 128-135.
- (1992) 30th Annu. Meeting of the Association for Computational Linguistics , pp. 128-135
- Pereira, F.¹ Schabes, Y.²

54
- 0004263661
- New York: William Morrow and Co
- S. Pinker. The Language Instinct. New York: William Morrow and Co., 1994.
- (1994) The Language Instinct
- Pinker, S.¹

55
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- L. Rabiner and B. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.²

56
- 0028996967
- Lattice-based search strategies for large vocabulary speech recognition
- May
- F. Richardson, M. Ostendorf, and J. R. Rohlicek, "Lattice-based search strategies for large vocabulary speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, May 1995, pp. 576-579.
- (1995) Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing , pp. 576-579
- Richardson, F.¹ Ostendorf, M.² Rohlicek, J.R.³

57
- 84881675408
- Cepstral channel normalization techniques for HMM-based speaker verification
- A. E. Rosenberg, C. H. Lee, and F. K. Soong, "Cepstral channel normalization techniques for HMM-based speaker verification," in Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, 1994, pp. 1835-1838.
- (1994) Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing , pp. 1835-1838
- Rosenberg, A.E.¹ Lee, C.H.² Soong, F.K.³

58
- 0036152936
- Learning words from sights and sounds: A computational model
- Jan
- D. Roy and A. Pentland, "Learning words from sights and sounds: A computational model," Cogn. Sci., vol. 26, no. 1, pp. 113-146, Jan. 2002.
- (2002) Cogn. Sci , vol.26 , Issue.1 , pp. 113-146
- Roy, D.¹ Pentland, A.²

59
- 0036629220
- Constraints on statistical language learning
- July
- J. R. Saffran, "Constraints on statistical language learning," J. Mem. Lang., vol. 47, no. 1, pp. 172-196, July 2002.
- (2002) J. Mem. Lang , vol.47 , Issue.1 , pp. 172-196
- Saffran, J.R.¹

60
- 33244496414
- Unsupervised context sensitive language acquisition from a large corpus
- L. Saul, Ed. Cambridge, MA: MIT Press
- Z. Solan, D. Horn, E. Ruppin, and S. Edelman, "Unsupervised context sensitive language acquisition from a large corpus," in Advances in Neural Information Processing Systems, L. Saul, Ed. Cambridge, MA: MIT Press, vol. 16, 2004.
- (2004) Advances in Neural Information Processing Systems , vol.16
- Solan, Z.¹ Horn, D.² Ruppin, E.³ Edelman, S.⁴

61
- 56249109227
- How to handle pronunciation variation in ASR: By storing episodes in memory?
- Toulouse, France, May
- H. Strik, "How to handle pronunciation variation in ASR: By storing episodes in memory?," in Proc. ITRW on Speech Recognition and Intrinsic Variation (SRIV2006), Toulouse, France, May 2006, pp. 33-38.
- (2006) Proc. ITRW on Speech Recognition and Intrinsic Variation (SRIV2006) , pp. 33-38
- Strik, H.¹

62
- 0036165806
- An overlapping-feature based phonological model incorporating linguistic constraints: Applications to speech recognition
- Feb
- J. Sun and L. Deng, "An overlapping-feature based phonological model incorporating linguistic constraints: Applications to speech recognition," J. Acoust. Soc. Amer., vol. 111, no. 2, pp. 1086-1101, Feb. 2002.
- (2002) J. Acoust. Soc. Amer , vol.111 , Issue.2 , pp. 1086-1101
- Sun, J.¹ Deng, L.²

63
- 0008501167
- A statistical model for word discovery in transcribed speech
- Sept
- A. Venkataraman, "A statistical model for word discovery in transcribed speech," Comput. Linguist., vol. 27, no. 3, pp. 352-372, Sept. 2001.
- (2001) Comput. Linguist , vol.27 , Issue.3 , pp. 352-372
- Venkataraman, A.¹

64
- 85009227403
- Data-driven example based continuous speech recognition
- Geneva, Sept
- M. Wachter, K. Demuynck, D. Van Compernolle, and P. Wambacq, "Data-driven example based continuous speech recognition," in Proc. EUROSPEECH, Geneva, Sept. 2003, pp. 1133-1136.
- (2003) Proc. EUROSPEECH , pp. 1133-1136
- Wachter, M.¹ Demuynck, K.² Van Compernolle, D.³ Wambacq, P.⁴

65
- 34547512577
- Boosting HMM performance with a memory upgrade
- Pittsburgh, PA, Sept
- M. Wachter, K. Demuynck, and D. Van Compernolle, "Boosting HMM performance with a memory upgrade," in Proc. Interspeech, Pittsburgh, PA, Sept. 2006, pp. 1730-1733.
- (2006) Proc. Interspeech , pp. 1730-1733
- Wachter, M.¹ Demuynck, K.² Van Compernolle, D.³

66
- 0141814617
- Comparison of acoustic model adaptation techniques on non-native speech
- Z. Wang, T. Schultz, and A. Waibel, "Comparison of acoustic model adaptation techniques on non-native speech," in Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, 2003, pp. 540-543.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing , pp. 540-543
- Wang, Z.¹ Schultz, T.² Waibel, A.³

67
- 0343249600
- Performance improvements through combining phone-and syllable-scale information in automatic speech recognition
- Sydney, Australia
- S. Wu, B. Kingsbury, N. Morgan, and S. Greenberg, "Performance improvements through combining phone-and syllable-scale information in automatic speech recognition," in Proc. Int. Conf. Spoken Language Processing, Sydney, Australia, 1998, pp. 854-857.
- (1998) Proc. Int. Conf. Spoken Language Processing , pp. 854-857
- Wu, S.¹ Kingsbury, B.² Morgan, N.³ Greenberg, S.⁴

68
- 0029733178
- Comparison of four approaches to automatic language identification of telephone speech
- Jan
- M. Zissman, "Comparison of four approaches to automatic language identification of telephone speech," IEEE Trans. Speech Audio Process., vol. 4, no. 1, pp. 31-44, Jan. 1996.
- (1996) IEEE Trans. Speech Audio Process , vol.4 , Issue.1 , pp. 31-44
- Zissman, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.