메뉴 건너뛰기




Volumn 26, Issue 4, 2009, Pages 78-85

Updated MINDS report on speech recognition and understanding, part 2

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTICS; AUDIO SIGNAL PROCESSING; DATA MINING; DEEP NEURAL NETWORKS; SEARCH ENGINES; SPEECH;

EID: 85032759066     PISSN: 10535888     EISSN: None     Source Type: Journal    
DOI: 10.1109/MSP.2009.932707     Document Type: Article
Times cited : (49)

References (68)
  • 2
    • 33745203699 scopus 로고    scopus 로고
    • Improving speech recognition using a data-driven approach
    • Sept
    • G. Aradilla, J. Vepa, and H. Bourlard, "Improving speech recognition using a data-driven approach," in Proc. Eurospeech, pp. 3333-3336, Sept. 2005.
    • (2005) Proc. Eurospeech , pp. 3333-3336
    • Aradilla, G.1    Vepa, J.2    Bourlard, H.3
  • 3
    • 85032768965 scopus 로고    scopus 로고
    • K. Asanovic, R. Bodik, B. C. Catanzaro, J. Gebis, P. Husbands, K. Keutzer, D. Patterson, W. Plishker, J. Shalf, S. Williams, and K. Yelick, The landscape of parallel computing research: A view from Berkeley, EECS Dept., Univ.California at Berkeley, Tech. Rep. UCB/ EECS-2006-183, Dec. 2006.
    • K. Asanovic, R. Bodik, B. C. Catanzaro, J. Gebis, P. Husbands, K. Keutzer, D. Patterson, W. Plishker, J. Shalf, S. Williams, and K. Yelick, "The landscape of parallel computing research: A view from Berkeley," EECS Dept., Univ.California at Berkeley, Tech. Rep. UCB/ EECS-2006-183, Dec. 2006.
  • 4
  • 7
    • 0030355935 scopus 로고    scopus 로고
    • A new ASR approach based on independent processing and recombination of partial frequency bands
    • Oct
    • H. Bourlard and S. Dupont, "A new ASR approach based on independent processing and recombination of partial frequency bands," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Oct. 1996, vol. 1, pp. 426-429.
    • (1996) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , vol.1 , pp. 426-429
    • Bourlard, H.1    Dupont, S.2
  • 8
    • 0032677683 scopus 로고    scopus 로고
    • An efficient probabilistically sound-algorithm for segmentation and word discovery
    • Feb
    • M. R. Brent, "An efficient probabilistically sound-algorithm for segmentation and word discovery," Mach. Learn., vol. 34, no. 1-3, pp. 71-105, Feb. 1999.
    • (1999) Mach. Learn , vol.34 , Issue.1-3 , pp. 71-105
    • Brent, M.R.1
  • 9
    • 84860169561 scopus 로고
    • A corpus-based approach to language learning,
    • Ph.D. dissertation, Univ. Pennsylvania, Philadelphia, PA
    • E. Brill, "A corpus-based approach to language learning," Ph.D. dissertation, Univ. Pennsylvania, Philadelphia, PA, 1993.
    • (1993)
    • Brill, E.1
  • 12
    • 85032756635 scopus 로고    scopus 로고
    • A. Clark, Unsupervised language acquisition: Theory and practice, Ph.D. dissertation, Univ. Sussex, Brighton, U.K., 2001.
    • A. Clark, "Unsupervised language acquisition: Theory and practice," Ph.D. dissertation, Univ. Sussex, Brighton, U.K., 2001.
  • 13
    • 33745224873 scopus 로고
    • Vocal tract normalization in speech recognition: Compensating for systematic speaker variability
    • May
    • J. Cohen, T. Kamm, and A. G. Andreou, "Vocal tract normalization in speech recognition: Compensating for systematic speaker variability," J. Acoust. Soc. Amer., vol. 97, no. 5, pp. 3246-3247, May 1995.
    • (1995) J. Acoust. Soc. Amer , vol.97 , Issue.5 , pp. 3246-3247
    • Cohen, J.1    Kamm, T.2    Andreou, A.G.3
  • 14
    • 0346594072 scopus 로고
    • Language acquisition in the absence of experience
    • Dec
    • S. Crain, "Language acquisition in the absence of experience," Behav. Brain Sci., vol. 14, no. 4, pp. 601-699, Dec. 1991.
    • (1991) Behav. Brain Sci , vol.14 , Issue.4 , pp. 601-699
    • Crain, S.1
  • 15
    • 0035312570 scopus 로고    scopus 로고
    • Spatiotemporal mapping of brain activity by integration of multiple imaging modalities
    • A. M. Dale and E. Halgren, "Spatiotemporal mapping of brain activity by integration of multiple imaging modalities," Curr. Opin. Neurobiol. vol. 11, no. 2, pp. 202-208, 2001.
    • (2001) Curr. Opin. Neurobiol , vol.11 , Issue.2 , pp. 202-208
    • Dale, A.M.1    Halgren, E.2
  • 16
    • 0004241790 scopus 로고    scopus 로고
    • Unsupervised language acquisition,
    • Ph.D. dissertation, MIT, Cambridge, MA
    • C. G. de Marcken, "Unsupervised language acquisition," Ph.D. dissertation, MIT, Cambridge, MA, 1996.
    • (1996)
    • de Marcken, C.G.1
  • 17
    • 0028516022 scopus 로고
    • Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
    • L. Deng, M. Aksmanovic, D. Sun, and J. Wu, "Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 507-520, 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 507-520
    • Deng, L.1    Aksmanovic, M.2    Sun, D.3    Wu, J.4
  • 18
    • 34047266395 scopus 로고    scopus 로고
    • L. Deng, D. Yu, and A. Acero, Structured speech modeling, IEEE Trans. Audio, Speech Lang. Process. (Special Issue on Rich Transcription), 14, no. 5, pp. 1492-1504, Sept. 2006.
    • L. Deng, D. Yu, and A. Acero, "Structured speech modeling," IEEE Trans. Audio, Speech Lang. Process. (Special Issue on Rich Transcription), vol. 14, no. 5, pp. 1492-1504, Sept. 2006.
  • 22
    • 34547549792 scopus 로고    scopus 로고
    • Speech recognition using linear dynamic models
    • J. Frankel and S. King, "Speech recognition using linear dynamic models," IEEE Trans. Audio, Speech Lang. Process., vol. 15, no. 1, pp. 246-256, 2007.
    • (2007) IEEE Trans. Audio, Speech Lang. Process , vol.15 , Issue.1 , pp. 246-256
    • Frankel, J.1    King, S.2
  • 23
    • 58849145971 scopus 로고    scopus 로고
    • ASR - Articulatory Speech Recognition
    • Aalborg, Denmark
    • J. Frankel and S. King, "ASR - Articulatory Speech Recognition," in Proc. Eurospeech, Aalborg, Denmark, 2001, pp. 599-602.
    • (2001) Proc. Eurospeech , pp. 599-602
    • Frankel, J.1    King, S.2
  • 24
    • 85032775863 scopus 로고    scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains," IEEE Trans. Speech Audio Process., no. 7, pp. 711-720, 1997.
    • (1997) IEEE Trans. Speech Audio Process , Issue.7 , pp. 711-720
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 25
    • 0029114364 scopus 로고
    • Mapping function in the brain with magnetoencephalography, anatomical magnetic resonance imaging, and functional magnetic resonance imaging
    • J. S. George, C. J. Aine, J. C. Mosher, D. M. Schmidt, D. M. Ranken, and H. A. Schlitt, "Mapping function in the brain with magnetoencephalography, anatomical magnetic resonance imaging, and functional magnetic resonance imaging," J. Clin. Neurophysiol., vol. 12, no. 5, pp. 406-431, 1995.
    • (1995) J. Clin. Neurophysiol , vol.12 , Issue.5 , pp. 406-431
    • George, J.S.1    Aine, C.J.2    Mosher, J.C.3    Schmidt, D.M.4    Ranken, D.M.5    Schlitt, H.A.6
  • 26
    • 0038359548 scopus 로고    scopus 로고
    • J. R. Glass, A probabilistic framework for segment-based speech recognition, Comput., Speech Lang., 17, no. 2-3, pp. 137-152, 2003 (Eds.: M. Russell and J. Bilmes, Special Issue).
    • J. R. Glass, "A probabilistic framework for segment-based speech recognition," Comput., Speech Lang., vol. 17, no. 2-3, pp. 137-152, 2003 (Eds.: M. Russell and J. Bilmes, Special Issue).
  • 27
  • 28
    • 0344147463 scopus 로고    scopus 로고
    • Contribution of fine phonetic detail to speech understanding
    • Barcelona, Spain
    • S. Hawkins, "Contribution of fine phonetic detail to speech understanding," in Proc. 15th Int. Congress of Phonetic Sciences (ICPhS-03), Barcelona, Spain, 2003, pp. 293-296.
    • (2003) Proc. 15th Int. Congress of Phonetic Sciences (ICPhS-03) , pp. 293-296
    • Hawkins, S.1
  • 31
    • 68349094559 scopus 로고    scopus 로고
    • Speech recognition on vector architectures,
    • Ph.D. dissertation, Univ. California, Berkeley
    • A. Janin, "Speech recognition on vector architectures," Ph.D. dissertation, Univ. California, Berkeley, 2004.
    • (2004)
    • Janin, A.1
  • 33
    • 0016939124 scopus 로고
    • Continuous speech recognition by statistical methods
    • F. Jelinek, "Continuous speech recognition by statistical methods," Proc. IEEE, vol. 64, no. 4, pp. 532-557, 1976.
    • (1976) Proc. IEEE , vol.64 , Issue.4 , pp. 532-557
    • Jelinek, F.1
  • 34
    • 15844399848 scopus 로고    scopus 로고
    • Vocabulary independent word confidence measure using subword features
    • Sydney, Australia
    • L. Jiang and X. D. Huang, "Vocabulary independent word confidence measure using subword features," in Proc. Int. Conf. Spoken Language Processing, Sydney, Australia, 1998, pp. 401-404.
    • (1998) Proc. Int. Conf. Spoken Language Processing , pp. 401-404
    • Jiang, L.1    Huang, X.D.2
  • 36
    • 0029351511 scopus 로고
    • Infants' detection of sound patterns of words in fluent speech
    • Aug
    • P. W. Jusczyk and R. N. Aslin, "Infants' detection of sound patterns of words in fluent speech," Cogn. Psychol., vol. 29, no. 1, pp. 1-23, Aug. 1995.
    • (1995) Cogn. Psychol , vol.29 , Issue.1 , pp. 1-23
    • Jusczyk, P.W.1    Aslin, R.N.2
  • 37
    • 85032779194 scopus 로고    scopus 로고
    • Identifying unexpected words using in-context and out-of-context phoneme posteriors,
    • Tech. Rep, IDIAPRR 06-68
    • H. Ketabdar and H. Hermansky, "Identifying unexpected words using in-context and out-of-context phoneme posteriors," Tech. Rep., IDIAPRR 06-68, 2006.
    • (2006)
    • Ketabdar, H.1    Hermansky, H.2
  • 38
    • 84964379003 scopus 로고    scopus 로고
    • From tree bank to prop bank
    • Canary Islands, Spain
    • P. Kingsbury and M. Palmer, "From tree bank to prop bank," in Proc. LREC, Las Palmas, Canary Islands, Spain, 2002.
    • (2002) Proc. LREC, Las Palmas
    • Kingsbury, P.1    Palmer, M.2
  • 39
    • 0141703242 scopus 로고    scopus 로고
    • K. Kirchhoff, J. Bilmes, S. Das, N. Duta, M. Egan, J. Gang, H. Feng, J. Henderson, L. Daben, M. Noamany, P. Schone, R. Schwartz, and D. Vergyri, Novel approaches to Arabic speech recognition: Report from the 2002 Johns-Hopkins summer workshop, in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Apr. 2003, pp. 344-347.
    • K. Kirchhoff, J. Bilmes, S. Das, N. Duta, M. Egan, J. Gang, H. Feng, J. Henderson, L. Daben, M. Noamany, P. Schone, R. Schwartz, and D. Vergyri, "Novel approaches to Arabic speech recognition: Report from the 2002 Johns-Hopkins summer workshop," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Apr. 2003, pp. 344-347.
  • 40
    • 33746094611 scopus 로고    scopus 로고
    • The unsupervised learning of natural language structure,
    • Ph.D. dissertation, Stanford Univ, Palo Alto, CA
    • D. Klein, "The unsupervised learning of natural language structure," Ph.D. dissertation, Stanford Univ., Palo Alto, CA, 2005.
    • (2005)
    • Klein, D.1
  • 41
    • 84864010278 scopus 로고
    • Speaker adaptation of continuous density HMMs using multivariate linear regression
    • C. Leggetter and P. Woodland, "Speaker adaptation of continuous density HMMs using multivariate linear regression," in Proc. Int. Conf. Spoken Language Processing, 1994, pp. 451-454.
    • (1994) Proc. Int. Conf. Spoken Language Processing , pp. 451-454
    • Leggetter, C.1    Woodland, P.2
  • 42
    • 0002502431 scopus 로고
    • Languages and language
    • K. Gunderson, Ed. Minneapolis, MN: Univ. Minnesota Press
    • D. Lewis, "Languages and language," in Language, Mind, and Knowledge, K. Gunderson, Ed. Minneapolis, MN: Univ. Minnesota Press, 1975, pp. 3-35.
    • (1975) Language, Mind, and Knowledge , pp. 3-35
    • Lewis, D.1
  • 43
    • 33745220761 scopus 로고    scopus 로고
    • An investigation into a simulation of episodic memory for automatic speech recognition
    • Lisbon, Portugal, 5-9 Sept
    • V. Maier and R. K. Moore, "An investigation into a simulation of episodic memory for automatic speech recognition," in Proc. Interspeech 2005, Lisbon, Portugal, 5-9 Sept. 2005, pp. 1245-1248.
    • (2005) Proc. Interspeech 2005 , pp. 1245-1248
    • Maier, V.1    Moore, R.K.2
  • 44
    • 1642276395 scopus 로고    scopus 로고
    • Spatiotemporal dynamics of word processing in the human cortex
    • K. Marinkovic, "Spatiotemporal dynamics of word processing in the human cortex," Neuroscientist, vol. 10, no. 2, pp. 142-152, 2004.
    • (2004) Neuroscientist , vol.10 , Issue.2 , pp. 142-152
    • Marinkovic, K.1
  • 49
    • 34250826265 scopus 로고    scopus 로고
    • A conversation with John Hennessy and David Patterson
    • Dec./Jan
    • K. Olukotun, "A conversation with John Hennessy and David Patterson," ACM Queue Mag., vol. 4, no. 10, pp. 14-22, Dec./Jan. 2006-2007.
    • (2006) ACM Queue Mag , vol.4 , Issue.10 , pp. 14-22
    • Olukotun, K.1
  • 51
    • 0030245363 scopus 로고    scopus 로고
    • From HMMs to segment models: A unified view of stochastic modeling for speech recognition
    • M. Ostendorf, V. Digalakis, and J. Rohlicek, "From HMMs to segment models: A unified view of stochastic modeling for speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 360-378, 1996.
    • (1996) IEEE Trans. Speech Audio Process , vol.4 , Issue.5 , pp. 360-378
    • Ostendorf, M.1    Digalakis, V.2    Rohlicek, J.3
  • 52
    • 64849086376 scopus 로고    scopus 로고
    • Unsupervised pattern discovery in speech: Applications to word acquisition and speaker segmentation,
    • Ph.D. dissertation, MIT, Cambridge, MA
    • A. Park, "Unsupervised pattern discovery in speech: Applications to word acquisition and speaker segmentation," Ph.D. dissertation, MIT, Cambridge, MA, 2006.
    • (2006)
    • Park, A.1
  • 58
    • 0036152936 scopus 로고    scopus 로고
    • Learning words from sights and sounds: A computational model
    • Jan
    • D. Roy and A. Pentland, "Learning words from sights and sounds: A computational model," Cogn. Sci., vol. 26, no. 1, pp. 113-146, Jan. 2002.
    • (2002) Cogn. Sci , vol.26 , Issue.1 , pp. 113-146
    • Roy, D.1    Pentland, A.2
  • 59
    • 0036629220 scopus 로고    scopus 로고
    • Constraints on statistical language learning
    • July
    • J. R. Saffran, "Constraints on statistical language learning," J. Mem. Lang., vol. 47, no. 1, pp. 172-196, July 2002.
    • (2002) J. Mem. Lang , vol.47 , Issue.1 , pp. 172-196
    • Saffran, J.R.1
  • 60
    • 33244496414 scopus 로고    scopus 로고
    • Unsupervised context sensitive language acquisition from a large corpus
    • L. Saul, Ed. Cambridge, MA: MIT Press
    • Z. Solan, D. Horn, E. Ruppin, and S. Edelman, "Unsupervised context sensitive language acquisition from a large corpus," in Advances in Neural Information Processing Systems, L. Saul, Ed. Cambridge, MA: MIT Press, vol. 16, 2004.
    • (2004) Advances in Neural Information Processing Systems , vol.16
    • Solan, Z.1    Horn, D.2    Ruppin, E.3    Edelman, S.4
  • 61
    • 56249109227 scopus 로고    scopus 로고
    • How to handle pronunciation variation in ASR: By storing episodes in memory?
    • Toulouse, France, May
    • H. Strik, "How to handle pronunciation variation in ASR: By storing episodes in memory?," in Proc. ITRW on Speech Recognition and Intrinsic Variation (SRIV2006), Toulouse, France, May 2006, pp. 33-38.
    • (2006) Proc. ITRW on Speech Recognition and Intrinsic Variation (SRIV2006) , pp. 33-38
    • Strik, H.1
  • 62
    • 0036165806 scopus 로고    scopus 로고
    • An overlapping-feature based phonological model incorporating linguistic constraints: Applications to speech recognition
    • Feb
    • J. Sun and L. Deng, "An overlapping-feature based phonological model incorporating linguistic constraints: Applications to speech recognition," J. Acoust. Soc. Amer., vol. 111, no. 2, pp. 1086-1101, Feb. 2002.
    • (2002) J. Acoust. Soc. Amer , vol.111 , Issue.2 , pp. 1086-1101
    • Sun, J.1    Deng, L.2
  • 63
    • 0008501167 scopus 로고    scopus 로고
    • A statistical model for word discovery in transcribed speech
    • Sept
    • A. Venkataraman, "A statistical model for word discovery in transcribed speech," Comput. Linguist., vol. 27, no. 3, pp. 352-372, Sept. 2001.
    • (2001) Comput. Linguist , vol.27 , Issue.3 , pp. 352-372
    • Venkataraman, A.1
  • 64
    • 85009227403 scopus 로고    scopus 로고
    • Data-driven example based continuous speech recognition
    • Geneva, Sept
    • M. Wachter, K. Demuynck, D. Van Compernolle, and P. Wambacq, "Data-driven example based continuous speech recognition," in Proc. EUROSPEECH, Geneva, Sept. 2003, pp. 1133-1136.
    • (2003) Proc. EUROSPEECH , pp. 1133-1136
    • Wachter, M.1    Demuynck, K.2    Van Compernolle, D.3    Wambacq, P.4
  • 65
    • 34547512577 scopus 로고    scopus 로고
    • Boosting HMM performance with a memory upgrade
    • Pittsburgh, PA, Sept
    • M. Wachter, K. Demuynck, and D. Van Compernolle, "Boosting HMM performance with a memory upgrade," in Proc. Interspeech, Pittsburgh, PA, Sept. 2006, pp. 1730-1733.
    • (2006) Proc. Interspeech , pp. 1730-1733
    • Wachter, M.1    Demuynck, K.2    Van Compernolle, D.3
  • 67
    • 0343249600 scopus 로고    scopus 로고
    • Performance improvements through combining phone-and syllable-scale information in automatic speech recognition
    • Sydney, Australia
    • S. Wu, B. Kingsbury, N. Morgan, and S. Greenberg, "Performance improvements through combining phone-and syllable-scale information in automatic speech recognition," in Proc. Int. Conf. Spoken Language Processing, Sydney, Australia, 1998, pp. 854-857.
    • (1998) Proc. Int. Conf. Spoken Language Processing , pp. 854-857
    • Wu, S.1    Kingsbury, B.2    Morgan, N.3    Greenberg, S.4
  • 68
    • 0029733178 scopus 로고    scopus 로고
    • Comparison of four approaches to automatic language identification of telephone speech
    • Jan
    • M. Zissman, "Comparison of four approaches to automatic language identification of telephone speech," IEEE Trans. Speech Audio Process., vol. 4, no. 1, pp. 31-44, Jan. 1996.
    • (1996) IEEE Trans. Speech Audio Process , vol.4 , Issue.1 , pp. 31-44
    • Zissman, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.