메뉴 건너뛰기




Volumn , Issue , 2012, Pages 229-250

Feature Compensation

Author keywords

Acoustic and signals using microphone, linear nonlinear effects; AURORA digital speech recognition, and system evaluation tasks; Discriminative SPLICE and MMI, Aurora 4 and Rprop iterations; Discriminative SPLICE in shortcomings, noise into features; Feature compensation, ideal from observed noisy speech features; Feature enhancement, information and distortion removal; Joint distribution of clean noisy speech model, GMM; MBFE utterance by utterance, SLDM and speech or noise cepstra; MMSE SPLICE in noisy speech, word error rate on Aurora 2; VTS feature enhancement, for processing power and distortion

Indexed keywords


EID: 84886120743     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1002/9781118392683.ch9     Document Type: Chapter
Times cited : (3)

References (24)
  • 1
    • 0026385284 scopus 로고
    • Robust speech recognition by normalization of the acoustic space
    • A. Acero and R. M. Stern, "Robust speech recognition by normalization of the acoustic space," in Proceedings of the IEEE ICASSP, vol. 2, pp. 893-896, 1991.
    • (1991) Proceedings of the IEEE ICASSP. , vol.2 , pp. 893-896
    • Acero, A.1    Stern, R.M.2
  • 2
    • 84886205387 scopus 로고    scopus 로고
    • Maximum mutual information estimation of hidden Markov model parameters for speech recognition
    • International Conference on Acoustics
    • L. R. Bahl, P. F. Brown, P. V. D. Souza, and R. L. Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition," in International Conference on Acoustics, Speech and Signal Processing, 1997.
    • (1997) Speech and Signal Processing
    • Bahl, L.R.1    Brown, P.F.2    Souza, P.V.D.3    Mercer, R.L.4
  • 3
    • 0036226165 scopus 로고    scopus 로고
    • Noise estimation by minima controlled recursive averaging for robust speech enhancement
    • I. Cohen and B. Berdugo, "Noise estimation by minima controlled recursive averaging for robust speech enhancement," IEEE Signal Processing Letters, vol. 9, no. 1, pp. 12-15, 2002.
    • (2002) IEEE Signal Processing Letters , vol.9 , Issue.1 , pp. 12-15
    • Cohen, I.1    Berdugo, B.2
  • 4
    • 85009284508 scopus 로고    scopus 로고
    • "Log-domain speech feature enhancement using sequential map noise estimate and a nonlinear model of acoustic environment,"
    • Denver, CO, September
    • L. Deng, J. Droppo, and A. Acero, "Log-domain speech feature enhancement using sequential map noise estimate and a nonlinear model of acoustic environment," in Proceedings of the ICSLP, Denver, CO, September 2002.
    • (2002) Proceedings of the ICSLP.
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 5
    • 2442551863 scopus 로고    scopus 로고
    • "Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features,"
    • May
    • L. Deng, J. Droppo, and A. Acero, "Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features," IEEE Transactions on Speech and Audio Processing, vol. 12, no. 3, pp. 218-233, May 2004.
    • (2004) IEEE Transactions on Speech and Audio Processing , vol.12 , Issue.3 , pp. 218-233
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 7
    • 33947702149 scopus 로고    scopus 로고
    • "Joint discriminative front end and back end training for improved speech recognition accuracy,"
    • Speech and Signal Processing, Toulouse, France, May
    • J. Droppo and A. Acero, "Joint discriminative front end and back end training for improved speech recognition accuracy," in International Conference on Acoustics, Speech and Signal Processing, Toulouse, France, May 2006.
    • (2006) International Conference on Acoustics
    • Droppo, J.1    Acero, A.2
  • 8
    • 4544236840 scopus 로고    scopus 로고
    • "Noise robust speech recognition with a switching linear dynamic model,"
    • Speech and Signal Processing, Montreal, Canada, May
    • J. Droppo and A. Acero, "Noise robust speech recognition with a switching linear dynamic model," in International Conference on Acoustics, Speech and Signal Processing, Montreal, Canada, May 2004.
    • (2004) International Conference on Acoustics
    • Droppo, J.1    Acero, A.2
  • 9
    • 0036291376 scopus 로고    scopus 로고
    • "Uncertainty decoding with SPLICE for noise robust speech recognition,"
    • Orlando, Florida, May
    • J. Droppo, A. Acero, and L. Deng, "Uncertainty decoding with SPLICE for noise robust speech recognition," in Proceedings of the 2002 ICASSP, Orlando, Florida, May 2002.
    • (2002) Proceedings of the 2002 ICASSP.
    • Droppo, J.1    Acero, A.2    Deng, L.3
  • 10
    • 85009265626 scopus 로고    scopus 로고
    • Evaluation of SPLICE on the Aurora 2 and 3 tasks
    • J. Droppo, L. Deng, and A. Acero, "Evaluation of SPLICE on the Aurora 2 and 3 tasks," in Proceedings of the ICSLP, pp. 29-32, 2002.
    • (2002) Proceedings of the ICSLP. , pp. 29-32
    • Droppo, J.1    Deng, L.2    Acero, A.3
  • 11
    • 54349123450 scopus 로고    scopus 로고
    • "A comparison of three non-linear observation models for noisy speech features,"
    • Geneva, Switzerland, September
    • J. Droppo, L. Deng, and A. Acero, "A comparison of three non-linear observation models for noisy speech features," in Proceedings of the 2003 Eurospeech, Geneva, Switzerland, September 2003, pp. 681-684.
    • (2003) Proceedings of the 2003 Eurospeech , pp. 681-684
    • Droppo, J.1    Deng, L.2    Acero, A.3
  • 12
    • 33846265376 scopus 로고    scopus 로고
    • "How to train a discriminative front end with stochastic gradient descent and maximum mutual information,"
    • J. Droppo, M. Mahajan, A. Gunawardana, and A. Acero, "How to train a discriminative front end with stochastic gradient descent and maximum mutual information," in Proceedings of the IEEE ASRU, 2005.
    • (2005) Proceedings of the IEEE ASRU.
    • Droppo, J.1    Mahajan, M.2    Gunawardana, A.3    Acero, A.4
  • 13
    • 85009135386 scopus 로고    scopus 로고
    • "Investigations into tandem acoustic modeling for the Aurora task,"
    • D. Ellis and M. Gomez, "Investigations into tandem acoustic modeling for the Aurora task," in Proceedings of the Eurospeech, 2001, pp. 189-192.
    • (2001) Proceedings of the Eurospeech , pp. 189-192
    • Ellis, D.1    Gomez, M.2
  • 14
    • 85009074657 scopus 로고    scopus 로고
    • "ALGONQUIN: Iterating Laplace'smethod to remove multiple types of acoustic distortion for robust speech recognition,"
    • Aalbork, Denmark, September
    • B. Frey, L. Deng, A. Acero, and T. Kristjansson, "ALGONQUIN: Iterating Laplace'smethod to remove multiple types of acoustic distortion for robust speech recognition," in Proceedings of the 2001 Eurospeech, Aalbork, Denmark, September 2001.
    • (2001) Proceedings of the 2001 Eurospeech
    • Frey, B.1    Deng, L.2    Acero, A.3    Kristjansson, T.4
  • 16
    • 0028531044 scopus 로고
    • Prototype-based minimum classification error/generalized probabilistic descent training for various speech units
    • E. McDermott and S. Katagiri, "Prototype-based minimum classification error/generalized probabilistic descent training for various speech units," Computer Speech & Language, vol. 8, pp. 351-368, 1994.
    • (1994) Computer Speech & Language , vol.8 , pp. 351-368
    • McDermott, E.1    Katagiri, S.2
  • 17
    • 65549153550 scopus 로고    scopus 로고
    • Speech recognition in noisy environments
    • Carnegie Mellon University
    • P. Moreno, "Speech recognition in noisy environments," PhD dissertation, Carnegie Mellon University, 1996.
    • (1996) PhD dissertation
    • Moreno, P.1
  • 18
    • 0028996861 scopus 로고
    • "Multivariate-gaussian-based cepstral normalization for robust speech recognition,"
    • Speech and Signal Processing
    • P. J. Moreno, B. Raj, E. Gouvea, and R. M. Stern, "Multivariate-gaussian-based cepstral normalization for robust speech recognition," in International Conference on Acoustics, Speech and Signal Processing, 1995, pp. 137-140.
    • (1995) International Conference on Acoustics , pp. 137-140
    • Moreno, P.J.1    Raj, B.2    Gouvea, E.3    Stern, R.M.4
  • 19
    • 0003459132 scopus 로고
    • Hidden Markov models, maximum mutual information estimation and the speech recognition problem
    • McGill University
    • Y. Normandin, "Hidden Markov models, maximum mutual information estimation and the speech recognition problem," PhD dissertation, McGill University, 1991.
    • (1991) PhD dissertation
    • Normandin, Y.1
  • 20
    • 4544265717 scopus 로고    scopus 로고
    • Discriminative training for large vocabulary speech recognition
    • Cambridge University
    • D. Povey, "Discriminative training for large vocabulary speech recognition," PhD dissertation, Cambridge University, 2003.
    • (2003) PhD dissertation
    • Povey, D.1
  • 21
    • 4544365937 scopus 로고    scopus 로고
    • "On tracking noise with linear dynamical system models,"
    • Speech and Signal Processing, Montreal, Canada, May
    • B. Raj, R. Singh, and R. Stern, "On tracking noise with linear dynamical system models," in International Conference on Acoustics, Speech and Signal Processing, Montreal, Canada, May 2004.
    • (2004) International Conference on Acoustics
    • Raj, B.1    Singh, R.2    Stern, R.3
  • 22
    • 84943274699 scopus 로고
    • "A direct adaptive method for faster backpropagation learning: The RPROP algorithm,"
    • M. Riedmiller and H. Braun, "A direct adaptive method for faster backpropagation learning: The RPROP algorithm," in IEEE International Conference on Neural Networks, vol. 1, 1993, pp. 586-91.
    • (1993) IEEE International Conference on Neural Networks , vol.1 , pp. 586-91
    • Riedmiller, M.1    Braun, H.2
  • 23
    • 67650135931 scopus 로고    scopus 로고
    • Recognition of noisy speech: A comparative survey of robust model architecture and feature enhancement
    • Speech and Music Processing
    • B. Schuller, M.Wollmer, T. Moosmayr, and G. Rigoll, "Recognition of noisy speech: A comparative survey of robust model architecture and feature enhancement," EURASIP Journal on Audio, Speech and Music Processing, 2009.
    • (2009) EURASIP Journal on Audio
    • Schuller, B.1    Wollmer, M.2    Moosmayr, T.3    Rigoll, G.4
  • 24
    • 85009228863 scopus 로고    scopus 로고
    • "Robust speech recognition using model-based feature enhancement,"
    • Geneva, Switzerland, September
    • V. Stouten, H. Van hamme, K. Demuynck, and P. Wambacq, "Robust speech recognition using model-based feature enhancement," in Proceedings of the 2003 Eurospeech,Geneva, Switzerland, September 2003, pp. 17-20.
    • (2003) Proceedings of the 2003 Eurospeech , pp. 17-20
    • Stouten, V.1    Van Hamme, H.2    Demuynck, K.3    Wambacq, P.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.