메뉴 건너뛰기




Volumn 29, Issue 6, 2012, Pages 127-140

Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTICS; ARRAY PROCESSING; DEEP NEURAL NETWORKS; DISTRIBUTED COMPUTER SYSTEMS; HUMAN COMPUTER INTERACTION; MICROPHONES; SPEECH; SPEECH PROCESSING; SPHERES;

EID: 85032750883     PISSN: 10535888     EISSN: None     Source Type: Journal    
DOI: 10.1109/MSP.2012.2205285     Document Type: Article
Times cited : (122)

References (43)
  • 1
    • 0032142014 scopus 로고    scopus 로고
    • Environmental conditions and acoustic transduction in hands-free speech recognition
    • PII S0167639398000302
    • M. Omologo, M. Matassoni, and P. Svaizer, "Environmental conditions and acoustic transduction in hands-free speech recognition," Speech Commun., vol. 25, no. 1-3, pp. 75-95, 1998. (Pubitemid 128413635)
    • (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 75-95
    • Omologo, M.1    Svaizer, P.2    Matassoni, M.3
  • 7
    • 50449092852 scopus 로고    scopus 로고
    • Bridging the gap: Towards a unified framework for hands-free speech recognition using microphone arrays
    • M. Seltzer, "Bridging the gap: Towards a unified framework for hands-free speech recognition using microphone arrays," in Proc. HSCMA, Trento, Italy, 2008, pp. 104-107.
    • (2008) Proc. HSCMA, Trento, Italy , pp. 104-107
    • Seltzer, M.1
  • 8
    • 79959845286 scopus 로고    scopus 로고
    • The CHiME corpus: A resource and a challenge for computational hearing in multisource environments
    • H. Christensen, J. Barker, N. Ma, and P. Green, "The CHiME corpus: A resource and a challenge for computational hearing in multisource environments," in Proc. Interspeech, Makuhari, Japan, 2010, pp. 1918-1921.
    • (2010) Proc. Interspeech, Makuhari, Japan , pp. 1918-1921
    • Christensen, H.1    Barker, J.2    Ma, N.3    Green, P.4
  • 9
    • 84867591985 scopus 로고    scopus 로고
    • Logmax observation model with MFCC-based spectral prior for reduction of highly nonstationary ambient noise
    • Kyoto, Japan
    • T. Nakatani, T. Yoshioka, S. Araki, M. Delcroix, and M. Fujimoto, "Logmax observation model with MFCC-based spectral prior for reduction of highly nonstationary ambient noise," in Proc. ICASSP 2012, Kyoto, Japan, pp. 4029-4032.
    • (2012) Proc. ICASSP , pp. 4029-4032
    • Nakatani, T.1    Yoshioka, T.2    Araki, S.3    Delcroix, M.4    Fujimoto, M.5
  • 11
    • 84867602659 scopus 로고    scopus 로고
    • Integration of beamforming and automatic speech recognition through propagation of the wiener posterior
    • Kyoto, Japan
    • R. F. Astudillo, A. Abad, and J. P. S. Neto, "Integration of beamforming and automatic speech recognition through propagation of the wiener posterior," in Proc. ICASSP 2012, Kyoto, Japan, pp. 4909-4912.
    • (2012) Proc. ICASSP , pp. 4909-4912
    • Astudillo, R.F.1    Abad, A.2    Neto, J.P.S.3
  • 12
    • 51449086836 scopus 로고    scopus 로고
    • A microphone array beamforming approach to blind speech separation
    • I. McCowan, I. Himawan, and M. Lincoln, "A microphone array beamforming approach to blind speech separation," in Proc. MLMI, 2007, pp. 295-305.
    • (2007) Proc. MLMI , pp. 295-305
    • McCowan, I.1    Himawan, I.2    Lincoln, M.3
  • 13
    • 77956766546 scopus 로고    scopus 로고
    • Audio-visual fusion and tracking with multilevel iterative decoding: Framework and experimental evaluation
    • S. T. Shivappa, B. D. Rao, and M. M. Trivedi, "Audio-visual fusion and tracking with multilevel iterative decoding: Framework and experimental evaluation," J. Sel. Topics Signal Processing, vol. 4, no. 5, pp. 882-894, 2010.
    • (2010) J. Sel. Topics Signal Processing , vol.4 , Issue.5 , pp. 882-894
    • Shivappa, S.T.1    Rao, B.D.2    Trivedi, M.M.3
  • 14
    • 34250174176 scopus 로고    scopus 로고
    • Microphone array driven speech recognition: Influence of localization on the word error rate
    • M. Wölfel, K. Nickel, and J. W. McDonough, "Microphone array driven speech recognition: Influence of localization on the word error rate," in Proc. MLMI, 2005, pp. 320-331.
    • (2005) Proc. MLMI , pp. 320-331
    • Wölfel, M.1    Nickel, K.2    McDonough, J.W.3
  • 18
    • 33746653380 scopus 로고    scopus 로고
    • Time delay estimation in room acoustic environments: An overview
    • J. Chen, J. Benesty, and Y. Huang, "Time delay estimation in room acoustic environments: An overview," EURASIP J. Adv. Signal Processing, vol. 2006, no. AD-26503, pp. 1-19, 2006.
    • (2006) EURASIP J. Adv. Signal Processing , vol.2006 , Issue.AD26503 , pp. 1-19
    • Chen, J.1    Benesty, J.2    Huang, Y.3
  • 19
    • 50449084235 scopus 로고    scopus 로고
    • Comparison between different sound source localization techniques based on a real data collection
    • A. Brutti, M. Omologo, and P. Svaizer, "Comparison between different sound source localization techniques based on a real data collection," in Proc. HSCMA, Trento, Italy, 2008, pp. 69-72.
    • (2008) Proc. HSCMA, Trento, Italy , pp. 69-72
    • Brutti, A.1    Omologo, M.2    Svaizer, P.3
  • 20
    • 33645696863 scopus 로고    scopus 로고
    • Kalman filters for time delay of arrivalbased source localization
    • U. Klee, T. Gehrig, and J. McDonough, "Kalman filters for time delay of arrivalbased source localization," EURASIP J. Adv. Signal Processing, vol. 2006, no. AD-12378, pp. 1-15, 2006.
    • (2006) EURASIP J. Adv. Signal Processing , vol.2006 , Issue.AD12378 , pp. 1-15
    • Klee, U.1    Gehrig, T.2    McDonough, J.3
  • 22
    • 40249109687 scopus 로고    scopus 로고
    • Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters
    • T. Gehrig, U. Klee, J. McDonough, S. Ikbal, M. Wölfel, and C. Fügen, "Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters," in Proc. Interspeech, 2006, pp. 2594-2597.
    • (2006) Proc. Interspeech , pp. 2594-2597
    • Gehrig, T.1    Klee, U.2    McDonough, J.3    Ikbal, S.4    Wölfel, M.5    Fügen, C.6
  • 25
    • 0346707504 scopus 로고    scopus 로고
    • Microphone array post-filter based on noise field coherence
    • I. A. McCowan and H. Bourlard, "Microphone array post-filter based on noise field coherence," IEEE Trans. Speech Audio Processing, vol. 11, no. 6, pp. 709-716, 2003.
    • (2003) IEEE Trans. Speech Audio Processing , vol.11 , Issue.6 , pp. 709-716
    • McCowan, I.A.1    Bourlard, H.2
  • 27
    • 0032072917 scopus 로고    scopus 로고
    • Analysis of noise reduction and dereverberation techniques based on microphone arrays with postfiltering
    • PII S1063667698029034
    • C. Marro, Y. Mahieux, and K. U. Simmer, "Analysis of noise reduction and dereverberation techniques based on microphone arrays with postfiltering," IEEE Trans. Speech Audio Processing, vol. 6, pp. 240-259, 1998. (Pubitemid 128720650)
    • (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , Issue.3 , pp. 240-259
    • Marro, C.1    Mahieux, Y.2    Simmer, K.U.3
  • 31
    • 79958015105 scopus 로고    scopus 로고
    • Maximum negentropy beamforming using complex generalized Gaussian distribution model
    • K. Kumatani, J. McDonough, B. Rauch, and D. Klakow, "Maximum negentropy beamforming using complex generalized Gaussian distribution model," in Proc. ASILOMAR, Pacific Grove, CA, 2010, pp. 1420-1424.
    • (2010) Proc. ASILOMAR, Pacific Grove, CA , pp. 1420-1424
    • Kumatani, K.1    McDonough, J.2    Rauch, B.3    Klakow, D.4
  • 32
    • 84867218783 scopus 로고    scopus 로고
    • Maximum kurtosis beamforming with the generalized sidelobe canceller
    • Brisbane, Australia, Sept.
    • K. Kumatani, J. McDonough, B. Rauch, P. N. Garner, W. Li, and J. Dines, "Maximum kurtosis beamforming with the generalized sidelobe canceller," in Proc. Interspeech, Brisbane, Australia, Sept. 2008, pp. 423-426.
    • (2008) Proc. Interspeech , pp. 423-426
    • Kumatani, K.1    McDonough, J.2    Rauch, B.3    Garner, P.N.4    Li, W.5    Dines, J.6
  • 33
    • 84858959884 scopus 로고    scopus 로고
    • Maximum kurtosis beamforming with a subspace filter for distant speech recognition
    • K. Kumatani, J. McDonough, and B. Raj, "Maximum kurtosis beamforming with a subspace filter for distant speech recognition," in Proc. ASRU, 2011, pp. 179-184.
    • (2011) Proc. ASRU , pp. 179-184
    • Kumatani, K.1    McDonough, J.2    Raj, B.3
  • 34
    • 79961162572 scopus 로고    scopus 로고
    • Channel selection based on multichannel crosscorrelation coefficients for distant speech recognition
    • K. Kumatani, J. McDonough, J. Lehman, and B. Raj, "Channel selection based on multichannel crosscorrelation coefficients for distant speech recognition," in Proc. HSCMA, Edinburgh, UK, 2011, pp. 1-6.
    • (2011) Proc. HSCMA, Edinburgh, UK , pp. 1-6
    • Kumatani, K.1    McDonough, J.2    Lehman, J.3    Raj, B.4
  • 35
    • 51449110329 scopus 로고    scopus 로고
    • Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller
    • E. Warsitz, A. Krueger, and R. Haeb-Umbach, "Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller," in Proc. ICASSP, Las Vegas, NV, 2008, pp. 73-76.
    • (2008) Proc. ICASSP, Las Vegas, NV , pp. 73-76
    • Warsitz, E.1    Krueger, A.2    Haeb-Umbach, R.3
  • 38
    • 34948844018 scopus 로고    scopus 로고
    • Flexible and optimal design of spherical microphone arrays for beamforming
    • Z. Li and R. Duraiswami, "Flexible and optimal design of spherical microphone arrays for beamforming," IEEE Trans. Speech Audio Processing, vol. 15, no. 2, pp. 2007, 702-714.
    • (2007) IEEE Trans. Speech Audio Processing , vol.15 , Issue.2 , pp. 702-714
    • Li, Z.1    Duraiswami, R.2
  • 39
    • 11144229405 scopus 로고    scopus 로고
    • Analysis and design of spherical microphone arrays
    • DOI 10.1109/TSA.2004.839244
    • B. Rafaely, "Analysis and design of spherical microphone arrays," IEEE Trans. Speech Audio Processing, vol. 13, no. 1, pp. 135-143, 2005. (Pubitemid 40049946)
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.1 , pp. 135-143
    • Rafaely, B.1
  • 42
    • 78651227376 scopus 로고    scopus 로고
    • Bessel functions
    • F. W. J. Olver, D. W. Lozier, R. F. Boisvert, and C. W. Clark, Eds. New York, NY: Cambridge Univ. Press
    • F. W. J. Olver and L. C. Maximon, "Bessel functions," in NIST Handbook of Mathematical Functions, F. W. J. Olver, D. W. Lozier, R. F. Boisvert, and C. W. Clark, Eds. New York, NY: Cambridge Univ. Press, 2010.
    • (2010) NIST Handbook of Mathematical Functions
    • Olver, F.W.J.1    Maximon, L.C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.