메뉴 건너뛰기




Volumn , Issue , 2010, Pages

Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding

Author keywords

Fragment decoding; Missing data; Noise robust speech recognition; Reverberation

Indexed keywords

ACOUSTIC NOISE; DECODING; FLOORS; MICROPHONES; SIGNAL TO NOISE RATIO;

EID: 84940458837     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (1)

References (18)
  • 1
    • 0026882842 scopus 로고
    • Experiments with nonlinear spectral subtractor, Hidden Markov Models and the projection for robust speech recognition in cars
    • P. Lockwood and J. Boudy, “Experiments with nonlinear spectral subtractor, Hidden Markov Models and the projection for robust speech recognition in cars,” Speech Communication, vol. 11, 1992.
    • (1992) Speech Communication , vol.11
    • Lockwood, P.1    Boudy, J.2
  • 2
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and uncertain acoustic data
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, “Robust automatic speech recognition with missing and uncertain acoustic data,” Speech Commun., vol. 34, no. 3, pp. 267–285, 2001.
    • (2001) Speech Commun , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 3
    • 0025681008 scopus 로고
    • Hidden Markov model decomposition of speech and noise
    • A. Varga and R. Moore, “Hidden Markov model decomposition of speech and noise,” in Proc. IEEE ICASSP’90, 1990, pp. 845–848.
    • (1990) Proc. IEEE ICASSP’90 , pp. 845-848
    • Varga, A.1    Moore, R.2
  • 4
    • 85135375893 scopus 로고
    • HMM recognition in noise using parallel model combination
    • Berlin
    • M. Gales and S. Young, “HMM recognition in noise using parallel model combination,” in Proc. Eurospeech’93, Berlin, 1993.
    • (1993) Proc. Eurospeech’93
    • Gales, M.1    Young, S.2
  • 5
    • 85009074657 scopus 로고    scopus 로고
    • ALGONQUIN: Iterating Laplace’s method to remove multiple types of distortion for robust speech recognition
    • Aalborg, Denmark
    • B. Frey, L. Deng, A. Acero, and T. Kristjansson, “ALGONQUIN: Iterating Laplace’s method to remove multiple types of distortion for robust speech recognition,” in Proc. Eurospeech’01, Aalborg, Denmark, 2001, pp. 901–904.
    • (2001) Proc. Eurospeech’01 , pp. 901-904
    • Frey, B.1    Deng, L.2    Acero, A.3    Kristjansson, T.4
  • 6
    • 69249202377 scopus 로고    scopus 로고
    • Monaural speech separation and recognition challenge
    • M. Cooke, J. Hershey, and S. Rennie, “Monaural speech separation and recognition challenge,” Comput. Speech. Lang., vol. 24, no. 1, pp. 1–15, 2010.
    • (2010) Comput. Speech. Lang , vol.24 , Issue.1 , pp. 1-15
    • Cooke, M.1    Hershey, J.2    Rennie, S.3
  • 7
    • 79959845286 scopus 로고    scopus 로고
    • The CHiME corpus: a resource and a challenge for Computational Hearing in Multisource Environments
    • Makuhari
    • H. Christensen, J. Barker, N. Ma, and P. Green, “The CHiME corpus: a resource and a challenge for Computational Hearing in Multisource Environments,” in Proc. Interspeech’10, Makuhari, 2010.
    • (2010) Proc. Interspeech’10
    • Christensen, H.1    Barker, J.2    Ma, N.3    Green, P.4
  • 8
    • 33750368310 scopus 로고    scopus 로고
    • An audiovisual corpus for speech perception and automatic speech recognition
    • M. Cooke, J. Barker, S. Cunningham, and X. Shao, “An audiovisual corpus for speech perception and automatic speech recognition,” J. Acoust. Soc. Am., vol. 120, pp. 2421–2424, 2006.
    • (2006) J. Acoust. Soc. Am , vol.120 , pp. 2421-2424
    • Cooke, M.1    Barker, J.2    Cunningham, S.3    Shao, X.4
  • 9
    • 69249231059 scopus 로고    scopus 로고
    • Speech fragment decoding techniques for simultaneous speaker identification and speech recognition
    • J. Barker, N. Ma, A. Coy, and M. Cooke, “Speech fragment decoding techniques for simultaneous speaker identification and speech recognition,” Comput. Speech. Lang., vol. 24, no. 1, pp. 94–111, 2010.
    • (2010) Comput. Speech. Lang , vol.24 , Issue.1 , pp. 94-111
    • Barker, J.1    Ma, N.2    Coy, A.3    Cooke, M.4
  • 10
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • G. Brown and M. Cooke, “Computational auditory scene analysis,” Comput. Speech. Lang., vol. 8, no. 4, pp. 297–336, 1994.
    • (1994) Comput. Speech. Lang , vol.8 , Issue.4 , pp. 297-336
    • Brown, G.1    Cooke, M.2
  • 11
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • B. Glasberg and B. Moore, “Derivation of auditory filter shapes from notched-noise data,” Hearing Res., vol. 47, pp. 103–138, 1990.
    • (1990) Hearing Res , vol.47 , pp. 103-138
    • Glasberg, B.1    Moore, B.2
  • 12
    • 85009063707 scopus 로고    scopus 로고
    • Soft decisions in missing data techniques for robust automatic speech recognition
    • Beijing
    • J. Barker, L. Josifovski, M. Cooke, and P. Green, “Soft decisions in missing data techniques for robust automatic speech recognition,” in Proc. ICSLP’00, Beijing, 2000, pp. 373–376.
    • (2000) Proc. ICSLP’00 , pp. 373-376
    • Barker, J.1    Josifovski, L.2    Cooke, M.3    Green, P.4
  • 13
    • 85009106519 scopus 로고    scopus 로고
    • Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise
    • Aalborg
    • J. Barker, M. Cooke, and P. Green, “Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise,” in Proc. Eurospeech’01, Aalborg, 2001, pp. 213–216.
    • (2001) Proc. Eurospeech’01 , pp. 213-216
    • Barker, J.1    Cooke, M.2    Green, P.3
  • 14
    • 11144316019 scopus 로고    scopus 로고
    • Decoding speech in the presence of other sources
    • J. Barker, M. Cooke, and D. Ellis, “Decoding speech in the presence of other sources,” Speech Commun., vol. 45, no. 1, pp. 5–25, 2005.
    • (2005) Speech Commun , vol.45 , Issue.1 , pp. 5-25
    • Barker, J.1    Cooke, M.2    Ellis, D.3
  • 16
    • 44949104414 scopus 로고    scopus 로고
    • Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source
    • Pittsburgh, PA
    • N. Ma, P. Green, and A. Coy, “Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source,” in Proc. Interspeech’06, Pittsburgh, PA, 2006, pp. 669–672.
    • (2006) Proc. Interspeech’06 , pp. 669-672
    • Ma, N.1    Green, P.2    Coy, A.3
  • 17
    • 34748817500 scopus 로고    scopus 로고
    • Exploiting correlogram structure for robust speech recognition with multiple speech sources
    • N. Ma, P. Green, J. Barker, and A. Coy, “Exploiting correlogram structure for robust speech recognition with multiple speech sources,” Speech Commun., vol. 49, no. 12, pp. 874–891, 2007.
    • (2007) Speech Commun , vol.49 , Issue.12 , pp. 874-891
    • Ma, N.1    Green, P.2    Barker, J.3    Coy, A.4
  • 18
    • 57849093600 scopus 로고    scopus 로고
    • Integrating pitch and localisation cues at a speech fragment level
    • Antwerp
    • H. Christensen, N. Ma, S. Wrigley, and J. Barker, “Integrating pitch and localisation cues at a speech fragment level,” in Proc. Interspeech’07, Antwerp, 2007, pp. 2769–2772.
    • (2007) Proc. Interspeech’07 , pp. 2769-2772
    • Christensen, H.1    Ma, N.2    Wrigley, S.3    Barker, J.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.