메뉴 건너뛰기




Volumn 2, Issue , 2012, Pages 1202-1205

Improved model selection for the ASR-driven binary mask

Author keywords

Binary mask estimation; Speech recognition

Indexed keywords

BINARY MASKS; FRAME-BASED; LINGUISTIC INFORMATION; MASK ESTIMATIONS; MODEL SELECTION; SEQUENCE MODELS;

EID: 84878392281     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (3)

References (14)
  • 1
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Y. Gong, "Speech Recognition in Noisy Environments: A Survey, " Speech Communications, vol. 16, pp. 261-291, 1995.
    • (1995) Speech Communications , vol.16 , pp. 261-291
    • Gong, Y.1
  • 2
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • P. Divenyi, Ed. Norwell MA: Kluwer Academic
    • D. L. Wang, "On ideal binary mask as the computational goal of auditory scene analysis, " in Speech separation by humans and machines, P. Divenyi, Ed. Norwell MA: Kluwer Academic, 2005, pp. 181-197.
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 3
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data, " Speech Communication, vol. 34, pp. 267-285, 2001.
    • (2001) Speech Communication , vol.34 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 4
    • 79956289561 scopus 로고    scopus 로고
    • A novel mask estimation method employing posterior-based representative mean estimate for missing-feature speech recognition
    • July
    • W. Kim and J. H. L. Hansen, "A novel mask estimation method employing posterior-based representative mean estimate for missing-feature speech recognition, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 5, pp. 1434-1443, July 2011.
    • (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , Issue.5 , pp. 1434-1443
    • Kim, W.1    Hansen, J.H.L.2
  • 5
    • 11144316019 scopus 로고    scopus 로고
    • Decoding speech in the presence of other sources
    • J. Barker, M. Cooke, and D. P. W. Ellis, "Decoding speech in the presence of other sources, " Speech Communication, vol. 45, pp. 5-25, 2005.
    • (2005) Speech Communication , vol.45 , pp. 5-25
    • Barker, J.1    Cooke, M.2    Ellis, D.P.W.3
  • 6
    • 70350038037 scopus 로고    scopus 로고
    • Robust speech recognition by integrating speech separation and hypothesis testing
    • S. Srinivasan and D. L. Wang, "Robust speech recognition by integrating speech separation and hypothesis testing, " Speech Communication, vol. 52, pp. 72-81, 2010.
    • (2010) Speech Communication , vol.52 , pp. 72-81
    • Srinivasan, S.1    Wang, D.L.2
  • 8
    • 85127836544 scopus 로고    scopus 로고
    • Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms
    • M. Collins, "Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms, " in Proceedings of EMNLP, 2002.
    • (2002) Proceedings of EMNLP
    • Collins, M.1
  • 9
    • 85009227702 scopus 로고    scopus 로고
    • Analysis of the aurora large vocabulary extensions
    • Geneva, Switzerland, September
    • N. Parihar and J. Picone, "Analysis of the aurora large vocabulary extensions, " in Proceedings of Eurospeech, vol. 4, Geneva, Switzerland, September 2003, pp. 337-340.
    • (2003) Proceedings of Eurospeech , vol.4 , pp. 337-340
    • Parihar, N.1    Picone, J.2
  • 11
    • 80051633766 scopus 로고    scopus 로고
    • Investigations into the incorporation of the ideal binary mask in asr
    • Prague, Czech Republic, May
    • W. Hartmann and E. Fosler-Lussier, "Investigations into the incorporation of the ideal binary mask in asr, " in Proceedings of IEEE ICASSP, Prague, Czech Republic, May 2011.
    • (2011) Proceedings of IEEE ICASSP
    • Hartmann, W.1    Fosler-Lussier, E.2
  • 12
    • 71049180205 scopus 로고    scopus 로고
    • Computational auditory scene analysis: Principles
    • Wiley-IEEE Press
    • D. L. Wang and G. Brown, Computational Auditory Scene Analysis: Principles, Algorithms, and Applications. Wiley-IEEE Press, 2006.
    • (2006) Algorithms, and Applications
    • Wang, D.L.1    Brown, G.2
  • 13
    • 0004319968 scopus 로고
    • The NOISEX-92 study on the effect of additive noise on automatic speech recognition
    • Defense Research Agency, Malvern, UK, Tech. Rep
    • A. P. Varga, H. J. M. Steeneken, M. Tomlinson, and D. Jones, "The NOISEX-92 Study on the Effect of Additive Noise on Automatic Speech Recognition, " Speech Research Unit, Defense Research Agency, Malvern, UK, Tech. Rep., 1992.
    • (1992) Speech Research Unit
    • Varga, A.P.1    Steeneken, H.J.M.2    Tomlinson, M.3    Jones, D.4
  • 14
    • 0032626792 scopus 로고    scopus 로고
    • Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures
    • D. P. W. Ellis, "Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures, " Speech Communication, vol. 27, pp. 281-298, 1999.
    • (1999) Speech Communication , vol.27 , pp. 281-298
    • Ellis, D.P.W.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.