메뉴 건너뛰기




Volumn , Issue , 2013, Pages 6817-6821

Coupling binary masking and robust ASR

Author keywords

Aurora 4; bidirectional speech decoder; Computational Auditory Scene Analysis; noise robust ASR

Indexed keywords

AURORA-4; BIDIRECTIONAL SPEECH DECODER; CEPSTRAL FEATURES; COMPUTATIONAL AUDITORY SCENE ANALYSIS; IDEAL BINARY MASK (IBM); NOISE ROBUST ASR; ROBUST AUTOMATIC SPEECH RECOGNITIONS (ASR); SPEECH SEPARATION;

EID: 84890475416     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6638982     Document Type: Conference Paper
Times cited : (7)

References (34)
  • 3
    • 85135375893 scopus 로고
    • HMM recognition in noise using parallel model combination
    • M. J. F. Gales and S. J. Young, "HMM recognition in noise using parallel model combination," in Proc. Eurospeech, 1993, vol. 2, pp. 837-840
    • (1993) Proc. Eurospeech , vol.2 , pp. 837-840
    • Gales, M.J.F.1    Young, S.J.2
  • 4
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, 1998
    • (1998) Comput. Speech Lang. , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 5
    • 62249130045 scopus 로고    scopus 로고
    • A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions
    • J. Li, L. Deng, D. Yu, Y. Gong, and A. Acero, "A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions," Comput. Speech Lang., vol. 23, pp. 389-405, 2009
    • (2009) Comput. Speech Lang. , vol.23 , pp. 389-405
    • Li, J.1    Deng, L.2    Yu, D.3    Gong, Y.4    Acero, A.5
  • 7
    • 84867584623 scopus 로고    scopus 로고
    • Improvements to VTS feature enhancement
    • J. Droppo, L. Deng, and A. Acero, "Improvements to VTS feature enhancement," in Proc. IEEE ICASSP, 2012, pp. 4677-4680
    • (2012) Proc. IEEE ICASSP , pp. 4677-4680
    • Droppo, J.1    Deng, L.2    Acero, A.3
  • 8
    • 33750376174 scopus 로고    scopus 로고
    • Model-based feature enhancement with uncertainty decoding for noise robust ASR
    • V. Stouten, H. Van Hamme, and P. Wambacq, "Model-based feature enhancement with uncertainty decoding for noise robust ASR," Speech Commun., vol. 48, pp. 1502-1514, 2006
    • (2006) Speech Commun. , vol.48 , pp. 1502-1514
    • Stouten, V.1    Van Hamme, H.2    Wambacq, P.3
  • 9
    • 84878567715 scopus 로고    scopus 로고
    • Advances in noise robust digit recognition using hybrid exemplar-based techniques
    • J. F. Gemmeke and H. Van Hamme, "Advances in noise robust digit recognition using hybrid exemplar-based techniques," in Proc. Interspeech, 2012
    • (2012) Proc. Interspeech
    • Gemmeke, J.F.1    Van Hamme, H.2
  • 10
    • 56249136428 scopus 로고    scopus 로고
    • Transforming binary uncertainties for robust speech recognition
    • S. Srinivasan and D. L.Wang, "Transforming binary uncertainties for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, pp. 2130-2140, 2007
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , pp. 2130-2140
    • Srinivasan, S.1    Wang, D.L.2
  • 11
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • R. P. Lippmann, "Speech recognition by machines and humans," Speech Commun., vol. 22, pp. 1-16, 1997
    • (1997) Speech Commun. , vol.22 , pp. 1-16
    • Lippmann, R.P.1
  • 14
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary masks as the computational goal of auditory scene analysis
    • P. Divenyi, Ed.Kluwer Academic, Boston, MA
    • D. L.Wang, "On ideal binary masks as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed., pp. 181-197. Kluwer Academic, Boston, MA, 2005
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 15
    • 70349161218 scopus 로고    scopus 로고
    • Role of mask pattern in intelligibility of ideal binarymasked noisy speech
    • U. Kjems, J. B. Boldt, M. S. Pedersen, T. Lunner, and D. L. Wang, "Role of mask pattern in intelligibility of ideal binarymasked noisy speech," J. Acoust. Soc. Am., vol. 126, pp. 1415-1426, 2009
    • (2009) J. Acoust. Soc. Am. , vol.126 , pp. 1415-1426
    • Kjems, U.1    Boldt, J.B.2    Pedersen, M.S.3    Lunner, T.4    Wang, D.L.5
  • 16
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and uncertain acoustic data
    • M. P Cooke, P. Greene, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and uncertain acoustic data," Speech Commun., vol. 34, pp. 141-177, 2001
    • (2001) Speech Commun. , vol.34 , pp. 141-177
    • Cooke, M.P.1    Greene, P.2    Josifovski, L.3    Vizinho, A.4
  • 17
    • 85032752225 scopus 로고    scopus 로고
    • Missing-feature approaches in speech recognition
    • B. Raj and R. Stern, "Missing-feature approaches in speech recognition," IEEE Signal Process. Mag., vol. 22, pp. 101-116, 2005
    • (2005) IEEE Signal Process. Mag. , vol.22 , pp. 101-116
    • Raj, B.1    Stern, R.2
  • 18
    • 84877594942 scopus 로고    scopus 로고
    • Tech. Rep. OSU-CISRC-7/11-TR21, Depart. Comput. Sc. Eng., The Ohio State University, Columbus, Ohio, USA
    • W. Hartmann, A. Narayanan, E. Fosler-Lussier, and D. L. Wang, "Nothing doing: Re-evaluating missing feature ASR," Tech. Rep. OSU-CISRC-7/11-TR21, Depart. Comput. Sc. Eng., The Ohio State University, Columbus, Ohio, USA, 2011, Available: ftp://ftp.cse.ohio-state.edu/pub/tech- report/2011
    • (2011) Nothing Doing: Re-evaluating Missing Feature ASR
    • Hartmann, W.1    Narayanan, A.2    Fosler-Lussier, E.3    Wang, D.L.4
  • 19
    • 85008054377 scopus 로고    scopus 로고
    • Unvoiced speech segregation from nonspeech interference via CASA and spectral subtraction
    • K. Hu and D. L. Wang, "Unvoiced speech segregation from nonspeech interference via CASA and spectral subtraction," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, pp. 1600-1609, 2011
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , pp. 1600-1609
    • Hu, K.1    Wang, D.L.2
  • 20
    • 11144316019 scopus 로고    scopus 로고
    • Decoding speech in the presence of other sources
    • J. Barker, M. P. Cooke, and D. P. W. Ellis, "Decoding speech in the presence of other sources," Speech Commun., vol. 45, pp. 5-25, 2005
    • (2005) Speech Commun. , vol.45 , pp. 5-25
    • Barker, J.1    Cooke, M.P.2    Ellis, D.P.W.3
  • 21
    • 33750311718 scopus 로고    scopus 로고
    • Binary and ratio time-frequency masks for robust speech recognition
    • S. Srinivasan, N. Roman, and D. L. Wang, "Binary and ratio time-frequency masks for robust speech recognition," Speech Commun., vol. 48, pp. 1486-1501, 2006
    • (2006) Speech Commun. , vol.48 , pp. 1486-1501
    • Srinivasan, S.1    Roman, N.2    Wang, D.L.3
  • 22
    • 84878618681 scopus 로고    scopus 로고
    • Coupling identification and reconstruction of missing features for noise-robust automatic speech recognition
    • N. Ma and J. Barker, "Coupling identification and reconstruction of missing features for noise-robust automatic speech recognition," in Proc. Interspeech, 2012
    • (2012) Proc. Interspeech
    • Ma, N.1    Barker, J.2
  • 23
    • 70350038037 scopus 로고    scopus 로고
    • Robust speech recognition by integrating speech separation and hypothesis testing
    • S. Srinivasan and D. L. Wang, "Robust speech recognition by integrating speech separation and hypothesis testing," Speech Commun., vol. 52, pp. 72-81, 2010
    • (2010) Speech Commun. , vol.52 , pp. 72-81
    • Srinivasan, S.1    Wang, D.L.2
  • 24
    • 84878392281 scopus 로고    scopus 로고
    • Improved model selection for the ASR-driven binary mask
    • W. Hartmann and E. Fosler-Lussier, "Improved model selection for the ASR-driven binary mask," in Proc. Interspeech, 2012
    • (2012) Proc. Interspeech
    • Hartmann, W.1    Fosler-Lussier, E.2
  • 25
    • 77957739976 scopus 로고    scopus 로고
    • Advances in missing feature techniques for robust large-vocabulary continuous speech recognition
    • M. Van Segbroeck and H. Van Hamme, "Advances in missing feature techniques for robust large-vocabulary continuous speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, pp. 123-137, 2011
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , pp. 123-137
    • Van Segbroeck, M.1    Van Hamme, H.2
  • 26
    • 84877621926 scopus 로고    scopus 로고
    • The role of binary mask pattern in automatic speech recognition in background noise
    • in press
    • A. Narayanan and D. L. Wang, "The role of binary mask pattern in automatic speech recognition in background noise," J. Acoust. Soc. Am., 2013, in press
    • (2013) J. Acoust. Soc. Am.
    • Narayanan, A.1    Wang, D.L.2
  • 27
    • 78649580634 scopus 로고    scopus 로고
    • Robust speech recognition from binary masks
    • A. Narayanan and D. L. Wang, "Robust speech recognition from binary masks," J. Acoust. Soc. Am., vol. 128, pp. EL217-222, 2010
    • (2010) J. Acoust. Soc. Am. , vol.128
    • Narayanan, A.1    Wang, D.L.2
  • 28
    • 85009227702 scopus 로고    scopus 로고
    • Analysis of the Aurora large vocabulary evalutions
    • N. Parihar and J. Picone, "Analysis of the Aurora large vocabulary evalutions," in Proc. ECSCT, 2003, pp. 337-340
    • (2003) Proc. ECSCT , pp. 337-340
    • Parihar, N.1    Picone, J.2
  • 31
  • 32
    • 78049364397 scopus 로고    scopus 로고
    • MMSE based noise PSD tracking with low complexity
    • R.C. Hendriks, R. Heusdens, and J. Jensen, "MMSE based noise PSD tracking with low complexity," in Proc. IEEE ICASSP, 2010, pp. 4266-4269
    • (2010) Proc. IEEE ICASSP , pp. 4266-4269
    • Hendriks, R.C.1    Heusdens, R.2    Jensen, J.3
  • 33
    • 70349093614 scopus 로고    scopus 로고
    • An algorithm that improves speech intelligibility in noise for normal-hearing listeners
    • G. Kim, Y. Lu, Y. Hu, and P. Loizou, "An algorithm that improves speech intelligibility in noise for normal-hearing listeners," J. Acoust. Soc. Am, vol. 126, pp. 1486-1494, 2009
    • (2009) J. Acoust. Soc. Am , vol.126 , pp. 1486-1494
    • Kim, G.1    Lu, Y.2    Hu, Y.3    Loizou, P.4
  • 34
    • 84870477511 scopus 로고    scopus 로고
    • Exploring monaural features for classification-based speech segregation
    • Y. Wang, K. Han, and D. Wang, "Exploring monaural features for classification-based speech segregation," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, pp. 270-279, 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , pp. 270-279
    • Wang, Y.1    Han, K.2    Wang, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.