SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2012, Pages 4685-4688

ASR-driven top-down binary mask estimation using spectral priors

(2) Hartmann, William a Fosler Lussier, Eric a

a The Ohio State University (United States)

Author keywords

ideal binary mask; mask estimation; robust automatic speech recognition

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; BASELINE RECOGNITION SYSTEMS; BINARY MASKS; ESTIMATION ALGORITHM; IDEAL BINARY MASK; LINGUISTIC INFORMATION; LOW-LEVEL FEATURES; MODEL SELECTION; PILOT STUDIES; SNR IMPROVEMENT; TOP-DOWN APPROACH; TOPDOWN;

SIGNAL PROCESSING; SIGNAL TO NOISE RATIO; SPEECH ENHANCEMENT; SPEECH RECOGNITION;

ESTIMATION;

EID: 84867589172 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2012.6288964 Document Type: Conference Paper

Times cited : (3)

References (13)

1
- 84892233308
- On ideal binary mask as the computational goal of auditory scene analysis
- P. Divenyi, Ed., Kluwer Academic, Norwell MA
- D. L.Wang, "On ideal binary mask as the computational goal of auditory scene analysis," in Speech separation by humans and machines, P. Divenyi, Ed., pp. 181-197. Kluwer Academic, Norwell MA, 2005.
- (2005) Speech Separation by Humans and Machines , pp. 181-197
- Wang, D.L.¹

2
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Communication, vol. 34, pp. 267-285, 2001.
- (2001) Speech Communication , vol.34 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

3
- 78049364397
- Mmse based noise psd tracking with low complexity
- R. C. Hendriks, R. Heusdens, and J. Jensen, "Mmse based noise psd tracking with low complexity," in Proceedings of IEEE ICASSP, 2010, pp. 4266-4269.
- Proceedings of IEEE ICASSP, 2010 , pp. 4266-4269
- Hendriks, R.C.¹ Heusdens, R.² Jensen, J.³

4
- 79956289561
- A novel mask estimation method employing posterior-based representative mean estimate for missing-feature speech recognition
- July
- W. Kim and J. H. L. Hansen, "A novel mask estimation method employing posterior-based representative mean estimate for missing-feature speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 5, pp. 1434-1443, July 2011.
- (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , Issue.5 , pp. 1434-1443
- Kim, W.¹ Hansen, J.H.L.²

5
- 11144316019
- Decoding speech in the presence of other sources
- J. Barker, M. Cooke, and D. P. W. Ellis, "Decoding speech in the presence of other sources," Speech Communication, vol. 45, pp. 5-25, 2005.
- (2005) Speech Communication , vol.45 , pp. 5-25
- Barker, J.¹ Cooke, M.² Ellis, D.P.W.³

6
- 70350038037
- Robust speech recognition by integrating speech separation and hypothesis testing
- S. Srinivasan and D. L. Wang, "Robust speech recognition by integrating speech separation and hypothesis testing," Speech Communication, vol. 52, pp. 72-81, 2010.
- (2010) Speech Communication , vol.52 , pp. 72-81
- Srinivasan, S.¹ Wang, D.L.²

7
- 82255178542
- Wiley-IEEE Press
- D. L. Wang and G. Brown, Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, Wiley-IEEE Press, 2006.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications
- Wang, D.L.¹ Brown, G.²

8
- 0003822743
- Cambridge University Publishing Department
- S. Young, G. Evermann, T. Hain, D. Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland, The HTK Book, Cambridge University Publishing Department, 2002.
- (2002) The HTK Book
- Young, S.¹ Evermann, G.² Hain, T.³ Kershaw, D.⁴ Moore, G.⁵ Odell, J.⁶ Ollason, D.⁷ Povey, D.⁸ Valtchev, V.⁹ Woodland, P.¹⁰

9
- 4644336054
- Reconstruction of missing features for robust speech recognition
- B. Raj, M. L. Seltzer, and R. M. Stern, "Reconstruction of missing features for robust speech recognition," Speech Communication, vol. 43, pp. 275-296, 2004.
- (2004) Speech Communication , vol.43 , pp. 275-296
- Raj, B.¹ Seltzer, M.L.² Stern, R.M.³

10
- 80051633766
- Investigations into the incorporation of the ideal binary mask in asr
- W. Hartmann and E. Fosler-Lussier, "Investigations into the incorporation of the ideal binary mask in asr," in Proceedings of IEEE ICASSP, Prague, Czech Republic, May 2011.
- Proceedings of IEEE ICASSP, Prague, Czech Republic, May 2011
- Hartmann, W.¹ Fosler-Lussier, E.²

11
- 85009227702
- Analysis of the aurora large vocabulary extensions
- N. Parihar and J. Picone, "Analysis of the aurora large vocabulary extensions," in Proceedings of Eurospeech, Geneva, Switzerland, September 2003, vol. 4, pp. 337-340.
- Proceedings of Eurospeech, Geneva, Switzerland, September 2003 , vol.4 , pp. 337-340
- Parihar, N.¹ Picone, J.²

12
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 27, pp. 113- 120, 1979.
- (1979) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.27 , pp. 113-120
- Boll, S.F.¹

13
- 0032626792
- Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures
- D. P. W. Ellis, "Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures," Speech Communication, vol. 27, pp. 281-298, 1999.
- (1999) Speech Communication , vol.27 , pp. 281-298
- Ellis, D.P.W.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.