SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn 1, Issue , 2006, Pages 73-76

A computational auditory scene analysis system for robust speech recognition

(4) Srinivasan, Soundararajan a Shao, Yang b Jin, Zhaozhang b Wang, DeLiang a,b

a OHIO STATE UNIVERSITY (United States)

b The Ohio State University (United States)

Author keywords

Binary time frequency mask; Computational auditory scene analysis; Robust speech recognition; Speech segregation

Indexed keywords

BINS; DEEP NEURAL NETWORKS; PATIENT REHABILITATION; SPEECH; SPEECH COMMUNICATION;

BASE-LINE PERFORMANCE; COMPUTATIONAL AUDITORY SCENE ANALYSIS; MISSING DATA METHODS; ROBUST SPEECH RECOGNITION; SPEAKER CHARACTERISTICS; SPEECH SEGREGATION; SYSTEMATIC EVALUATION; TIME FREQUENCY;

SPEECH RECOGNITION;

EID: 40749137520 PISSN: None EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (11)

References (19)

1
- 0004056285
- Upper Saddle River, NJ: Prentice Hall PTR
- X. Huang, A. Acero, and H. Hon, Spoken Language Processing. Upper Saddle River, NJ: Prentice Hall PTR, 2001.
- (2001) Spoken Language Processing
- Huang, X.¹ Acero, A.² Hon, H.³

2
- 4644257621
- Single microphone source separation using high resolution signal reconstruction
- T. Kristjansson, H. Attias, and J. Hershey, "Single microphone source separation using high resolution signal reconstruction," in Proc. ICASSP '04, vol. 2, 2004, pp. 817-820.
- (2004) Proc. ICASSP '04 , vol.2 , pp. 817-820
- Kristjansson, T.¹ Attias, H.² Hershey, J.³

3
- 84899014722
- A probabilistic approach to single channel blind signal separation
- S. Becker, S. Thrun, and K. Obermayer, Eds. Cambridge, MA: MIT Press
- G-J Jang and T-W Lee, "A probabilistic approach to single channel blind signal separation," in Advances in Neural Information Processing Systems 15, S. Becker, S. Thrun, and K. Obermayer, Eds. Cambridge, MA: MIT Press, 2003, pp. 1173-1180.
- (2003) Advances in Neural Information Processing Systems 15 , pp. 1173-1180
- Jang, G.-J.¹ Lee, T.-W.²

4
- 33745190244
- Recognizing speech from simultaneous speakers
- B. Raj, R. Singh, and P. Smaragdis, "Recognizing speech from simultaneous speakers," in Proc. Interspeech '05, 2005, pp. 3317-3320.
- (2005) Proc. Interspeech '05 , pp. 3317-3320
- Raj, B.¹ Singh, R.² Smaragdis, P.³

5
- 4544369701
- A factorial HMM approach to simultaneous recognition of isolated digits spoken by multiple talkers on one audio channel
- A. N. Deoras and M. Hasegawa-Johnson, "A factorial HMM approach to simultaneous recognition of isolated digits spoken by multiple talkers on one audio channel," in Proc. ICASSP '04, vol. 1, 2004, pp. 861-864.
- (2004) Proc. ICASSP '04 , vol.1 , pp. 861-864
- Deoras, A.N.¹ Hasegawa-Johnson, M.²

6
- 84892233308
- On ideal binary mask as the computational goal of auditory scene analysis
- P. Divenyi, Ed, Norwell, MA
- D. L. Wang, "On ideal binary mask as the computational goal of auditory scene analysis," in Speech separation by humans and machines, P. Divenyi, Ed., Norwell, MA, 2005, pp. 181-197.
- (2005) Speech separation by humans and machines , pp. 181-197
- Wang, D.L.¹

7
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Comm., vol. 34, pp. 267-285, 2001.
- (2001) Speech Comm , vol.34 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

8
- 0142026377
- Speech segregation based on sound localization
- N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Am., vol. 114, pp. 2236-2252, 2003.
- (2003) J. Acoust. Soc. Am , vol.114 , pp. 2236-2252
- Roman, N.¹ Wang, D.L.² Brown, G.J.³

9
- 85022115206
- A. S. Bregman, Auditory scene analysis. Cambridge, MA: The MIT Press, 1990.
- A. S. Bregman, Auditory scene analysis. Cambridge, MA: The MIT Press, 1990.

10
- 0032682770
- Separation of speech from interfering sounds based on oscillatory correlation
- D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. on Neural Networks, vol. 10, no. 3, pp. 684-697, 1999.
- (1999) IEEE Trans. on Neural Networks , vol.10 , Issue.3 , pp. 684-697
- Wang, D.L.¹ Brown, G.J.²

11
- 44949099721
- Auditory segmentation based on onset and offset analysis
- in press
- G. Hu and D. L. Wang, "Auditory segmentation based on onset and offset analysis," IEEE Trans. on Audio, Speech and Language Processing, 2006, in press.
- (2006) IEEE Trans. on Audio, Speech and Language Processing
- Hu, G.¹ Wang, D.L.²

12
- 4644336054
- Reconstruction of missing features for robust speech recognition
- B. Raj, M. L. Seltzer, and R. M. Stem, "Reconstruction of missing features for robust speech recognition," Speech Communication, vol. 43, pp. 275-296, 2004.
- (2004) Speech Communication , vol.43 , pp. 275-296
- Raj, B.¹ Seltzer, M.L.² Stem, R.M.³

13
- 34547539772
- Available at
- M. Cooke and T-W. Lee. Speech separation and recognition competition. Available at http://www.dcs.shef.ac.uk/~martin/ SpeechSeparationChallenge.htm
- Speech separation and recognition competition
- Cooke, M.¹ Lee, T.-W.²

14
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- G. Hu and D. L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. on Neural Networks, vol. 15, pp. 1135-1150, 2004.
- (2004) IEEE Trans. on Neural Networks , vol.15 , pp. 1135-1150
- Hu, G.¹ Wang, D.L.²

15
- 85045165251
- Monaural speech organization and segregation,
- Ph.D. dissertation, Biophysics Program, The Ohio State University
- G. Hu, "Monaural speech organization and segregation," Ph.D. dissertation, Biophysics Program, The Ohio State University, 2006.
- (2006)
- Hu, G.¹

16
- 33744996003
- Model-based sequential organization in cochannel speech
- Y. Shao and D. L. Wang, "Model-based sequential organization in cochannel speech," IEEE Trans. on Audio, Speech and Language Processing, vol. 14, pp. 289-298, 2006.
- (2006) IEEE Trans. on Audio, Speech and Language Processing , vol.14 , pp. 289-298
- Shao, Y.¹ Wang, D.L.²

17
- 33947649051
- Robust speaker recognition using binary time-frequency masks
- _, "Robust speaker recognition using binary time-frequency masks," in Proc. ICASSP '06, vol. I, 2006, pp. 645-648.
- (2006) Proc. ICASSP '06 , vol.1 , pp. 645-648
- Shao, Y.¹ Wang, D.L.²

18
- 0026172104
- Watersheds in digital spaces: An efficient algorithm based on immersion simulations
- L. Vincent and P. Soille, "Watersheds in digital spaces: An efficient algorithm based on immersion simulations," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 13, no. 6, pp. 583-598, 1991.
- (1991) IEEE Trans. on Pattern Analysis and Machine Intelligence , vol.13 , Issue.6 , pp. 583-598
- Vincent, L.¹ Soille, P.²

19
- 33947644911
- A supervised learning approach to uncertainty decoding for robust speech recognition
- S. Srinivasan and D. L. Wang, "A supervised learning approach to uncertainty decoding for robust speech recognition," in Proc. ICASSP '06, vol. I, 2006, pp. 297-300.
- (2006) Proc. ICASSP '06 , vol.1 , pp. 297-300
- Srinivasan, S.¹ Wang, D.L.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.