SCOPUS 정보 검색 플랫폼

Volumn 24, Issue 1, 2010, Pages 94-111

Speech fragment decoding techniques for simultaneous speaker identification and speech recognition

(4) Barker, Jon a Ma, Ning a Coy, André a Cooke, Martin b

a UNIVERSITY OF SHEFFIELD (United Kingdom)

b UNIVERSITY OF THE BASQUE COUNTRY UPV EHU (Spain)

Author keywords

Auditory scene analysis; Noise robustness; Simultaneous speech; Speaker identification; Speech recognition; Speech separation

Indexed keywords

AUDITORY SCENE ANALYSIS; NOISE ROBUSTNESS; SIMULTANEOUS SPEECH; SPEAKER IDENTIFICATION; SPEECH SEPARATION;

ACOUSTIC NOISE; DECODING; ERROR CORRECTION; LOUDSPEAKERS; PATIENT REHABILITATION; SPEECH ANALYSIS; SPEECH PROCESSING; TARGETS;

SPEECH RECOGNITION;

EID: 69249231059 PISSN: 08852308 EISSN: 10958363 Source Type: Journal
DOI: 10.1016/j.csl.2008.05.003 Document Type: Article

Times cited : (30)

References (23)

1
- 0025003184
- Modeling the perception of concurrent vowels: vowels with different fundamental frequencies
- Assmann P., and Summerfield Q. Modeling the perception of concurrent vowels: vowels with different fundamental frequencies. Journal of the Acoustical Society of America 88 2 (1990) 680-697
- (1990) Journal of the Acoustical Society of America , vol.88 , Issue.2 , pp. 680-697
- Assmann, P.¹ Summerfield, Q.²

2
- 11144316019
- Decoding speech in the presence of other sources
- Barker J., Cooke M., and Ellis D. Decoding speech in the presence of other sources. Speech Communication 45 1 (2005) 5-25
- (2005) Speech Communication , vol.45 , Issue.1 , pp. 5-25
- Barker, J.¹ Cooke, M.² Ellis, D.³

3
- 44949219122
- Recent advances in speech fragment decoding techniques
- Pittsburgh, pp
- Barker, J., Coy, A., Ma, N., Cooke, M., 2006. Recent advances in speech fragment decoding techniques. In: Proceedings of Interspeech 2006, Pittsburgh, pp. 85-88.
- (2006) Proceedings of Interspeech , pp. 85-88
- Barker, J.¹ Coy, A.² Ma, N.³ Cooke, M.⁴

4
- 85009063707
- Soft decisions in missing data techniques for robust automatic speech recognition
- Beijing, China, pp
- Barker, J., Josifovski, L., Cooke, M., Green, P., 2000. Soft decisions in missing data techniques for robust automatic speech recognition. In: Proceedings of ICSLP 2000, Beijing, China, pp. 373-376.
- (2000) Proceedings of ICSLP , pp. 373-376
- Barker, J.¹ Josifovski, L.² Cooke, M.³ Green, P.⁴

5
- 0003684441
- MIT Press, Cambridge MA
- Bregman A. Auditory Scene Analysis (1990), MIT Press, Cambridge MA
- (1990) Auditory Scene Analysis
- Bregman, A.¹

6
- 0035169173
- Informational and energetic masking effects in the perception of multiple simultaneous talkers
- Brungart D.S., Simpson B.D., Ericson M.A., and Scott K.R. Informational and energetic masking effects in the perception of multiple simultaneous talkers. Journal of the Acoustical Society of America 100 (2001) 2527-2538
- (2001) Journal of the Acoustical Society of America , vol.100 , pp. 2527-2538
- Brungart, D.S.¹ Simpson, B.D.² Ericson, M.A.³ Scott, K.R.⁴

7
- 33644661135
- A glimpsing model of speech perception in noise
- Cooke M. A glimpsing model of speech perception in noise. Journal of the Acoustical Society of America 119 (2006) 1562-1573
- (2006) Journal of the Acoustical Society of America , vol.119 , pp. 1562-1573
- Cooke, M.¹

8
- 33750368310
- An audio-visual corpus for speech perception and automatic speech recognition
- Cooke M., Barker J., Cunningham S., and Shao X. An audio-visual corpus for speech perception and automatic speech recognition. Journal of the Acoustical Society of America 120 (2006) 2421-2424
- (2006) Journal of the Acoustical Society of America , vol.120 , pp. 2421-2424
- Cooke, M.¹ Barker, J.² Cunningham, S.³ Shao, X.⁴

9
- 37849011878
- The foreign language cocktail party problem: energetic and informational masking effects in non-native speech perception
- Cooke M., Garcia Lecumberri M., and Barker J. The foreign language cocktail party problem: energetic and informational masking effects in non-native speech perception. Journal of the Acoustical Society of America 123 (2008) 414-427
- (2008) Journal of the Acoustical Society of America , vol.123 , pp. 414-427
- Cooke, M.¹ Garcia Lecumberri, M.² Barker, J.³

10
- 0035342414
- Robust automatic speech recognition with missing and uncertain acoustic data
- Cooke M., Green P., Josifovski L., and Vizinho A. Robust automatic speech recognition with missing and uncertain acoustic data. Speech Communication 34 3 (2001) 267-285
- (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

11
- 69249202377
- Monaural speech separation and recognition challenge
- Cooke M., Hershey J., and Rennie S. Monaural speech separation and recognition challenge. Computer Speech and Language 24 1 (2010) 1-15
- (2010) Computer Speech and Language , vol.24 , Issue.1 , pp. 1-15
- Cooke, M.¹ Hershey, J.² Rennie, S.³

12
- 34247623029
- An automatic speech recognition system based on the scene analysis account of auditory perception
- Coy A., and Barker J. An automatic speech recognition system based on the scene analysis account of auditory perception. Speech Communication 49 5 (2007) 384-401
- (2007) Speech Communication , vol.49 , Issue.5 , pp. 384-401
- Coy, A.¹ Barker, J.²

13
- 33846957558
- Auditory grouping and attention to speech (keynote paper)
- Darwin, C., 2001. Auditory grouping and attention to speech (keynote paper). In: Proceedings of the Institute of Acoustics, vol. 23, pp. 165-172.
- (2001) Proceedings of the Institute of Acoustics , vol.23 , pp. 165-172
- Darwin, C.¹

14
- 0027298253
- Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model of auditory processing
- de Cheveigné A. Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model of auditory processing. Journal of the Acoustical Society of America 93 6 (1993) 3271-3290
- (1993) Journal of the Acoustical Society of America , vol.93 , Issue.6 , pp. 3271-3290
- de Cheveigné, A.¹

15
- 0012323283
- Note on informational masking
- Durlach N., Mason C., Kidd Jr. G., Arbogast T., Colburn H., and Shinn-Cunningham B. Note on informational masking. Journal of the Acoustical Society of America 113 6 (2003) 2984-2987
- (2003) Journal of the Acoustical Society of America , vol.113 , Issue.6 , pp. 2984-2987
- Durlach, N.¹ Mason, C.² Kidd Jr., G.³ Arbogast, T.⁴ Colburn, H.⁵ Shinn-Cunningham, B.⁶

16
- 84987702417
- The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- Hirsch, H., Pearce, D., 2000. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proceedings of ICSLP 2000, vol. 4, pp. 29-32.
- (2000) Proceedings of ICSLP , vol.4 , pp. 29-32
- Hirsch, H.¹ Pearce, D.²

17
- 44949258898
- Super-human multi-talker speech recognition: The IBM 2006 Speech Separation Challenge system
- Pittsburgh
- Kristjansson, T., Hershey, J., Olsen, P., Rennie, S., Gopinath, R., 2006. Super-human multi-talker speech recognition: the IBM 2006 Speech Separation Challenge system. In: Proceedings of Interspeech 2006, Pittsburgh.
- (2006) Proceedings of Interspeech
- Kristjansson, T.¹ Hershey, J.² Olsen, P.³ Rennie, S.⁴ Gopinath, R.⁵

18
- 34748817500
- Exploiting correlogram structure for robust speech recognition with multiple speech sources
- Ma N., Green P., Barker J., and Coy A. Exploiting correlogram structure for robust speech recognition with multiple speech sources. Speech Communication 49 (2007) 874-891
- (2007) Speech Communication , vol.49 , pp. 874-891
- Ma, N.¹ Green, P.² Barker, J.³ Coy, A.⁴

19
- 0026654967
- Modeling the identification of concurrent vowels with different fundamental frequencies
- Meddis R., and Hewitt M. Modeling the identification of concurrent vowels with different fundamental frequencies. Journal of the Acoustical Society of America 91 1 (1992) 233-245
- (1992) Journal of the Acoustical Society of America , vol.91 , Issue.1 , pp. 233-245
- Meddis, R.¹ Hewitt, M.²

20
- 0026818496
- Data driven search organization for continuous speech recognition
- Noll A., Ney H., Mergel D., and Paeseler A. Data driven search organization for continuous speech recognition. IEEE Transactions on Speech and Audio Processing 40 (1992) 272-281
- (1992) IEEE Transactions on Speech and Audio Processing , vol.40 , pp. 272-281
- Noll, A.¹ Ney, H.² Mergel, D.³ Paeseler, A.⁴

21
- 0000950331
- The watershed transform: definitions, algorithms and parallelization strategies
- Roerdink J., and Meijster A. The watershed transform: definitions, algorithms and parallelization strategies. Fundamenta Informaticae 41 12 (2001) 187-228
- (2001) Fundamenta Informaticae , vol.41 , Issue.12 , pp. 187-228
- Roerdink, J.¹ Meijster, A.²

22
- 44849140301
- Speech recognition using factorial hidden markov models for separation in the feature space
- Pittsburgh
- Virtanen, T., 2006. Speech recognition using factorial hidden markov models for separation in the feature space. In: Proceedings of Interspeech 2006, Pittsburgh.
- (2006) Proceedings of Interspeech
- Virtanen, T.¹

23
- 0029249228
- Spectral redundancy: intelligibility of sentences heard through narrow spectral slits
- Warren R., Riener K., Bashford J., and Brubaker B. Spectral redundancy: intelligibility of sentences heard through narrow spectral slits. Perception and Pyschophysics 57 2 (1995) 175-182
- (1995) Perception and Pyschophysics , vol.57 , Issue.2 , pp. 175-182
- Warren, R.¹ Riener, K.² Bashford, J.³ Brubaker, B.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.