메뉴 건너뛰기




Volumn 24, Issue 1, 2010, Pages 94-111

Speech fragment decoding techniques for simultaneous speaker identification and speech recognition

Author keywords

Auditory scene analysis; Noise robustness; Simultaneous speech; Speaker identification; Speech recognition; Speech separation

Indexed keywords

AUDITORY SCENE ANALYSIS; NOISE ROBUSTNESS; SIMULTANEOUS SPEECH; SPEAKER IDENTIFICATION; SPEECH SEPARATION;

EID: 69249231059     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2008.05.003     Document Type: Article
Times cited : (30)

References (23)
  • 1
    • 0025003184 scopus 로고
    • Modeling the perception of concurrent vowels: vowels with different fundamental frequencies
    • Assmann P., and Summerfield Q. Modeling the perception of concurrent vowels: vowels with different fundamental frequencies. Journal of the Acoustical Society of America 88 2 (1990) 680-697
    • (1990) Journal of the Acoustical Society of America , vol.88 , Issue.2 , pp. 680-697
    • Assmann, P.1    Summerfield, Q.2
  • 2
    • 11144316019 scopus 로고    scopus 로고
    • Decoding speech in the presence of other sources
    • Barker J., Cooke M., and Ellis D. Decoding speech in the presence of other sources. Speech Communication 45 1 (2005) 5-25
    • (2005) Speech Communication , vol.45 , Issue.1 , pp. 5-25
    • Barker, J.1    Cooke, M.2    Ellis, D.3
  • 3
    • 44949219122 scopus 로고    scopus 로고
    • Recent advances in speech fragment decoding techniques
    • Pittsburgh, pp
    • Barker, J., Coy, A., Ma, N., Cooke, M., 2006. Recent advances in speech fragment decoding techniques. In: Proceedings of Interspeech 2006, Pittsburgh, pp. 85-88.
    • (2006) Proceedings of Interspeech , pp. 85-88
    • Barker, J.1    Coy, A.2    Ma, N.3    Cooke, M.4
  • 4
    • 85009063707 scopus 로고    scopus 로고
    • Soft decisions in missing data techniques for robust automatic speech recognition
    • Beijing, China, pp
    • Barker, J., Josifovski, L., Cooke, M., Green, P., 2000. Soft decisions in missing data techniques for robust automatic speech recognition. In: Proceedings of ICSLP 2000, Beijing, China, pp. 373-376.
    • (2000) Proceedings of ICSLP , pp. 373-376
    • Barker, J.1    Josifovski, L.2    Cooke, M.3    Green, P.4
  • 7
    • 33644661135 scopus 로고    scopus 로고
    • A glimpsing model of speech perception in noise
    • Cooke M. A glimpsing model of speech perception in noise. Journal of the Acoustical Society of America 119 (2006) 1562-1573
    • (2006) Journal of the Acoustical Society of America , vol.119 , pp. 1562-1573
    • Cooke, M.1
  • 9
    • 37849011878 scopus 로고    scopus 로고
    • The foreign language cocktail party problem: energetic and informational masking effects in non-native speech perception
    • Cooke M., Garcia Lecumberri M., and Barker J. The foreign language cocktail party problem: energetic and informational masking effects in non-native speech perception. Journal of the Acoustical Society of America 123 (2008) 414-427
    • (2008) Journal of the Acoustical Society of America , vol.123 , pp. 414-427
    • Cooke, M.1    Garcia Lecumberri, M.2    Barker, J.3
  • 10
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and uncertain acoustic data
    • Cooke M., Green P., Josifovski L., and Vizinho A. Robust automatic speech recognition with missing and uncertain acoustic data. Speech Communication 34 3 (2001) 267-285
    • (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 11
    • 69249202377 scopus 로고    scopus 로고
    • Monaural speech separation and recognition challenge
    • Cooke M., Hershey J., and Rennie S. Monaural speech separation and recognition challenge. Computer Speech and Language 24 1 (2010) 1-15
    • (2010) Computer Speech and Language , vol.24 , Issue.1 , pp. 1-15
    • Cooke, M.1    Hershey, J.2    Rennie, S.3
  • 12
    • 34247623029 scopus 로고    scopus 로고
    • An automatic speech recognition system based on the scene analysis account of auditory perception
    • Coy A., and Barker J. An automatic speech recognition system based on the scene analysis account of auditory perception. Speech Communication 49 5 (2007) 384-401
    • (2007) Speech Communication , vol.49 , Issue.5 , pp. 384-401
    • Coy, A.1    Barker, J.2
  • 13
    • 33846957558 scopus 로고    scopus 로고
    • Auditory grouping and attention to speech (keynote paper)
    • Darwin, C., 2001. Auditory grouping and attention to speech (keynote paper). In: Proceedings of the Institute of Acoustics, vol. 23, pp. 165-172.
    • (2001) Proceedings of the Institute of Acoustics , vol.23 , pp. 165-172
    • Darwin, C.1
  • 14
    • 0027298253 scopus 로고
    • Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model of auditory processing
    • de Cheveigné A. Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model of auditory processing. Journal of the Acoustical Society of America 93 6 (1993) 3271-3290
    • (1993) Journal of the Acoustical Society of America , vol.93 , Issue.6 , pp. 3271-3290
    • de Cheveigné, A.1
  • 16
    • 84987702417 scopus 로고    scopus 로고
    • The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • Hirsch, H., Pearce, D., 2000. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proceedings of ICSLP 2000, vol. 4, pp. 29-32.
    • (2000) Proceedings of ICSLP , vol.4 , pp. 29-32
    • Hirsch, H.1    Pearce, D.2
  • 17
    • 44949258898 scopus 로고    scopus 로고
    • Super-human multi-talker speech recognition: The IBM 2006 Speech Separation Challenge system
    • Pittsburgh
    • Kristjansson, T., Hershey, J., Olsen, P., Rennie, S., Gopinath, R., 2006. Super-human multi-talker speech recognition: the IBM 2006 Speech Separation Challenge system. In: Proceedings of Interspeech 2006, Pittsburgh.
    • (2006) Proceedings of Interspeech
    • Kristjansson, T.1    Hershey, J.2    Olsen, P.3    Rennie, S.4    Gopinath, R.5
  • 18
    • 34748817500 scopus 로고    scopus 로고
    • Exploiting correlogram structure for robust speech recognition with multiple speech sources
    • Ma N., Green P., Barker J., and Coy A. Exploiting correlogram structure for robust speech recognition with multiple speech sources. Speech Communication 49 (2007) 874-891
    • (2007) Speech Communication , vol.49 , pp. 874-891
    • Ma, N.1    Green, P.2    Barker, J.3    Coy, A.4
  • 19
    • 0026654967 scopus 로고
    • Modeling the identification of concurrent vowels with different fundamental frequencies
    • Meddis R., and Hewitt M. Modeling the identification of concurrent vowels with different fundamental frequencies. Journal of the Acoustical Society of America 91 1 (1992) 233-245
    • (1992) Journal of the Acoustical Society of America , vol.91 , Issue.1 , pp. 233-245
    • Meddis, R.1    Hewitt, M.2
  • 21
    • 0000950331 scopus 로고    scopus 로고
    • The watershed transform: definitions, algorithms and parallelization strategies
    • Roerdink J., and Meijster A. The watershed transform: definitions, algorithms and parallelization strategies. Fundamenta Informaticae 41 12 (2001) 187-228
    • (2001) Fundamenta Informaticae , vol.41 , Issue.12 , pp. 187-228
    • Roerdink, J.1    Meijster, A.2
  • 22
    • 44849140301 scopus 로고    scopus 로고
    • Speech recognition using factorial hidden markov models for separation in the feature space
    • Pittsburgh
    • Virtanen, T., 2006. Speech recognition using factorial hidden markov models for separation in the feature space. In: Proceedings of Interspeech 2006, Pittsburgh.
    • (2006) Proceedings of Interspeech
    • Virtanen, T.1
  • 23
    • 0029249228 scopus 로고
    • Spectral redundancy: intelligibility of sentences heard through narrow spectral slits
    • Warren R., Riener K., Bashford J., and Brubaker B. Spectral redundancy: intelligibility of sentences heard through narrow spectral slits. Perception and Pyschophysics 57 2 (1995) 175-182
    • (1995) Perception and Pyschophysics , vol.57 , Issue.2 , pp. 175-182
    • Warren, R.1    Riener, K.2    Bashford, J.3    Brubaker, B.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.