SCOPUS 정보 검색 플랫폼

ISCA Workshop on Statistical and Perceptual Audio Processing, SAPA 2010

Volumn , Issue , 2010, Pages

Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding

(4) Ma, Ning a Barker, Jon a Christensen, Heidi a Green, Phil a

a UNIVERSITY OF SHEFFIELD (United Kingdom)

Author keywords

Fragment decoding; Missing data; Noise robust speech recognition; Reverberation

Indexed keywords

ACOUSTIC NOISE; DECODING; FLOORS; MICROPHONES; SIGNAL TO NOISE RATIO;

ACOUSTIC EVENTS; ADAPTIVE NOISE; FRAGMENT DECODING; HOME ENVIRONMENT; INDOOR ENVIRONMENT; MICROPHONE SPEECH; MISSING DATA; NOISE FLOOR; NOISE ROBUST SPEECH RECOGNITION; TARGET SPEECH;

SPEECH RECOGNITION;

EID: 84940458837 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (1)

References (18)

1
- 0026882842
- Experiments with nonlinear spectral subtractor, Hidden Markov Models and the projection for robust speech recognition in cars
- P. Lockwood and J. Boudy, “Experiments with nonlinear spectral subtractor, Hidden Markov Models and the projection for robust speech recognition in cars,” Speech Communication, vol. 11, 1992.
- (1992) Speech Communication , vol.11
- Lockwood, P.¹ Boudy, J.²

2
- 0035342414
- Robust automatic speech recognition with missing and uncertain acoustic data
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho, “Robust automatic speech recognition with missing and uncertain acoustic data,” Speech Commun., vol. 34, no. 3, pp. 267–285, 2001.
- (2001) Speech Commun , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

3
- 0025681008
- Hidden Markov model decomposition of speech and noise
- A. Varga and R. Moore, “Hidden Markov model decomposition of speech and noise,” in Proc. IEEE ICASSP’90, 1990, pp. 845–848.
- (1990) Proc. IEEE ICASSP’90 , pp. 845-848
- Varga, A.¹ Moore, R.²

4
- 85135375893
- HMM recognition in noise using parallel model combination
- Berlin
- M. Gales and S. Young, “HMM recognition in noise using parallel model combination,” in Proc. Eurospeech’93, Berlin, 1993.
- (1993) Proc. Eurospeech’93
- Gales, M.¹ Young, S.²

5
- 85009074657
- ALGONQUIN: Iterating Laplace’s method to remove multiple types of distortion for robust speech recognition
- Aalborg, Denmark
- B. Frey, L. Deng, A. Acero, and T. Kristjansson, “ALGONQUIN: Iterating Laplace’s method to remove multiple types of distortion for robust speech recognition,” in Proc. Eurospeech’01, Aalborg, Denmark, 2001, pp. 901–904.
- (2001) Proc. Eurospeech’01 , pp. 901-904
- Frey, B.¹ Deng, L.² Acero, A.³ Kristjansson, T.⁴

6
- 69249202377
- Monaural speech separation and recognition challenge
- M. Cooke, J. Hershey, and S. Rennie, “Monaural speech separation and recognition challenge,” Comput. Speech. Lang., vol. 24, no. 1, pp. 1–15, 2010.
- (2010) Comput. Speech. Lang , vol.24 , Issue.1 , pp. 1-15
- Cooke, M.¹ Hershey, J.² Rennie, S.³

7
- 79959845286
- The CHiME corpus: a resource and a challenge for Computational Hearing in Multisource Environments
- Makuhari
- H. Christensen, J. Barker, N. Ma, and P. Green, “The CHiME corpus: a resource and a challenge for Computational Hearing in Multisource Environments,” in Proc. Interspeech’10, Makuhari, 2010.
- (2010) Proc. Interspeech’10
- Christensen, H.¹ Barker, J.² Ma, N.³ Green, P.⁴

8
- 33750368310
- An audiovisual corpus for speech perception and automatic speech recognition
- M. Cooke, J. Barker, S. Cunningham, and X. Shao, “An audiovisual corpus for speech perception and automatic speech recognition,” J. Acoust. Soc. Am., vol. 120, pp. 2421–2424, 2006.
- (2006) J. Acoust. Soc. Am , vol.120 , pp. 2421-2424
- Cooke, M.¹ Barker, J.² Cunningham, S.³ Shao, X.⁴

9
- 69249231059
- Speech fragment decoding techniques for simultaneous speaker identification and speech recognition
- J. Barker, N. Ma, A. Coy, and M. Cooke, “Speech fragment decoding techniques for simultaneous speaker identification and speech recognition,” Comput. Speech. Lang., vol. 24, no. 1, pp. 94–111, 2010.
- (2010) Comput. Speech. Lang , vol.24 , Issue.1 , pp. 94-111
- Barker, J.¹ Ma, N.² Coy, A.³ Cooke, M.⁴

10
- 0028531926
- Computational auditory scene analysis
- G. Brown and M. Cooke, “Computational auditory scene analysis,” Comput. Speech. Lang., vol. 8, no. 4, pp. 297–336, 1994.
- (1994) Comput. Speech. Lang , vol.8 , Issue.4 , pp. 297-336
- Brown, G.¹ Cooke, M.²

11
- 0025110885
- Derivation of auditory filter shapes from notched-noise data
- B. Glasberg and B. Moore, “Derivation of auditory filter shapes from notched-noise data,” Hearing Res., vol. 47, pp. 103–138, 1990.
- (1990) Hearing Res , vol.47 , pp. 103-138
- Glasberg, B.¹ Moore, B.²

12
- 85009063707
- Soft decisions in missing data techniques for robust automatic speech recognition
- Beijing
- J. Barker, L. Josifovski, M. Cooke, and P. Green, “Soft decisions in missing data techniques for robust automatic speech recognition,” in Proc. ICSLP’00, Beijing, 2000, pp. 373–376.
- (2000) Proc. ICSLP’00 , pp. 373-376
- Barker, J.¹ Josifovski, L.² Cooke, M.³ Green, P.⁴

13
- 85009106519
- Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise
- Aalborg
- J. Barker, M. Cooke, and P. Green, “Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise,” in Proc. Eurospeech’01, Aalborg, 2001, pp. 213–216.
- (2001) Proc. Eurospeech’01 , pp. 213-216
- Barker, J.¹ Cooke, M.² Green, P.³

14
- 11144316019
- Decoding speech in the presence of other sources
- J. Barker, M. Cooke, and D. Ellis, “Decoding speech in the presence of other sources,” Speech Commun., vol. 45, no. 1, pp. 5–25, 2005.
- (2005) Speech Commun , vol.45 , Issue.1 , pp. 5-25
- Barker, J.¹ Cooke, M.² Ellis, D.³

15
- 0003684441
- Cambridge, MA: MIT Press
- A. Bregman, Auditory Scene Analysis. Cambridge, MA: MIT Press, 1990.
- (1990) Auditory Scene Analysis
- Bregman, A.¹

16
- 44949104414
- Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source
- Pittsburgh, PA
- N. Ma, P. Green, and A. Coy, “Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source,” in Proc. Interspeech’06, Pittsburgh, PA, 2006, pp. 669–672.
- (2006) Proc. Interspeech’06 , pp. 669-672
- Ma, N.¹ Green, P.² Coy, A.³

17
- 34748817500
- Exploiting correlogram structure for robust speech recognition with multiple speech sources
- N. Ma, P. Green, J. Barker, and A. Coy, “Exploiting correlogram structure for robust speech recognition with multiple speech sources,” Speech Commun., vol. 49, no. 12, pp. 874–891, 2007.
- (2007) Speech Commun , vol.49 , Issue.12 , pp. 874-891
- Ma, N.¹ Green, P.² Barker, J.³ Coy, A.⁴

18
- 57849093600
- Integrating pitch and localisation cues at a speech fragment level
- Antwerp
- H. Christensen, N. Ma, S. Wrigley, and J. Barker, “Integrating pitch and localisation cues at a speech fragment level,” in Proc. Interspeech’07, Antwerp, 2007, pp. 2769–2772.
- (2007) Proc. Interspeech’07 , pp. 2769-2772
- Christensen, H.¹ Ma, N.² Wrigley, S.³ Barker, J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.