SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 22, Issue 2, 2014, Pages 368-380

Estimating uncertainty to improve exemplar-based feature enhancement for noise robust speech recognition

(3) Kallasjoki, Heikki a Gemmeke, Jort F b Palomäki, Kalle J a

a AALTO UNIVERSITY (Finland)

b UNIVERSITY OF LEUVEN (Belgium)

Author keywords

Exemplar based; Noise robustness; Observation uncertainty; Speech recognition; Uncertainty estimation

Indexed keywords

ESTIMATION; SPEECH RECOGNITION; ACOUSTIC NOISE; GAUSSIAN DISTRIBUTION; HUMAN COMPUTER INTERACTION; SEPARATION; SOURCE SEPARATION; SPEECH; UNCERTAINTY ANALYSIS;

AUTOMATIC SPEECH RECOGNITION; EXEMPLAR-BASED; FEATURE ENHANCEMENT; GAUSSIAN MIXTURE MODEL; NOISE ROBUST SPEECH RECOGNITION; NOISE ROBUSTNESS; OBSERVATION UNCERTAINTIES; UNCERTAINTY ESTIMATION;

UNCERTAINTY ANALYSIS; SPEECH RECOGNITION;

EID: 84897933562 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASLP.2013.2292328 Document Type: Article

Times cited : (8)

References (28)

1
- 18744401086
- Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion
- DOI 10.1109/TSA.2005.845814
- L. Deng, J. Droppo, and A. Acero, "Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 412-421, May 2005. (Pubitemid 40666175)
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.3 , pp. 412-421
- Deng, L.¹ Droppo, J.² Acero, A.³

2
- 79959814198
- Observation uncertainty measures for sparse imputation
- J. F. Gemmeke, U. Remes, and K. J. Palomäki, "Observation uncertainty measures for sparse imputation," in Proc. INTERSPEECH, Makuhari, Japan, 2010, pp. 2262-2265.
- Proc. INTERSPEECH, Makuhari, Japan, 2010 , pp. 2262-2265
- Gemmeke, J.F.¹ Remes, U.² Palomäki, K.J.³

3
- 40249103761
- Issues with uncertainty decoding for noise robust automatic speech recognition
- H. Liao and M. J. F. Gales, "Issues with uncertainty decoding for noise robust automatic speech recognition," Speech Commun., vol. 50, no. 4, pp. 265-277, 2008.
- (2008) Speech Commun. , vol.50 , Issue.4 , pp. 265-277
- Liao, H.¹ Gales, M.J.F.²

4
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- [Online]. Available
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, no. 3, pp. 267-285, 2001 [Online]. Available: http://www.sciencedirect.com/science/article/pii/S0167639300000340
- (2001) Speech Commun. , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

5
- 56249136428
- Transforming binary uncertainties for robust speech recognition
- Sep.
- S. Srinivasan and D. L. Wang, "Transforming binary uncertainties for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 2130-2140, Sep. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 2130-2140
- Srinivasan, S.¹ Wang, D.L.²

6
- 77954583785
- Independent component analysis and time-frequency masking for speech recognition in multitalker conditions
- [Online]. Available
- D. Kolossa, R. Fernandez Astudillo, E. Hoffmann, and R. Orglmeister, "Independent component analysis and time-frequency masking for speech recognition in multitalker conditions," EURASIP J. Audio, Speech, Music Process., vol. 2010, no. 1, p. 651420, 2010 [Online]. Available: http://asmp.eurasipjournals.com/content/2010/1/651420
- (2010) EURASIP J. Audio, Speech, Music Process. , vol.2010 , Issue.1 , pp. 651420
- Kolossa, D.¹ Fernandez Astudillo, R.² Hoffmann, E.³ Orglmeister, R.⁴

7
- 69249159165
- A computational auditory scene analysis system for speech segregation and robust speech recognition
- [Online]. Available: Speech Separation and Recognition Challenge
- Y. Shao, S. Srinivasan, Z. Jin, and D. Wang, "A computational auditory scene analysis system for speech segregation and robust speech recognition," Comput. Speech Lang., vol. 24, no. 1, pp. 77-93, 2010 [Online]. Available: http://www.sciencedirect.com/science/article/pii/ S088523080800020X, Speech Separation and Recognition Challenge
- (2010) Comput. Speech Lang. , vol.24 , Issue.1 , pp. 77-93
- Shao, Y.¹ Srinivasan, S.² Jin, Z.³ Wang, D.⁴

8
- 84940448917
- Uncertainty-based learning of acoustic models from noisy data
- Jul. [Online]. Available
- A. Ozerov, M. Lagrange, and E. Vincent, "Uncertainty-based learning of acoustic models from noisy data," Comput. Speech Lang., Jul. 2012 [Online]. Available: http://www.sciencedirect.com/science/article/pii/ S0885230812000502
- (2012) Comput. Speech Lang.
- Ozerov, A.¹ Lagrange, M.² Vincent, E.³

9
- 84893328634
- Integration of beamforming and uncertainty-of-observation techniques for robust asr in multi-source environments
- [Online]. Available: special on Speech Separation and Recognition in Multisource Environments
- R. F. Astudillo, D. Kolossa, A. Abad, S. Zeiler, R. Saeidi, P. Mowlaee, J. P. da Silva Neto, and R.Martin, "Integration of beamforming and uncertainty-of-observation techniques for robust asr in multi-source environments," Comput. Speech Lang., vol. 27, no. 3, pp. 837-850, 2013 [Online]. Available: http://www.sciencedirect.com/science/article/pii/ S0885230812000575, special on Speech Separation and Recognition in Multisource Environments
- (2013) Comput. Speech Lang. , vol.27 , Issue.3 , pp. 837-850
- Astudillo, R.F.¹ Kolossa, D.² Abad, A.³ Zeiler, S.⁴ Saeidi, R.⁵ Mowlaee, P.⁶ Da Silva Neto, J.P.⁷ Martin, R.⁸

10
- 84865766789
- Uncertainty measures for improving exemplar-based source separation
- H. Kallasjoki, U. Remes, J. F. Gemmeke, T. Virtanen, and K. J. Palomäki, "Uncertainty measures for improving exemplar-based source separation," in Proc. INTERSPEECH, Florence, Italy, Sep. 2011.
- Proc. INTERSPEECH, Florence, Italy, Sep. 2011
- Kallasjoki, H.¹ Remes, U.² Gemmeke, J.F.³ Virtanen, T.⁴ Palomäki, K.J.⁵

11
- 79959819066
- Ph.D. dissertation, Technische Univ. Berlin, Berlin, Germany
- R. F. Astudillo, "Integration of short-time Fourier domain speech enhancement and observation uncertainty techniques for robust automatic speech recognition," Ph.D. dissertation, Technische Univ. Berlin, Berlin, Germany, 2010.
- (2010) Integration of Short-time Fourier Domain Speech Enhancement and Observation Uncertainty Techniques for Robust Automatic Speech Recognition
- Astudillo, R.F.¹

12
- 79960657803
- Exemplar-based sparse representations for noise robust automatic speech recognition
- Sep.
- J. Gemmeke, T. Virtanen, and A. Hurmalainen, "Exemplar-based sparse representations for noise robust automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 7, pp. 2067-2080, Sep. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.7 , pp. 2067-2080
- Gemmeke, J.¹ Virtanen, T.² Hurmalainen, A.³

13
- 85032752215
- Exemplar-based processing for speech recognition: An overview
- Nov.
- T. Sainath, B. Ramabhadran, D. Nahamoo, D. Kanevsky, D. Van Compernolle, K. Demuynck, J. Gemmeke, J. Bellegarda, and S. Sundaram, "Exemplar-based processing for speech recognition: An overview," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 98-113, Nov. 2012.
- (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 98-113
- Sainath, T.¹ Ramabhadran, B.² Nahamoo, D.³ Kanevsky, D.⁴ Van Compernolle, D.⁵ Demuynck, K.⁶ Gemmeke, J.⁷ Bellegarda, J.⁸ Sundaram, S.⁹

14
- 0036291376
- Uncertainty decoding with SPLICE for noise robust speech recognition
- J. Droppo, A. Acero, and L.Deng, "Uncertainty decoding with SPLICE for noise robust speech recognition," in Proc. ICASSP, Orlando, FL, USA, May 2002, vol. 1, pp. I-57-I-60.
- Proc. ICASSP, Orlando, FL, USA, May 2002 , vol.1
- Droppo, J.¹ Acero, A.² Deng, L.³

15
- 79959818117
- Non-negative matrix factorization based compensation of music for automatic speech recognition
- B. Raj, T. Virtanen, S. Chaudhure, and R. Singh, "Non-negative matrix factorization based compensation of music for automatic speech recognition," in Proc. INTERSPEECH, 2010, pp. 717-720.
- Proc. INTERSPEECH, 2010 , pp. 717-720
- Raj, B.¹ Virtanen, T.² Chaudhure, S.³ Singh, R.⁴

16
- 2942539074
- Techniques for handling convolutional distortion with 'missing data' automatic speech recognition
- [Online]. Available
- K. J. Palomäki, G. J. Brown, and J. P. Barker, "Techniques for handling convolutional distortion with 'missing data' automatic speech recognition," Speech Commun., vol. 43, no. 12, pp. 123-142, 2004 [Online]. Available: http://www.sciencedirect.com/science/article/pii/S0167639304000238
- (2004) Speech Commun. , vol.43 , Issue.12 , pp. 123-142
- Palomäki, K.J.¹ Brown, G.J.² Barker, J.P.³

17
- 34547517122
- Ph.D. dissertation, Georgia Inst. of Technol., Atlanta, GA, USA
- J. A. Arrowood, "Using observation uncertainty for robust speech recognition," Ph.D. dissertation, Georgia Inst. of Technol., Atlanta, GA, USA, 2003.
- (2003) Using Observation Uncertainty for Robust Speech Recognition
- Arrowood, J.A.¹

18
- 84890541336
- Mask estimation and sparse imputation for missing data speech recognition in multisource reverberant environments
- H.Kallasjoki, S. Keronen, G. J. Brown, J. F. Gemmeke, U. Remes, and K. J. Palomäki, "Mask estimation and sparse imputation for missing data speech recognition in multisource reverberant environments," in Proc. CHiME 2011 Workshop Mach. Listening in Multisource Environ., Florence, Italy, 2011, pp. 58-63.
- Proc. CHiME 2011 Workshop Mach. Listening in Multisource Environ., Florence, Italy, 2011 , pp. 58-63
- Kallasjoki, H.¹ Keronen, S.² Brown, G.J.³ Gemmeke, J.F.⁴ Remes, U.⁵ Palomäki, K.J.⁶

19
- 84910032186
- SPEECON - Speech databases for consumer devices: Database specification and validation
- D. Iskra, B. Grosskopf, K. Marasek, H. van den Heuvel, F. Diehl, and A. Kiessling, "SPEECON - speech databases for consumer devices: Database specification and validation," in Proc. LREC, 2002, pp. 329-333.
- Proc. LREC, 2002 , pp. 329-333
- Iskra, D.¹ Grosskopf, B.² Marasek, K.³ Van Den Heuvel, H.⁴ Diehl, F.⁵ Kiessling, A.⁶

20
- 79959834868
- Artificial and online acquired noise dictionaries for noise robust ASR
- J. F. Gemmeke and T. Virtanen, "Artificial and online acquired noise dictionaries for noise robust ASR," in Proc. INTERSPEECH, 2010, pp. 2082-2085.
- Proc. INTERSPEECH, 2010 , pp. 2082-2085
- Gemmeke, J.F.¹ Virtanen, T.²

21
- 0034842343
- Efficient on-line acoustic environment estimation for FCDCN in a continuous speech recognition system
- J. Droppo, A. Acero, and L. Deng, "Efficient on-line acoustic environment estimation for FCDCN in a continuous speech recognition system," in Proc. ICASSP, Salt Lake City, UT, USA, May 2001.
- Proc. ICASSP, Salt Lake City, UT, USA, May 2001
- Droppo, J.¹ Acero, A.² Deng, L.³

22
- 85009089669
- Duration modeling techniques for continuous speech recognition
- J. Pylkkönen and M. Kurimo, "Duration modeling techniques for continuous speech recognition," in Proc. INTERSPEECH, 2004, pp. 385-388.
- Proc. INTERSPEECH, 2004 , pp. 385-388
- Pylkkönen, J.¹ Kurimo, M.²

23
- 33746524944
- Unlimited vocabulary speech recognition with morph language models applied to Finnish
- DOI 10.1016/j.csl.2005.07.002, PII S0885230805000331
- T. Hirsimäki, M. Creutz, V. Siivola, M. Kurimo, S. Virpioja, and S. Pylkkönen, "Unlimited vocabulary speech recognition with morph language models applied to Finnish," Comput. Speech Lang., vol. 20, no. 4, pp. 515-541, 2006. (Pubitemid 44142005)
- (2006) Computer Speech and Language , vol.20 , Issue.4 , pp. 515-541
- Hirsimaki, T.¹ Creutz, M.² Siivola, V.³ Kurimo, M.⁴ Virpioja, S.⁵ Pylkkonen, J.⁶

24
- 33745202806
- Joint uncertainty decoding for noise robust speech recognition
- ISCA
- H. Liao and M. J. F. Gales, "Joint uncertainty decoding for noise robust speech recognition.," in Proc. INTERSPEECH, Lisbon, Portugal, Sep. 2005, pp. 3129-3132, ISCA.
- Proc. INTERSPEECH, Lisbon, Portugal, Sep. 2005 , pp. 3129-3132
- Liao, H.¹ Gales, M.J.F.²

25
- 85009154856
- Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement
- V. Stouten, H. V. Hamme, and P. Wambacq, "Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement," in Proc. ICSLP, Jeju Island, Korea, Oct. 2004.
- Proc. ICSLP, Jeju Island, Korea, Oct. 2004
- Stouten, V.¹ Hamme, H.V.² Wambacq, P.³

26
- 85009154399
- Including uncertainty of speech observations in robust speech recognition
- M. C. Benítez, J. C. Segura, A. Torre, J. Ramrez, and A. Rubio, "Including uncertainty of speech observations in robust speech recognition," in Proc. ICSLP, Jeju Island, Korea, Oct. 2004, pp. 137-140.
- Proc. ICSLP, Jeju Island, Korea, Oct. 2004 , pp. 137-140
- Benítez, M.C.¹ Segura, J.C.² Torre, A.³ Ramrez, J.⁴ Rubio, A.⁵

27
- 51449111646
- Bayesian extensions to non-negative matrix factorisation for audio signal modelling
- T. Virtanen, A. Cemgil, and S. Godsill, "Bayesian extensions to non-negative matrix factorisation for audio signal modelling," in Proc. ICASSP, Las Vegas, NV, USA, 2008, pp. 1825-1828.
- Proc. ICASSP, Las Vegas, NV, USA, 2008 , pp. 1825-1828
- Virtanen, T.¹ Cemgil, A.² Godsill, S.³

28
- 79959845286
- The CHiME corpus: A resource and challenge for Computational Hearing in Multisource Environments
- H. Christensen, J. Barker, N. Ma, and P. Green, "The CHiME corpus: A resource and challenge for Computational Hearing in Multisource Environments," in Proc. INTERSPEECH, Makuhari, Japan, 2010, pp. 1918-1921.
- Proc. INTERSPEECH, Makuhari, Japan, 2010 , pp. 1918-1921
- Christensen, H.¹ Barker, J.² Ma, N.³ Green, P.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.