SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 17, Issue 7, 2009, Pages 1325-1334

Stereo-based stochastic mapping for robust speech recognition

(3) Afify, Mohamed a Cui, Xiaodong b Gao, Yuqing b

a Orange Lab (Egypt)

b IBM T J WATSON RESEARCH CENTER (United States)

Author keywords

Noise robustness; Nonlinear mapping; Speech recognition; Stereo data

Indexed keywords

CLEAN SPEECH; DIGIT RECOGNITION; FIELD DATA; GAUSSIAN MIXTURE MODEL; JOINT DISTRIBUTIONS; LARGE VOCABULARY; LINEAR TRANSFORM; LINEAR TRANSFORMATION; MAXIMUM A POSTERIORI CRITERIONS; MINIMUM MEAN SQUARE ERROR CRITERION; NOISE ROBUSTNESS; NONLINEAR MAPPING; REAL ENVIRONMENTS; ROBUST SPEECH RECOGNITION; STEREO-BASED; STEREO-DATA; STOCHASTIC MAPPING; WORD ERROR RATE;

ACOUSTIC NOISE; ERROR COMPENSATION; MAPPING; MATHEMATICAL TRANSFORMATIONS;

SPEECH RECOGNITION;

EID: 68549125183 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2009.2018017 Document Type: Article

Times cited : (34)

References (37)

1
- 0004319970
- Acoustical and environmental robustness for automatic speech recognition,
- Ph.D. dissertation, Elect. Comput. Eng. Dept, Carnegie Mellon Univ, Pittsburgh, PA, Sep
- A. Acero, "Acoustical and environmental robustness for automatic speech recognition," Ph.D. dissertation, Elect. Comput. Eng. Dept., Carnegie Mellon Univ., Pittsburgh, PA, Sep. 1990.
- (1990)
- Acero, A.¹

2
- 34547550766
- Stereo-based stochastic mapping for robust speech recognition
- Honolulu, hi, Apr
- M. Afify, X. Cui, and Y. Gao, "Stereo-based stochastic mapping for robust speech recognition," in Proc. ICASSP'07, Honolulu, hi, Apr. 2007, pp. 377-380.
- (2007) Proc. ICASSP'07 , pp. 377-380
- Afify, M.¹ Cui, X.² Gao, Y.³

3
- 18744396687
- Accurate compensation in the log-spectral domain for noisy speech recognition
- May
- M. Afify, "Accurate compensation in the log-spectral domain for noisy speech recognition," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 388-398, May 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.3 , pp. 388-398
- Afify, M.¹

4
- 0030643240
- Subband-based speech recognition
- Munich, Germany, Apr
- H. Bourlard and S. Dupont, "Subband-based speech recognition," in Proc. ICASSP'97, Munich, Germany, Apr. 1997, pp. 1251-1254.
- (1997) Proc. ICASSP'97 , pp. 1251-1254
- Bourlard, H.¹ Dupont, S.²

5
- 84867220211
- N-best based stochastic mapping on stereo HMM for noise robust speech recognition
- Brisbane, Australia, Sep
- X. Cui, M. Afify, and Y. Gao, "N-best based stochastic mapping on stereo HMM for noise robust speech recognition," in Proc. Interspeech'08, Brisbane, Australia, Sep. 2008.
- (2008) Proc. Interspeech'08
- Cui, X.¹ Afify, M.² Gao, Y.³

6
- 0029375590
- Speaker adaptation by constrained estimation of Gaussian mixtures
- Sep
- V. Digalakis, D. Rtischev, and L. Neumeyer, "Speaker adaptation by constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 357-366, Sep. 1995.
- (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.5 , pp. 357-366
- Digalakis, V.¹ Rtischev, D.² Neumeyer, L.³

7
- 85006734596
- Evaluation of the SPLICE algorithm on the AURORA 2 database
- Aalborg, Denmark, Sep
- J. Droppo, L. Deng, and A. Acero, "Evaluation of the SPLICE algorithm on the AURORA 2 database," in Proc. Eurospeech'01, Aalborg, Denmark, Sep. 2001.
- (2001) Proc. Eurospeech'01
- Droppo, J.¹ Deng, L.² Acero, A.³

8
- 0036291376
- Uncertainty decoding with splice for noise robust speech recognition
- Orlando, FL, May
- J. Droppo, L. Deng, and A. Acero, "Uncertainty decoding with splice for noise robust speech recognition," in Proc. ICASSP'02, Orlando, FL, May 2002, pp. 57-80.
- (2002) Proc. ICASSP'02 , pp. 57-80
- Droppo, J.¹ Deng, L.² Acero, A.³

9
- 85009074657
- ALGONQUIN: Iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition
- Aalborg, Denmark, Sep
- B. Frey, L. Deng, A. Acero, and T. Kristjanson, "ALGONQUIN: Iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition," in Proc. Eurospeech'01, Aalborg, Denmark, Sep. 2001.
- (2001) Proc. Eurospeech'01
- Frey, B.¹ Deng, L.² Acero, A.³ Kristjanson, T.⁴

10
- 0030245128
- Robust continuous speech recognition using parallel model combination
- Sep
- M. Gales and S. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 352-359, Sep. 1996.
- (1996) IEEE Trans. Speech Audio Process , vol.4 , Issue.5 , pp. 352-359
- Gales, M.¹ Young, S.²

11
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., pp. 193-228, 1998.
- (1998) Comput. Speech Lang , pp. 193-228
- Gales, M.¹

12
- 0032638856
- Semi-tied covariance matrices for hidden Markov models
- May
- M. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 272-281, May 1999.
- (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.3 , pp. 272-281
- Gales, M.¹

13
- 33947700425
- IBM MASTOR: Multilingual automatic speech-to-speech translator
- Tolouse, France
- Y. Gao, B. Zhou, L. Gu, R. Sarikaya, H.-K. Kuo., A.-V.I. Rosti, M. Afify, and W. Zhu, "IBM MASTOR: Multilingual automatic speech-to-speech translator," in Proc. ICASSP'06, Tolouse, France, 2006, pp. 1205, 1208.
- (2006) Proc. ICASSP'06
- Gao, Y.¹ Zhou, B.² Gu, L.³ Sarikaya, R.⁴ Kuo, H.-K.⁵ Rosti, A.-V.I.⁶ Afify, M.⁷ Zhu, W.⁸

14
- 0029288202
- Speech recognition in noisy environments: A survey
- Apr
- Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 16, pp. 261-291, Apr. 1995.
- (1995) Speech Commun , vol.16 , pp. 261-291
- Gong, Y.¹

15
- 68549096935
- Model-based fusion ofbone and air sensors for speech enhancement and robust speech recognition
- J. Hershey, T.Kristjansson, and Z. Zhang, "Model-based fusion ofbone and air sensors for speech enhancement and robust speech recognition," in Proc. ISCA Workshop Statist. Percept. Audio Process., 2004.
- (2004) Proc. ISCA Workshop Statist. Percept. Audio Process
- Hershey, J.¹ Kristjansson, T.² Zhang, Z.³

16
- 0004056285
- Upper Saddle River, NJ: Prentice-Hall
- X. Huang, A. Acero, and H. Hon, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Upper Saddle River, NJ: Prentice-Hall, 2001.
- (2001) Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
- Huang, X.¹ Acero, A.² Hon, H.³

17
- 34547526633
- A maximum likelihood training approach to irrelevant variability compensation based on piecewise linear transformations
- Pittsburgh, PA, Sep
- Q. Huo and D. Zhu, "A maximum likelihood training approach to irrelevant variability compensation based on piecewise linear transformations," in Proc. Interspeech'06, Pittsburgh, PA, Sep. 2006.
- (2006) Proc. Interspeech'06
- Huo, Q.¹ Zhu, D.²

18
- 0023167174
- Signal restoration by spectral mapping
- Apr
- B. H. Juang and L. R. Rabiner, "Signal restoration by spectral mapping," in Proc. ICASSP'87, Apr. 1987, pp. 2368-2372.
- (1987) Proc. ICASSP'87 , pp. 2368-2372
- Juang, B.H.¹ Rabiner, L.R.²

19
- 33947669945
- Feature adaptation based on Gaussian posteriors
- Tolouse, France, Apr
- S. Kozat, K. Visweswariah, andR. Gopinath, "Feature adaptation based on Gaussian posteriors," in Proc. ICASSP'06, Tolouse, France, Apr. 2006, pp. 221-224.
- (2006) Proc. ICASSP'06 , pp. 221-224
- Kozat, S.¹ Visweswariah, K.² andR³ Gopinath⁴

20
- 0036293930
- Accounting for uncertainity in observations: A new paradigm for robust speech recognition
- Orlando, FL, May
- T. Kristjansson and B. Frey, "Accounting for uncertainity in observations: A new paradigm for robust speech recognition," in Proc. ICASSP'02, Orlando, FL, May 2002.
- (2002) Proc. ICASSP'02
- Kristjansson, T.¹ Frey, B.²

21
- 0032140546
- On stochastic feature and model compensation approaches to robust speech recognition
- C. H. Lee, "On stochastic feature and model compensation approaches to robust speech recognition," Speech Commun., vol. 25, pp. 29-47, 1998.
- (1998) Speech Commun , vol.25 , pp. 29-47
- Lee, C.H.¹

22
- 0000159105
- On adaptive decision rules and decision parameter adaptation for automatic speech recognition
- Aug
- C. H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol. 88, no. 8, pp. 1241-1269, Aug. 2000.
- (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1241-1269
- Lee, C.H.¹ Huo, Q.²

23
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
- (1995) Comput. Speech Lang , vol.9 , pp. 171-185
- Leggetter, C.¹ Woodland, P.²

24
- 68549095140
- High performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series
- Kyoto, Japan
- J. Li, L. Deng, Y.Gong, and A. Acero, "High performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series," in Proc. ASRU'07, Kyoto, Japan, 2007.
- (2007) Proc. ASRU'07
- Li, J.¹ Deng, L.² Gong, Y.³ Acero, A.⁴

25
- 34547528168
- Adaptive training with joint uncertainty decoding for robust recognition of noisy data
- Honolulu, HI, Apr
- H. Liao and M. Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data," in Proc. ICASSP'07, Honolulu, HI, Apr. 2007, pp. 389-392.
- (2007) Proc. ICASSP'07 , pp. 389-392
- Liao, H.¹ Gales, M.²

26
- 33947630738
- Joint uncertainity decoding for noise robust speech recognition
- Lisbone, Portugal, Sep
- H. Liao and M. Gales, "Joint uncertainity decoding for noise robust speech recognition," in Proc. Eurospeech'05, Lisbone, Portugal, Sep. 2005.
- (2005) Proc. Eurospeech'05
- Liao, H.¹ Gales, M.²

27
- 68549134095
- Multi-style training for robust isolated-word recognition
- Mar. 24-26
- R. Lippmann, E. Martin, and D. Paul, "Multi-style training for robust isolated-word recognition," in Proc. DARPA Speech Recognition Workshop, Mar. 24-26, 1987, pp. 96-99.
- (1987) Proc. DARPA Speech Recognition Workshop , pp. 96-99
- Lippmann, R.¹ Martin, E.² Paul, D.³

28
- 0026370318
- Word recognition in the car: Speech enhancement/spectral transformations
- Toronto
- C. Mokbel and G. Chollet, "Word recognition in the car: Speech enhancement/spectral transformations," in Proc. ICASSP'91, Toronto, 1991, pp. 925-928.
- (1991) Proc. ICASSP'91 , pp. 925-928
- Mokbel, C.¹ Chollet, G.²

29
- 0029725301
- A vector taylor series approach for environment-independent speech recognition
- Atlanta, GA, May
- P. J. Moreno, B. Raj, and R. M. Stern, "A vector taylor series approach for environment-independent speech recognition," in Proc. ICASSP, Atlanta, GA, May 1996, pp. 733-736.
- (1996) Proc. ICASSP , pp. 733-736
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

30
- 0002127129
- Probabilistic optimal filtering for robust speech recognition
- Adelaide, Australia, Apr
- L. Neumeyer and M. Weintraub, "Probabilistic optimal filtering for robust speech recognition," in Proc. ICASSP'94, Adelaide, Australia, Apr. 1994, pp. 417-420.
- (1994) Proc. ICASSP'94 , pp. 417-420
- Neumeyer, L.¹ Weintraub, M.²

31
- 68549086009
- personal communication, May
- M. K. Omar, personal communication, May 2004.
- (2004)
- Omar, M.K.¹

32
- 0035278964
- Time-frequency distributions for automatic speech recognition
- Mar
- A. Potamianos and P. Maragos, "Time-frequency distributions for automatic speech recognition," IEEE Trans. Speech Audio Process., vol. 9, pp. 196-200, Mar. 2001.
- (2001) IEEE Trans. Speech Audio Process , vol.9 , pp. 196-200
- Potamianos, A.¹ Maragos, P.²

33
- 0034841234
- Linear feature space projections for speaker adaptation
- Salt lake City, UT, Apr
- G. Saon, G. Zweig, and M. Padmanabhan, "Linear feature space projections for speaker adaptation," in Proc. ICASSP'01, Salt lake City, UT, Apr. 2001, pp. 325-328.
- (2001) Proc. ICASSP'01 , pp. 325-328
- Saon, G.¹ Zweig, G.² Padmanabhan, M.³

34
- 85009154856
- Accounting for the uncertainity of speech estimates in the context of model-based feature enhancement
- Jeju, Korea, Sep
- V. Stouten, H. Van Hamme, and P. Wambacq, "Accounting for the uncertainity of speech estimates in the context of model-based feature enhancement," in Proc. ICSLP'04, Jeju, Korea, Sep. 2004.
- (2004) Proc. ICSLP'04
- Stouten, V.¹ Van Hamme, H.² Wambacq, P.³

35
- 0032026483
- Continuous probabilistic transform for voice conversion
- Jan
- Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol. 6, pp. 131-142, Jan. 1998.
- (1998) IEEE Trans. Speech Audio Process , vol.6 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

36
- 33947637126
- Feature adaptation using projection of Gaussian posteriors
- Lisbone, Portugal, Sep
- K. Visweswariah and P. Olsen, "Feature adaptation using projection of Gaussian posteriors," in Proc. Interspeech'05, Lisbone, Portugal, Sep. 2005.
- (2005) Proc. Interspeech'05
- Visweswariah, K.¹ Olsen, P.²

37
- 68549086008
- S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK book for HTK Version 3.1, Dec. 2001
- S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK book (for HTK Version 3.1). Dec. 2001.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.