SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 18, Issue 2, 2010, Pages 296-309

Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition

(5) Buera, Luis a Miguel, Antonio b Saz, Óscar b Ortega, Alfonso b Lleida, Eduardo b

Author keywords

Acoustic model adaptation; data driven feature vector normalization; linear transformation matrices; robust speech recognition; unsupervised

Indexed keywords

EID: 85008564998 PISSN: 15587916 EISSN: 15587924 Source Type: Journal
DOI: 10.1109/TASL.2009.2026441 Document Type: Article

Times cited : (17)

References (28)

1
- 0029288202
- Speech recognition in noisy environments: A survey
- Y. Gong “Speech recognition in noisy environments: A survey,” Speech Commun., vol. 3, no. 16, pp. 261–291, 1995.
- (1995) Speech Commun. , vol.3 , Issue.16 , pp. 261-291
- Gong, Y.¹

2
- 0001972937
- Compensation for environmental degradation in automatic speech recognition
- Pont-au-Mousson, France, Apr.
- R. M. Stern, B. Raj, and P. J. Moreno, “Compensation for environmental degradation in automatic speech recognition,” in Proc. ESCA Tutorial Research Workshop Robust Speech Recognition for Unknown Communication Channels, Pont-au-Mousson, France, Apr. 1997, pp. 33–42.
- (1997) Proc. ESCA Tutorial Research Workshop Robust Speech Recognition for Unknown Communication Channels , pp. 33-42
- Stern, R.M.¹ Raj, B.² Moreno, P.J.³

3
- 64149116705
- Robust speech recognition in the automobile
- Yokohama, Japan, Sep.
- N. Hanai and R. M. Stern, “Robust speech recognition in the automobile,” in Proc. ICSLP, Yokohama, Japan, Sep. 1994, pp. 1339–1342.
- (1994) Proc. ICSLP , pp. 1339-1342
- Hanai, N.¹ Stern, R.M.²

4
- 85009266810
- High performance digit recognition in real car environments
- Denver, CO, Sep.
- U. Yapanel, X. Zhang, and J. Hansen, “High performance digit recognition in real car environments,” in Proc. ICSLP, Denver, CO, Sep. 2002, pp. 793–796.
- (2002) Proc. ICSLP , pp. 793-796
- Yapanel, U.¹ Zhang, X.² Hansen, J.³

5
- 0028517164
- RASTA processing for speech
- Oct.
- H. Hermansky and N. Morgan “RASTA processing for speech,” IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 578–589, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

6
- 85008584719
- Speech recognition in noisy environments using first-order vector Taylor series
- Mar.
- D. Y. Kim, C. K. Un, and N. S. Kim “Speech recognition in noisy environments using first-order vector Taylor series,” IEEE Trans. Signal Process., vol. 5, no. 3, pp. 57–59, Mar. 1998.
- (1998) IEEE Trans. Signal Process. , vol.5 , Issue.3 , pp. 57-59
- Kim, D.Y.¹ Un, C.K.² Kim, N.S.³

7
- 0004319970
- Acoustical and environmental robustness in automatic speech recognition
- Ph.D. dissertation, Elect. Comput. Eng. Dept., Carnegie Mellon Univ., Pittsburgh, PA, Sep.
- A. Acero, “Acoustical and environmental robustness in automatic speech recognition,” Ph.D. dissertation, Elect. Comput. Eng. Dept., Carnegie Mellon Univ., Pittsburgh, PA, Sep. 1990.
- (1990)
- Acero, A.¹

8
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr.
- S. F. Boll “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoust., Speech, Signal Process., vol. 27, no. 2, pp. 113–120, Apr. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.27 , Issue.2 , pp. 113-120
- Boll, S.F.¹

9
- 65549153550
- Ph.D. dissertation, Elect. Comput. Eng. Dept., Carnegie-Mellon Univ., Apr.
- P. Moreno, “Speech recognition in noisy environments,” Ph.D. dissertation, Elect. Comput. Eng. Dept., Carnegie-Mellon Univ., Apr. 1996.
- (1996) Speech recognition in noisy environments
- Moreno, P.¹

10
- 44849120851
- Cepstral vector normalization based on stereo data for robust speech recognition
- Mar.
- L. Buera, E. Lleida, A. Miguel, A. Ortega, and O. Saz, “Cepstral vector normalization based on stereo data for robust speech recognition,” IEEE Trans. Speech Audio Process., vol. 15, no. 3, pp. 1098–1113, Mar. 2007.
- (2007) IEEE Trans. Speech Audio Process. , vol.15 , Issue.3 , pp. 1098-1113
- Buera, L.¹ Lleida, E.² Miguel, A.³ Ortega, A.⁴ Saz, O.⁵

11
- 85006734596
- Evaluation of the SPLICE algorithm on the AURORA2 database
- Aalborg, Denmark
- J. Droppo, L. Deng, and A. Acero, “Evaluation of the SPLICE algorithm on the AURORA2 database,” in Proc. Eurospeech, Aalborg, Denmark, 2001, pp. 217–220.
- (2001) Proc. Eurospeech , pp. 217-220
- Droppo, J.¹ Deng, L.² Acero, A.³

12
- 0002127129
- Probabilistic optimum filtering for robust speech recognition
- Adelaide, Australia, Apr.
- L. Neumeyer and M. Weintraub, “Probabilistic optimum filtering for robust speech recognition,” in Proc. ICASSP, Adelaide, Australia, Apr. 1994, vol. 1, pp. 417–420.
- (1994) Proc. ICASSP , vol.1 , pp. 417-420
- Neumeyer, L.¹ Weintraub, M.²

13
- 0028996866
- Robust speech recognition in noise using adaptation and mapping techniques
- Detroit, MI, May
- L. Neumeyer and M. Weintraub, “Robust speech recognition in noise using adaptation and mapping techniques,” in Proc. ICASSP, Detroit, MI, May 1995, vol. 1, pp. 141–144.
- (1995) Proc. ICASSP , vol.1 , pp. 141-144
- Neumeyer, L.¹ Weintraub, M.²

14
- 0003778679
- Lattice-based unsupervised MLLR for speaker adaptation
- M. Padmanabhan, G. Saon, and G. Zweig, “Lattice-based unsupervised MLLR for speaker adaptation,” in Proc. ASR, 2000, vol. 2, pp. 128–132.
- (2000) Proc. ASR , vol.2 , pp. 128-132
- Padmanabhan, M.¹ Saon, G.² Zweig, G.³

15
- 44949162200
- Time-dependent cross-probability model for multi-environment model based linear normalization
- Sep.
- L. Buera, E. Lleida, J. Nolazco, A. Miguel, and A. Ortega, “Time-dependent cross-probability model for multi-environment model based linear normalization,” in Proc. ICSLP, Sep. 2006, pp. 1555–1558.
- (2006) Proc. ICSLP , pp. 1555-1558
- Buera, L.¹ Lleida, E.² Nolazco, J.³ Miguel, A.⁴ Ortega, A.⁵

16
- 44949166839
- Local transformation models for speech recognition
- Pittsburgh, PA
- A. Miguel, E. Lleida, A. Juan, L. Buera, A. Ortega, and O. Saz, “Local transformation models for speech recognition,” in Proc. ICSLP, Pittsburgh, PA, 2006, pp. 1598–1601.
- (2006) Proc. ICSLP , pp. 1598-1601
- Miguel, A.¹ Lleida, E.² Juan, A.³ Buera, L.⁴ Ortega, A.⁵ Saz, O.⁶

17
- 33745197687
- Normalization in the acoustic feature space for improved speech recognition
- Ph.D. dissertation, Univ. of Aachen, Aachen, Germany, Feb.
- S. Molau, “Normalization in the acoustic feature space for improved speech recognition,” Ph.D. dissertation, Univ. of Aachen, Aachen, Germany, Feb. 2003.
- (2003)
- Molau, S.¹

18
- 85009223874
- Speechdat-car. A large speech database for automotive environments
- Athens, Greece
- A. Moreno, B. Lindberg, C. Draxler, G. Richard, K. Choukri, S. Euler, and J. Allen, “Speechdat-car. A large speech database for automotive environments,” in Proc. LREC, Athens, Greece, 2000, vol. 2, pp. 895–900.
- (2000) Proc. LREC , vol.2 , pp. 895-900
- Moreno, A.¹ Lindberg, B.² Draxler, C.³ Richard, G.⁴ Choukri, K.⁵ Euler, S.⁶ Allen, J.⁷

19
- 85135275880
- The speechdat-car multilingual speech databases for in-car applications: Some first validation results
- Budapest, Hungary, Sep.
- H. van den Heuvel, J. Boudy, R. Comeyne, S. Euler, A. Moreno, and G. Richard, “The speechdat-car multilingual speech databases for in-car applications: Some first validation results,” in Proc. Eurospeech, Budapest, Hungary, Sep. 1999, vol. 5, pp. 2279–2282.
- (1999) Proc. Eurospeech , vol.5 , pp. 2279-2282
- van den Heuvel, H.¹ Boudy, J.² Comeyne, R.³ Euler, S.⁴ Moreno, A.⁵ Richard, G.⁶

20
- 0038669544
- The aurora experimental framework for the performance evaluations of speech recognition systems under noisy conditions
- Paris, France, Sep.
- H. G. Hirsch and D. Pearce, “The aurora experimental framework for the performance evaluations of speech recognition systems under noisy conditions,” in Proc. ISCA ITRW ASR2000, Paris, France, Sep. 2000, pp. 29–32.
- (2000) Proc. ISCA ITRW ASR2000 , pp. 29-32
- Hirsch, H.G.¹ Pearce, D.²

21
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. P. Dempster, N. Laird, and D. Rubin “Maximum likelihood from incomplete data via the EM algorithm,” J. R. Statist. Soc., vol. 9, no. 1, pp. 1–37, 1977.
- (1977) J. R. Statist. Soc. , vol.9 , Issue.1 , pp. 1-37
- Dempster, A.P.¹ Laird, N.² Rubin, D.³

22
- 65549153550
- Ph.D. dissertation, Elect. Comput. Eng. Dept., Carnegie Mellon Univ., Pittsburgh, PA, Apr.
- P. Moreno, “Speech Recognition in Noisy Environments,” Ph.D. dissertation, Elect. Comput. Eng. Dept., Carnegie Mellon Univ., Pittsburgh, PA, Apr. 1996.
- (1996) Speech Recognition in Noisy Environments
- Moreno, P.¹

23
- 85006734596
- Evaluation of the splice algorithm on the Aurora2 database
- Sep.
- J. Droppo, L. Deng, and A. Acero, “Evaluation of the splice algorithm on the Aurora2 database,” in Proc. Eurospeech, Sep. 2001, vol. 1, pp. 217–220.
- (2001) Proc. Eurospeech , vol.1 , pp. 217-220
- Droppo, J.¹ Deng, L.² Acero, A.³

24
- 0025681008
- Hidden Markov model decomposition of speech and noise
- A. P. Varga and R. K. Moore, “Hidden Markov model decomposition of speech and noise,” in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1990, pp. 845–848.
- (1990) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 845-848
- Varga, A.P.¹ Moore, R.K.²

25
- 0009589650
- Speech processing transmission and quality aspects (STQ); Distributed speech recognition; Front-end feature extraction algorithm; Compression algorithms
- Apr. 2000, ETSI ES 201 108 version 1.1.2, Tech. Rep.
- ETSI, “Speech processing transmission and quality aspects (STQ); Distributed speech recognition; Front-end feature extraction algorithm; Compression algorithms,” Apr. 2000, ETSI ES 201 108 version 1.1.2, Tech. Rep.

26
- 85008531089
- Speech processing, transmission and quality aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms
- Oct. 2002, ETSI ES 202 050 version 1.1.1, Tech. Rep.
- ETSI, “Speech processing, transmission and quality aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms,” Oct. 2002, ETSI ES 202 050 version 1.1.1, Tech. Rep.

27
- 85032751521
- Dynamic programming search for continuous speech recognition
- Sep.
- H. Ney and S. Ortmanns “Dynamic programming search for continuous speech recognition,” IEEE Signal Process. Mag., vol. 16, no. 5, pp. 64–83, Sep. 1999.
- (1999) IEEE Signal Process. Mag. , vol.16 , Issue.5 , pp. 64-83
- Ney, H.¹ Ortmanns, S.²

28
- 85008533741
- HTK Book (for HTK Version 3.3)
- Apr.
- S. Young, G. Evermann, M. Gales, T. Hain, D. Tershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woolland, The “HTK Book (for HTK Version 3.3),” Cambridge Univ. Eng. Dept., Apr. 2005.
- (2005) Cambridge Univ. Eng. Dept.
- Young, S.¹ Evermann, G.² Gales, M.³ Hain, T.⁴ Tershaw, D.⁵ Moore, G.⁶ Odell, J.⁷ Ollason, D.⁸ Povey, D.⁹ Valtchev, V.¹⁰ Woolland, P.¹¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.