SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Computer Speech and Language

Volumn 20, Issue 1, 2006, Pages 107-123

Improved automatic speech recognition through speaker normalization

(3) Giuliani, Diego a Gerosa, Matteo a,b Brugnara, Fabio a

a ITC IRST (Italy)

b UNIVERSITY OF TRENTO (Italy)

Author keywords

[No Author keywords available]

Indexed keywords

ERROR ANALYSIS; ESTIMATION; MARKOV PROCESSES; MATHEMATICAL MODELS; MATHEMATICAL TRANSFORMATIONS; PERSONNEL TRAINING;

ACOUSTIC MODELING; ACOUSTIC VARIABILITY; MARKOV MODELS; SPEAKER NORMALIZATION;

SPEECH RECOGNITION;

EID: 27744595137 PISSN: 08852308 EISSN: 10958363 Source Type: Journal
DOI: 10.1016/j.csl.2005.05.002 Document Type: Article

Times cited : (52)

References (27)

1
- 0030362995
- A compact model for speaker-adaptive training
- Philadelphia, PA
- Anastasakos, T., McDonough, J., Schwartz, R., Makhoul, J., 1996. A compact model for speaker-adaptive training. In: Proc. of ICSLP, Philadelphia, PA, pp. 1137-1140.
- (1996) Proc. of ICSLP , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

2
- 0000353178
- A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
- L.E. Baum, T. Petrie, G. Soules, and N. Weiss A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains Ann. Math. Stat. 41 1970 167 171
- (1970) Ann. Math. Stat. , vol.41 , pp. 167-171
- Baum, L.E.¹ Petrie, T.² Soules, G.³ Weiss, N.⁴

3
- 0029375590
- Speaker adaptation using constrained estimation of Gaussian mixtures
- V. Digalakis, D. Rtischev, and L.G. Neumeyer Speaker adaptation using constrained estimation of Gaussian mixtures IEEE Trans. Speech and Audio Process. 3 5 1995 357 366
- (1995) IEEE Trans. Speech and Audio Process. , vol.3 , Issue.5 , pp. 357-366
- Digalakis, V.¹ Rtischev, D.² Neumeyer, L.G.³

4
- 0029725604
- A parametric approach to vocal tract lenght normalization
- Atlanta, GA
- Eide, E., Gish, H., 1996. A parametric approach to vocal tract lenght normalization. In: Proc. of ICASSP, Atlanta, GA, pp. 346-349.
- (1996) Proc. of ICASSP , pp. 346-349
- Eide, E.¹ Gish, H.²

5
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M.J.F. Gales Maximum likelihood linear transformations for HMM-based speech recognition Computer Speech and Language 12 1998 75 98
- (1998) Computer Speech and Language , vol.12 , pp. 75-98
- Gales, M.J.F.¹

6
- 0034227757
- Cluster adaptive training of hidden Markov models
- M.J.F. Gales Cluster adaptive training of hidden Markov models IEEE Trans. Speech and Audio Process. 8 4 2000 417 428
- (2000) IEEE Trans. Speech and Audio Process. , vol.8 , Issue.4 , pp. 417-428
- Gales, M.J.F.¹

7
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- J.-L. Gauvain, and C.-H. Lee Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains IEEE Trans. Speech and Audio Process. 2 2 1994 291 298
- (1994) IEEE Trans. Speech and Audio Process. , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

8
- 0024909979
- Some statistical issues in the comparison of speech recognition algorithms
- Glasgow, Scotland
- Gillick, L., Cox, S., 1989. Some statistical issues in the comparison of speech recognition algorithms. In: Proc. of ICASSP, Glasgow, Scotland, pp. I-532-535.
- (1989) Proc. of ICASSP
- Gillick, L.¹ Cox, S.²

9
- 0141702066
- Investigating recognition of children's speech
- Hong Kong, China
- Giuliani, D., Gerosa, M., 2003. Investigating recognition of children's speech. In: Proc. of ICASSP, Hong Kong, China, pp. II-137-140.
- (2003) Proc. of ICASSP
- Giuliani, D.¹ Gerosa, M.²

10
- 85009110337
- Speaker normalization through constrained MLLR based transforms
- Jeju Island, Korea
- Giuliani, D., Gerosa, M., Brugnara, F., 2004. Speaker normalization through constrained MLLR based transforms. In: Proc. of INTERSPEECH/ICSLP, Jeju Island, Korea, pp. 2893-2897.
- (2004) Proc. of INTERSPEECH/ICSLP , pp. 2893-2897
- Giuliani, D.¹ Gerosa, M.² Brugnara, F.³

11
- 0027578837
- On speaker-independent, speaker-dependent, and speaker-adaptive speech recognition
- X. Huang, and K.-F. Lee On speaker-independent, speaker-dependent, and speaker-adaptive speech recognition IEEE Trans. Speech and Audio Process. 1 2 1993 150 157
- (1993) IEEE Trans. Speech and Audio Process. , vol.1 , Issue.2 , pp. 150-157
- Huang, X.¹ Lee, K.-F.²

12
- 0036124301
- A robust compensation strategy against extraneous acoustic variations for spontaneous speech recognition
- H. Jiang, and L. Deng A robust compensation strategy against extraneous acoustic variations for spontaneous speech recognition IEEE Trans. Speech and Audio Process. 10 1 2002 9 17
- (2002) IEEE Trans. Speech and Audio Process. , vol.10 , Issue.1 , pp. 9-17
- Jiang, H.¹ Deng, L.²

13
- 0025681028
- Speaker adaptation from speaker-independent training corpus
- Albuquerque, NM
- Kubala, F., Schwartz, R., Barry, C., 1990. Speaker adaptation from speaker-independent training corpus. In: Proc. of ICASSP. Albuquerque, NM, pp. I-137-140.
- (1990) Proc. of ICASSP
- Kubala, F.¹ Schwartz, R.² Barry, C.³

14
- 0034320005
- Rapid speaker adaptation in eigenvoice space
- R. Kuhn, J.-C. Junqua, P. Nguyen, and N. Niedzielski Rapid speaker adaptation in eigenvoice space IEEE Trans. Speech and Audio Process. 8 6 2000 695 707
- (2000) IEEE Trans. Speech and Audio Process. , vol.8 , Issue.6 , pp. 695-707
- Kuhn, R.¹ Junqua, J.-C.² Nguyen, P.³ Niedzielski, N.⁴

15
- 0029747183
- Speaker normalization using efficient frequency warping procedure
- Atlanta, GA
- Lee, L., Rose, R., 1996. Speaker normalization using efficient frequency warping procedure. In: Proc. of ICASSP, Atlanta, GA, pp. I-353-356.
- (1996) Proc. of ICASSP
- Lee, L.¹ Rose, R.²

16
- 0032969462
- Acoustic of children's speech: Developmental changes of temporal and spectral parameters
- S. Lee, A. Potamianos, and S. Narayanan Acoustic of children's speech: developmental changes of temporal and spectral parameters J. Acoust. Soc. Am. 105 3 1999 1455 1468
- (1999) J. Acoust. Soc. Am. , vol.105 , Issue.3 , pp. 1455-1468
- Lee, S.¹ Potamianos, A.² Narayanan, S.³

17
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C.J. Leggetter, and P.C. Woodland Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models Computer Speech and Language 9 1995 171 185
- (1995) Computer Speech and Language , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

18
- 85128432219
- Speaker normalization with all-pass transforms
- Sydney, Australia
- McDonough, J., Byrne, W., Luo, X., 1998. Speaker normalization with all-pass transforms. In: Proc. of ICSLP, vol. VI, Sydney, Australia, pp. 2307-2310.
- (1998) Proc. of ICSLP , vol.6 , pp. 2307-2310
- McDonough, J.¹ Byrne, W.² Luo, X.³

19
- 0036475971
- Creating conversational interfaces for children
- S. Narayanan, and A. Potamianos Creating conversational interfaces for children IEEE Trans. Speech and Audio Process. 10 2 2002 65 78
- (2002) IEEE Trans. Speech and Audio Process. , vol.10 , Issue.2 , pp. 65-78
- Narayanan, S.¹ Potamianos, A.²

20
- 0141760645
- 1993 Benchmark tests for the ARPA spoken language program
- Plainsboro, NJ
- Pallet, D., Fiscus, J., Fisher, W., Garofolo, J., Lund, B., Pryzbocki, M., 1994. 1993 Benchmark tests for the ARPA spoken language program. In: Proc. of ARPA HLT Workshop, Plainsboro, NJ, p. 51.
- (1994) Proc. of ARPA HLT Workshop , pp. 51
- Pallet, D.¹ Fiscus, J.² Fisher, W.³ Garofolo, J.⁴ Lund, B.⁵ Pryzbocki, M.⁶

21
- 85009174854
- Vocal tract normalization as linear transformation of MFCC
- Geneva, Switzerland
- Pitz, M., Ney, H., 2003. Vocal tract normalization as linear transformation of MFCC. In: Proc. of EUROSPEECH, Geneva, Switzerland, pp. 1445-1448.
- (2003) Proc. of EUROSPEECH , pp. 1445-1448
- Pitz, M.¹ Ney, H.²

22
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- L. Rabiner A tutorial on hidden Markov models and selected applications in speech recognition Proc. IEEE 77 2 1989 257 285
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-285
- Rabiner, L.¹

23
- 33646759965
- Adaptive training using simple target models
- Philadelphia, PA
- Stemmer, G., Brugnara, F., Giuliani, D., 2005. Adaptive training using simple target models. In: Proc. of ICASSP, Philadelphia, PA.
- (2005) Proc. of ICASSP
- Stemmer, G.¹ Brugnara, F.² Giuliani, D.³

24
- 85135261079
- An investigation into vocal tract length normalisation
- Budapest, Hungary
- Uebel, L., Woodland, P., 1999. An investigation into vocal tract length normalisation. In: Proc. of EUROSPEECH, Budapest, Hungary, pp. 2527-2530.
- (1999) Proc. of EUROSPEECH , pp. 2527-2530
- Uebel, L.¹ Woodland, P.²

25
- 0029764708
- Speaker normalisation on conversational telephone speech
- Atlanta, GA
- Wegmann, S., McAllaster, D., Orloff., J., Peskin, B., 1996. Speaker normalisation on conversational telephone speech. In: Proc. of ICASSP, Atlanta, GA, pp. I-339-341.
- (1996) Proc. of ICASSP
- Wegmann, S.¹ McAllaster, D.² Orloff, J.³ Peskin, B.⁴

26
- 0032629626
- Improved methods for vocal tract normalization
- Phoenix, AZ
- Welling, L., Kanthak, S., Ney, H., 1999. Improved methods for vocal tract normalization. In: Proc. of ICASSP, vol. 2, Phoenix, AZ, pp. 761-764.
- (1999) Proc. of ICASSP , vol.2 , pp. 761-764
- Welling, L.¹ Kanthak, S.² Ney, H.³

27
- 0029747582
- A study of speech recognition for children and elderly
- Atlanta, GA
- Wilpon, J., Jacobsen, C., 1996. A study of speech recognition for children and elderly. In: Proc. of ICASSP, Atlanta, GA, pp. I-349-352.
- (1996) Proc. of ICASSP
- Wilpon, J.¹ Jacobsen, C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.