SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Speech Communication

Volumn 48, Issue 11, 2006, Pages 1556-1572

MAP prediction of formant frequencies and voicing class from MFCC vectors in noise

(3) Darch, Jonathan a Milner, Ben a Vaseghi, Saeed b

a UNIVERSITY OF EAST ANGLIA (United Kingdom)

b BRUNEL UNIVERSITY (United Kingdom)

Author keywords

DSR; Formant estimation; Formant prediction; GMM; HMM; MAP prediction

Indexed keywords

FREQUENCIES; MARKOV PROCESSES; MATHEMATICAL MODELS; SIGNAL TO NOISE RATIO; SPEECH COMMUNICATION; SPEECH RECOGNITION;

DSR; FORMANT ESTIMATION; FORMANT PREDICTION; GMM; HMM; MAP PREDICTION;

VECTORS;

EID: 33750293417 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2006.06.001 Document Type: Article

Times cited : (22)

References (24)

1
- 0015112070
- Speech analysis and synthesis by linear prediction of the speech wave
- Atal B.S., and Hanauer S.L. Speech analysis and synthesis by linear prediction of the speech wave. J. Acoust. Soc. Amer. 50 2 (1971) 637-655
- (1971) J. Acoust. Soc. Amer. , vol.50 , Issue.2 , pp. 637-655
- Atal, B.S.¹ Hanauer, S.L.²

2
- 17344378368
- Bruce, I.C., Karkhanis, N.V., Young, E.D., Sachs, M.B., 2002. Robust formant tracking in noise. In: ICASSP, Orlando, FL, May, Vol. 1, pp. 281-284.

3
- 4544315994
- Chen, B., Loizou, P.C., 2004. Formant frequency estimation in noise. In: ICASSP, Montreal, Canada, May, Vol. 1, pp. 581-584.

4
- 33646763209
- Darch, J., Milner, B., Shao, X., Vaseghi, S., Yan, Q., 2005a. Predicting formant frequencies from MFCC vectors. In: ICASSP, Philadelphia, PA, March, Vol. 1, pp. 941-944.

5
- 33745198685
- Darch, J., Milner, B., Vaseghi, S., 2005b. Formant frequency prediction from MFCC vectors in noisy environments. In: Eurospeech, Lisbon, Portugal, September, pp. 1129-1132.

6
- 33750335159
- Fransen, J., Pye, D., Robinson, T., Woodland, P., Young, S., 1994. WSJCAM0 corpus and recording description. Tech. Rep. CUED/F-INFENG/TR.192, Cambridge University Engineering Department, September.

7
- 0004225947
- Thomson Learning. 0-7693-0112-6
- Kent R.D., and Read C. Acoustic Analysis of Speech. second ed (2002), Thomson Learning. 0-7693-0112-6
- (2002) Acoustic Analysis of Speech. second ed
- Kent, R.D.¹ Read, C.²

8
- 0018986665
- Software for a cascade/parallel formant synthesizer
- Klatt D.H. Software for a cascade/parallel formant synthesizer. J. Acoust. Soc. Amer. 67 3 (1980) 971-995
- (1980) J. Acoust. Soc. Amer. , vol.67 , Issue.3 , pp. 971-995
- Klatt, D.H.¹

9
- 0016049328
- An algorithm for automatic formant extraction using linear prediction spectra
- McCandless S. An algorithm for automatic formant extraction using linear prediction spectra. IEEE Trans. Acoust. Speech Signal Process. 22 2 (1974) 135-141
- (1974) IEEE Trans. Acoust. Speech Signal Process. , vol.22 , Issue.2 , pp. 135-141
- McCandless, S.¹

10
- 33745203982
- Milner, B., Shao, X., Darch, J., 2005. Fundamental frequency and voicing prediction from MFCCs for speech reconstruction from unconstrained speech. In: Eurospeech, Lisbon, Portugal, September, pp. 321-324.

11
- 33750327908
- Niederjohn, R.J., Svoren, T.J., Heinen, J.A., 1992. Intelligibility enhancement of noise-corrupted speech based on formant tracking involving prefiltering. In: IEEE Industrial Electronics, Control, Instrumentation, and Automation, San Diego, CA, November, Vol. 3, pp. 1336-1341.

12
- 84987702417
- Pearce, D., Hirsch, H.-G., 2000. The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: ICSLP, Beijing, China, October, Vol. 4, pp. 29-32.

13
- 0003425258
- Prentice-Hall. 0-13-213603-1
- Rabiner L.R., and Schafer R.W. Digital Processing of Speech Signals (1978), Prentice-Hall. 0-13-213603-1
- (1978) Digital Processing of Speech Signals
- Rabiner, L.R.¹ Schafer, R.W.²

14
- 4644336054
- Reconstruction of missing features for robust speech recognition
- Raj B., Seltzer M.L., and Stern R.M. Reconstruction of missing features for robust speech recognition. Speech Comm. 43 4 (2004) 275-296
- (2004) Speech Comm. , vol.43 , Issue.4 , pp. 275-296
- Raj, B.¹ Seltzer, M.L.² Stern, R.M.³

15
- 0014730929
- System for automatic formant analysis of voiced speech
- Schafer R.W., and Rabiner L.R. System for automatic formant analysis of voiced speech. J. Acoust. Soc. Amer. 47 2 (1970) 634-648
- (1970) J. Acoust. Soc. Amer. , vol.47 , Issue.2 , pp. 634-648
- Schafer, R.W.¹ Rabiner, L.R.²

16
- 23744446244
- Predicting fundamental frequency from mel-frequency cepstral coefficients to enable speech reconstruction
- Shao X., and Milner B. Predicting fundamental frequency from mel-frequency cepstral coefficients to enable speech reconstruction. J. Acoust. Soc. Amer. 118 2 (2005) 1134-1143
- (2005) J. Acoust. Soc. Amer. , vol.118 , Issue.2 , pp. 1134-1143
- Shao, X.¹ Milner, B.²

17
- 0027579682
- Formant location from LPC analysis data
- Snell R.C., and Milinazzo F. Formant location from LPC analysis data. IEEE Trans. Speech Audio Process. 1 2 (1993) 129-134
- (1993) IEEE Trans. Speech Audio Process. , vol.1 , Issue.2 , pp. 129-134
- Snell, R.C.¹ Milinazzo, F.²

18
- 33750371975
- Sorin, A., Ramabadran, T., 2003. Extended advanced front end algorithm description, Version 1.1. Tech. Rep. ES 202 212, ETSI STQ-Aurora DSR Working Group, April.

19
- 0038797944
- Wiley. 0-470-84514-7
- Webb A.R. Statistical Pattern Recognition. second ed. (2002), Wiley. 0-470-84514-7
- (2002) Statistical Pattern Recognition. second ed.
- Webb, A.R.¹

20
- 0029746535
- Welling, L., Ney, H., 1996. A model for efficient formant estimation. In: ICASSP, Atlanta, GA, May, Vol. 2, pp. 797-800.

21
- 0031647965
- Formant estimation for speech recognition
- Welling L., and Ney H. Formant estimation for speech recognition. IEEE Trans. Speech Audio Process. 6 1 (1998) 36-48
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.1 , pp. 36-48
- Welling, L.¹ Ney, H.²

22
- 84864724283
- Wilkinson, N., Russell, M.J., 2002. Improved phone recognition on TIMIT using formant frequency data and confidence measures. In: ICSLP, Denver, CO, September, pp. 2121-2124.

23
- 33745193696
- Yan, Q., Vaseghi, S., Zavarehei, E., Milner, B., 2005. Formant-tracking linear prediction models for speech processing in noisy environments. In: Eurospeech, Lisbon, Portugal, September, pp. 2081-2084.

24
- 0003571976
- Cambridge University Engineering Department
- Young S., Evermann G., Kershaw D., Moore G., Odell J., Ollason D., Povey D., Valtchev V., and Woodland P. The HTK Book, Version 3.2 (2002), Cambridge University Engineering Department
- (2002) The HTK Book, Version 3.2
- Young, S.¹ Evermann, G.² Kershaw, D.³ Moore, G.⁴ Odell, J.⁵ Ollason, D.⁶ Povey, D.⁷ Valtchev, V.⁸ Woodland, P.⁹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.