SCOPUS 정보 검색 플랫폼

Volumn 48, Issue 6, 2006, Pages 697-715

Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end

Author keywords

Auditory model; Distributed speech recognition; Fundamental frequency estimation; Sinusoidal model; Source filter model; Speech reconstruction

Indexed keywords

DATABASE SYSTEMS; ESTIMATION; MATHEMATICAL MODELS; SIGNAL PROCESSING; SPECTRUM ANALYZERS; TIME DOMAIN ANALYSIS; VECTORS;

AUDITORY MODEL; DISTRIBUTED SPEECH RECOGNITION; FUNDAMENTAL FREQUENCY ESTIMATION; SINUSOIDAL MODEL; SOURCE-FILTER MODEL; SPEECH RECONSTRUCTION;

SPEECH SYNTHESIS;

EID: 33646236798 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2005.10.004 Document Type: Article

Times cited : (28)

References (24)

1
- 0033677157
- Speech reconstruction from mel-frequency cepstral coefficients and pitch
- Chazan D., Hoory R., Cohen G., and Zibulski M. Speech reconstruction from mel-frequency cepstral coefficients and pitch. Proc. ICASSP (2000)
- (2000) Proc. ICASSP
- Chazan, D.¹ Hoory, R.² Cohen, G.³ Zibulski, M.⁴

2
- 85009110579
- Efficient periodicity extraction based on sine-wave representation and its application to pitch determination of speech signals
- Chazan D., Zibulski M., Hoory R., and Cohen. Efficient periodicity extraction based on sine-wave representation and its application to pitch determination of speech signals. Proc. Eurospeech (2001)
- (2001) Proc. Eurospeech
- Chazan, D.¹ Zibulski, M.² Hoory, R.³ Cohen⁴

3
- 33646265363
- ETSI document-ES 201 108-STQ: DSR, 2000. Front-end feature extraction algorithm; compression algorithm.

4
- 33646244344
- ETSI document-ES 202 212-STQ: DSR, 2003. Extended advanced front-end feature extraction algorithm; compression algorithms; back-end speech reconstruction algorithm.

5
- 0025110885
- Derivation of auditory filter shapes from notched-noise data
- Glasberg B.R., and Moore B.C.J. Derivation of auditory filter shapes from notched-noise data. Hear. Res. 47 (1990) 103-138
- (1990) Hear. Res. , vol.47 , pp. 103-138
- Glasberg, B.R.¹ Moore, B.C.J.²

6
- 0343685305
- Speech recognition from GSM codec parameters
- Huerta J.M., and Stern R.M. Speech recognition from GSM codec parameters. Proc. ICSLP (1998) 1463-1466
- (1998) Proc. ICSLP , pp. 1463-1466
- Huerta, J.M.¹ Stern, R.M.²

7
- 33646242606
- ITU-T Recommendation P.800, 1996. Methods for subjective determination of transmission quality.

8
- 0027210171
- Some useful properties of Teager's energy operators
- Kaiser J.F. Some useful properties of Teager's energy operators. Proc. ICASSP (1993) 149-152
- (1993) Proc. ICASSP , pp. 149-152
- Kaiser, J.F.¹

9
- 0035396207
- A bitstream-based front-end for wireless speech recognition on IS-136 communication systems
- Kim H.K., and Cox R.V. A bitstream-based front-end for wireless speech recognition on IS-136 communication systems. IEEE Trans. Speech Audio Process. 9 5 (2001) 558-568
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 558-568
- Kim, H.K.¹ Cox, R.V.²

10
- 0003637864
- Elsevier
- Kleijn W.B., and Paliwal K.K. Speech Coding and Synthesis (1995), Elsevier
- (1995) Speech Coding and Synthesis
- Kleijn, W.B.¹ Paliwal, K.K.²

11
- 0026882842
- Experiments with a nonlinear spectral subtractor (NSS), hidden Markov models and the projection for robust speech recognition in cars
- Lockwood P., and Boudy J. Experiments with a nonlinear spectral subtractor (NSS), hidden Markov models and the projection for robust speech recognition in cars. Speech Commun. (1992) 215-228
- (1992) Speech Commun. , pp. 215-228
- Lockwood, P.¹ Boudy, J.²

12
- 84863772450
- Speech analysis/synthesis based on a sinusoidal representation
- McAulay R.J., and Quatiery T.F. Speech analysis/synthesis based on a sinusoidal representation. IEEE Trans. ASSP 34 (1986) 744-754
- (1986) IEEE Trans. ASSP , vol.34 , pp. 744-754
- McAulay, R.J.¹ Quatiery, T.F.²

13
- 0141628275
- Speech reconstruction from MFCCs using a source-filter model
- Milner B.P., and Shao X. Speech reconstruction from MFCCs using a source-filter model. Proc. ICSLP (2002)
- (2002) Proc. ICSLP
- Milner, B.P.¹ Shao, X.²

14
- 33646230385
- Patterson, R.D., Holdsworth, J., Nimmo-Smith, I., Rice, P., 1988. SVOS final report: The auditory filterbank, APU Report 2341.

16
- 0003425258
- Prentice-Hall
- Rabiner L.R., and Schafer R.W. Digital Processing of Speech Signals (1978), Prentice-Hall
- (1978) Digital Processing of Speech Signals
- Rabiner, L.R.¹ Schafer, R.W.²

17
- 0017097478
- A comparative performance study of several pitch detection algorithms
- Rabiner L.R., Cheng M.J., Rosenberg A.J., and McGonegal C.A. A comparative performance study of several pitch detection algorithms. IEEE Trans. ASSP 24 5 (1976) 399-418
- (1976) IEEE Trans. ASSP , vol.24 , Issue.5 , pp. 399-418
- Rabiner, L.R.¹ Cheng, M.J.² Rosenberg, A.J.³ McGonegal, C.A.⁴

18
- 33748595676
- Distributed speech recognition with codec parameters
- Raj B., Migdal J., and Singh R. Distributed speech recognition with codec parameters. Proc. ASRU (2001)
- (2001) Proc. ASRU
- Raj, B.¹ Migdal, J.² Singh, R.³

19
- 0031124228
- A pitch determination and voiced/unvoiced algorithm for noisy speech
- Rouat J., Liu Y.C., and Morissette D. A pitch determination and voiced/unvoiced algorithm for noisy speech. Speech Commun. J. (1997) 191-207
- (1997) Speech Commun. J. , pp. 191-207
- Rouat, J.¹ Liu, Y.C.² Morissette, D.³

20
- 33646261237
- Slaney, M., 1993. An efficient implementation of the Patterson-Holdsworth auditory filterbank. Apple Computer Technical Report #35, Perception Group, Advanced Technology Group, Apple Computer, Inc.

21
- 0010571306
- Compression of acoustic features-are perceptual quality and recognition performance incompatible goals?
- Tucker R., Robinson T., Christie J., and Seymour C. Compression of acoustic features-are perceptual quality and recognition performance incompatible goals?. Proc. Eurospeech (1999)
- (1999) Proc. Eurospeech
- Tucker, R.¹ Robinson, T.² Christie, J.³ Seymour, C.⁴

22
- 0026635515
- Pitch and voiced/unvoiced determination with an auditory model
- Van Immerseel L., and Martens J.P. Pitch and voiced/unvoiced determination with an auditory model. JASA 91 (1992) 3311-3526
- (1992) JASA , vol.91 , pp. 3311-3526
- Van Immerseel, L.¹ Martens, J.P.²

23
- 0030779363
- Noise compensation methods for hidden Markov model speech recognition in adverse environments
- Vaseghi S.V., and Milner B.P. Noise compensation methods for hidden Markov model speech recognition in adverse environments. IEEE Trans. Speech Audio Process. 5 1 (1997) 11-21
- (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.1 , pp. 11-21
- Vaseghi, S.V.¹ Milner, B.P.²

24
- 0036296012
- A multi-pitch tracking algorithm for noisy speech
- Wu M., Wang D.L., and Brown G.J. A multi-pitch tracking algorithm for noisy speech. Proc. ICASSP (2002)
- (2002) Proc. ICASSP
- Wu, M.¹ Wang, D.L.² Brown, G.J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.