메뉴 건너뛰기




Volumn 48, Issue 6, 2006, Pages 697-715

Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end

Author keywords

Auditory model; Distributed speech recognition; Fundamental frequency estimation; Sinusoidal model; Source filter model; Speech reconstruction

Indexed keywords

DATABASE SYSTEMS; ESTIMATION; MATHEMATICAL MODELS; SIGNAL PROCESSING; SPECTRUM ANALYZERS; TIME DOMAIN ANALYSIS; VECTORS;

EID: 33646236798     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2005.10.004     Document Type: Article
Times cited : (28)

References (24)
  • 1
    • 0033677157 scopus 로고    scopus 로고
    • Speech reconstruction from mel-frequency cepstral coefficients and pitch
    • Chazan D., Hoory R., Cohen G., and Zibulski M. Speech reconstruction from mel-frequency cepstral coefficients and pitch. Proc. ICASSP (2000)
    • (2000) Proc. ICASSP
    • Chazan, D.1    Hoory, R.2    Cohen, G.3    Zibulski, M.4
  • 2
    • 85009110579 scopus 로고    scopus 로고
    • Efficient periodicity extraction based on sine-wave representation and its application to pitch determination of speech signals
    • Chazan D., Zibulski M., Hoory R., and Cohen. Efficient periodicity extraction based on sine-wave representation and its application to pitch determination of speech signals. Proc. Eurospeech (2001)
    • (2001) Proc. Eurospeech
    • Chazan, D.1    Zibulski, M.2    Hoory, R.3    Cohen4
  • 3
    • 33646265363 scopus 로고    scopus 로고
    • ETSI document-ES 201 108-STQ: DSR, 2000. Front-end feature extraction algorithm; compression algorithm.
  • 4
    • 33646244344 scopus 로고    scopus 로고
    • ETSI document-ES 202 212-STQ: DSR, 2003. Extended advanced front-end feature extraction algorithm; compression algorithms; back-end speech reconstruction algorithm.
  • 5
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • Glasberg B.R., and Moore B.C.J. Derivation of auditory filter shapes from notched-noise data. Hear. Res. 47 (1990) 103-138
    • (1990) Hear. Res. , vol.47 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 6
    • 0343685305 scopus 로고    scopus 로고
    • Speech recognition from GSM codec parameters
    • Huerta J.M., and Stern R.M. Speech recognition from GSM codec parameters. Proc. ICSLP (1998) 1463-1466
    • (1998) Proc. ICSLP , pp. 1463-1466
    • Huerta, J.M.1    Stern, R.M.2
  • 7
    • 33646242606 scopus 로고    scopus 로고
    • ITU-T Recommendation P.800, 1996. Methods for subjective determination of transmission quality.
  • 8
    • 0027210171 scopus 로고
    • Some useful properties of Teager's energy operators
    • Kaiser J.F. Some useful properties of Teager's energy operators. Proc. ICASSP (1993) 149-152
    • (1993) Proc. ICASSP , pp. 149-152
    • Kaiser, J.F.1
  • 9
    • 0035396207 scopus 로고    scopus 로고
    • A bitstream-based front-end for wireless speech recognition on IS-136 communication systems
    • Kim H.K., and Cox R.V. A bitstream-based front-end for wireless speech recognition on IS-136 communication systems. IEEE Trans. Speech Audio Process. 9 5 (2001) 558-568
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 558-568
    • Kim, H.K.1    Cox, R.V.2
  • 11
    • 0026882842 scopus 로고
    • Experiments with a nonlinear spectral subtractor (NSS), hidden Markov models and the projection for robust speech recognition in cars
    • Lockwood P., and Boudy J. Experiments with a nonlinear spectral subtractor (NSS), hidden Markov models and the projection for robust speech recognition in cars. Speech Commun. (1992) 215-228
    • (1992) Speech Commun. , pp. 215-228
    • Lockwood, P.1    Boudy, J.2
  • 12
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • McAulay R.J., and Quatiery T.F. Speech analysis/synthesis based on a sinusoidal representation. IEEE Trans. ASSP 34 (1986) 744-754
    • (1986) IEEE Trans. ASSP , vol.34 , pp. 744-754
    • McAulay, R.J.1    Quatiery, T.F.2
  • 13
    • 0141628275 scopus 로고    scopus 로고
    • Speech reconstruction from MFCCs using a source-filter model
    • Milner B.P., and Shao X. Speech reconstruction from MFCCs using a source-filter model. Proc. ICSLP (2002)
    • (2002) Proc. ICSLP
    • Milner, B.P.1    Shao, X.2
  • 14
    • 33646230385 scopus 로고    scopus 로고
    • Patterson, R.D., Holdsworth, J., Nimmo-Smith, I., Rice, P., 1988. SVOS final report: The auditory filterbank, APU Report 2341.
  • 17
    • 0017097478 scopus 로고
    • A comparative performance study of several pitch detection algorithms
    • Rabiner L.R., Cheng M.J., Rosenberg A.J., and McGonegal C.A. A comparative performance study of several pitch detection algorithms. IEEE Trans. ASSP 24 5 (1976) 399-418
    • (1976) IEEE Trans. ASSP , vol.24 , Issue.5 , pp. 399-418
    • Rabiner, L.R.1    Cheng, M.J.2    Rosenberg, A.J.3    McGonegal, C.A.4
  • 18
    • 33748595676 scopus 로고    scopus 로고
    • Distributed speech recognition with codec parameters
    • Raj B., Migdal J., and Singh R. Distributed speech recognition with codec parameters. Proc. ASRU (2001)
    • (2001) Proc. ASRU
    • Raj, B.1    Migdal, J.2    Singh, R.3
  • 19
    • 0031124228 scopus 로고    scopus 로고
    • A pitch determination and voiced/unvoiced algorithm for noisy speech
    • Rouat J., Liu Y.C., and Morissette D. A pitch determination and voiced/unvoiced algorithm for noisy speech. Speech Commun. J. (1997) 191-207
    • (1997) Speech Commun. J. , pp. 191-207
    • Rouat, J.1    Liu, Y.C.2    Morissette, D.3
  • 20
    • 33646261237 scopus 로고    scopus 로고
    • Slaney, M., 1993. An efficient implementation of the Patterson-Holdsworth auditory filterbank. Apple Computer Technical Report #35, Perception Group, Advanced Technology Group, Apple Computer, Inc.
  • 21
    • 0010571306 scopus 로고    scopus 로고
    • Compression of acoustic features-are perceptual quality and recognition performance incompatible goals?
    • Tucker R., Robinson T., Christie J., and Seymour C. Compression of acoustic features-are perceptual quality and recognition performance incompatible goals?. Proc. Eurospeech (1999)
    • (1999) Proc. Eurospeech
    • Tucker, R.1    Robinson, T.2    Christie, J.3    Seymour, C.4
  • 22
    • 0026635515 scopus 로고
    • Pitch and voiced/unvoiced determination with an auditory model
    • Van Immerseel L., and Martens J.P. Pitch and voiced/unvoiced determination with an auditory model. JASA 91 (1992) 3311-3526
    • (1992) JASA , vol.91 , pp. 3311-3526
    • Van Immerseel, L.1    Martens, J.P.2
  • 23
    • 0030779363 scopus 로고    scopus 로고
    • Noise compensation methods for hidden Markov model speech recognition in adverse environments
    • Vaseghi S.V., and Milner B.P. Noise compensation methods for hidden Markov model speech recognition in adverse environments. IEEE Trans. Speech Audio Process. 5 1 (1997) 11-21
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.1 , pp. 11-21
    • Vaseghi, S.V.1    Milner, B.P.2
  • 24
    • 0036296012 scopus 로고    scopus 로고
    • A multi-pitch tracking algorithm for noisy speech
    • Wu M., Wang D.L., and Brown G.J. A multi-pitch tracking algorithm for noisy speech. Proc. ICASSP (2002)
    • (2002) Proc. ICASSP
    • Wu, M.1    Wang, D.L.2    Brown, G.J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.