메뉴 건너뛰기




Volumn 27, Issue 3, 1999, Pages 187-207

Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds

Author keywords

[No Author keywords available]

Indexed keywords

SPEECH ANALYSIS; SPEECH SYNTHESIS;

EID: 0032673049     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-6393(98)00085-5     Document Type: Article
Times cited : (1804)

References (33)
  • 1
    • 0029369322 scopus 로고
    • Harmonics estimation based on instantaneous frequency and its application to pitch determination
    • Abe T., Kobayashi T., Imai S. Harmonics estimation based on instantaneous frequency and its application to pitch determination. IEICE Trans. Information and Systems. E78-D(9):1995;1188-1194.
    • (1995) IEICE Trans. Information and Systems , vol.E78-D , Issue.9 , pp. 1188-1194
    • Abe, T.1    Kobayashi, T.2    Imai, S.3
  • 2
    • 0030371135 scopus 로고    scopus 로고
    • Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequency
    • Philadelphia
    • Abe, T., Kobayashi, T., Imai, S., 1996. Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequency. In: Proc. ICSLP 96, Philadelphia, pp. 1277-1280.
    • (1996) In: Proc. ICSLP 96 , pp. 1277-1280
    • Abe, T.1    Kobayashi, T.2    Imai, S.3
  • 3
    • 30244435612 scopus 로고
    • Hybrid sinusoidal modeling of speech without voicing decision
    • Paris
    • Abrantes, A.J., Marques, J.S., Trancoso, I.M., 1991. Hybrid sinusoidal modeling of speech without voicing decision. In: Proc. Eurospeech 91, Paris, pp. 231-234.
    • (1991) In: Proc. Eurospeech 91 , pp. 231-234
    • Abrantes, A.J.1    Marques, J.S.2    Trancoso, I.M.3
  • 4
    • 0015112070 scopus 로고
    • Speech analysis and synthesis by linear prediction of speech wave
    • Atal B.S., Hanauer S.L. Speech analysis and synthesis by linear prediction of speech wave. J. Acoust. Soc. Amer. 50(2 pt.2):1971;637-655.
    • (1971) J. Acoust. Soc. Amer. , vol.50 , Issue.2 PART 2 , pp. 637-655
    • Atal, B.S.1    Hanauer, S.L.2
  • 5
    • 0017964427 scopus 로고
    • Group delay distortion in electroacoustical systems
    • Blauert J., Laws P. Group delay distortion in electroacoustical systems. J. Acoust. Soc. Amer. 63(5):1978;1478-1483.
    • (1978) J. Acoust. Soc. Amer. , vol.63 , Issue.5 , pp. 1478-1483
    • Blauert, J.1    Laws, P.2
  • 6
    • 84937035392 scopus 로고
    • Estimating and interpreting the instantaneous frequency of a signal - Part 1: Fundamentals
    • Boashash B. Estimating and interpreting the instantaneous frequency of a signal - part 1: Fundamentals. Proc. IEEE. 80(4):1992;520-538.
    • (1992) Proc. IEEE , vol.80 , Issue.4 , pp. 520-538
    • Boashash, B.1
  • 7
    • 84937035392 scopus 로고
    • Estimating and interpreting the instantaneous frequency of a signal - Part 2: Algorithms and applications
    • Boashash B. Estimating and interpreting the instantaneous frequency of a signal - part 2: Algorithms and applications. Proc. IEEE. 80(4):1992;550-568.
    • (1992) Proc. IEEE , vol.80 , Issue.4 , pp. 550-568
    • Boashash, B.1
  • 10
    • 0024705330 scopus 로고
    • Time-frequency distributions - A review
    • Cohen L. Time-frequency distributions - a review. Proc. IEEE. 77(7):1989;941-981.
    • (1989) Proc. IEEE , vol.77 , Issue.7 , pp. 941-981
    • Cohen, L.1
  • 13
    • 0031942341 scopus 로고    scopus 로고
    • Cancellation model of pitch perception
    • de Cheveigné A. Cancellation model of pitch perception. J. Acoust. Soc. Amer. 103(3):1998;1261-1271.
    • (1998) J. Acoust. Soc. Amer. , vol.103 , Issue.3 , pp. 1261-1271
    • De Cheveigné, A.1
  • 14
    • 84942494747 scopus 로고
    • Remaking speech
    • Dudley H. Remaking speech. J. Acoust. Soc. Amer. 11(2):1939;169-177.
    • (1939) J. Acoust. Soc. Amer. , vol.11 , Issue.2 , pp. 169-177
    • Dudley, H.1
  • 15
    • 0344607972 scopus 로고
    • An analysis of the performance of the MBE model when used in the context of a text-to-speech system
    • Berlin
    • Dutoit, T., Leich, H., 1993. An analysis of the performance of the MBE model when used in the context of a text-to-speech system. In: Proc. Eurospeech 93, Berlin, pp. 531-534.
    • (1993) In: Proc. Eurospeech 93 , pp. 531-534
    • Dutoit, T.1    Leich, H.2
  • 18
    • 0014704814 scopus 로고
    • A statistical method for estimation of speech spectral density and formant frequencies
    • 53-A in Japanese
    • Itakura, F., Saito, S., 1970. A statistical method for estimation of speech spectral density and formant frequencies. Trans. IECE Japan, 53-A, 36-43 (in Japanese).
    • (1970) Trans. IECE Japan , pp. 36-43
    • Itakura, F.1    Saito, S.2
  • 19
    • 0030677481 scopus 로고    scopus 로고
    • Speech representation and transformation using adaptive interpolation of weighted spectrum: Vocoder revisited
    • Münich
    • Kawahara, H., 1997. Speech representation and transformation using adaptive interpolation of weighted spectrum: Vocoder revisited. In: Proc. IEEE Internat. Conf. Acoust. Speech and Signal Processing 2, Münich, 1303-1306.
    • (1997) In: Proc. IEEE Internat. Conf. Acoust. Speech and Signal Processing , vol.2 , pp. 1303-1306
    • Kawahara, H.1
  • 20
    • 0007955889 scopus 로고    scopus 로고
    • Speech representation and transformation based on adaptive time-frequency interpolation
    • EA96-28, (in Japanese)
    • Kawahara, H., Masuda, I., 1996. Speech representation and transformation based on adaptive time-frequency interpolation. Technical Report of IEICE, EA96-28, pp. 9-16 (in Japanese).
    • (1996) Technical Report of IEICE , pp. 9-16
    • Kawahara, H.1    Masuda, I.2
  • 21
    • 0010353240 scopus 로고    scopus 로고
    • Effects of auditory feedback on voice pitch
    • In: Davis, P.J., Fletcher, N.H. (Eds.), Singular, Münich, Chapter 18
    • Kawahara, H., Williams, J.C., 1996. Effects of auditory feedback on voice pitch. In: Davis, P.J., Fletcher, N.H. (Eds.), Vocal Fold Physiology. Singular, Münich, Chapter 18, pp. 263-278.
    • (1996) Vocal Fold Physiology , pp. 263-278
    • Kawahara, H.1    Williams, J.C.2
  • 24
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • McAulay R.J., Quatieri T.F. Speech analysis/synthesis based on a sinusoidal representation. IEEE Trans. ASSP. 34:1986;744-754.
    • (1986) IEEE Trans. ASSP , vol.34 , pp. 744-754
    • McAulay, R.J.1    Quatieri, T.F.2
  • 27
    • 0023448388 scopus 로고
    • A pulse ribbon model of monaural phase perception
    • Patterson R.D. A pulse ribbon model of monaural phase perception. J. Acoust. Soc. Amer. 82(5):1987;1560-1586.
    • (1987) J. Acoust. Soc. Amer. , vol.82 , Issue.5 , pp. 1560-1586
    • Patterson, R.D.1
  • 29
    • 0020497760 scopus 로고
    • An integrated pitch tracking algorithm for speech systems
    • Secrest, B.G., Doddington, G.R., 1983. An integrated pitch tracking algorithm for speech systems. In: Proc. IEEE ICASSP83, pp. 1352-1355.
    • (1983) In: Proc. IEEE ICASSP83 , pp. 1352-1355
    • Secrest, B.G.1    Doddington, G.R.2
  • 31
    • 85135177301 scopus 로고
    • High-quality speech modification based on a harmonic + noise model
    • Madrid
    • Stylianou, Y., Laroche, J., Moulines, E., 1995. High-quality speech modification based on a harmonic + noise model. In: Proc. Eurospeech 95, Madrid, pp. 451-454.
    • (1995) In: Proc. Eurospeech 95 , pp. 451-454
    • Stylianou, Y.1    Laroche, J.2    Moulines, E.3
  • 32
    • 0030145771 scopus 로고    scopus 로고
    • Time-scale and pitch modifications of speech signals and resynthesis from the discrete short-time Fourier transform
    • Veldhuis R., He H. Time-scale and pitch modifications of speech signals and resynthesis from the discrete short-time Fourier transform. Speech Communication. 18:1996;257-279.
    • (1996) Speech Communication , vol.18 , pp. 257-279
    • Veldhuis, R.1    He, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.