메뉴 건너뛰기




Volumn 15, Issue 3, 2007, Pages 851-861

Perceptual long-term variable-rate sinusoidal modeling of speech

Author keywords

Perceptual models; Sinusoidal model; Speech modeling; Speech processing; Variable rate

Indexed keywords

A-FRAMES; DISCRETE COSINE FUNCTIONS; FIRST HARMONICS; ITERATIVE ALGORITHMS; LT MODELS; MODELING ACCURACIES; OPTIMAL FITTINGS; PERCEPTUAL MODELS; PHASE MODELING; PHASE PARAMETERS; SINUSOIDAL COMPONENTS; SINUSOIDAL MODEL; SINUSOIDAL MODELING; SPEECH MODELING; SPEECH WATERMARKING; SYNTHESIS PROCESS; VARIABLE RATE; VOICED SPEECH;

EID: 51449108696     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.885928     Document Type: Article
Times cited : (13)

References (40)
  • 1
    • 0343137810 scopus 로고
    • Signal processing aspects of computer music-A survey
    • J. A. Moorer, "Signal processing aspects of computer music-A survey," Comput. Music J., vol. 1, no. 1, pp. 4-37, 1977.
    • (1977) Comput. Music J , vol.1 , Issue.1 , pp. 4-37
    • Moorer, J.A.1
  • 2
    • 0017923449 scopus 로고
    • The use of the phase vocoder in computer music applications
    • -, "The use of the phase vocoder in computer music applications," J. Audio Eng. Soc., vol. 26, no. 1/2, pp. 42-45, 1978.
    • (1978) J. Audio Eng. Soc , vol.26 , Issue.1-2 , pp. 42-45
    • Moorer, J.A.1
  • 3
    • 0018982701 scopus 로고
    • Time-frequency representation of digital signals and systems based on short-time Fourier transform
    • Feb
    • M. R. Portnoff, "Time-frequency representation of digital signals and systems based on short-time Fourier transform," IEEE Trans. Acoust. Speech, Signal Process., vol. ASSP-28, no. 1, pp. 55-69, Feb. 1980.
    • (1980) IEEE Trans. Acoust. Speech, Signal Process , vol.ASSP-28 , Issue.1 , pp. 55-69
    • Portnoff, M.R.1
  • 4
    • 0022909412 scopus 로고
    • The phase vocoder: A tutorial
    • M. B. Dolson, "The phase vocoder: A tutorial," Comput. Music J., vol. 10, no. 4, pp. 14-27, 1986.
    • (1986) Comput. Music J , vol.10 , Issue.4 , pp. 14-27
    • Dolson, M.B.1
  • 5
    • 0017626192 scopus 로고
    • Short-term spectral analysis, synthesis, and modification by the discrete Fourier transform
    • Jun
    • J. B. Allen, "Short-term spectral analysis, synthesis, and modification by the discrete Fourier transform," IEEE Trans. Acoust. Speech, Signal Process., vol. ASSP-25, no. 3, pp. 235-238, Jun. 1977.
    • (1977) IEEE Trans. Acoust. Speech, Signal Process , vol.ASSP-25 , Issue.3 , pp. 235-238
    • Allen, J.B.1
  • 6
    • 84955033070 scopus 로고
    • Analysis of musical instrument tones
    • J.-C. Risset and M. V. Mathews, "Analysis of musical instrument tones," Phys. Today, vol. 22, no. 2, pp. 23-30, 1969.
    • (1969) Phys. Today , vol.22 , Issue.2 , pp. 23-30
    • Risset, J.-C.1    Mathews, M.V.2
  • 7
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • Aug
    • R. J. McAulay and T. F. Quatieri, "Speech analysis/synthesis based on a sinusoidal representation," IEEE Trans. Acoust. Speech, Signal Process., vol. ASSP-34, no. 4, pp. 744-754, Aug. 1986.
    • (1986) IEEE Trans. Acoust. Speech, Signal Process , vol.ASSP-34 , Issue.4 , pp. 744-754
    • McAulay, R.J.1    Quatieri, T.F.2
  • 8
    • 0025544510 scopus 로고
    • Spectral modeling synthesis: A sound analysis/ synthesis system based on a deterministic plus stochastic dec-omposition
    • X. Serra and J. O. Smith, "Spectral modeling synthesis: A sound analysis/ synthesis system based on a deterministic plus stochastic dec-omposition," Comput. Music J., vol. 14, no. 4, pp. 12-24, 1990.
    • (1990) Comput. Music J , vol.14 , Issue.4 , pp. 12-24
    • Serra, X.1    Smith, J.O.2
  • 9
    • 64149124871 scopus 로고    scopus 로고
    • quot;AudioSculpt User's Manual, 2nd ed. IRCAM, Paris, France, 1996.
    • quot;AudioSculpt User's Manual," 2nd ed. IRCAM, Paris, France, 1996.
  • 10
    • 0012355466 scopus 로고    scopus 로고
    • Sinusoidal modeling and manipulation using lemur
    • K. Fitz and L. Haken, "Sinusoidal modeling and manipulation using lemur," Comput. Music J., vol. 20, no. 4, pp. 44-59, 1996.
    • (1996) Comput. Music J , vol.20 , Issue.4 , pp. 44-59
    • Fitz, K.1    Haken, L.2
  • 11
    • 84906261797 scopus 로고
    • PARSHL: An analysis/synthesis program for non-harmonic sounds based on a sinusoidal representation
    • San Francisco, CA
    • J. O. Smith and X. Serra, "PARSHL: An analysis/synthesis program for non-harmonic sounds based on a sinusoidal representation," in Proc. Int. Comput. Music Conf., San Francisco, CA, 1987, pp. 290-297.
    • (1987) Proc. Int. Comput. Music Conf , pp. 290-297
    • Smith, J.O.1    Serra, X.2
  • 12
    • 64149106046 scopus 로고    scopus 로고
    • X. Serra, Musical Signal Processing. Lisse, The Netherlands: Swets & Zeitlinger, 1997, ch. Musical Sound Modeling with Sinusoids plus Noise, pp. 91-122.
    • X. Serra, Musical Signal Processing. Lisse, The Netherlands: Swets & Zeitlinger, 1997, ch. Musical Sound Modeling with Sinusoids plus Noise, pp. 91-122.
  • 13
    • 0343573689 scopus 로고    scopus 로고
    • InSpect and respect: Spectral modeling, analysis and real-time synthesis software tools for researchers and composers
    • Beijing, China
    • S. Marchand and R. Strandh, "InSpect and respect: Spectral modeling, analysis and real-time synthesis software tools for researchers and composers," in Proc. Int. Comput. Music Conf. (ICMC'99), Beijing, China, 1999, pp. 341-344.
    • (1999) Proc. Int. Comput. Music Conf. (ICMC'99) , pp. 341-344
    • Marchand, S.1    Strandh, R.2
  • 14
    • 2342653418 scopus 로고
    • Performance, synthesis and control of additive synthesis on a desktop computer using FFT-1
    • Tokyo, Japan
    • A. Freed, X. Rodet, and P. Depalle, "Performance, synthesis and control of additive synthesis on a desktop computer using FFT-1," in Proc. Int. Computer Music Conf. (ICMC'93), Tokyo, Japan, 1993, pp. 98-101.
    • (1993) Proc. Int. Computer Music Conf. (ICMC'93) , pp. 98-101
    • Freed, A.1    Rodet, X.2    Depalle, P.3
  • 15
    • 84976780172 scopus 로고
    • A sine generation algorithm for VLSI applications
    • Vancouver, BC, Canada
    • J. W. Gordon and J. O. Smith, "A sine generation algorithm for VLSI applications," in Proc. Int. Comput. Music Conf. (ICMC'85), Vancouver, BC, Canada, 1985, pp. 165-168.
    • (1985) Proc. Int. Comput. Music Conf. (ICMC'85) , pp. 165-168
    • Gordon, J.W.1    Smith, J.O.2
  • 16
  • 17
    • 0001935942 scopus 로고
    • Sinusoidal coding
    • W. B. Kleijin and K. K. Paliwal, Eds. New York: Elsevier, ch. 4
    • R. J. McAulay and T. F. Quatieri, "Sinusoidal coding," in Speech Coding and Synthesis, W. B. Kleijin and K. K. Paliwal, Eds. New York: Elsevier, 1995, ch. 4.
    • (1995) Speech Coding and Synthesis
    • McAulay, R.J.1    Quatieri, T.F.2
  • 18
    • 0026830163 scopus 로고
    • Shape invariant time-scale and pitch modification of speech
    • Mar
    • T. F. Quatieri and R. J. McAulay, "Shape invariant time-scale and pitch modification of speech," IEEE Trans. Signal Process., vol. 40, no. 3, pp. 497-510, Mar. 1992.
    • (1992) IEEE Trans. Signal Process , vol.40 , Issue.3 , pp. 497-510
    • Quatieri, T.F.1    McAulay, R.J.2
  • 19
    • 0031232722 scopus 로고    scopus 로고
    • Speech analysis/ synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model
    • Sep
    • E. B. George and M. J. T. Smith, "Speech analysis/ synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model," IEEE Trans. Speech Audio Process., vol. 5, no. 5, pp. 389-406, Sep. 1997.
    • (1997) IEEE Trans. Speech Audio Process , vol.5 , Issue.5 , pp. 389-406
    • George, E.B.1    Smith, M.J.T.2
  • 20
    • 0005252733 scopus 로고
    • An exploration of musical timbre,
    • Ph.D. dissertation, Dept. Music, Stanford Univ, Stanford, CA
    • J. M. Grey, "An exploration of musical timbre," Ph.D. dissertation, Dept. Music, Stanford Univ., Stanford, CA, 1975.
    • (1975)
    • Grey, J.M.1
  • 21
    • 0031187549 scopus 로고    scopus 로고
    • Processing of musical tones using a combined quadratic polynomial phase sinusoid and residual signal model
    • Y. Ding and X. Qian, "Processing of musical tones using a combined quadratic polynomial phase sinusoid and residual signal model," J. Audio Eng. Soc., vol. 45, no. 7/8, pp. 571-585, 1997.
    • (1997) J. Audio Eng. Soc , vol.45 , Issue.7-8 , pp. 571-585
    • Ding, Y.1    Qian, X.2
  • 23
    • 0030232135 scopus 로고    scopus 로고
    • Analysis/synthesis and modification of the speech aperiodic component
    • G. Richard and C. d'Alessandro, "Analysis/synthesis and modification of the speech aperiodic component," Speech Commun., vol. 19, pp. 221-244, 1996.
    • (1996) Speech Commun , vol.19 , pp. 221-244
    • Richard, G.1    d'Alessandro, C.2
  • 25
    • 0031276676 scopus 로고    scopus 로고
    • Sinusoidal modeling and modification of unvoiced speech
    • Nov
    • M. W. Macon and M. A. Clements, "Sinusoidal modeling and modification of unvoiced speech," IEEE Trans. Speech Audio Process., vol. 5, no. 6, pp. 557-560, Nov. 1997.
    • (1997) IEEE Trans. Speech Audio Process , vol.5 , Issue.6 , pp. 557-560
    • Macon, M.W.1    Clements, M.A.2
  • 27
    • 85009106654 scopus 로고    scopus 로고
    • Long term modeling of phase trajectories within the speech sinusoidal model framework
    • Jeju, Korea, CD-ROM
    • L. Girin, M. Firouzmand, and S. Marchand, "Long term modeling of phase trajectories within the speech sinusoidal model framework," in Proc. Int. Conf. on Speech & Language Proc., Jeju, Korea, 2004, CD-ROM.
    • (2004) Proc. Int. Conf. on Speech & Language Proc
    • Girin, L.1    Firouzmand, M.2    Marchand, S.3
  • 28
    • 85137465653 scopus 로고
    • An improved cepstral method for deconvolution of source-filter systems with discrete spectra: Application to musical sound signals
    • Glasgow, U.K
    • T. Galas and X. Rodet, "An improved cepstral method for deconvolution of source-filter systems with discrete spectra: Application to musical sound signals," in Proc. Int. Comput. Music Conf. (ICMC), Glasgow, U.K., 1990, pp. 82-84.
    • (1990) Proc. Int. Comput. Music Conf. (ICMC) , pp. 82-84
    • Galas, T.1    Rodet, X.2
  • 30
    • 64149130400 scopus 로고    scopus 로고
    • Information Technology-Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to About 1.5 Mbits/s, Part 3: Audio, ISO/IEC JTC1/SC29/WG11 MPEG, IS11172-3, 1992.
    • Information Technology-Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to About 1.5 Mbits/s, Part 3: Audio, ISO/IEC JTC1/SC29/WG11 MPEG, IS11172-3, 1992.
  • 31
    • 0034172308 scopus 로고    scopus 로고
    • Perceptual coding of digital audio
    • Apr
    • T. Painter and A. Spanias, "Perceptual coding of digital audio," Proc. IEEE, vol. 88, no. 4, pp. 451-513, Apr. 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.4 , pp. 451-513
    • Painter, T.1    Spanias, A.2
  • 34
    • 0043069843 scopus 로고    scopus 로고
    • Squared error as a measure of perceived phase distortion
    • H. Pobloth and W. B. Kleijn, "Squared error as a measure of perceived phase distortion," J. Acoust. Soc. Amer., vol. 114, no. 2, pp. 1081-1094, 2003.
    • (2003) J. Acoust. Soc. Amer , vol.114 , Issue.2 , pp. 1081-1094
    • Pobloth, H.1    Kleijn, W.B.2
  • 35
    • 0035509497 scopus 로고    scopus 로고
    • On the perceptually irrelevant phase information in sinusoidal representation of speech
    • Nov
    • D. S. Kim, "On the perceptually irrelevant phase information in sinusoidal representation of speech," IEEE Trans. Speech Audio Process., vol. 9, no. 8, pp. 900-905, Nov. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.8 , pp. 900-905
    • Kim, D.S.1
  • 36
    • 0043095309 scopus 로고    scopus 로고
    • Perceptual phase quantization of speech
    • Jul
    • -, "Perceptual phase quantization of speech," IEEE Trans. Speech Audio Process., vol. 11, no. 4, pp. 355-364, Jul. 2003.
    • (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.4 , pp. 355-364
    • Kim, D.S.1
  • 37
    • 0032639698 scopus 로고    scopus 로고
    • Dispersion phase vector quantization for enhancement of waveform interpolative coder
    • Phoenix, AZ
    • O. Gottesman, "Dispersion phase vector quantization for enhancement of waveform interpolative coder," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., Phoenix, AZ, 1999, pp. 269-272.
    • (1999) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 269-272
    • Gottesman, O.1
  • 38
    • 0024505689 scopus 로고
    • Detection thresholds for sinusoidal frequency modulation
    • L. Demany and C. Semal, "Detection thresholds for sinusoidal frequency modulation," J. Acoust. Soc. Amer., vol. 85, no. 3, pp. 1295-1301, 1989.
    • (1989) J. Acoust. Soc. Amer , vol.85 , Issue.3 , pp. 1295-1301
    • Demany, L.1    Semal, C.2
  • 39
    • 4544259304 scopus 로고    scopus 로고
    • Watermarking of speech signals using the sinusoidal model and frequency modulation of the partials
    • Montréal, QC, Canada
    • L. Girin and L. S. Marchand, "Watermarking of speech signals using the sinusoidal model and frequency modulation of the partials," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., Montréal, QC, Canada, 2004, pp. 633-636.
    • (2004) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 633-636
    • Girin, L.1    Marchand, L.S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.