메뉴 건너뛰기




Volumn 13, Issue 2, 2005, Pages 149-161

Perceptual segmentation and component selection for sinusoidal representations of audio

Author keywords

Audio coding; Psychoacoustics; Segmentation; Sinusoidal models

Indexed keywords

ERROR ANALYSIS; FAST FOURIER TRANSFORMS; FREQUENCIES; INTERPOLATION; MATHEMATICAL MODELS; OPTIMIZATION; PARAMETER ESTIMATION; RANDOM PROCESSES; SIGNAL PROCESSING;

EID: 14644436350     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2004.841050     Document Type: Article
Times cited : (19)

References (41)
  • 1
    • 0029725684 scopus 로고    scopus 로고
    • Speech analysis and coding using a multi-resolution sinusoidal transform
    • May
    • D. V. Anderson, "Speech analysis and coding using a multi-resolution sinusoidal transform," in Proc. ICASSP, May 1996, pp. 1045-1048.
    • (1996) Proc. ICASSP , pp. 1045-1048
    • Anderson, D.V.1
  • 2
    • 0031635035 scopus 로고    scopus 로고
    • On the harmonic analysis of speech
    • Monterey, CA, May/Jun. 31-3
    • Y. Stylianou, "On the harmonic analysis of speech," in Proc. IEEE Int. Symp. Circuits Systems ISCAS, Monterey, CA, May/Jun. 31-3, 1998, pp. 5-8.
    • (1998) Proc. IEEE Int. Symp. Circuits Systems ISCAS , pp. 5-8
    • Stylianou, Y.1
  • 3
    • 0002445128 scopus 로고
    • A wavelet-based sinusoid model of sound for auditory signal separation
    • D. Ellis and B. Vercoe, "A wavelet-based sinusoid model of sound for auditory signal separation," in Proc. Int. Comp. Mus. Conf., 1991, pp. 86-89.
    • (1991) Proc. Int. Comp. Mus. Conf. , pp. 86-89
    • Ellis, D.1    Vercoe, B.2
  • 4
    • 0029745882 scopus 로고    scopus 로고
    • Residual modeling in music analysis-synthesis
    • May
    • M. Goodwin, "Residual modeling in music analysis-synthesis," in Proc. ICASSP, May 1996.
    • (1996) Proc. ICASSP
    • Goodwin, M.1
  • 5
    • 84863772450 scopus 로고
    • Speech analysis synthesis based on a sinusoidal representation
    • Aug.
    • R. McAulay and T. Quatieri, "Speech analysis synthesis based on a sinusoidal representation," IEEE Trans. Acoust. Speech Signal Process., pp. 744-754, Aug. 1986.
    • (1986) IEEE Trans. Acoust. Speech Signal Process , pp. 744-754
    • McAulay, R.1    Quatieri, T.2
  • 6
    • 0032165792 scopus 로고    scopus 로고
    • A new phase model for sinusoidal transform coding
    • Sep.
    • S. Ahmadi and A. S. Spanias, "A new phase model for sinusoidal transform coding," IEEE Trans. Speech Audio Process., vol. 6, no. 5, pp. 495-501, Sep. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.5 , pp. 495-501
    • Ahmadi, S.1    Spanias, A.S.2
  • 7
    • 0026202612 scopus 로고
    • A hybrid transform method for speech analysis and synthesis
    • A. S. Spanias, "A hybrid transform method for speech analysis and synthesis," Signal Process., vol. 24, pp. 217-229, 1991.
    • (1991) Signal Process , vol.24 , pp. 217-229
    • Spanias, A.S.1
  • 9
    • 0020764310 scopus 로고
    • Nonstationary spectral modeling of voiced speech
    • Jun.
    • L. Almeida, "Nonstationary spectral modeling of voiced speech," in Proc. IEEE Trans. Acoust. Speech, Signal Process., vol. ASSP-31, Jun. 1983, pp. 374-390.
    • (1983) Proc. IEEE Trans. Acoust. Speech, Signal Process , vol.ASSP-31 , pp. 374-390
    • Almeida, L.1
  • 10
    • 0025675356 scopus 로고
    • Perceptual considerations in a low bit rate sinusoidal vocoder
    • Mar.
    • E. B. George and M. J. T. Smith, "Perceptual considerations in a low bit rate sinusoidal vocoder," in Proc. IEEE Int. Phoenix Conf. Comp. Comm., Mar. 1990, pp. 268-275.
    • (1990) Proc. IEEE Int. Phoenix Conf. Comp. Comm. , pp. 268-275
    • George, E.B.1    Smith, M.J.T.2
  • 11
    • 0003285531 scopus 로고    scopus 로고
    • A sines + transients + noise audio representation for data compression and time/pitch scale modifications
    • Sep. , preprint #4781
    • S. Levine and J. Smith, "A sines + transients + noise audio representation for data compression and time/pitch scale modifications," in Proc. 105th Conv. Aud. Eng. Soc., Sep. 1998, preprint #4781.
    • (1998) Proc. 105th Conv. Aud. Eng. Soc.
    • Levine, S.1    Smith, J.2
  • 14
    • 0034172308 scopus 로고    scopus 로고
    • Perceptual coding of digital audio
    • Apr.
    • T. Painter and A. Spanias, "Perceptual coding of digital audio," Proc. IEEE, vol. 88, pp. 451-513, Apr. 2000.
    • (2000) Proc. IEEE , vol.88 , pp. 451-513
    • Painter, T.1    Spanias, A.2
  • 15
    • 0031119324 scopus 로고    scopus 로고
    • A model for the pre-diction of thresholds, loudness, and partial loudness
    • Apr.
    • B. C. J. Moore, B. Glasberg, and T. Baer, "A model for the pre-diction of thresholds, loudness, and partial loudness," J. Aud. Eng. Soc., vol. 45, no. 4, pp. 224-240, Apr. 1997.
    • (1997) J. Aud. Eng. Soc. , vol.45 , Issue.4 , pp. 224-240
    • Moore, B.C.J.1    Glasberg, B.2    Baer, T.3
  • 16
    • 0025544510 scopus 로고
    • Spectral modeling and synthesis: A sound analysis/synthesis system based on a deterministic plus stochastic decomposition
    • Winter
    • X. Serra and J. O. Smith III, "Spectral modeling and synthesis: A sound analysis/synthesis system based on a deterministic plus stochastic decomposition," Comput. Mus. J., pp. 12-24, Winter 1990.
    • (1990) Comput. Mus. J. , pp. 12-24
    • Serra, X.1    Smith III, J.O.2
  • 17
    • 0023206590 scopus 로고
    • A new speech coding model based on a least-squares sinusoidal representation
    • Apr.
    • E. B. George and M. J. T. Smith, "A new speech coding model based on a least-squares sinusoidal representation," in Proc. ICASSP-87, Apr. 1987, pp. 1641-1644.
    • (1987) Proc. ICASSP-87 , pp. 1641-1644
    • George, E.B.1    Smith, M.J.T.2
  • 18
    • 0001654096 scopus 로고
    • Analysis-by-Synthesis/overlap-add sinusoidal modeling applied to the analysis and synthesis of musical tones
    • Jun.
    • _, "Analysis-by-Synthesis/overlap-add sinusoidal modeling applied to the analysis and synthesis of musical tones," J. Aud. Eng. Soc., pp. 497-516, Jun. 1992.
    • (1992) J. Aud. Eng. Soc. , pp. 497-516
  • 21
    • 0023869370 scopus 로고
    • Computationally efficient sine-wave synthesis and its application to sinusoidal transform coding
    • Apr.
    • R. McAulay and T. Quatieri, "Computationally efficient sine-wave synthesis and its application to sinusoidal transform coding," in Proc. Int. Conf. Acoustic, Speech, Signal Processing (ICASSP), Apr. 1988, pp. 370-373.
    • (1988) Proc. Int. Conf. Acoustic, Speech, Signal Processing (ICASSP) , pp. 370-373
    • McAulay, R.1    Quatieri, T.2
  • 24
    • 0029763793 scopus 로고    scopus 로고
    • Low bit rate high quality audio coding with combined harmonic and wavelet representations
    • May
    • K. N. Hamdy et al., "Low bit rate high quality audio coding with combined harmonic and wavelet representations," in Proc. ICASSP, May 1996, pp. 1045-1048.
    • (1996) Proc. ICASSP , pp. 1045-1048
    • Hamdy, K.N.1
  • 25
    • 0031628825 scopus 로고    scopus 로고
    • An analysis/synthesis tool for transient signals the allows a flexible sines + transients + noise model for audio
    • May
    • T. Verma and T. Meng, "An analysis/synthesis tool for transient signals the allows a flexible sines + transients + noise model for audio," in Proc. ICASSP, May 1998.
    • (1998) Proc. ICASSP
    • Verma, T.1    Meng, T.2
  • 26
    • 85159475599 scopus 로고    scopus 로고
    • Transient modeling synthesis: A flexible analysis/synthesis tool for transient signals
    • T. Verma et al., "Transient modeling synthesis: A flexible analysis/synthesis tool for transient signals," in Proc. Int. Comp. Mus. Conf., 1997.
    • (1997) Proc. Int. Comp. Mus. Conf.
    • Verma, T.1
  • 28
    • 0026686048 scopus 로고
    • Entropy based algorithms for best basis selection
    • Mar.
    • R. Coifman and M. Wickerhauser, "Entropy based algorithms for best basis selection," IEEE Trans. Info. Theory, vol. 38, no. 2, pp. 712-718, Mar. 1992.
    • (1992) IEEE Trans. Info. Theory , vol.38 , Issue.2 , pp. 712-718
    • Coifman, R.1    Wickerhauser, M.2
  • 29
    • 0027842081 scopus 로고
    • Matching Pursuits with time-frequency dictionaries
    • Dec.
    • S. Mallat and Z. Zhang, "Matching Pursuits with time-frequency dictionaries," IEEE Trans. Signal Process., vol. 41, no. 12, pp. 3397-3415, Dec. 1993.
    • (1993) IEEE Trans. Signal Process , vol.41 , Issue.12 , pp. 3397-3415
    • Mallat, S.1    Zhang, Z.2
  • 30
    • 14644394002 scopus 로고
    • Improving time-scale modification of audio signals using wavelets
    • M. Rodriguez-Hernandez and F. Casajus-Quiros, "Improving time-scale modification of audio signals using wavelets," in Proc. ICSPAT, 1994, pp. 1573-1577.
    • (1994) Proc. ICSPAT , pp. 1573-1577
    • Rodriguez-Hernandez, M.1    Casajus-Quiros, F.2
  • 31
    • 0031619389 scopus 로고    scopus 로고
    • Multiresolution sinusoidal modeling using adaptive segmentation
    • May
    • M. Goodwin, "Multiresolution sinusoidal modeling using adaptive segmentation," in Proc. ICASSP, May 1998.
    • (1998) Proc. ICASSP
    • Goodwin, M.1
  • 32
    • 84953658827 scopus 로고
    • On the masking pattern of a simple auditory stimulus
    • J. Egan and H. Hake, "On the masking pattern of a simple auditory stimulus," J. Acoust. Soc. Amer., vol. 22, pp. 622-630, 1950.
    • (1950) J. Acoust. Soc. Amer. , vol.22 , pp. 622-630
    • Egan, J.1    Hake, H.2
  • 33
    • 0003273322 scopus 로고    scopus 로고
    • ASAC - Analysis/synthesis audio codec for very low bit rates
    • May preprint #4179
    • B. E. Edler et al., "ASAC - Analysis/synthesis audio codec for very low bit rates," in Proc. 100th Conv. Audio Engineering Soc., May 1996, preprint #4179.
    • (1996) Proc. 100th Conv. Audio Engineering Soc.
    • Edler, B.E.1
  • 35
    • 0026624560 scopus 로고
    • PERCEVAL: Perceptual evaluation of the quality of audio signals
    • Jan./Feb.
    • B. Paillard et al., "PERCEVAL: Perceptual evaluation of the quality of audio signals," J. Aud. Eng. Soc., vol. 40, no. 1/2, pp. 21-31, Jan./Feb. 1992.
    • (1992) J. Aud. Eng. Soc. , vol.40 , Issue.1-2 , pp. 21-31
    • Paillard, B.1
  • 36
    • 0011255924 scopus 로고
    • A perceptual model applied to audio bit-rate reduction
    • Apr.
    • C. Colomes et al., "A perceptual model applied to audio bit-rate reduction," J. Aud. Eng. Soc., vol. 43, no. 4, pp. 233-240, Apr. 1995.
    • (1995) J. Aud. Eng. Soc. , vol.43 , Issue.4 , pp. 233-240
    • Colomes, C.1
  • 39
    • 0031214234 scopus 로고    scopus 로고
    • Speech enhancement using state-based estimation and sinusoidal modeling
    • Aug.
    • M. Deisher and A. S. Spanias, "Speech enhancement using state-based estimation and sinusoidal modeling," J. Acoust. Soc. Amer., vol. 102, no. 2, pp. 1141-1148, Aug. 1997.
    • (1997) J. Acoust. Soc. Amer. , vol.102 , Issue.2 , pp. 1141-1148
    • Deisher, M.1    Spanias, A.S.2
  • 40
    • 0035400321 scopus 로고    scopus 로고
    • Algorithms for low-bit rate sinusoidal coding
    • June
    • S. Ahmadi and A. Spanias, "Algorithms for low-bit rate sinusoidal coding," Speech Commun., vol. 34, no. 2001, pp. 369-390, June 2001.
    • (2001) Speech Commun. , vol.34 , Issue.2001 , pp. 369-390
    • Ahmadi, S.1    Spanias, A.2
  • 41
    • 84866492988 scopus 로고
    • Optimising digital speech coders by exploiting masking properties of the human ear
    • M. R. Schroeder, B. S. Atal, and J. L. Hall, "Optimising digital speech coders by exploiting masking properties of the human ear," J. Acoust. Soc. Amer., vol. 66, 1979.
    • (1979) J. Acoust. Soc. Amer. , vol.66
    • Schroeder, M.R.1    Atal, B.S.2    Hall, J.L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.