SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 13, Issue 2, 2005, Pages 149-161

Perceptual segmentation and component selection for sinusoidal representations of audio

(2) Painter, Ted a Spanias, Andreas b

a INTEL CORPORATION (United States)

b Arizona State University (United States)

Author keywords

Audio coding; Psychoacoustics; Segmentation; Sinusoidal models

Indexed keywords

ERROR ANALYSIS; FAST FOURIER TRANSFORMS; FREQUENCIES; INTERPOLATION; MATHEMATICAL MODELS; OPTIMIZATION; PARAMETER ESTIMATION; RANDOM PROCESSES; SIGNAL PROCESSING;

AUDIO CODING; PSYCHOACOUSTICS; SEGMENTATION; SINUSOIDAL MODELS;

AUDIO ACOUSTICS;

EID: 14644436350 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2004.841050 Document Type: Article

Times cited : (19)

References (41)

1
- 0029725684
- Speech analysis and coding using a multi-resolution sinusoidal transform
- May
- D. V. Anderson, "Speech analysis and coding using a multi-resolution sinusoidal transform," in Proc. ICASSP, May 1996, pp. 1045-1048.
- (1996) Proc. ICASSP , pp. 1045-1048
- Anderson, D.V.¹

2
- 0031635035
- On the harmonic analysis of speech
- Monterey, CA, May/Jun. 31-3
- Y. Stylianou, "On the harmonic analysis of speech," in Proc. IEEE Int. Symp. Circuits Systems ISCAS, Monterey, CA, May/Jun. 31-3, 1998, pp. 5-8.
- (1998) Proc. IEEE Int. Symp. Circuits Systems ISCAS , pp. 5-8
- Stylianou, Y.¹

3
- 0002445128
- A wavelet-based sinusoid model of sound for auditory signal separation
- D. Ellis and B. Vercoe, "A wavelet-based sinusoid model of sound for auditory signal separation," in Proc. Int. Comp. Mus. Conf., 1991, pp. 86-89.
- (1991) Proc. Int. Comp. Mus. Conf. , pp. 86-89
- Ellis, D.¹ Vercoe, B.²

4
- 0029745882
- Residual modeling in music analysis-synthesis
- May
- M. Goodwin, "Residual modeling in music analysis-synthesis," in Proc. ICASSP, May 1996.
- (1996) Proc. ICASSP
- Goodwin, M.¹

5
- 84863772450
- Speech analysis synthesis based on a sinusoidal representation
- Aug.
- R. McAulay and T. Quatieri, "Speech analysis synthesis based on a sinusoidal representation," IEEE Trans. Acoust. Speech Signal Process., pp. 744-754, Aug. 1986.
- (1986) IEEE Trans. Acoust. Speech Signal Process , pp. 744-754
- McAulay, R.¹ Quatieri, T.²

6
- 0032165792
- A new phase model for sinusoidal transform coding
- Sep.
- S. Ahmadi and A. S. Spanias, "A new phase model for sinusoidal transform coding," IEEE Trans. Speech Audio Process., vol. 6, no. 5, pp. 495-501, Sep. 1998.
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.5 , pp. 495-501
- Ahmadi, S.¹ Spanias, A.S.²

7
- 0026202612
- A hybrid transform method for speech analysis and synthesis
- A. S. Spanias, "A hybrid transform method for speech analysis and synthesis," Signal Process., vol. 24, pp. 217-229, 1991.
- (1991) Signal Process , vol.24 , pp. 217-229
- Spanias, A.S.¹

8
- 0019655870
- A tone-oriented voice-excited vocoder
- Mar.
- P. Hedelin, "A tone-oriented voice-excited vocoder," in Proc. IEEE Int. Conf. Acoustic Speech, Signal ICASSP, Mar. 1981, pp. 205-208.
- (1981) Proc. IEEE Int. Conf. Acoustic Speech, Signal ICASSP , pp. 205-208
- Hedelin, P.¹

9
- 0020764310
- Nonstationary spectral modeling of voiced speech
- Jun.
- L. Almeida, "Nonstationary spectral modeling of voiced speech," in Proc. IEEE Trans. Acoust. Speech, Signal Process., vol. ASSP-31, Jun. 1983, pp. 374-390.
- (1983) Proc. IEEE Trans. Acoust. Speech, Signal Process , vol.ASSP-31 , pp. 374-390
- Almeida, L.¹

10
- 0025675356
- Perceptual considerations in a low bit rate sinusoidal vocoder
- Mar.
- E. B. George and M. J. T. Smith, "Perceptual considerations in a low bit rate sinusoidal vocoder," in Proc. IEEE Int. Phoenix Conf. Comp. Comm., Mar. 1990, pp. 268-275.
- (1990) Proc. IEEE Int. Phoenix Conf. Comp. Comm. , pp. 268-275
- George, E.B.¹ Smith, M.J.T.²

11
- 0003285531
- A sines + transients + noise audio representation for data compression and time/pitch scale modifications
- Sep. , preprint #4781
- S. Levine and J. Smith, "A sines + transients + noise audio representation for data compression and time/pitch scale modifications," in Proc. 105th Conv. Aud. Eng. Soc., Sep. 1998, preprint #4781.
- (1998) Proc. 105th Conv. Aud. Eng. Soc.
- Levine, S.¹ Smith, J.²

12
- 0003983976
- Ph.D. dissertation, Dept. Elect. Eng., Stanford Univ., Stanford, CA
- S. Levine, "Audio representations for data compression and compressed domain processing," Ph.D. dissertation, Dept. Elect. Eng., Stanford Univ., Stanford, CA, 1998.
- (1998) Audio Representations for Data Compression and Compressed Domain Processing
- Levine, S.¹

13
- 0003994562
- Ph.D. dissertation, Dept. Elect. Eng., Arizona State Univ., Tempe
- T. Painter, "Scalable perceptual audio coding with a hybrid adap-tive sinusoidal signal model," Ph.D. dissertation, Dept. Elect. Eng., Arizona State Univ., Tempe, 2000.
- (2000) Scalable Perceptual Audio Coding with A Hybrid Adap-tive Sinusoidal Signal Model
- Painter, T.¹

14
- 0034172308
- Perceptual coding of digital audio
- Apr.
- T. Painter and A. Spanias, "Perceptual coding of digital audio," Proc. IEEE, vol. 88, pp. 451-513, Apr. 2000.
- (2000) Proc. IEEE , vol.88 , pp. 451-513
- Painter, T.¹ Spanias, A.²

15
- 0031119324
- A model for the pre-diction of thresholds, loudness, and partial loudness
- Apr.
- B. C. J. Moore, B. Glasberg, and T. Baer, "A model for the pre-diction of thresholds, loudness, and partial loudness," J. Aud. Eng. Soc., vol. 45, no. 4, pp. 224-240, Apr. 1997.
- (1997) J. Aud. Eng. Soc. , vol.45 , Issue.4 , pp. 224-240
- Moore, B.C.J.¹ Glasberg, B.² Baer, T.³

16
- 0025544510
- Spectral modeling and synthesis: A sound analysis/synthesis system based on a deterministic plus stochastic decomposition
- Winter
- X. Serra and J. O. Smith III, "Spectral modeling and synthesis: A sound analysis/synthesis system based on a deterministic plus stochastic decomposition," Comput. Mus. J., pp. 12-24, Winter 1990.
- (1990) Comput. Mus. J. , pp. 12-24
- Serra, X.¹ Smith III, J.O.²

17
- 0023206590
- A new speech coding model based on a least-squares sinusoidal representation
- Apr.
- E. B. George and M. J. T. Smith, "A new speech coding model based on a least-squares sinusoidal representation," in Proc. ICASSP-87, Apr. 1987, pp. 1641-1644.
- (1987) Proc. ICASSP-87 , pp. 1641-1644
- George, E.B.¹ Smith, M.J.T.²

18
- 0001654096
- Analysis-by-Synthesis/overlap-add sinusoidal modeling applied to the analysis and synthesis of musical tones
- Jun.
- _, "Analysis-by-Synthesis/overlap-add sinusoidal modeling applied to the analysis and synthesis of musical tones," J. Aud. Eng. Soc., pp. 497-516, Jun. 1992.
- (1992) J. Aud. Eng. Soc. , pp. 497-516

19
- 0003962735
- Ph.D. dissertation, Dept. Elect. Eng. Comput. Sci., Univ. California, Berkeley
- M. Goodwin, "Adaptive signal models: theory, algorithms, and audio applications," Ph.D. dissertation, Dept. Elect. Eng. Comput. Sci., Univ. California, Berkeley, 1997.
- (1997) Adaptive Signal Models: Theory, Algorithms, and Audio Applications
- Goodwin, M.¹

20
- 0007848147
- Ph.D. dissertation, Dept. of Elect. Eng., Stanford Univ., Stanford, CA
- T. Verma, "A perceptually based audio signal model with application to scalable audio compression," Ph.D. dissertation, Dept. of Elect. Eng., Stanford Univ., Stanford, CA, 1999.
- (1999) A Perceptually Based Audio Signal Model with Application to Scalable Audio Compression
- Verma, T.¹

21
- 0023869370
- Computationally efficient sine-wave synthesis and its application to sinusoidal transform coding
- Apr.
- R. McAulay and T. Quatieri, "Computationally efficient sine-wave synthesis and its application to sinusoidal transform coding," in Proc. Int. Conf. Acoustic, Speech, Signal Processing (ICASSP), Apr. 1988, pp. 370-373.
- (1988) Proc. Int. Conf. Acoustic, Speech, Signal Processing (ICASSP) , pp. 370-373
- McAulay, R.¹ Quatieri, T.²

22
- 14644432132
- Spectral envelopes and inverse FFT synthesis
- Oct. , preprint #3393
- X. Rodet and P. Depalle, "Spectral envelopes and inverse FFT synthesis," in Proc. 93rd Com. Audio Engineering Soc., Oct. 1992, preprint #3393.
- (1992) Proc. 93rd Com. Audio Engineering Soc.
- Rodet, X.¹ Depalle, P.²

23
- 0003837108
- Ph.D. dissertation, Dept. Comput. Music, Stanford Univ., Stanford, CA
- X. Serra, "A system for sound analysis/transformation/synthesis based on a deterministic plust stochastic decomposition," Ph.D. dissertation, Dept. Comput. Music, Stanford Univ., Stanford, CA, 1989.
- (1989) A System for Sound Analysis/transformation/synthesis Based on A Deterministic Plust Stochastic Decomposition
- Serra, X.¹

24
- 0029763793
- Low bit rate high quality audio coding with combined harmonic and wavelet representations
- May
- K. N. Hamdy et al., "Low bit rate high quality audio coding with combined harmonic and wavelet representations," in Proc. ICASSP, May 1996, pp. 1045-1048.
- (1996) Proc. ICASSP , pp. 1045-1048
- Hamdy, K.N.¹

25
- 0031628825
- An analysis/synthesis tool for transient signals the allows a flexible sines + transients + noise model for audio
- May
- T. Verma and T. Meng, "An analysis/synthesis tool for transient signals the allows a flexible sines + transients + noise model for audio," in Proc. ICASSP, May 1998.
- (1998) Proc. ICASSP
- Verma, T.¹ Meng, T.²

26
- 85159475599
- Transient modeling synthesis: A flexible analysis/synthesis tool for transient signals
- T. Verma et al., "Transient modeling synthesis: A flexible analysis/synthesis tool for transient signals," in Proc. Int. Comp. Mus. Conf., 1997.
- (1997) Proc. Int. Comp. Mus. Conf.
- Verma, T.¹

27
- 0003555096
- Wellesley, MA: A. K. Peters
- M. Wickerhauser, Adapted Wavelet Analysis from Theory to Software. Wellesley, MA: A. K. Peters, 1994.
- (1994) Adapted Wavelet Analysis from Theory to Software
- Wickerhauser, M.¹

28
- 0026686048
- Entropy based algorithms for best basis selection
- Mar.
- R. Coifman and M. Wickerhauser, "Entropy based algorithms for best basis selection," IEEE Trans. Info. Theory, vol. 38, no. 2, pp. 712-718, Mar. 1992.
- (1992) IEEE Trans. Info. Theory , vol.38 , Issue.2 , pp. 712-718
- Coifman, R.¹ Wickerhauser, M.²

29
- 0027842081
- Matching Pursuits with time-frequency dictionaries
- Dec.
- S. Mallat and Z. Zhang, "Matching Pursuits with time-frequency dictionaries," IEEE Trans. Signal Process., vol. 41, no. 12, pp. 3397-3415, Dec. 1993.
- (1993) IEEE Trans. Signal Process , vol.41 , Issue.12 , pp. 3397-3415
- Mallat, S.¹ Zhang, Z.²

30
- 14644394002
- Improving time-scale modification of audio signals using wavelets
- M. Rodriguez-Hernandez and F. Casajus-Quiros, "Improving time-scale modification of audio signals using wavelets," in Proc. ICSPAT, 1994, pp. 1573-1577.
- (1994) Proc. ICSPAT , pp. 1573-1577
- Rodriguez-Hernandez, M.¹ Casajus-Quiros, F.²

31
- 0031619389
- Multiresolution sinusoidal modeling using adaptive segmentation
- May
- M. Goodwin, "Multiresolution sinusoidal modeling using adaptive segmentation," in Proc. ICASSP, May 1998.
- (1998) Proc. ICASSP
- Goodwin, M.¹

32
- 84953658827
- On the masking pattern of a simple auditory stimulus
- J. Egan and H. Hake, "On the masking pattern of a simple auditory stimulus," J. Acoust. Soc. Amer., vol. 22, pp. 622-630, 1950.
- (1950) J. Acoust. Soc. Amer. , vol.22 , pp. 622-630
- Egan, J.¹ Hake, H.²

33
- 0003273322
- ASAC - Analysis/synthesis audio codec for very low bit rates
- May preprint #4179
- B. E. Edler et al., "ASAC - Analysis/synthesis audio codec for very low bit rates," in Proc. 100th Conv. Audio Engineering Soc., May 1996, preprint #4179.
- (1996) Proc. 100th Conv. Audio Engineering Soc.
- Edler, B.E.¹

34
- 0004236521
- New York: Springer-Verlag
- E. Zwicker and H. Fastl, Psychoacoustics Facts and Models. New York: Springer-Verlag, 1990.
- (1990) Psychoacoustics Facts and Models
- Zwicker, E.¹ Fastl, H.²

35
- 0026624560
- PERCEVAL: Perceptual evaluation of the quality of audio signals
- Jan./Feb.
- B. Paillard et al., "PERCEVAL: Perceptual evaluation of the quality of audio signals," J. Aud. Eng. Soc., vol. 40, no. 1/2, pp. 21-31, Jan./Feb. 1992.
- (1992) J. Aud. Eng. Soc. , vol.40 , Issue.1-2 , pp. 21-31
- Paillard, B.¹

36
- 0011255924
- A perceptual model applied to audio bit-rate reduction
- Apr.
- C. Colomes et al., "A perceptual model applied to audio bit-rate reduction," J. Aud. Eng. Soc., vol. 43, no. 4, pp. 233-240, Apr. 1995.
- (1995) J. Aud. Eng. Soc. , vol.43 , Issue.4 , pp. 233-240
- Colomes, C.¹

37
- 0348042738
- ISO/IEC JTC1/SC29/WG11 MPEG97/2480
- H. Purnhagen et al., "Proposal of a Core Experiment for Ex-tended 'Harmonic and Individual Lines Plus Noise' Tools for the Parametric Audio Coder Core,", ISO/IEC JTC1/SC29/WG11 MPEG97/2480, 1997.
- (1997) Proposal of A Core Experiment for Ex-tended 'Harmonic and Individual Lines Plus Noise' Tools for the Parametric Audio Coder Core
- Purnhagen, H.¹

38
- 0026941365
- A mixed fourier/walsh transform scheme for speech coding at 4 kbps
- Oct.
- A. S. Spanias and P. Loizou, "A mixed fourier/walsh transform scheme for speech coding at 4 kbps," Proc. Inst. Elec. Eng.-Part I Communications, Speech, Vision, vol. 139, no. 5, pp. 473-481, Oct. 1992.
- (1992) Proc. Inst. Elec. Eng.-Part I Communications, Speech, Vision , vol.139 , Issue.5 , pp. 473-481
- Spanias, A.S.¹ Loizou, P.²

39
- 0031214234
- Speech enhancement using state-based estimation and sinusoidal modeling
- Aug.
- M. Deisher and A. S. Spanias, "Speech enhancement using state-based estimation and sinusoidal modeling," J. Acoust. Soc. Amer., vol. 102, no. 2, pp. 1141-1148, Aug. 1997.
- (1997) J. Acoust. Soc. Amer. , vol.102 , Issue.2 , pp. 1141-1148
- Deisher, M.¹ Spanias, A.S.²

40
- 0035400321
- Algorithms for low-bit rate sinusoidal coding
- June
- S. Ahmadi and A. Spanias, "Algorithms for low-bit rate sinusoidal coding," Speech Commun., vol. 34, no. 2001, pp. 369-390, June 2001.
- (2001) Speech Commun. , vol.34 , Issue.2001 , pp. 369-390
- Ahmadi, S.¹ Spanias, A.²

41
- 84866492988
- Optimising digital speech coders by exploiting masking properties of the human ear
- M. R. Schroeder, B. S. Atal, and J. L. Hall, "Optimising digital speech coders by exploiting masking properties of the human ear," J. Acoust. Soc. Amer., vol. 66, 1979.
- (1979) J. Acoust. Soc. Amer. , vol.66
- Schroeder, M.R.¹ Atal, B.S.² Hall, J.L.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.