메뉴 건너뛰기




Volumn 54, Issue 3, 2006, Pages 167-188

Bit-rate scalable intraframe sinusoidal audio coding based on rate-distortion optimization

Author keywords

[No Author keywords available]

Indexed keywords

OPTIMIZATION; SIGNAL DISTORTION; SIGNAL ENCODING; SPEECH CODING; SPEECH PROCESSING;

EID: 33646166772     PISSN: 15494950     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (10)

References (48)
  • 1
    • 84863772450 scopus 로고
    • "Speech Analysis/Synthesis Based on a Sinusoidal Representation"
    • (Aug.)
    • R. J. McAulay and T. F. Quatieri, "Speech Analysis/Synthesis Based on a Sinusoidal Representation," IEEE Trans. Acoust., Speech, Signal Process., vol. 34, pp. 744-754 (1986 Aug.).
    • (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.34 , pp. 744-754
    • McAulay, R.J.1    Quatieri, T.F.2
  • 2
    • 0022883476 scopus 로고
    • "Speech Transformations Based on a Sinusoidal Representation"
    • (Dec.)
    • T. F. Quatieri and R. J. McAulay, "Speech Transformations Based on a Sinusoidal Representation," IEEE Trans. Acoust., Speech, Signal Process., vol. 34, pp. 1449-1464 (1986 Dec.).
    • (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.34 , pp. 1449-1464
    • Quatieri, T.F.1    McAulay, R.J.2
  • 3
    • 0001935942 scopus 로고
    • "Sinusoidal Coding"
    • W. B. Kleijn and K. K. Paliwal, Eds. (Elsevier, Amsterdam), chap. 4
    • R. J. McAulay and T. F. Quatieri, "Sinusoidal Coding," in Speech Coding and Synthesis, W. B. Kleijn and K. K. Paliwal, Eds. (Elsevier, Amsterdam, 1995), chap. 4, pp. 121-174.
    • (1995) Speech Coding and Synthesis , pp. 121-174
    • McAulay, R.J.1    Quatieri, T.F.2
  • 4
    • 33646205916 scopus 로고    scopus 로고
    • "ASAC - Analysis/Synthesis Audio Codec for Very Low Bit Rates"
    • presented at the 100th Convention of the Audio Engineering Society, J. Audio Eng. Soc. (July/Aug.), preprint 4179
    • B. Edler, H. Purnhagen, and C. Ferekidis, "ASAC - Analysis/Synthesis Audio Codec for Very Low Bit Rates," presented at the 100th Convention of the Audio Engineering Society, J. Audio Eng. Soc. (Abstracts), vol. 44, p. 636 (1996 July/Aug.), preprint 4179.
    • (1996) , vol.44 , pp. 636
    • Edler, B.1    Purnhagen, H.2    Ferekidis, C.3
  • 5
    • 0030682314 scopus 로고    scopus 로고
    • "Matching Pursuit with Damped Sinusoids"
    • (Munich, Germany, May)
    • M. Goodwin, "Matching Pursuit with Damped Sinusoids," in Proc. ICASSP '97 (Munich, Germany, 1997 May), vol. 3, pp. 2037-2040.
    • (1997) Proc. ICASSP '97 , vol.3 , pp. 2037-2040
    • Goodwin, M.1
  • 6
    • 0031644336 scopus 로고    scopus 로고
    • "Robust Exponential Modeling of Audio Signals"
    • (Seattle, WA, USA, May)
    • J. Nieuwenhuijse, R. Heusdens, and E. F. Deprettere, "Robust Exponential Modeling of Audio Signals," in Proc. ICASSP '98 (Seattle, WA, USA, 1998 May), vol. 6, pp. 3581-3584.
    • (1998) Proc. ICASSP '98 , vol.6 , pp. 3581-3584
    • Nieuwenhuijse, J.1    Heusdens, R.2    Deprettere, E.F.3
  • 7
    • 0000934038 scopus 로고    scopus 로고
    • "High-Quality Consistent Analysis-Synthesis in Sinusoidal Coding"
    • (Florence, Italy, Sept.)
    • K. Vos, R. Vafin, R. Heusdens, and W. B. Kleijn, "High-Quality Consistent Analysis-Synthesis in Sinusoidal Coding," in Proc. AES 17th Int. Conf. (Florence, Italy, 1999 Sept.), pp. 244-250.
    • (1999) Proc. AES 17th Int. Conf. , pp. 244-250
    • Vos, K.1    Vafin, R.2    Heusdens, R.3    Kleijn, W.B.4
  • 8
    • 0032627241 scopus 로고    scopus 로고
    • "Sinusoidal Modeling Using Frame-Based Perceptually Weighted Matching Pursuits"
    • (Phoenix, AZ, USA, May)
    • T. S. Verma and T. H. Y. Meng, "Sinusoidal Modeling Using Frame-Based Perceptually Weighted Matching Pursuits," in Proc. ICASSP '99 (Phoenix, AZ, USA, 1999 May), vol. 2, pp. 981-984.
    • (1999) Proc. ICASSP '99 , vol.2 , pp. 981-984
    • Verma, T.S.1    Meng, T.H.Y.2
  • 9
    • 0033707067 scopus 로고    scopus 로고
    • "A 6 kbps to 85 kbps Scalable Audio Coder"
    • (Istanbul, Turkey, May)
    • T. S. Verma and T. H. Y. Meng, "A 6 kbps to 85 kbps Scalable Audio Coder," in Proc. ICASSP '00 (Istanbul, Turkey, 2000 May), pp. 877-880.
    • (2000) Proc. ICASSP '00 , pp. 877-880
    • Verma, T.S.1    Meng, T.H.Y.2
  • 10
    • 33646188989 scopus 로고    scopus 로고
    • "HILN, Harmonic and Individual Lines Plus Noise"
    • ISO/IEC JTC1/SC29/WG11 (MPEG) Committee, ISO/IEC 14496-3:1999/AMD1:2000
    • ISO/IEC JTC1/SC29/WG11 (MPEG) Committee, "HILN, Harmonic and Individual Lines Plus Noise," ISO/IEC 14496-3:1999/AMD1:2000 (2001).
    • (2001)
  • 11
    • 33646193323 scopus 로고    scopus 로고
    • "Parametric Coding for High-Quality Audio"
    • ISO/MPEG Committee, ISO/IEC 14496-3:2001/AMD2 (July)
    • ISO/MPEG Committee, "Parametric Coding for High-Quality Audio," ISO/IEC 14496-3:2001/AMD2 (2004 July).
    • (2004)
  • 12
    • 0141623881 scopus 로고    scopus 로고
    • "Psycho-acoustic Modeling of Audio with Exponentially Damped Sinusoids"
    • (Orlando, FL, USA, May 13-17)
    • K. Hermus, W. Verhelst, and P. Wambacq, "Psycho-acoustic Modeling of Audio with Exponentially Damped Sinusoids," in Proc. ICASSP '02 (Orlando, FL, USA, 2002 May 13-17), pp. 1821-1824.
    • (2002) Proc. ICASSP '02 , pp. 1821-1824
    • Hermus, K.1    Verhelst, W.2    Wambacq, P.3
  • 13
    • 0038044722 scopus 로고    scopus 로고
    • "Schemes for Optimal Frequency-Differential Encoding of Sinusoidal Model Parameters"
    • (Aug.)
    • J. Jensen and R. Heusdens, "Schemes for Optimal Frequency-Differential Encoding of Sinusoidal Model Parameters," Signal Process., vol. 83, pp. 1721-1735 (2003 Aug.).
    • (2003) Signal Process. , vol.83 , pp. 1721-1735
    • Jensen, J.1    Heusdens, R.2
  • 14
    • 0027842081 scopus 로고
    • "Matching Pursuits with Time-Frequency Dictionaries"
    • (Dec.)
    • S. G. Mallat and Z. Zhang, "Matching Pursuits with Time-Frequency Dictionaries," IEEE Trans. Signal Process., vol. 41, pp. 3397-3415 (1993 Dec.).
    • (1993) IEEE Trans. Signal Process. , vol.41 , pp. 3397-3415
    • Mallat, S.G.1    Zhang, Z.2
  • 15
    • 0031232722 scopus 로고    scopus 로고
    • "Speech Analysis/Synthesis and Modification Using an Analysis-by-Synthesis/Overlap-Add Sinusoidal Model"
    • (Sept.)
    • E. B. George and M. J. T. Smith, "Speech Analysis/Synthesis and Modification Using an Analysis-by-Synthesis/Overlap-Add Sinusoidal Model," IEEE Trans. Speech Audio Process., vol. 5, pp. 389-406 (1997 Sept.).
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , pp. 389-406
    • George, E.B.1    Smith, M.J.T.2
  • 16
    • 0034848222 scopus 로고    scopus 로고
    • "Sinusoidal Modeling of Audio and Speech Signals Using Psychoacoustic-Adaptive Matching Pursuits"
    • (Salt Lake City, UT, USA, May)
    • R. Heusdens, R. Vafin, and W. B. Kleijn, "Sinusoidal Modeling of Audio and Speech Signals Using Psychoacoustic-Adaptive Matching Pursuits," in Proc. ICASSP '01 (Salt Lake City, UT, USA, 2001 May), vol. 5, pp. 3281-3284.
    • (2001) Proc. ICASSP '01 , vol.5 , pp. 3281-3284
    • Heusdens, R.1    Vafin, R.2    Kleijn, W.B.3
  • 17
    • 33646200294 scopus 로고    scopus 로고
    • "Sinusoidal Coding for Audio and Speech (SiCAS)"
    • STW Project Proposal DET.4625, Delft University of Technology (Oct.)
    • R. Heusdens and E. F. Deprettere, "Sinusoidal Coding for Audio and Speech (SiCAS)," STW Project Proposal, DET.4625, Delft University of Technology (1998 Oct.).
    • (1998)
    • Heusdens, R.1    Deprettere, E.F.2
  • 18
    • 0034854444 scopus 로고    scopus 로고
    • "Modifying Transients for Efficient Coding of Audio"
    • (Salt Lake City, UT, USA, May)
    • R. Vafin, R. Heusdens, and W. B. Kleijn, "Modifying Transients for Efficient Coding of Audio," in Proc. ICASSP '01 (Salt Lake City, UT, USA, 2001 May), vol. 5, pp. 3285-3288.
    • (2001) Proc. ICASSP '01 , vol.5 , pp. 3285-3288
    • Vafin, R.1    Heusdens, R.2    Kleijn, W.B.3
  • 21
    • 0030643261 scopus 로고    scopus 로고
    • "Optimal Time Segmentation for Signal Modeling and Compression"
    • (Munich, Germany, Apr.)
    • P. Prandoni, M. Goodwin, and M. Vetterli, "Optimal Time Segmentation for Signal Modeling and Compression," in Proc. ICASSP '97 (Munich, Germany, 1997 Apr.), pp. 2029-2032.
    • (1997) Proc. ICASSP '97 , pp. 2029-2032
    • Prandoni, P.1    Goodwin, M.2    Vetterli, M.3
  • 22
    • 0031077325 scopus 로고    scopus 로고
    • "Flexible Tree-Structured Signal Expansions Using Time-Varying Wavelet Packets"
    • (Feb.)
    • Z. Xiong, K. Ramchandran, C. Herley, and M. T. Orchard, "Flexible Tree-Structured Signal Expansions Using Time-Varying Wavelet Packets," IEEE Trans. Signal Process., vol. 45, pp. 333-345 (1997 Feb.).
    • (1997) IEEE Trans. Signal Process. , vol.45 , pp. 333-345
    • Xiong, Z.1    Ramchandran, K.2    Herley, C.3    Orchard, M.T.4
  • 23
    • 84960909873 scopus 로고    scopus 로고
    • "A Comparison of Sinusoidal Model Variants for Speech and Audio Representation"
    • (Toulouse, France, Sept. 3-6)
    • J. Jensen and R. Heusdens, "A Comparison of Sinusoidal Model Variants for Speech and Audio Representation," in Proc. XI Eur. Signal Processing Conf. (Toulouse, France, 2002 Sept. 3-6), pp. 479-482.
    • (2002) Proc. XI Eur. Signal Processing Conf. , pp. 479-482
    • Jensen, J.1    Heusdens, R.2
  • 24
    • 0036699533 scopus 로고    scopus 로고
    • "Sinusoidal Modeling Using Psychoacoustic-Adaptive Matching Pursuits"
    • (Aug.)
    • R. Heusdens, R. Vafin, and W. B. Kleijn, "Sinusoidal Modeling Using Psychoacoustic-Adaptive Matching Pursuits," IEEE Signal Process. Lett., vol. 9, pp. 262-265, (2002 Aug.).
    • (2002) IEEE Signal Process. Lett. , vol.9 , pp. 262-265
    • Heusdens, R.1    Vafin, R.2    Kleijn, W.B.3
  • 25
    • 0141847242 scopus 로고    scopus 로고
    • "A New Psycho-Acoustical Masking Model for Audio Coding Applications"
    • (Orlando, FL, USA, May 13-17)
    • S. van de Par, A. Kohlrausch, G. Charestan, and R. Heusdens, "A New Psycho-Acoustical Masking Model for Audio Coding Applications," in Proc. ICASSP '02 (Orlando, FL, USA, 2002 May 13-17), pp. 1805-1808.
    • (2002) Proc. ICASSP '02 , pp. 1805-1808
    • van de Par, S.1    Kohlrausch, A.2    Charestan, G.3    Heusdens, R.4
  • 28
    • 34249819475 scopus 로고    scopus 로고
    • "Audio Subband Coding with Improved Representation of Transient Signal Segments"
    • (Rhodos, Greece)
    • J. Kliewer and A. Mertins, "Audio Subband Coding with Improved Representation of Transient Signal Segments," in Proc. XI Eur. Signal Processing Conf. (Rhodos, Greece, 1998), pp. 2345-2348.
    • (1998) Proc. XI Eur. Signal Processing Conf. , pp. 2345-2348
    • Kliewer, J.1    Mertins, A.2
  • 30
    • 0029952425 scopus 로고    scopus 로고
    • "A Quantitative Model of the "Effective" Signal Processing in the Auditory System, Part I: Model Structure"
    • T. Dau, D. Püschel, and A. Kohlrausch, "A Quantitative Model of the "Effective" Signal Processing in the Auditory System, Part I: Model Structure," J. Acoust. Soc. Am., vol. 99, pp. 3615-3622 (1996).
    • (1996) J. Acoust. Soc. Am. , vol.99 , pp. 3615-3622
    • Dau, T.1    Püschel, D.2    Kohlrausch, A.3
  • 31
    • 0003821625 scopus 로고
    • "Coding of Moving Pictures and Associated Audio for Storage at up to about 1.5 Mbit/s, Part 3: Audio"
    • ISO/MPEG Committee, ISO/IEC 11172-3 (Nov.)
    • ISO/MPEG Committee, "Coding of Moving Pictures and Associated Audio for Storage at up to about 1.5 Mbit/s, Part 3: Audio," ISO/IEC 11172-3 (1993 Nov.).
    • (1993)
  • 32
    • 27844573198 scopus 로고    scopus 로고
    • "Rate-Distortion Optimal Exponential Modeling of Audio and Speech Signals"
    • (Wassenaar, The Netherlands, May)
    • K. Vos and R. Heusdens, "Rate-Distortion Optimal Exponential Modeling of Audio and Speech Signals," in Proc. 21st Symp. on Information Theory in the Benelux (Wassenaar, The Netherlands, 2000 May), pp. 77-84.
    • (2000) Proc. 21st Symp. on Information Theory in the Benelux , pp. 77-84
    • Vos, K.1    Heusdens, R.2
  • 33
    • 33646176610 scopus 로고    scopus 로고
    • "Estimation of Sinusoidal Model Parameters Using Newton Optimization and a Perceptual Distortion Measure"
    • (Hilvarenbeek, The Netherlands, April 15-16)
    • D. Kloosterman, R. Heusdens, and J. Jensen, "Estimation of Sinusoidal Model Parameters Using Newton Optimization and a Perceptual Distortion Measure," in Proc. IEEE Benelux SPS 2004 (Hilvarenbeek, The Netherlands, 2004 April 15-16), pp. 199-202.
    • (2004) Proc. IEEE Benelux SPS 2004 , pp. 199-202
    • Kloosterman, D.1    Heusdens, R.2    Jensen, J.3
  • 34
    • 0036297213 scopus 로고    scopus 로고
    • "Rate-Distortion Optimal Sinusoidal Modeling of Audio and Speech Using Psychoacoustical Matching Pursuits"
    • (Orlando, FL, USA, May 13-17)
    • R. Heusdens and S. van der Par, "Rate-Distortion Optimal Sinusoidal Modeling of Audio and Speech Using Psychoacoustical Matching Pursuits," in Proc. ICASSP '02 (Orlando, FL, USA, 2002 May 13-17), pp. 1809-1812.
    • (2002) Proc. ICASSP '02 , pp. 1809-1812
    • Heusdens, R.1    van der Par, S.2
  • 35
    • 0003983976 scopus 로고    scopus 로고
    • "Audio Representations for Data Compression and Compressed Domain Processing"
    • Ph.D. thesis, Stanford University, Stanford, CA (Dec.)
    • S. C. Levine, "Audio Representations for Data Compression and Compressed Domain Processing," Ph.D. thesis, Stanford University, Stanford, CA (1998 Dec.).
    • (1998)
    • Levine, S.C.1
  • 36
    • 0029763793 scopus 로고    scopus 로고
    • "Low Bit-Rate High-Quality Audio Coding with Combined Harmonic and Wavelet Representation"
    • (Atlanta, GA, USA, May)
    • K. N. Hamdy, M. Ali, and A. H. Tewfik, "Low Bit-Rate High-Quality Audio Coding with Combined Harmonic and Wavelet Representation," in Proc. ICASSP '96 (Atlanta, GA, USA, 1996 May), pp. 1045-1048.
    • (1996) Proc. ICASSP '96 , pp. 1045-1048
    • Hamdy, K.N.1    Ali, M.2    Tewfik, A.H.3
  • 37
  • 38
    • 0023173192 scopus 로고
    • "A Shortest Path Algorithm for Dense and Sparse Linear Assignment Problems"
    • R. Jonker and A. Volgenant, "A Shortest Path Algorithm for Dense and Sparse Linear Assignment Problems," Computing, vol. 38, pp. 325-340 (1987).
    • (1987) Computing , vol.38 , pp. 325-340
    • Jonker, R.1    Volgenant, A.2
  • 41
    • 0025110885 scopus 로고
    • "Derivation of Auditory Filter Shapes from Notched-Noise Data"
    • B. R. Glasberg and B. C. J. Moore, "Derivation of Auditory Filter Shapes from Notched-Noise Data," Hearing Res., vol. 47, pp. 103-138 (1990).
    • (1990) Hearing Res. , vol.47 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 42
    • 33646193323 scopus 로고    scopus 로고
    • "Parametric Coding for High-Quality Audio"
    • ISO/IEC JTCl/SC29/WGl1 (MPEG) Committee. ISO/IEC 14496-3
    • ISO/IEC JTCl/SC29/WGl1 (MPEG) Committee. "Parametric Coding for High-Quality Audio," ISO/IEC 14496-3 (2001).
    • (2001)
  • 43
    • 33646201331 scopus 로고    scopus 로고
    • ISO/IEC JTC1/SC29/WG11 (MPEG) Committee, 2nd ed. ISO/IEC 13818-7
    • ISO/IEC JTC1/SC29/WG11 (MPEG) Committee "MPEG-2 AAC," 2nd ed. ISO/IEC 13818-7 (2003).
    • (2003) "MPEG-2 AAC"
  • 44
    • 0004124275 scopus 로고    scopus 로고
    • ISO/MPEG Committee, ISO/IEC JTC1/SC 29/WG11 N4030 (Mar.)
    • ISO/MPEG Committee, "MPEG-4 Overview (V.18 - Singapore Version)," ISO/IEC JTC1/SC 29/WG11 N4030 (2001 Mar.).
    • (2001) "MPEG-4 Overview (V.18 - Singapore Version)"
  • 46
    • 14644404848 scopus 로고    scopus 로고
    • "Advances in Parametric Coding for High-Quality Audio"
    • presented at the 114th Convention of the Audio Engineering Society, J. Audio Eng. Soc. (Abstracts), 440, (May), convention paper 5852
    • E. Schuijers, W. Oomen, B. den Brinker, and J. Breebaart, "Advances in Parametric Coding for High-Quality Audio," presented at the 114th Convention of the Audio Engineering Society, J. Audio Eng. Soc. (Abstracts), vol. 51, pp. 440, 441 (2003 May), convention paper 5852.
    • (2003) , vol.51 , pp. 441
    • Schuijers, E.1    Oomen, W.2    den Brinker, B.3    Breebaart, J.4
  • 47
    • 13344250603 scopus 로고    scopus 로고
    • "Method for the Subjective Assessment of Intermediate Quality Level of Coding Systems
    • ITU-R BS.1534, (Geneva,Switzerland)
    • ITU-R BS.1534, "Method for the Subjective Assessment of Intermediate Quality Level of Coding Systems (Geneva, Switzerland, 2001).
    • (2001)
  • 48
    • 0003553472 scopus 로고    scopus 로고
    • 2nd ed. Springer ser. in Information Sciences (Springer, Berlin Heidelberg)
    • E. Zwicker and H. Fastl, Psychoacoustics, 2nd ed. Springer ser. in Information Sciences (Springer, Berlin Heidelberg, 1999).
    • (1999) Psychoacoustics
    • Zwicker, E.1    Fastl, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.