메뉴 건너뛰기




Volumn 15, Issue 1, 2007, Pages 310-319

The sensitivity matrix: Using advanced auditory models in speech and audio processing

Author keywords

Auditory model; Distortion; Perception; Quantization; Sensitivity; Speech and audio coding

Indexed keywords

AUDITORY MODEL; DISTORTION; PERCEPTION; QUANTIZATION; SENSITIVITY; SPEECH AND AUDIO CODING;

EID: 47649083103     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.876722     Document Type: Article
Times cited : (21)

References (34)
  • 1
    • 84866492988 scopus 로고
    • Optimizing digital speech coders by exploiting masking properties of the human ear
    • Dec
    • M. R. Schroeder, B. S. Atal, and J. Hall, "Optimizing digital speech coders by exploiting masking properties of the human ear," J. Acoust. Soc. Amer., vol. 66, no. 6, pp. 1647-1652, Dec. 1979.
    • (1979) J. Acoust. Soc. Amer , vol.66 , Issue.6 , pp. 1647-1652
    • Schroeder, M.R.1    Atal, B.S.2    Hall, J.3
  • 2
    • 0018478298 scopus 로고
    • Optimizing predictive coders for minimum audible noise
    • Jun
    • B. S. Atal and M. R. Schroeder, "Optimizing predictive coders for minimum audible noise," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 3, pp. 247-254, Jun. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-27 , Issue.3 , pp. 247-254
    • Atal, B.S.1    Schroeder, M.R.2
  • 4
    • 0034225434 scopus 로고    scopus 로고
    • On time-frequency masking in voiced speech
    • Jul
    • J. Skoglund and W. B. Kleijn, "On time-frequency masking in voiced speech," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 361-369, Jul. 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.4 , pp. 361-369
    • Skoglund, J.1    Kleijn, W.B.2
  • 5
    • 0023963510 scopus 로고
    • Transform coding of audio signals using perceptual noise criteria
    • Feb
    • J. D. Johnston, "Transform coding of audio signals using perceptual noise criteria," IEEE J. Select. Areas Commun., vol. 6, no. 2, pp. 314-323, Feb. 1988.
    • (1988) IEEE J. Select. Areas Commun , vol.6 , Issue.2 , pp. 314-323
    • Johnston, J.D.1
  • 6
    • 0024736787 scopus 로고
    • Codierung von Audiosignalen mit überlappender Transformation und adaptiven Fensterfunktionen
    • B. Edler, "Codierung von Audiosignalen mit überlappender Transformation und adaptiven Fensterfunktionen," Frequenz, vol. 43, no. 9, pp. 252-256, 1989.
    • (1989) Frequenz , vol.43 , Issue.9 , pp. 252-256
    • Edler, B.1
  • 7
    • 0141603789 scopus 로고    scopus 로고
    • Enhancing the performance of perceptual audio coders by using Temporal Noise Shaping (TNS)
    • preprint 4384
    • J. Herre and J. D. Johnston, "Enhancing the performance of perceptual audio coders by using Temporal Noise Shaping (TNS)," in Proc. 101st Conv. Audio Engineering Society, 1996, preprint 4384.
    • (1996) Proc. 101st Conv. Audio Engineering Society
    • Herre, J.1    Johnston, J.D.2
  • 8
    • 0022219187 scopus 로고
    • Code-Excited Linear Prediction (CELP): High-quality speech at very low bit rates
    • Tampa, FL
    • M. R. Schroeder and B. S. Atal, "Code-Excited Linear Prediction (CELP): high-quality speech at very low bit rates," in Proc. IEEE Int. Conf. Acoust. Speech Sign. Process., Tampa, FL, 1985, vol. 10, pp. 937-940.
    • (1985) Proc. IEEE Int. Conf. Acoust. Speech Sign. Process , vol.10 , pp. 937-940
    • Schroeder, M.R.1    Atal, B.S.2
  • 10
    • 0023448388 scopus 로고
    • A pulse ribbon model of monaural phase perception
    • Nov
    • R. D. Patterson, "A pulse ribbon model of monaural phase perception," J. Acoust. Soc. Am., vol. 82, no. 5, pp. 1560-1586, Nov. 1987.
    • (1987) J. Acoust. Soc. Am , vol.82 , Issue.5 , pp. 1560-1586
    • Patterson, R.D.1
  • 11
    • 0029132067 scopus 로고
    • Time-domain modeling of peripheral auditory processing: A modular architecture and a software platform
    • Oct
    • R. D. Patterson, M. Allerhand, and C. Giguère, "Time-domain modeling of peripheral auditory processing: a modular architecture and a software platform," J. Acoust. Soc. Amer., vol. 98, no. 4, pp. 1890-1894, Oct. 1995.
    • (1995) J. Acoust. Soc. Amer , vol.98 , Issue.4 , pp. 1890-1894
    • Patterson, R.D.1    Allerhand, M.2    Giguère, C.3
  • 12
    • 64149131134 scopus 로고    scopus 로고
    • R. D. Patterson and J. Holdsworth, A functional model of neural activity patterns and auditory images, in Advances in Speech, Hearing and Language Processing. Greenwich, CT: JAI, 1996, 3, pt. B, pp. 547-563.
    • R. D. Patterson and J. Holdsworth, "A functional model of neural activity patterns and auditory images," in Advances in Speech, Hearing and Language Processing. Greenwich, CT: JAI, 1996, vol. 3, pt. B, pp. 547-563.
  • 13
    • 0029952425 scopus 로고    scopus 로고
    • A quantitative model of the effective signal processing in the auditory system. I. Model structure
    • Jun
    • T. Dau, D. Püschel, and A. Kohlrausch, "A quantitative model of the effective signal processing in the auditory system. I. Model structure," J. Acoust. Soc. Amer., vol. 99, no. 6, pp. 3615-3622, Jun. 1996.
    • (1996) J. Acoust. Soc. Amer , vol.99 , Issue.6 , pp. 3615-3622
    • Dau, T.1    Püschel, D.2    Kohlrausch, A.3
  • 14
    • 0029952451 scopus 로고    scopus 로고
    • A quantitative model of the effective signal processing in the auditory system. II. Simulations and measurements
    • Jun
    • -, "A quantitative model of the effective signal processing in the auditory system. II. Simulations and measurements," J. Acoust. Soc. Amer., vol. 99, no. 6, pp. 3623-3631, Jun. 1996.
    • (1996) J. Acoust. Soc. Amer , vol.99 , Issue.6 , pp. 3623-3631
    • Dau, T.1    Püschel, D.2    Kohlrausch, A.3
  • 15
    • 0030691985 scopus 로고    scopus 로고
    • Modeling auditory processing of amplitude modulation. I. Spectral and temporal integration
    • Nov
    • T. Dau, B. D. Kollmeier, and A. Kohlrausch, "Modeling auditory processing of amplitude modulation. I. Spectral and temporal integration," J. Acoust. Soc. Amer., vol. 102, no. 5, pp. 2892-2905, Nov. 1997.
    • (1997) J. Acoust. Soc. Amer , vol.102 , Issue.5 , pp. 2892-2905
    • Dau, T.1    Kollmeier, B.D.2    Kohlrausch, A.3
  • 16
    • 0030699329 scopus 로고    scopus 로고
    • Modeling auditory processing of amplitude modulation. II. Detection and masking with narrow-band carriers
    • Nov
    • -, "Modeling auditory processing of amplitude modulation. II. Detection and masking with narrow-band carriers," J. Acoust. Soc. Amer., vol. 102, no. 5, pp. 2906-2919, Nov. 1997.
    • (1997) J. Acoust. Soc. Amer , vol.102 , Issue.5 , pp. 2906-2919
    • Dau, T.1    Kollmeier, B.D.2    Kohlrausch, A.3
  • 17
    • 27844433476 scopus 로고    scopus 로고
    • Ein psychophysiologisches Gehörmodell zur Nachbildung von Wahrnehmungsschwellen für die Audiocodierung,
    • Ph.D. dissertation, Univ. Hannover, Hannover, Germany
    • F. Baumgarte, "Ein psychophysiologisches Gehörmodell zur Nachbildung von Wahrnehmungsschwellen für die Audiocodierung," Ph.D. dissertation, Univ. Hannover, Hannover, Germany, 2000.
    • (2000)
    • Baumgarte, F.1
  • 18
    • 0026980861 scopus 로고
    • A perceptual audio quality measure based on a psycho-acoustical sound representation
    • Dec
    • J. Beerends and J. Stemerdink, "A perceptual audio quality measure based on a psycho-acoustical sound representation," J. Audio Eng. Soc., vol. 40, no. 12, pp. 963-978, Dec. 1992.
    • (1992) J. Audio Eng. Soc , vol.40 , Issue.12 , pp. 963-978
    • Beerends, J.1    Stemerdink, J.2
  • 19
    • 0018845170 scopus 로고
    • Asymptotic performance of block quantizers with difference distortion measures
    • Jan
    • Y. Yamada, S. Tazaki, and R. M. Gray, "Asymptotic performance of block quantizers with difference distortion measures," IEEE Trans. Inform. Theory, vol. IT-26, no. 1, pp. 6-14, Jan. 1980.
    • (1980) IEEE Trans. Inform. Theory , vol.IT-26 , Issue.1 , pp. 6-14
    • Yamada, Y.1    Tazaki, S.2    Gray, R.M.3
  • 20
    • 0029375948 scopus 로고
    • Theoretical analysis of the high-rate vector quantization of LPC parameters
    • Sep
    • W. R. Gardner and B. D. Rao, "Theoretical analysis of the high-rate vector quantization of LPC parameters," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 367-381, Sep. 1995.
    • (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.5 , pp. 367-381
    • Gardner, W.R.1    Rao, B.D.2
  • 21
    • 0033099498 scopus 로고    scopus 로고
    • High-resolution source coding for non-difference distortion measures: The rate-distortion function
    • Mar
    • T. Linder and R. Zamir, "High-resolution source coding for non-difference distortion measures: the rate-distortion function," IEEE Trans. Inform. Theory, vol. 45, no. 2, pp. 533-547, Mar. 1999.
    • (1999) IEEE Trans. Inform. Theory , vol.45 , Issue.2 , pp. 533-547
    • Linder, T.1    Zamir, R.2
  • 22
    • 0033101246 scopus 로고    scopus 로고
    • High-resolution source coding for non-difference distortion measures: Multidimensional companding
    • Mar
    • T. Linder, R. Zamir, and K. Zeger, "High-resolution source coding for non-difference distortion measures: multidimensional companding," IEEE Trans. Inform. Theory, vol. 45, no. 2, pp. 548-561, Mar. 1999.
    • (1999) IEEE Trans. Inform. Theory , vol.45 , Issue.2 , pp. 548-561
    • Linder, T.1    Zamir, R.2    Zeger, K.3
  • 23
    • 0032643189 scopus 로고    scopus 로고
    • Asymptotic performance of vector quantizers with a perceptual distortion measure
    • May
    • J. Li, N. Chaddha, and R. M. Gray, "Asymptotic performance of vector quantizers with a perceptual distortion measure," IEEE Trans. Inform. Theory, vol. 45, no. 4, pp. 1082-1091, May 1999.
    • (1999) IEEE Trans. Inform. Theory , vol.45 , Issue.4 , pp. 1082-1091
    • Li, J.1    Chaddha, N.2    Gray, R.M.3
  • 26
    • 0033729018 scopus 로고    scopus 로고
    • Objective modeling of speech quality with a psychoacoustically validated auditory model
    • May
    • M. Hansen and B. Kollmeier, "Objective modeling of speech quality with a psychoacoustically validated auditory model," J. Audio Eng. Soc, vol. 48, no. 5, pp. 395-409, May 2000.
    • (2000) J. Audio Eng. Soc , vol.48 , Issue.5 , pp. 395-409
    • Hansen, M.1    Kollmeier, B.2
  • 27
    • 0018008291 scopus 로고
    • Tree-encoding of speech using (M,L)-algorithm and adaptive quantization
    • N. S. Jayant and S. A. Christensen, "Tree-encoding of speech using (M,L)-algorithm and adaptive quantization," IEEE Trans. Commun., vol. COM-26, no. 9, pp. 1376-1379, 1978.
    • (1978) IEEE Trans. Commun , vol.COM-26 , Issue.9 , pp. 1376-1379
    • Jayant, N.S.1    Christensen, S.A.2
  • 28
    • 0026155550 scopus 로고
    • A low delay 16 kb/sec speech coder
    • May
    • V. Iyengar and P. Kabal, "A low delay 16 kb/sec speech coder," IEEE Trans. Signal Process., vol. 39, no. 5, pp. 1049-1057, May 1991.
    • (1991) IEEE Trans. Signal Process , vol.39 , Issue.5 , pp. 1049-1057
    • Iyengar, V.1    Kabal, P.2
  • 30
    • 27844479443 scopus 로고
    • Delayed decision coding of pitch and innovation signals in code-excited linear prediction coding of speech
    • Boston, MA: Kluwer
    • H. Y. Su and P. Mermelstein, "Delayed decision coding of pitch and innovation signals in code-excited linear prediction coding of speech," in Speech and Audio Coding for Wireless and Network Applications. Boston, MA: Kluwer, 1993, pp. 69-76.
    • (1993) Speech and Audio Coding for Wireless and Network Applications , pp. 69-76
    • Su, H.Y.1    Mermelstein, P.2
  • 32
    • 0020322234 scopus 로고
    • Inverse frequency dependence of simultaneous tone-on-tone masking patterns at low levels
    • Jun
    • E. Zwicker and A. Jaroszewski, "Inverse frequency dependence of simultaneous tone-on-tone masking patterns at low levels," J. Acoust. Soc. Amer., vol. 71, no. 6, pp. 1508-1512, Jun. 1982.
    • (1982) J. Acoust. Soc. Amer , vol.71 , Issue.6 , pp. 1508-1512
    • Zwicker, E.1    Jaroszewski, A.2
  • 33
    • 0033940591 scopus 로고    scopus 로고
    • On the role of envelope fluctuation processing in spectral masking
    • Jul
    • R. P. Derleth and T. Dau, "On the role of envelope fluctuation processing in spectral masking," J. Acoust. Soc. Amer., vol. 108, no. 1, pp. 285-296, Jul. 2000.
    • (2000) J. Acoust. Soc. Amer , vol.108 , Issue.1 , pp. 285-296
    • Derleth, R.P.1    Dau, T.2
  • 34
    • 84944788380 scopus 로고
    • Improving the performance of the 16 kb/s LD-CELP speech coder
    • San Francisco, CA, Mar
    • J. H. Chen, N. Jayant, and R. V. Cox, "Improving the performance of the 16 kb/s LD-CELP speech coder," in Proc. IEEE Int. Conf Acoust. Speech Sign. Process., San Francisco, CA, Mar. 1992, vol. 1, pp. 69-72.
    • (1992) Proc. IEEE Int. Conf Acoust. Speech Sign. Process , vol.1 , pp. 69-72
    • Chen, J.H.1    Jayant, N.2    Cox, R.V.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.