메뉴 건너뛰기




Volumn 2005, Issue 9, 2005, Pages 1334-1349

Anthropomorphic coding of speech and audio: A model inversion approach

Author keywords

Auditory model inversion; Auditory representation; Auditory synthesis; Multiple description coding; Perceptual domain coding; Speech and audio coding

Indexed keywords

AUDITORY MODEL INVERSION; AUDITORY REPRESENTATION; AUDITORY SYNTHESIS; MULTIPLE DESCRIPTION CODING; PERCEPTUAL DOMAIN CODING; SPEECH AND AUDIO CODING;

EID: 27844544161     PISSN: 11108657     EISSN: None     Source Type: Journal    
DOI: 10.1155/ASP.2005.1334     Document Type: Article
Times cited : (15)

References (50)
  • 1
    • 0028529233 scopus 로고
    • ISO-MPEG-1 Audio: A generic standard for coding of high-quality digital audio
    • K. Brandenburg and G. Stoll, "ISO-MPEG-1 Audio: a generic standard for coding of high-quality digital audio," Journal of the Audio Engineering Society, vol. 42, no. 10, pp. 780-792, 1994.
    • (1994) Journal of the Audio Engineering Society , vol.42 , Issue.10 , pp. 780-792
    • Brandenburg, K.1    Stoll, G.2
  • 3
    • 27844448428 scopus 로고
    • Waveform coding and auditory masking
    • W. B. Kleijn and K. K. Paliwal, Eds., Elsevier Science, Amsterdam, The Netherlands
    • R. Veldhuis and A. Kohlrausch, "Waveform coding and auditory masking," in Speech Coding and Synthesis, W. B. Kleijn and K. K. Paliwal, Eds., pp. 427-428, Elsevier Science, Amsterdam, The Netherlands, 1995.
    • (1995) Speech Coding and Synthesis , pp. 427-428
    • Veldhuis, R.1    Kohlrausch, A.2
  • 4
    • 0029952425 scopus 로고    scopus 로고
    • A quantitative model of the 'effective' signal processing in the auditory system. I. Model structure
    • T. Dau, D. Püschel, and A. Kohlrausch, "A quantitative model of the 'effective' signal processing in the auditory system. I. Model structure," Journal of the Acoustical Society of America, vol. 99, no. 6, pp. 3615-3622, 1996.
    • (1996) Journal of the Acoustical Society of America , vol.99 , Issue.6 , pp. 3615-3622
    • Dau, T.1    Püschel, D.2    Kohlrausch, A.3
  • 5
    • 0021285890 scopus 로고
    • Dependence of post-masking on masker duration and its relation to temporal effects in loudness
    • E. Zwicker, "Dependence of post-masking on masker duration and its relation to temporal effects in loudness," Journal of the Acoustical Society of America, vol. 75, no. 1, pp. 219-223, 1984.
    • (1984) Journal of the Acoustical Society of America , vol.75 , Issue.1 , pp. 219-223
    • Zwicker, E.1
  • 7
    • 0030677483 scopus 로고    scopus 로고
    • Using a quantitative psychoacoustical signal representation for objective speech quality measurement
    • Munich, Germany, April
    • M. Hansen and B. Kollmeier, "Using a quantitative psychoacoustical signal representation for objective speech quality measurement," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP '97), vol. 2, pp. 1387-1390, Munich, Germany, April 1997.
    • (1997) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP '97) , vol.2 , pp. 1387-1390
    • Hansen, M.1    Kollmeier, B.2
  • 8
    • 27844479443 scopus 로고
    • Delayed decision coding of pitch and innovation signals in code-excited linear prediction coding of speech
    • B. S. Atal, V. Cuperman, and A. Gersho, Eds., Kluwer Academic Publishers, Boston, Mass, USA
    • H. Su and P. Mermelstein, "Delayed decision coding of pitch and innovation signals in code-excited linear prediction coding of speech," in Speech and Audio Coding for Wireless and Network Applications, B. S. Atal, V. Cuperman, and A. Gersho, Eds., pp. 69-76, Kluwer Academic Publishers, Boston, Mass, USA, 1993.
    • (1993) Speech and Audio Coding for Wireless and Network Applications , pp. 69-76
    • Su, H.1    Mermelstein, P.2
  • 11
    • 0021794508 scopus 로고
    • Cochlear modeling
    • J. B. Allen, "Cochlear modeling," IEEE ASSP Mag., vol. 2, no. 1, pp. 3-29, 1985.
    • (1985) IEEE ASSP Mag. , vol.2 , Issue.1 , pp. 3-29
    • Allen, J.B.1
  • 12
    • 84928841914 scopus 로고
    • Acoustic transduction in the auditory periphery
    • S. Greenberg, "Acoustic transduction in the auditory periphery," Journal of Phonetics, vol. 16, pp. 3-17, 1988.
    • (1988) Journal of Phonetics , vol.16 , pp. 3-17
    • Greenberg, S.1
  • 13
    • 0001957983 scopus 로고
    • Physiology and coding of sound in the auditory nerve
    • A. Popper and R. Fay, Eds., Springer-Verlag, New York, NY, USA
    • M. A. Ruggero, "Physiology and coding of sound in the auditory nerve," in The Mammalian Auditory Pathway: Neurophysiology, A. Popper and R. Fay, Eds., pp. 34-93, Springer-Verlag, New York, NY, USA, 1992.
    • (1992) The Mammalian Auditory Pathway: Neurophysiology , pp. 34-93
    • Ruggero, M.A.1
  • 15
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • B. R. Glasberg and B. C. Moore, "Derivation of auditory filter shapes from notched-noise data," Hearing Research, vol. 47, no. 1-2, pp. 103-138, 1990.
    • (1990) Hearing Research , vol.47 , Issue.1-2 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.2
  • 17
    • 0025126556 scopus 로고
    • A cochlear frequency-position function for several species - 29 years later
    • D. D. Greenwood, "A cochlear frequency-position function for several species - 29 years later," Journal of the Acoustical Society of America, vol. 87, no. 6, pp. 2592-2605, 1990.
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.6 , pp. 2592-2605
    • Greenwood, D.D.1
  • 19
    • 0020459640 scopus 로고
    • The deterioration of hearing with age: Frequency selectivity, the critical ratio, the audiogram, and speech threshold
    • R. D. Patterson, I. Nimmo-Smith, D. L. Weber, and R. Milroy, "The deterioration of hearing with age: Frequency selectivity, the critical ratio, the audiogram, and speech threshold," Journal of the Acoustical Society of America, vol. 72, no. 6, pp. 1788-1803, 1982.
    • (1982) Journal of the Acoustical Society of America , vol.72 , Issue.6 , pp. 1788-1803
    • Patterson, R.D.1    Nimmo-Smith, I.2    Weber, D.L.3    Milroy, R.4
  • 21
    • 0001990961 scopus 로고    scopus 로고
    • Overview: Cochlear neurobiology
    • P. Dallos, A. Popper, and R. Fay, Eds., Springer Verlag, New York, NY, USA
    • P. Dallos, "Overview: Cochlear neurobiology," in The Cochlea, P. Dallos, A. Popper, and R. Fay, Eds., vol. 8, pp. 1-43, Springer Verlag, New York, NY, USA, 1996.
    • (1996) The Cochlea , vol.8 , pp. 1-43
    • Dallos, P.1
  • 22
    • 0031012771 scopus 로고    scopus 로고
    • A time-domain, level-dependent auditory filter: The gammachirp
    • T. Irino and R. D. Patterson, "A time-domain, level-dependent auditory filter: The gammachirp," Journal of the Acoustical Society of America, vol. 101, no. 1, pp. 412-419, 1997.
    • (1997) Journal of the Acoustical Society of America , vol.101 , Issue.1 , pp. 412-419
    • Irino, T.1    Patterson, R.D.2
  • 23
    • 79251542316 scopus 로고
    • A computational model of filtering, detection, and compression in the cochlea
    • Paris, France, May
    • R. F. Lyon, "A computational model of filtering, detection, and compression in the cochlea," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP '82), vol. 7, pp. 1282-1285, Paris, France, May 1982.
    • (1982) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP '82) , vol.7 , pp. 1282-1285
    • Lyon, R.F.1
  • 24
    • 84928837806 scopus 로고
    • A joint synchrony/mean-rate model of auditory speech processing
    • S. Seneff, "A joint synchrony/mean-rate model of auditory speech processing," Journal of Phonetics, vol. 16, pp. 55-76, 1988.
    • (1988) Journal of Phonetics , vol.16 , pp. 55-76
    • Seneff, S.1
  • 27
    • 0023448388 scopus 로고
    • A pulse ribbon model of monaural phase perception
    • R. D. Patterson, "A pulse ribbon model of monaural phase perception," Journal of the Acoustical Society of America, vol. 82, no. 5, pp. 1560-1586, 1987.
    • (1987) Journal of the Acoustical Society of America , vol.82 , Issue.5 , pp. 1560-1586
    • Patterson, R.D.1
  • 29
    • 0029480251 scopus 로고
    • Pattern playback from 1950 to 1995
    • Vancouver, BC, Canada, October
    • M. Slaney, "Pattern playback from 1950 to 1995," in Proc. IEEE Systems Man Cybern. Conf., vol. 4, pp. 3519-3524, Vancouver, BC, Canada, October 1995.
    • (1995) Proc. IEEE Systems Man Cybern. Conf. , vol.4 , pp. 3519-3524
    • Slaney, M.1
  • 30
    • 0018976681 scopus 로고
    • Acoustics in human communication: Evolving ideas about the nature of speech
    • F. S. Cooper, "Acoustics in human communication: Evolving ideas about the nature of speech," Journal of the Acoustical Society of America, vol. 68, no. 1, pp. 18-21, 1980.
    • (1980) Journal of the Acoustical Society of America , vol.68 , Issue.1 , pp. 18-21
    • Cooper, F.S.1
  • 31
    • 0027814174 scopus 로고
    • Signal reconstruction from modified auditory wavelet transform
    • T. Irino and H. Kawahara, "Signal reconstruction from modified auditory wavelet transform," IEEE Trans. Signal Processing, vol. 41, no. 12, pp. 3549-3554, 1993.
    • (1993) IEEE Trans. Signal Processing , vol.41 , Issue.12 , pp. 3549-3554
    • Irino, T.1    Kawahara, H.2
  • 34
    • 0026626445 scopus 로고
    • Auditory representations of acoustic signals
    • X. Yang, K. Wang, and S. A. Shamma, "Auditory representations of acoustic signals," IEEE Trans. Inform. Theory, vol. 38, no. 2, pp. 824-839, 1992.
    • (1992) IEEE Trans. Inform. Theory , vol.38 , Issue.2 , pp. 824-839
    • Yang, X.1    Wang, K.2    Shamma, S.A.3
  • 35
    • 85013945605 scopus 로고    scopus 로고
    • Multiple-description coding (MDC) of speech with an invertible auditory model
    • Porvoo, Finland, June
    • G. Kubin and W. B. Kleijn, "Multiple-description coding (MDC) of speech with an invertible auditory model," in Proc. IEEE Speech Coding Workshop, pp. 81-83, Porvoo, Finland, June 1999.
    • (1999) Proc. IEEE Speech Coding Workshop , pp. 81-83
    • Kubin, G.1    Kleijn, W.B.2
  • 36
  • 37
    • 0040947452 scopus 로고
    • Iterative reconstructions in irregular sampling with derivatives
    • H. N. Razafinjatovo, "Iterative reconstructions in irregular sampling with derivatives," J. Fourier Anal. Appl, vol. 1, no. 3, pp. 281-295, 1995.
    • (1995) J. Fourier Anal. Appl , vol.1 , Issue.3 , pp. 281-295
    • Razafinjatovo, H.N.1
  • 40
    • 27844574563 scopus 로고    scopus 로고
    • M.S. thesis, Institute of Communications and Wave Propagation, Graz University of Technology, Graz, Austria
    • M. Stocker, Efficient coding methods for a perceptual speech coder, M.S. thesis, Institute of Communications and Wave Propagation, Graz University of Technology, Graz, Austria, 2003.
    • (2003) Efficient Coding Methods for A Perceptual Speech Coder
    • Stocker, M.1
  • 42
    • 0343410913 scopus 로고    scopus 로고
    • Multiple description coding using non-hierachical signal decomposition
    • Rhodes, Greece, September
    • Y. Wang, "Multiple description coding using non-hierachical signal decomposition," in Proc. European Conference Signal Processing (EUSIPCO '98), pp. 233-236, Rhodes, Greece, September 1998.
    • (1998) Proc. European Conference Signal Processing (EUSIPCO '98) , pp. 233-236
    • Wang, Y.1
  • 43
    • 0003913694 scopus 로고
    • An efficient implementation of the Patterson-Holdsworth auditory filter bank
    • Apple Computer, New York, NY, USA
    • M. Slaney, "An efficient implementation of the Patterson-Holdsworth auditory filter bank," Tech. Rep. 35, Apple Computer, New York, NY, USA, 1993.
    • (1993) Tech. Rep. , vol.35
    • Slaney, M.1
  • 45
    • 0014975170 scopus 로고
    • Computation of spectra with unequal resolution using the fast Fourier transform
    • A. Oppenheim, D. Johnson, and K. Steiglitz, "Computation of spectra with unequal resolution using the fast Fourier transform," Proc. IEEE, vol. 59, no. 2, pp. 299-301, 1971.
    • (1971) Proc. IEEE , vol.59 , Issue.2 , pp. 299-301
    • Oppenheim, A.1    Johnson, D.2    Steiglitz, K.3
  • 46
    • 84910250979 scopus 로고
    • Ein Beitrag zur kurzzeitspektralanalyse mit digitalen Systemen
    • Universität Erlangen, Erlangen, Germay
    • P. Vary, "Ein Beitrag zur kurzzeitspektralanalyse mit digitalen Systemen," Ausgewählte Arbeiten über Nachrichtnesysteme 32, Universität Erlangen, Erlangen, Germay, 1978.
    • (1978) Ausgewählte Arbeiten Über Nachrichtnesysteme , vol.32
    • Vary, P.1
  • 48
    • 0001481529 scopus 로고    scopus 로고
    • Bark and ERB bilinear transforms
    • J. Smith and J. Abel, "Bark and ERB bilinear transforms," IEEE Trans. Speech Audio Processing, vol. 7, no. 6, pp. 697-708, 1999.
    • (1999) IEEE Trans. Speech Audio Processing , vol.7 , Issue.6 , pp. 697-708
    • Smith, J.1    Abel, J.2
  • 49
    • 51449088366 scopus 로고    scopus 로고
    • Critically sampled frequency-warped perfect reconstruction filterbank
    • Krakow, Poland, September
    • C. Feldbauer and G. Kubin, "Critically sampled frequency-warped perfect reconstruction filterbank," in Proc. European on Circuit Theory and Design Conference (ECCTD '03), vol. 3, pp. 109-112, Krakow, Poland, September 2003.
    • (2003) Proc. European on Circuit Theory and Design Conference (ECCTD '03) , vol.3 , pp. 109-112
    • Feldbauer, C.1    Kubin, G.2
  • 50
    • 84912495580 scopus 로고
    • Analytical expressions for critical-band rate and critical bandwidth as a function of frequency
    • E. Zwicker and E. Terhardt, "Analytical expressions for critical-band rate and critical bandwidth as a function of frequency," Journal of the Acoustical Society of America, vol. 68, pp.1523-1525, 1980.
    • (1980) Journal of the Acoustical Society of America , vol.68 , pp. 1523-1525
    • Zwicker, E.1    Terhardt, E.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.