메뉴 건너뛰기




Volumn 10, Issue 1, 1996, Pages 1-22

Perceptual wavelet-representation of speech signals and its application to speech enhancement

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC NOISE; AUDITION; HEURISTIC METHODS; SPECTROGRAPHS; SPEECH; SPEECH ANALYSIS; SPEECH INTELLIGIBILITY; TIME DOMAIN ANALYSIS; WAVELET TRANSFORMS;

EID: 0029735242     PISSN: 08852308     EISSN: None     Source Type: Journal    
DOI: 10.1006/csla.1996.0001     Document Type: Article
Times cited : (25)

References (52)
  • 1
    • 0040677675 scopus 로고
    • Auditory-based wavelet representation
    • M. Cooke, S. Beet & M. Crawford, eds. John Wiley & Sons, London
    • d'Alessandro, C. (1993). Auditory-based wavelet representation. In Visual Representations of Speech Signals (M. Cooke, S. Beet & M. Crawford, eds). John Wiley & Sons, London.
    • (1993) Visual Representations of Speech Signals
    • D'Alessandro, C.1
  • 3
    • 0040084107 scopus 로고
    • The time-scale transformation method as an instrument for phonetic analysis
    • M. Cooke, S. Beet & M. Crawford, eds. John Wiley & Sons, London
    • Basile, P., Cutugno, F., Maturi, P. & Piccialli, A. (1993). The time-scale transformation method as an instrument for phonetic analysis. In Visual Representations of Speech Signals (M. Cooke, S. Beet & M. Crawford, eds). John Wiley & Sons, London.
    • (1993) Visual Representations of Speech Signals
    • Basile, P.1    Cutugno, F.2    Maturi, P.3    Piccialli, A.4
  • 5
    • 0027815284 scopus 로고
    • The scale representation
    • Cohen, L. (1993). The scale representation. IEEE SP 41, 3275-3292.
    • (1993) IEEE SP , vol.41 , pp. 3275-3292
    • Cohen, L.1
  • 6
    • 0007362382 scopus 로고
    • Orthonormal bases of wavelets with finite support - Connection with discrete filters
    • J. M. Combes, A. Grossmann & P. Tchamitchian, eds. Springer Verlag, Berlin
    • Daubechies, I. (1990a). Orthonormal bases of wavelets with finite support - connection with discrete filters. In Wavelets (J. M. Combes, A. Grossmann & P. Tchamitchian, eds). Springer Verlag, Berlin.
    • (1990) Wavelets
    • Daubechies, I.1
  • 7
    • 0025482241 scopus 로고
    • The wavelet transform, time-frequency localization and signal analysis
    • Daubechies, I. (1990b). The wavelet transform, time-frequency localization and signal analysis. IEEE IT 36, 961-1005.
    • (1990) IEEE IT , vol.36 , pp. 961-1005
    • Daubechies, I.1
  • 8
    • 0038899570 scopus 로고
    • Comparative evaluations of auditory representations of speech
    • M. Cooke, S. Beet & M. Crawford, eds. John Wiley & Sons, London
    • Dermody, P. et al. (1993). Comparative evaluations of auditory representations of speech. In Visual Representations of Speech Signals (M. Cooke, S. Beet & M. Crawford, eds). John Wiley & Sons, London.
    • (1993) Visual Representations of Speech Signals
    • Dermody, P.1
  • 9
    • 0027850722 scopus 로고
    • Wavelet analysis in recruitment of loudness compensation
    • Drake, L. A., Rutledge, J. C. & Cohen, C. (1993). Wavelet analysis in recruitment of loudness compensation. IEEE SP 41, 3306-3312.
    • (1993) IEEE SP , vol.41 , pp. 3306-3312
    • Drake, L.A.1    Rutledge, J.C.2    Cohen, C.3
  • 10
    • 0039492252 scopus 로고
    • Multiresolution time-sequency speech proceeding based on orthogonal wavelet packet pulse forms
    • 21-23 September, Berlin, Germany
    • Dryjalgo, A. (1993). Multiresolution time-sequency speech proceeding based on orthogonal wavelet packet pulse forms. In Proceedings of the Conference of Eurospeech '93, 21-23 September, Berlin, Germany, pp. 147-150.
    • (1993) Proceedings of the Conference of Eurospeech '93 , pp. 147-150
    • Dryjalgo, A.1
  • 11
    • 0027239229 scopus 로고
    • A signal subspace approach for speech enhancement
    • Ephraim, Y. & Van Trees, H. L. (1993). A signal subspace approach for speech enhancement. Proceedings of the ICASSP'93 II, 355-358.
    • (1993) Proceedings of the ICASSP'93 , vol.2 , pp. 355-358
    • Ephraim, Y.1    Van Trees, H.L.2
  • 12
    • 0039492242 scopus 로고
    • Some aspects of non-stationary signal processing with emphasis on time-frequency and time-scale methods
    • J. M. Combes, A. Grossmann & P. Tchamitchian, eds. Springer Verlag, Berlin
    • Flandrin, P. (1990). Some aspects of non-stationary signal processing with emphasis on time-frequency and time-scale methods. In Wavelets (J. M. Combes, A. Grossmann & P. Tchamitchian, eds). Springer Verlag, Berlin.
    • (1990) Wavelets
    • Flandrin, P.1
  • 13
    • 5644237214 scopus 로고
    • Acoustical quanta and the theory of hearing
    • Gábor, D. (1947). Acoustical quanta and the theory of hearing. Nature 169, 591-602.
    • (1947) Nature , vol.169 , pp. 591-602
    • Gábor, D.1
  • 14
    • 84991416125 scopus 로고
    • Auditory nerve representation as a front end for speech recognition in a noisy environment
    • Ghitza, O. (1986). Auditory nerve representation as a front end for speech recognition in a noisy environment. Computer Speech and Language 1, 109-130.
    • (1986) Computer Speech and Language , vol.1 , pp. 109-130
    • Ghitza, O.1
  • 16
    • 0021642206 scopus 로고
    • Cycle-octave and related transforms in seismic signal analysis
    • Goupillaud, P., Grossmann, A. & Morlet, J. (1948). Cycle-octave and related transforms in seismic signal analysis. Geoexploration 23, 85-102.
    • (1948) Geoexploration , vol.23 , pp. 85-102
    • Goupillaud, P.1    Grossmann, A.2    Morlet, J.3
  • 17
    • 84953650495 scopus 로고
    • Critical bandwidth and the frequency coordinates of the basilar membrane
    • Greenwood, D.D. (1961). Critical bandwidth and the frequency coordinates of the basilar membrane. Journal of the Acoustical Society of America 33, 1344-1356.
    • (1961) Journal of the Acoustical Society of America , vol.33 , pp. 1344-1356
    • Greenwood, D.D.1
  • 18
    • 0025126556 scopus 로고
    • A cochlear frequency-position function for several species - 29 years later
    • Greenwood, D.D. (1990). A cochlear frequency-position function for several species - 29 years later. Journal of the Acoustical Society of America 87, 2592-2605.
    • (1990) Journal of the Acoustical Society of America , vol.87 , pp. 2592-2605
    • Greenwood, D.D.1
  • 19
    • 0028377806 scopus 로고
    • An optimal time-frequency distribution for speech analysis
    • Heitz, C. & Becker, J. D. (1994). An optimal time-frequency distribution for speech analysis. Speech Communication 41, 1-18.
    • (1994) Speech Communication , vol.41 , pp. 1-18
    • Heitz, C.1    Becker, J.D.2
  • 20
    • 0008806588 scopus 로고
    • Pitch analysis
    • M. Cooke, S. Beet & M. Crawford, eds. John Wiley & Sons, London
    • Hermes, D. J. (1993). Pitch analysis. In Visual Representations of Speech Signals (M. Cooke, S. Beet & M. Crawford, eds). John Wiley & Sons, London.
    • (1993) Visual Representations of Speech Signals
    • Hermes, D.J.1
  • 21
    • 0027814174 scopus 로고
    • Signal reconstruction from modified auditory wavelet transform
    • Irino, T. & Kawahara, H. (1993). Signal reconstruction from modified auditory wavelet transform. IEEE SP 41, 3549-3554.
    • (1993) IEEE SP , vol.41 , pp. 3549-3554
    • Irino, T.1    Kawahara, H.2
  • 22
    • 0026727405 scopus 로고
    • Application of the wavelet transform for pitch detection of speech signals
    • Kadambe, S. & Boudreaux-Bartels, G. F. (1992). Application of the wavelet transform for pitch detection of speech signals. IEEE IT 38, 917-924.
    • (1992) IEEE IT , vol.38 , pp. 917-924
    • Kadambe, S.1    Boudreaux-Bartels, G.F.2
  • 24
    • 0025659579 scopus 로고
    • An orthogonal set of frequency and amplitude modulated (FAM) functions for variable resolution signal analysis
    • Laine, U. K. (1990). An orthogonal set of frequency and amplitude modulated (FAM) functions for variable resolution signal analysis. Proceedings of the ICASSP'90, pp. 1615-1618.
    • (1990) Proceedings of the ICASSP'90 , pp. 1615-1618
    • Laine, U.K.1
  • 27
    • 0040677672 scopus 로고
    • Wavelets and granular analysis of speech
    • J. M. Combes, A. Grossmann & P. Tchamitchian, eds. Springer Verlag, Berlin
    • Lienard, J. S. & d'Alessandro, C. (1990). Wavelets and granular analysis of speech. In Wavelets (J. M. Combes, A. Grossmann & P. Tchamitchian, eds). Springer Verlag, Berlin.
    • (1990) Wavelets
    • Lienard, J.S.1    D'Alessandro, C.2
  • 28
    • 0024700097 scopus 로고
    • A theory for multiresolution signal decomposition: The wavelet transform
    • Mallat, S. (1989). A theory for multiresolution signal decomposition: the wavelet transform. IEEE PAMI 11, 674-693.
    • (1989) IEEE PAMI , vol.11 , pp. 674-693
    • Mallat, S.1
  • 30
    • 0038899513 scopus 로고
    • The structure and synthesis of the most frequent elements of the Hungarian speech
    • 121.sz
    • Olaszy, G. (1985). The structure and synthesis of the most frequent elements of the Hungarian speech (in Hungarian). Nyelvtudományi Értekezések 121.sz.
    • (1985) Nyelvtudományi Értekezések
    • Olaszy, G.1
  • 31
    • 0014831986 scopus 로고
    • Speech spectrograms using the fast Fourier transform
    • Oppenheim, A. V. (1970). Speech spectrograms using the fast Fourier transform. IEEE Spectrum 7, 57-62.
    • (1970) IEEE Spectrum , vol.7 , pp. 57-62
    • Oppenheim, A.V.1
  • 32
    • 0040084028 scopus 로고
    • Speech enhancement by soft thresholding in the perceptual wavelet domain
    • 20-22 June, Neos Marmaras, Greece
    • Pintér, I. (1995). Speech enhancement by soft thresholding in the perceptual wavelet domain. IEEE Workshop on Nonlinear Signal and Image Processing II, 666-669, 20-22 June, Neos Marmaras, Greece.
    • (1995) IEEE Workshop on Nonlinear Signal and Image Processing , vol.2 , pp. 666-669
    • Pintér, I.1
  • 36
    • 0026745217 scopus 로고
    • Fast algorithms for discrete and continuous wavelet transforms
    • Rioul, A. & Duhamel, P. (1992). Fast algorithms for discrete and continuous wavelet transforms. IEEE IT 38, 569-586.
    • (1992) IEEE IT , vol.38 , pp. 569-586
    • Rioul, A.1    Duhamel, P.2
  • 37
    • 0040677603 scopus 로고
    • A brief history of synthetic speech
    • Schroeder, M. R. (1993). A brief history of synthetic speech. Speech Communication 13, 231-239.
    • (1993) Speech Communication , vol.13 , pp. 231-239
    • Schroeder, M.R.1
  • 38
    • 0022741733 scopus 로고
    • A biophysical model of cochlear processing: Intensity dependence of pure tone responses
    • Shamma, S. A. (1986). A biophysical model of cochlear processing: intensity dependence of pure tone responses. Journal of the Acoustical Society of America 80, 133-145.
    • (1986) Journal of the Acoustical Society of America , vol.80 , pp. 133-145
    • Shamma, S.A.1
  • 39
    • 0003788577 scopus 로고
    • Spatial and temporal processing in central auditory networks
    • C. Koch & I. Segev, eds. Bradford Books, MIT Press
    • Shamma, S. A. (1989). Spatial and temporal processing in central auditory networks. In Methods in Neuronal Modelling: From Synapses to Networks (C. Koch & I. Segev, eds). Bradford Books, MIT Press.
    • (1989) Methods in Neuronal Modelling: From Synapses to Networks
    • Shamma, S.A.1
  • 41
    • 0027842082 scopus 로고
    • Low bit rate transparent audio compression using adapted wavelets
    • Sinha, D. & Tewfik, A. H. (1993). Low bit rate transparent audio compression using adapted wavelets. IEEE SP 41, 3463-3479.
    • (1993) IEEE SP , vol.41 , pp. 3463-3479
    • Sinha, D.1    Tewfik, A.H.2
  • 42
    • 0040084104 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • April
    • Steven, B. (1979). Suppression of acoustic noise in speech using spectral subtraction. IEEE ASSP 27, April.
    • (1979) IEEE ASSP , vol.27
    • Steven, B.1
  • 46
    • 0001843298 scopus 로고
    • Théorie et applications de la notion de signal analytique
    • 2ème A.
    • Ville, J. (1948). Théorie et applications de la notion de signal analytique. Cables et Transmission, 2ème A. (1), pp. 61-74.
    • (1948) Cables et Transmission , Issue.1 , pp. 61-74
    • Ville, J.1
  • 47
    • 0027316613 scopus 로고
    • Noise robustness in the auditory representation of speech signals
    • Wang, K., Shamma, S. A. & Byrne, W. J. (1993). Noise robustness in the auditory representation of speech signals. Proceedings of the ICASSP'93 II, 335-338.
    • (1993) Proceedings of the ICASSP'93 , vol.2 , pp. 335-338
    • Wang, K.1    Shamma, S.A.2    Byrne, W.J.3
  • 48
    • 0038899511 scopus 로고
    • Adapted local trigonometric transforms and speech processing
    • Wesfreid, E. & Wickerhauser, M. V (1993). Adapted local trigonometric transforms and speech processing. IEEE SP 41, 3597-3600.
    • (1993) IEEE SP , vol.41 , pp. 3597-3600
    • Wesfreid, E.1    Wickerhauser, M.V.2
  • 49
    • 0000549020 scopus 로고
    • Entropy based algorithms for best basis selection
    • Wickerhauser, M. V & Coifman, R. R. (1992). Entropy based algorithms for best basis selection. IEEE IT 38, 712-718.
    • (1992) IEEE IT , vol.38 , pp. 712-718
    • Wickerhauser, M.V.1    Coifman, R.R.2
  • 50
    • 33745014742 scopus 로고
    • On the quantum correction for thermodynamic equilibrium
    • Wigner, E. P. (1932). On the quantum correction for thermodynamic equilibrium. Physical Review 40, 749-759.
    • (1932) Physical Review , vol.40 , pp. 749-759
    • Wigner, E.P.1
  • 51
    • 0026626445 scopus 로고
    • Auditory representation of acoustic signals
    • Yang, X., Wang, K. & Shamma, S. A. (1992). Auditory representation of acoustic signals. IEEE IT 38, 824-839.
    • (1992) IEEE IT , vol.38 , pp. 824-839
    • Yang, X.1    Wang, K.2    Shamma, S.A.3
  • 52
    • 84953656445 scopus 로고
    • Subdivision of the audible frequency range into critical bands (frequenzgruppen)
    • Zwicker, E. (1961). Subdivision of the audible frequency range into critical bands (frequenzgruppen). Journal of the Acoustical Society of America 33, 248.
    • (1961) Journal of the Acoustical Society of America , vol.33 , pp. 248
    • Zwicker, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.