메뉴 건너뛰기




Volumn 9780521519540, Issue , 2009, Pages 1-206

Applied speech and audio processing: With matlab ® examples

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTICS; AUDIO SYSTEMS; AUDITION; SIGNAL PROCESSING; SPEECH; STUDENTS;

EID: 84926154764     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1017/CBO9780511609640     Document Type: Book
Times cited : (90)

References (86)
  • 1
    • 70349549530 scopus 로고    scopus 로고
    • Digital signal processing: A practical guide for engineers and scientists
    • S. W. Smith. Digital Signal Processing: A Practical Guide for Engineers and Scientists. Newnes, 2000. URL www.dspguide.com.
    • (2000) Newnes
    • Smith, S.W.1
  • 2
    • 51149145354 scopus 로고
    • Fourier series
    • J.W. Gibbs. Fourier series. Nature, 59: 606, 1899.
    • (1899) Nature , vol.59 , pp. 606
    • Gibbs, J.W.1
  • 3
    • 0016494264 scopus 로고
    • Digital representation of speech signals
    • R. W. Schaefer and L. R. Rabiner. Digital representation of speech signals. Proc. IEEE, 63(4): 662–677, 1975.
    • (1975) Proc. IEEE , vol.63 , Issue.4 , pp. 662-677
    • Schaefer, R.W.1    Rabiner, L.R.2
  • 4
    • 0002161311 scopus 로고
    • The quefrency analysis of time series for echoes: Cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking
    • M. Rosenblatt, editor, JohnWiley
    • B. P. Bogert, M. J. R. Healy, and J. W Tukey. The quefrency analysis of time series for echoes: Cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking. In M. Rosenblatt, editor, Proceedings of the Symposium on Time-Series Analysis, pages 209–243. JohnWiley, 1963.
    • (1963) Proceedings of the Symposium on Time-Series Analysis , pp. 209-243
    • Bogert, B.P.1    Healy, M.J.2    Tukey, J.W.3
  • 5
    • 0017542202 scopus 로고
    • The cepstrum: A guide to processing
    • D. G. Childers, D. P. Skinner, and R. C. Kemerait. The cepstrum: A guide to processing. Proc. IEEE, 65(10): 1428–1443, 1977.
    • (1977) Proc. IEEE , vol.65 , Issue.10 , pp. 1428-1443
    • Childers, D.G.1    Skinner, D.P.2    Kemerait, R.C.3
  • 6
    • 0035506942 scopus 로고    scopus 로고
    • Comparison of different implementations of mfcc
    • F. Zheng, G. Zhang, and Z. Song. Comparison of different implementations of MFCC. J. Computer Sci. and Technol., 16(6): 582–589, 2001.
    • (2001) J. Computer Sci. And Technol , vol.16 , Issue.6 , pp. 582-589
    • Zheng, F.1    Zhang, G.2    Song, Z.3
  • 7
    • 0032663191 scopus 로고    scopus 로고
    • Is the sine-wave speech cocktail party worth attending?
    • J. Barkera and M. Cooke. Is the sine-wave speech cocktail party worth attending? Speech Comm., 27(3-4): 159–174, 1999.
    • (1999) Speech Comm , vol.27 , Issue.3-4 , pp. 159-174
    • Barkera, J.1    Cooke, M.2
  • 8
    • 84866492988 scopus 로고
    • Optimizing digital speech coders by exploiting masking properties of the human ear
    • M. R. Schroeder, B.S. Atal, and J.L. Hall. Optimizing digital speech coders by exploiting masking properties of the human ear. J. Acoustical Soc. America, 66(6): 1647–1652, 1979.
    • (1979) J. Acoustical Soc. America , vol.66 , Issue.6 , pp. 1647-1652
    • Schroeder, M.R.1    Atal, B.S.2    Hall, J.L.3
  • 11
    • 0344765887 scopus 로고
    • The influence of first and second formants on the intelligibility of clipped speech
    • I. B. Thomas. The influence of first and second formants on the intelligibility of clipped speech. J. Acoustical Soc. America, 16: 182–185, 1968.
    • (1968) J. Acoustical Soc. America , vol.16 , pp. 182-185
    • Thomas, I.B.1
  • 12
    • 85032421322 scopus 로고
    • The sounds of speech communication
    • J. Pickett. The Sounds of Speech Communication. Allyn and Bacon, 1980.
    • (1980) Allyn and Bacon
    • Pickett, J.1
  • 13
    • 0033691440 scopus 로고    scopus 로고
    • Proposal of standards for intelligibility tests of chinese speech
    • Z. Li, E. C. Tan, I. McLoughlin, and T. T. Teo. Proposal of standards for intelligibility tests of Chinese speech. IEE Proc. Vision Image Sig. Proc., 147(3): 254–260, 2000.
    • (2000) IEE Proc. Vision Image Sig. Proc , vol.147 , Issue.3 , pp. 254-260
    • Li, Z.1    Tan, E.C.2    Mc Loughlin, I.3    Teo, T.T.4
  • 14
    • 34249288642 scopus 로고    scopus 로고
    • A methodology for improving pesq accuracy for chinese speech
    • Melbourne, November
    • F. L. Chong, I. McLoughlin, and K. Pawlikowski. A methodology for improving PESQ accuracy for Chinese speech. In Proc. IEEE TENCON, Melbourne, November 2005.
    • (2005) Proc. IEEE TENCON
    • Chong, F.L.1    Mc Loughlin, I.2    Pawlikowski, K.3
  • 16
    • 84932441923 scopus 로고
    • The design of speech communications systems
    • L. L. Beranek. The design of speech communications systems. Proc. IRE, pages 880–890, 1947.
    • (1947) Proc. IRE , pp. 880-890
    • Beranek, L.L.1
  • 20
    • 0023217566 scopus 로고
    • Acoustic parameters measured by a formant estimating speech processor for a multiple-channel cochlear implant
    • P. J. Blamey, R. C. Dowell, and G. M. Clark. Acoustic parameters measured by a formant estimating speech processor for a multiple-channel cochlear implant. J. Acoustical Soc. America, 82(1): 38–47, 1987.
    • (1987) J. Acoustical Soc. America , vol.82 , Issue.1 , pp. 38-47
    • Blamey, P.J.1    Dowell, R.C.2    Clark, G.M.3
  • 21
    • 0016494264 scopus 로고
    • Digital representation of speech signals
    • R. W. Schaefer and L. R. Rabiner. Digital representation of speech signals. Proc. IEEE, 63(4): 662–677, 1975.
    • (1975) Proc. IEEE , vol.63 , Issue.4 , pp. 662-677
    • Schaefer, R.W.1    Rabiner, L.R.2
  • 23
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • S. F. Boll. Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoustics, Speech Signal Proc., 27(2): 113–120, 1979.
    • (1979) IEEE Trans. Acoustics, Speech Signal Proc , vol.27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 25
    • 0028125658 scopus 로고
    • Preliminary evaluation of a formant enhancement algorithm on the perception of speech in noise for normally hearing listeners
    • J. I. Alcantera, G. J. Dooley, P. J. Blamey, and P. M. Seligman. Preliminary evaluation of a formant enhancement algorithm on the perception of speech in noise for normally hearing listeners. J. Audiology, 33(1): 15–24, 1994.
    • (1994) J. Audiology , vol.33 , Issue.1 , pp. 15-24
    • Alcantera, J.I.1    Dooley, G.J.2    Blamey, P.J.3    Seligman, P.M.4
  • 27
    • 0348076591 scopus 로고
    • The intelligibility of speech as a function of the context of the test materials. Experi
    • G. A. Miller, G. A. Heise, and W. Lichten. The intelligibility of speech as a function of the context of the test materials. Experi. Psychol., 41: 329–335, 1951.
    • (1951) Psychol , vol.41 , pp. 329-335
    • Miller, G.A.1    Heise, G.A.2    Lichten, W.3
  • 28
    • 0016494264 scopus 로고
    • Digital representation of speech signals
    • R. W. Schaefer and L. R. Rabiner. Digital representation of speech signals. Proc. IEEE, 63(4): 662–677, 1975.
    • (1975) Proc. IEEE , vol.63 , Issue.4 , pp. 662-677
    • Schaefer, R.W.1    Rabiner, L.R.2
  • 29
    • 84926206793 scopus 로고
    • Anatomy and physiology for nurses and students of human biology
    • 4th edition
    • W. G. Sears. Anatomy and Physiology for Nurses and Students of Human Biology. Arnold, 4th edition, 1967.
    • (1967) Arnold
    • Sears, W.G.1
  • 34
    • 84932441923 scopus 로고
    • The design of speech communications systems
    • September
    • L. L. Beranek. The design of speech communications systems. Proc. IRE, pages 880–890, September 1947.
    • (1947) Proc. IRE , pp. 880-890
    • Beranek, L.L.1
  • 37
    • 0023217566 scopus 로고
    • Acoustic parameters measured by a formant estimating speech processor for a multiple-channel cochlear implant
    • P. J. Blamey, R. C. Dowell, and G. M. Clark. Acoustic parameters measured by a formant estimating speech processor for a multiple-channel cochlear implant. J. Acoustical Soc. America, 82(1): 38–47, 1987.
    • (1987) J. Acoustical Soc. America , vol.82 , Issue.1 , pp. 38-47
    • Blamey, P.J.1    Dowell, R.C.2    Clark, G.M.3
  • 38
    • 84866492988 scopus 로고
    • Optimizing digital speech coders by exploiting masking properties of the human ear
    • M. R. Schroeder, B. S. Atal, and J. L. Hall. Optimizing digital speech coders by exploiting masking properties of the human ear. J. Acoustical Soc. America, 66(6): 1647 1979.
    • (1979) J. Acoustical Soc. America , vol.66 , Issue.6 , pp. 1647
    • Schroeder, M.R.1    Atal, B.S.2    Hall, J.L.3
  • 40
    • 0026219140 scopus 로고
    • Speech enhancement based conceptually on auditory evidence
    • Y. M. Cheng and D. O’Shaughnessy. Speech enhancement based conceptually on auditory evidence. IEEE Trans. Signal Proc., 39(9): 1943–1954, 1991.
    • (1991) IEEE Trans. Signal Proc , vol.39 , Issue.9 , pp. 1943-1954
    • Cheng, Y.M.1    O’shaughnessy, D.2
  • 43
    • 0028997028 scopus 로고
    • Speech enhancement based on masking properties of the auditory system
    • N. Virag. Speech enhancement based on masking properties of the auditory system. Proc. Int. Conf. on Acoustics, Speech and Signal Processing, Vol. 1 pages 796–799, 1995.
    • (1995) Proc. Int. Conf. On Acoustics, Speech and Signal Processing , vol.1 , pp. 796-799
    • Virag, N.1
  • 44
    • 85048425615 scopus 로고
    • Auditory feature analysis
    • Academic Press
    • J. C. R. Licklider. Auditory feature analysis. In Information Theory. Academic Press, 1956.
    • (1956) Information Theory
    • Licklider, J.1
  • 46
    • 0022686803 scopus 로고
    • Mistuning a harmonic of a vowel: Grouping and phase effects on vowel quality
    • C. R. Darwin and R. B. Gardner. Mistuning a harmonic of a vowel: Grouping and phase effects on vowel quality. J. Acoustical Soc. America, 79: 838–845, 1986.
    • (1986) J. Acoustical Soc. America , vol.79 , pp. 838-845
    • Darwin, C.R.1    Gardner, R.B.2
  • 48
    • 0025041264 scopus 로고
    • Perceptual linear predictive (Plp) analysis of speech
    • H. Hermansky. Perceptual linear predictive (PLP) analysis of speech. J. Acoustical Soc. America, 87(4): 1738–1752, 1990.
    • (1990) J. Acoustical Soc. America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 49
    • 84926209103 scopus 로고
    • Editorial pages
    • July
    • “ISO/MPEG – Audio Standard layers”. Editorial pages. Sound Studio Magazine, pages 40–41, July 1992.
    • (1992) Sound Studio Magazine , pp. 40-41
  • 50
    • 0028125658 scopus 로고
    • Preliminary evaluation of a formant enhancement algorithm on the perception of speech in noise for normally hearing listeners
    • J. I. Alcantera, G. J. Dooley, P. J. Blamey, and P. M. Seligman. Preliminary evaluation of a formant enhancement algorithm on the perception of speech in noise for normally hearing listeners. J. Audiology, 33(1): 15–24, 1994.
    • (1994) J. Audiology , vol.33 , Issue.1 , pp. 15-24
    • Alcantera, J.I.1    Dooley, G.J.2    Blamey, P.J.3    Seligman, P.M.4
  • 56
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • J. Makhoul. Linear prediction: A tutorial review. Proc. IEEE, 63(4): 561–580, 1975.
    • (1975) Proc. IEEE , vol.63 , Issue.4 , pp. 561-580
    • Makhoul, J.1
  • 58
    • 0023965954 scopus 로고
    • Quantizer design in lsp speech analysis-synthesis
    • N. Sugamura and N. Favardin. Quantizer design in LSP speech analysis-synthesis. IEEE J. Selec. Areas Comms, 6(2): 432–440, 1988.
    • (1988) IEEE J. Selec. Areas Comms , vol.6 , Issue.2 , pp. 432-440
    • Sugamura, N.1    Favardin, N.2
  • 59
    • 0026910787 scopus 로고
    • A new efficient algorithm to compute the lsp parameters for speech coding
    • S. Saoudi, J. Boucher, and A. Guyader. A new efficient algorithm to compute the LSP parameters for speech coding. Signal Proc., 28(2): 201–212, 1995.
    • (1995) Signal Proc , vol.28 , Issue.2 , pp. 201-212
    • Saoudi, S.1    Boucher, J.2    Guyader, A.3
  • 60
    • 0001390793 scopus 로고
    • Speech analysis and synthesis methods developed at ecl in ntt – from lpc to lsp
    • N. Sugamura and F. Itakura. Speech analysis and synthesis methods developed at ECL in NTT – from LPC to LSP. Speech Commun., 5: 213–229, 1986.
    • (1986) Speech Commun , vol.5 , pp. 213-229
    • Sugamura, N.1    Itakura, F.2
  • 64
  • 67
    • 0031153686 scopus 로고    scopus 로고
    • Aclassified vector quantization of lsf parameters
    • D. Chang, S. Ann, and C.W. Lee. Aclassified vector quantization of LSF parameters. Signal Proc., 59(3): 267–273, June 1997.
    • (1997) Signal Proc , vol.59 , Issue.3 , pp. 267-273
    • Chang, D.1    Ann, S.2    Lee, C.W.3
  • 70
    • 0027271246 scopus 로고
    • A long history quantization approach to scalar and vector quantization of lsp coefficients
    • C. S. Xydeas and K. K. M. So. A long history quantization approach to scalar and vector quantization of LSP coefficients. In Proc. Int. Conf. on Acoustics, Speech and Signal Processing, pages 1–4, 1993.
    • (1993) Proc. Int. Conf. On Acoustics, Speech and Signal Processing , pp. 1-4
    • Xydeas, C.S.1    So, K.K.2
  • 72
    • 0020114643 scopus 로고
    • Predictive coding of speech at low bitrates
    • B. S. Atal. Predictive coding of speech at low bitrates. IEEE Trans. Commun., COM30: 600–614, 1982.
    • (1982) IEEE Trans. Commun , vol.30 , pp. 600-614
    • Atal, B.S.1
  • 79
    • 0025588056 scopus 로고
    • A study of lsf representation for speaker-dependent and speaker-independent hmm-based speech recognition systems
    • K. K. Paliwal. A study of LSF representation for speaker-dependent and speaker-independent HMM-based speech recognition systems. In Proc. Int. Conf. on Acoustics, Speech and Signal Processing, Volume 2, pages 801–804, 1990.
    • (1990) Proc. Int. Conf. On Acoustics, Speech and Signal Processing , vol.2 , pp. 801-804
    • Paliwal, K.K.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.