메뉴 건너뛰기




Volumn 2007, Issue , 2007, Pages

Significance of joint features derived from the modified group delay function in speech processing

Author keywords

[No Author keywords available]

Indexed keywords


EID: 33845951461     PISSN: 16874714     EISSN: 16874722     Source Type: Journal    
DOI: 10.1155/2007/79032     Document Type: Article
Times cited : (21)

References (45)
  • 3
    • 0002735918 scopus 로고
    • Optimization of time-frequency masking filters using the minimum classification error criterion
    • Adelaide, SA, Australia, April
    • M. Bacchiani and K. Aikawa, "Optimization of time-frequency masking filters using the minimum classification error criterion," in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '94), vol. 2, pp. 197-200, Adelaide, SA, Australia, April 1994.
    • (1994) IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '94) , vol.2 , pp. 197-200
    • Bacchiani, M.1    Aikawa, K.2
  • 4
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," Journal of the Acoustical Society of America, vol. 87, no. 4, pp. 1738-1752, 1990.
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 5
    • 0028312802 scopus 로고
    • Auditory models and human performance in tasks related to speech coding and speech recognition
    • O. Ghitza, "Auditory models and human performance in tasks related to speech coding and speech recognition," IEEE Trans-actions on Speech and Audio Processing, vol. 2, no. 1, part 2, pp. 115-132, 1994.
    • (1994) IEEE Trans-actions on Speech and Audio Processing , vol.2 , Issue.1 PART 2 , pp. 115-132
    • Ghitza, O.1
  • 6
    • 0023841401 scopus 로고
    • Vowel processing by a model of the auditory periphery: A comparison to eighth-nerve responses
    • K. L. Payton, "Vowel processing by a model of the auditory periphery: a comparison to eighth-nerve responses," The Journal of the Acoustical Society of America, vol. 83, no. 1, pp. 145-162, 1988.
    • (1988) The Journal of the Acoustical Society of America , vol.83 , Issue.1 , pp. 145-162
    • Payton, K.L.1
  • 8
    • 84928837806 scopus 로고
    • A joint synchrony/mean-rate model of auditory speech processing
    • S. Seneff, "A joint synchrony/mean-rate model of auditory speech processing," Journal of Phonetics, vol. 16, no. 1, pp. 55-76, 1988.
    • (1988) Journal of Phonetics , vol.16 , Issue.1 , pp. 55-76
    • Seneff, S.1
  • 9
    • 0024392496 scopus 로고
    • Application of an auditory model to speech recognition
    • J. R. Cohen, "Application of an auditory model to speech recognition," The Journal of the Acoustical Society of America, vol. 85, no. 6, pp. 2623-2629, 1989.
    • (1989) The Journal of the Acoustical Society of America , vol.85 , Issue.6 , pp. 2623-2629
    • Cohen, J.R.1
  • 11
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 28, no. 4, pp. 357-366, 1980.
    • (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 12
    • 13544259544 scopus 로고    scopus 로고
    • On the usefulness of STFT phase spectrum in human listening tests
    • K. K. Paliwal and L. D. Alsteris, "On the usefulness of STFT phase spectrum in human listening tests," Speech Communication, vol. 45, no. 2, pp. 153-170, 2005.
    • (2005) Speech Communication , vol.45 , Issue.2 , pp. 153-170
    • Paliwal, K.K.1    Alsteris, L.D.2
  • 21
    • 74549174907 scopus 로고    scopus 로고
    • Feature stream combination before and/or after the acoustic model
    • Tech. Rep. TR-00-007, International Computer Science Institute, Berkeley, Calif, USA
    • D. Ellis, "Feature stream combination before and/or after the acoustic model," Tech. Rep. TR-00-007, International Computer Science Institute, Berkeley, Calif, USA, 2000.
    • (2000)
    • Ellis, D.1
  • 22
    • 33845961024 scopus 로고    scopus 로고
    • Speech recognition using heterogenous information extraction in multi-stream based systems,
    • Ph.D. dissertation, Aalborg University, Aalborg, Denmark
    • H. Christensen, "Speech recognition using heterogenous information extraction in multi-stream based systems," Ph.D. dissertation, Aalborg University, Aalborg, Denmark, 2002.
    • (2002)
    • Christensen, H.1
  • 26
    • 0032627223 scopus 로고    scopus 로고
    • K. Kirchhoff and J. A. Bilmes, Dynamic classifier combination in hybrid speech recognition systems using utterancelevel confidence values, in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '99), 2, pp. 693-696, Phoenix, Ariz, USA, March 1999.
    • K. Kirchhoff and J. A. Bilmes, "Dynamic classifier combination in hybrid speech recognition systems using utterancelevel confidence values," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '99), vol. 2, pp. 693-696, Phoenix, Ariz, USA, March 1999.
  • 27
    • 28244462378 scopus 로고    scopus 로고
    • Speech and Vision Lab, IIT Madras, Chennai, India
    • Database for Indian Languages, Speech and Vision Lab, IIT Madras, Chennai, India, 2001.
    • (2001) Database for Indian Languages
  • 29
    • 0025680225 scopus 로고    scopus 로고
    • C. Jankowski, A. Kalyanswamy, S. Basson, and J. Spitz, NTIMIT: a phonetically balanced, continuous speech, telephone bandwidth speech database, in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'90),1, pp. 109-112, Albuquerque, NM, USA, April 1990.
    • C. Jankowski, A. Kalyanswamy, S. Basson, and J. Spitz, "NTIMIT: a phonetically balanced, continuous speech, telephone bandwidth speech database," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'90),vol 1, pp. 109-112, Albuquerque, NM, USA, April 1990.
  • 33
    • 33751561891 scopus 로고    scopus 로고
    • Linear and order statistics combiners for reliable pattern classification,
    • Ph.D. dissertation, University of Texas at Austin, Austin, Tex, USA, May
    • K. Turner, "Linear and order statistics combiners for reliable pattern classification," Ph.D. dissertation, University of Texas at Austin, Austin, Tex, USA, May 1996.
    • (1996)
    • Turner, K.1
  • 34
    • 0000926506 scopus 로고
    • When networks disagree: Ensemble methods for hybrid neural networks
    • Chapman-Hall, London, UK
    • M. P. Perrone and L. N. Cooper, "When networks disagree: ensemble methods for hybrid neural networks," in Neural Networks for Speech and Image Processing, pp. 126-142, Chapman-Hall, London, UK, 1993.
    • (1993) Neural Networks for Speech and Image Processing , pp. 126-142
    • Perrone, M.P.1    Cooper, L.N.2
  • 36
    • 85054435084 scopus 로고
    • Neural network ensembles, cross validation, and active learning
    • MIT Press, Cambridge, Mass, USA
    • A. Krogh and J. Vedelsby, "Neural network ensembles, cross validation, and active learning," in Advances in Neural Information Processing Systems, vol. 7, pp. 231-238, MIT Press, Cambridge, Mass, USA, 1995.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 231-238
    • Krogh, A.1    Vedelsby, J.2
  • 37
    • 0026204672 scopus 로고
    • Formant extraction from group delay function
    • H. A. Murthy and B. Yegnanarayana, "Formant extraction from group delay function," Speech Communication, vol. 10, no. 3, pp. 209-221, 1991.
    • (1991) Speech Communication , vol.10 , Issue.3 , pp. 209-221
    • Murthy, H.A.1    Yegnanarayana, B.2
  • 39
    • 1842475640 scopus 로고    scopus 로고
    • Automatic segmentation of continuous speech using minimum phase group delay functions
    • V. K. Prasad, T. Nagarajan, and H. A. Murthy, "Automatic segmentation of continuous speech using minimum phase group delay functions," Speech Communication, vol. 42, no. 3-4, pp. 429-446, 2004.
    • (2004) Speech Communication , vol.42 , Issue.3-4 , pp. 429-446
    • Prasad, V.K.1    Nagarajan, T.2    Murthy, H.A.3
  • 40
    • 0026923568 scopus 로고
    • Significance of group delay functions in spectrum estimation
    • B. Yegnanarayana and H. A. Murthy, "Significance of group delay functions in spectrum estimation," IEEE Transactions on Signal Processing, vol. 40, no. 9, pp. 2281-2289, 1992.
    • (1992) IEEE Transactions on Signal Processing , vol.40 , Issue.9 , pp. 2281-2289
    • Yegnanarayana, B.1    Murthy, H.A.2
  • 42
    • 0004319975 scopus 로고
    • Acoustical and environmental robustness in automatic speech recognition,
    • Ph.D. dissertation, Carnegie Mellon University, Pittsburgh, Pa, USA
    • A. Acero, "Acoustical and environmental robustness in automatic speech recognition," Ph.D. dissertation, Carnegie Mellon University, Pittsburgh, Pa, USA, 1990.
    • (1990)
    • Acero, A.1
  • 44
    • 0027622158 scopus 로고
    • Root cepstral analysis: A unified view. Application to speech processing in car noise environments
    • P. Alexandre and P. Lockwood, "Root cepstral analysis: a unified view. Application to speech processing in car noise environments," Speech Communication, vol. 12, no. 3, pp. 277-288, 1993.
    • (1993) Speech Communication , vol.12 , Issue.3 , pp. 277-288
    • Alexandre, P.1    Lockwood, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.