메뉴 건너뛰기




Volumn 15, Issue 1, 2007, Pages 235-245

Training wideband acoustic models using mixed-bandwidth training data for speech recognition

Author keywords

Acoustic modeling; Bandwidth extension; Hidden Markov models (HMMs); Speech recognition; Telephone speech

Indexed keywords

ACOUSTIC MODELING; ACOUSTIC MODELS; BANDWIDTH EXTENSION; EXPECTATION-MAXIMIZATION ALGORITHMS; HIDDEN MARKOV MODELS (HMMS); NARROW BANDS; RECOGNITION ACCURACIES; RECOGNITION SYSTEMS; SPEECH RECOGNIZERS; SUB-OPTIMAL PERFORMANCE; TELEPHONE BANDWIDTHS; TELEPHONE SPEECH; TRAINING ALGORITHMS; TRAINING DATUM; TRAINING SCHEMES; TRAINING STRATEGIES; WIDE BANDS; WIDEBAND SPEECH;

EID: 64149084747     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.876774     Document Type: Article
Times cited : (18)

References (21)
  • 3
    • 85079086476 scopus 로고
    • Sources of degradation of speech recognition in the telephone network
    • Adelaide, Australia, Apr
    • P. Moreno and R. M. Stern, "Sources of degradation of speech recognition in the telephone network," in Proc. ICASSP, Adelaide, Australia, Apr. 1994, vol. I, pp. 109-112.
    • (1994) Proc. ICASSP , vol.1 , pp. 109-112
    • Moreno, P.1    Stern, R.M.2
  • 4
    • 0001551844 scopus 로고
    • Supervised learning from incomplete data via an EM approach
    • Z. Ghahramani and M. I. Jordan, "Supervised learning from incomplete data via an EM approach," Adv. Neural Inf. Proc. Sys., pp. 120-127, 1994.
    • (1994) Adv. Neural Inf. Proc. Sys , pp. 120-127
    • Ghahramani, Z.1    Jordan, M.I.2
  • 5
    • 4644336054 scopus 로고    scopus 로고
    • Reconstruction of damaged spectrographic features for robust speech recognition
    • Sep
    • B. Raj, M. L. Seltzer, and R. M. Stern, "Reconstruction of damaged spectrographic features for robust speech recognition," Speech Commun., vol. 43, no. 4, pp. 275-296, Sep. 2004.
    • (2004) Speech Commun , vol.43 , Issue.4 , pp. 275-296
    • Raj, B.1    Seltzer, M.L.2    Stern, R.M.3
  • 6
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • Jun
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, no. 3, pp. 267-285, Jun. 2001.
    • (2001) Speech Commun , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 7
    • 4644317224 scopus 로고    scopus 로고
    • Classifier-based mask estimation for missing feature methods of robust speech recognition
    • Sep
    • M. L. Seltzer, B. Raj, and R. M. Stern, "Classifier-based mask estimation for missing feature methods of robust speech recognition," Speech Commun., vol. 43, no. 4, pp. 379-393, Sep. 2004.
    • (2004) Speech Commun , vol.43 , Issue.4 , pp. 379-393
    • Seltzer, M.L.1    Raj, B.2    Stern, R.M.3
  • 8
    • 0028516117 scopus 로고
    • Training issues and channel equalization techniques for the construction of telephone acoustic models using a high-quality speech corpus
    • Oct
    • L. G. Neumeyer, V. V. Digalakis, and M. Weintraub, "Training issues and channel equalization techniques for the construction of telephone acoustic models using a high-quality speech corpus," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 590-597, Oct. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 590-597
    • Neumeyer, L.G.1    Digalakis, V.V.2    Weintraub, M.3
  • 9
    • 0028517647 scopus 로고
    • Statistical recovery of wideband speech from narrowband speech
    • Oct
    • Y. M. Cheng, D. O'Shaughnessy, and P. Mermelstein, "Statistical recovery of wideband speech from narrowband speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 544-548, Oct. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 544-548
    • Cheng, Y.M.1    O'Shaughnessy, D.2    Mermelstein, P.3
  • 10
    • 0033692729 scopus 로고    scopus 로고
    • Narrowband to wideband conversion of speech using GMM based transformation
    • Istanbul, Turkey, Jun
    • K.-Y. Park and H. S. Kim, "Narrowband to wideband conversion of speech using GMM based transformation," in Proc. ICASSP, Istanbul, Turkey, Jun. 2000, vol. 3, pp. 1843-1846.
    • (2000) Proc. ICASSP , vol.3 , pp. 1843-1846
    • Park, K.-Y.1    Kim, H.S.2
  • 11
    • 84951992170 scopus 로고    scopus 로고
    • Wideband extension of telephone speech using a hidden Markov model
    • Delavan, WI, Sep
    • P. Jax and P. Vary, "Wideband extension of telephone speech using a hidden Markov model," in IEEEWorkshop on Speech Coding, Delavan, WI, Sep. 2000, pp. 133-135.
    • (2000) IEEEWorkshop on Speech Coding , pp. 133-135
    • Jax, P.1    Vary, P.2
  • 12
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statistical Soc., vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statistical Soc , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 14
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1990.
    • (1990) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 16
    • 0012330750 scopus 로고
    • The design of the Wall Street Journalbased CSR corpus
    • Harriman, NY, Feb
    • D. B. Paul and J. M. Baker, "The design of the Wall Street Journalbased CSR corpus," in Proc. ARPA Speech Nat. Lang. Workshop, Harriman, NY, Feb. 1992, pp. 357-362.
    • (1992) Proc. ARPA Speech Nat. Lang. Workshop , pp. 357-362
    • Paul, D.B.1    Baker, J.M.2
  • 17
    • 64149129100 scopus 로고    scopus 로고
    • S. Young, The HTK Hidden Markov Model Toolkit: Design and Philosophy, Cambridge Univ. Tech. Rep., Cambridge, U.K., 1994.
    • S. Young, "The HTK Hidden Markov Model Toolkit: Design and Philosophy," Cambridge Univ. Tech. Rep., Cambridge, U.K., 1994.
  • 18
    • 33646785081 scopus 로고    scopus 로고
    • Training wideband acoustic models using mixed-bandwidth training data via feature bandwidth extension
    • Philadelphia, PA, Mar
    • M. L. Seltzer and A. Acero, "Training wideband acoustic models using mixed-bandwidth training data via feature bandwidth extension," in Proc. ICASSP, Philadelphia, PA, Mar. 2005, vol. 1, pp. 921-924.
    • (2005) Proc. ICASSP , vol.1 , pp. 921-924
    • Seltzer, M.L.1    Acero, A.2
  • 19
    • 64149086333 scopus 로고    scopus 로고
    • C. J. Leggetter and P. C. Woodland, Speaker Adaptation of HMMs Using Linear Regression, Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR. 181, Jun. 1994.
    • C. J. Leggetter and P. C. Woodland, "Speaker Adaptation of HMMs Using Linear Regression," Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR. 181, Jun. 1994.
  • 20
    • 85016587886 scopus 로고
    • SWITCHBOARD: Telephone speech corpus for research and development
    • San Francisco, CA, Mar
    • J. Godfrey, E. C. Holliman, and J. McDaniel, "SWITCHBOARD: telephone speech corpus for research and development," in Proc. ICASSP, San Francisco, CA, Mar. 1992, vol. 1, pp. 517-520.
    • (1992) Proc. ICASSP , vol.1 , pp. 517-520
    • Godfrey, J.1    Holliman, E.C.2    McDaniel, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.