메뉴 건너뛰기




Volumn 17, Issue 1, 2009, Pages 138-149

Unsupervised adaptation of categorical prosody models for prosody labeling and speech recognition

Author keywords

Categorical prosody models; Lattice enrichment speech recognition; Unsupervised adaptation

Indexed keywords

ACOUSTIC COMPONENTS; ACOUSTIC MODEL; AUTOMATIC SPEECH RECOGNITION SYSTEM; BASELINE SYSTEMS; BOSTON UNIVERSITY; BREAK INDICES; CATEGORICAL PROSODY MODELS; CLASSIFICATION ERROR RATE; HUMAN SPEECH; LATTICE ENRICHMENT SPEECH RECOGNITION; PITCH ACCENTS; PROSODY LABELING; PROSODY MODEL; RELATIVE REDUCTION; SEED MODEL; SPEECH RECOGNIZER; UNSUPERVISED ADAPTATION; WORD ERROR RATE;

EID: 70350458869     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2005347     Document Type: Article
Times cited : (20)

References (25)
  • 2
    • 0003665661 scopus 로고    scopus 로고
    • D. Hirst and A. D. Cristo, , D. Hirst and A. D. Cristo, Eds., Cambridge, U.K.: Cambridge Univ. Press
    • D. Hirst and A. D. Cristo, , D. Hirst and A. D. Cristo, Eds., Intonation Systems: A Survey of Twenty Languages. Cambridge, U.K.: Cambridge Univ. Press, 1998.
    • (1998) Intonation Systems: A Survey of Twenty Languages
  • 3
    • 33646805961 scopus 로고    scopus 로고
    • IViE-A comparative transcription system for intonational variation in English
    • E. Grabe, F. Nolan, and K. Farrar, "IViE-A comparative transcription system for intonational variation in English," in Proc. Int. Conf. Spoken Lang. Process., 1998, pp. 1259-1262.
    • (1998) Proc. Int. Conf. Spoken Lang. Process , pp. 1259-1262
    • Grabe, E.1    Nolan, F.2    Farrar, K.3
  • 4
    • 60849083145 scopus 로고    scopus 로고
    • Automatic prosodic event detection using acoustic, lexical, and syntactic evidence
    • Jan.
    • S. Ananthakrishnan and S. Narayanan, "Automatic prosodic event detection using acoustic, lexical, and syntactic evidence," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.1, pp. 216-228, Jan. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process , vol.16 , Issue.1 , pp. 216-228
    • Ananthakrishnan, S.1    Narayanan, S.2
  • 6
    • 85009102907 scopus 로고    scopus 로고
    • Lexical stress modeling for improved speech recognition of spontaneous telephone speech in the JUPITER domain
    • C. Wang and S. Seneff, "Lexical stress modeling for improved speech recognition of spontaneous telephone speech in the JUPITER domain," in Proc. 7th Eur. Conf. Speech Commun. Technol., 2001, pp. 2761-2764.
    • (2001) Proc. 7th Eur. Conf. Speech Commun. Technol. , pp. 2761-2764
    • Wang, C.1    Seneff, S.2
  • 10
    • 34547525606 scopus 로고    scopus 로고
    • Improved speech recognition using acoustic and lexical correlates of pitch accent in a N-best rescoring framework
    • S. Ananthakrishnan and S. Narayanan, "Improved speech recognition using acoustic and lexical correlates of pitch accent in a N-best rescoring framework," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2007, pp. 873-876.
    • (2007) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 873-876
    • Ananthakrishnan, S.1    Narayanan, S.2
  • 12
    • 0035156005 scopus 로고    scopus 로고
    • Automatic ToBI prediction and alignment to speed manual labeling of prosody
    • A. Syrdal, J. Hirschberg, J. McGory, and M. Beckman, "Automatic ToBI prediction and alignment to speed manual labeling of prosody," Speech Commun., vol.33, pp. 135-151, 2001.
    • (2001) Speech Commun. , vol.33 , pp. 135-151
    • Syrdal, A.1    Hirschberg, J.2    McGory, J.3    Beckman, M.4
  • 17
    • 70350481607 scopus 로고    scopus 로고
    • CSR-II (WSJ1) complete
    • Philadelphia, PA
    • "CSR-II (WSJ1) Complete," Linguistic Data Consortium, Philadelphia, PA, 1994.
    • Linguistic Data Consortium , pp. 1994
  • 19
    • 84891308106 scopus 로고    scopus 로고
    • SRILM-An extensible language modeling toolkit
    • Denver, CO
    • A. Stolcke, "SRILM-An extensible language modeling toolkit," in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, 2002, vol.2, pp. 901-904.
    • (2002) Proc. Int. Conf. Spoken Lang. Process , vol.2 , pp. 901-904
    • Stolcke, A.1
  • 20
    • 0034296009 scopus 로고    scopus 로고
    • Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
    • L. Mangu, E. Brill, and A. Stolcke, "Finding consensus in speech recognition: Word error minimization and other applications of confusion networks," Computer, Speech, Lang., vol.14, no.4, pp. 373-400, 2000.
    • (2000) Computer, Speech, Lang. , vol.14 , Issue.4 , pp. 373-400
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 22
    • 0003857778 scopus 로고    scopus 로고
    • A Gentle tutorial on the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models
    • J. Bilmes, "A Gentle tutorial on the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models," Univ. of Berkeley, Berkeley, CA, Tech. Rep. ICSI-TR-97- 021, 1997.
    • (1997) Univ. of Berkeley, Berkeley, CA, Tech. Rep. ICSI-TR-97-021
    • Bilmes, J.1
  • 23
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr.
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol.2, no.2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 24
    • 0040262052 scopus 로고
    • Bayesian learning of Gaussian mixture densities for hidden Markov models
    • Pacific Grove, CA, Morgan-Kaufmann
    • J.-L. Gauvain and C.-H. Lee, "Bayesian learning of Gaussian mixture densities for hidden Markov models," in Proc. DARPA Speech and Natural Language Workshop, Pacific Grove, CA, 1991, pp. 272-277, Morgan-Kaufmann.
    • (1991) Proc. DARPA Speech and Natural Language Workshop , pp. 272-277
    • Gauvain, J.-L.1    Lee, C.-H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.