메뉴 건너뛰기




Volumn 26, Issue 2, 2012, Pages 67-89

Integrating imperfect transcripts into speech recognition systems for building high-quality corpora

Author keywords

Acoustic model training; Speech processing; Text to speech alignment

Indexed keywords

ACOUSTIC MODEL; AUTOMATIC SPEECH RECOGNITION SYSTEM; CORRECT ERROR; DECODING STRATEGY; HIGH QUALITY; LOW-COST SOLUTION; SEARCH ALGORITHMS; SPEECH CORPORA; SPEECH RECOGNITION SYSTEMS; SPEECH SIGNALS; TEMPORAL INFORMATION; TEXT TO SPEECH; TRAINING CORPUS;

EID: 80055054639     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2011.06.001     Document Type: Article
Times cited : (13)

References (52)
  • 4
    • 33745188444 scopus 로고    scopus 로고
    • Segmentation of recordings based on partial transcriptions
    • P. Cardinal, G. Boulianne, and M. Comeau Segmentation of recordings based on partial transcriptions Proc. Interspeech'05 2005 3345 3348
    • (2005) Proc. Interspeech'05 , pp. 3345-3348
    • Cardinal, P.1    Boulianne, G.2    Comeau, M.3
  • 15
    • 0002910412 scopus 로고    scopus 로고
    • Stemming algorithms: A case study for detailed evaluation
    • D.A. Hull Stemming algorithms: a case study for detailed evaluation Journal of the American Society of Information Science 47 1996 70 84 (Pubitemid 126582657)
    • (1996) Journal of the American Society for Information Science , vol.47 , Issue.1 , pp. 70-84
    • Hull, D.A.1
  • 17
    • 0032785782 scopus 로고    scopus 로고
    • Modeling long distance dependence in language: Topic mixtures versus dynamic cache models
    • R. Iyer, and M. Ostendorf Modeling long distance dependence in language: topic mixtures versus dynamic cache models IEEE Transactions on Speech and Audio Processing 7 Jan 1999 30 39
    • (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , pp. 30-39
    • Iyer, R.1    Ostendorf, M.2
  • 19
    • 85135261720 scopus 로고    scopus 로고
    • Unsupervised training of a speech recognizer: Recent experiments
    • T. Kemp, and A. Waibel Unsupervised training of a speech recognizer: recent experiments Eurospeech'99 1999 2725 2728
    • (1999) Eurospeech'99 , pp. 2725-2728
    • Kemp, T.1    Waibel, A.2
  • 22
    • 0036460908 scopus 로고    scopus 로고
    • Lightly supervised and unsupervised acoustic models training
    • L. Lamel, J.-L. Gauvain, and G. Adda Lightly supervised and unsupervised acoustic models training Computer Speech and Language 16 2002 115 229
    • (2002) Computer Speech and Language , vol.16 , pp. 115-229
    • Lamel, L.1    Gauvain, J.-L.2    Adda, G.3
  • 32
    • 0027929445 scopus 로고
    • On structuring probabilistic dependencies in stochastic language modeling
    • H. Ney, U. Essen, and R. Kneser On structuring probabilistic dependencies in stochastic language modeling Computer Speech and Language 8 1994 1 38
    • (1994) Computer Speech and Language , vol.8 , pp. 1-38
    • Ney, H.1    Essen, U.2    Kneser, R.3
  • 35
    • 84867216798 scopus 로고    scopus 로고
    • Lightly supervised acoustic model training on epps recordings
    • M. Paulik, and A. Waibel Lightly supervised acoustic model training on epps recordings Proc. Interspeech'08 2008 224 227
    • (2008) Proc. Interspeech'08 , pp. 224-227
    • Paulik, M.1    Waibel, A.2
  • 41
  • 42
    • 45549117987 scopus 로고
    • Term-weighting approaches in automatic text retrieval
    • G. Salton, and C. Buckley Term-weighting approaches in automatic text retrieval Information Processing & Management 24 1988 513 523
    • (1988) Information Processing & Management , vol.24 , pp. 513-523
    • Salton, G.1    Buckley, C.2
  • 43
    • 0019887799 scopus 로고
    • Identification of common molecular subsequences
    • T.F. Smith, and M.S. Waterman Identification of common molecular subsequences Molecular Biology 147 1981 195 197
    • (1981) Molecular Biology , vol.147 , pp. 195-197
    • Smith, T.F.1    Waterman, M.S.2
  • 47
    • 0015960104 scopus 로고
    • The string-to-string correction problem
    • R. Wagner, and M. Fisher The string-to-string correction problem The Journal of the ACM 1 1974 168 173
    • (1974) The Journal of the ACM , vol.1 , pp. 168-173
    • Wagner, R.1    Fisher, M.2
  • 48
    • 11144239919 scopus 로고    scopus 로고
    • Unsupervised training of acoustic models for large vocabulary continuous speech recognition
    • DOI 10.1109/TSA.2004.838537
    • F. Wessel, and H. Ney Unsupervised training of acoustic models for large vocabulary continuous speech recognition IEEE Transactions on Speech and Audio Processing 13 2005 23 31 (Pubitemid 40049937)
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.1 , pp. 23-31
    • Wessel, F.1    Ney, H.2
  • 50
    • 0343950213 scopus 로고    scopus 로고
    • Improving acoustic models by watching television
    • Carnegie Mellon University
    • Witbrock, M.J., Hauptmann, A.G., 1998. Improving acoustic models by watching television. Tech. Rep., CMU-CS-98-110, Carnegie Mellon University.
    • (1998) Tech. Rep., CMU-CS-98-110
    • Witbrock, M.J.1    Hauptmann, A.G.2
  • 51
    • 0036461035 scopus 로고    scopus 로고
    • Large scale discriminative training of HMM for speech recognition
    • P. Woodland, and D. Povey Large scale discriminative training of HMM for speech recognition Computer Speech and Language 16 2002 25 47
    • (2002) Computer Speech and Language , vol.16 , pp. 25-47
    • Woodland, P.1    Povey, D.2
  • 52
    • 79951779719 scopus 로고    scopus 로고
    • Unsupervised training and directed manual transcription for lvcsr
    • K. Yu, M. Gales, L. Wang, and P.C. Woodland Unsupervised training and directed manual transcription for lvcsr Speech Communication 52 2010 652 663
    • (2010) Speech Communication , vol.52 , pp. 652-663
    • Yu, K.1    Gales, M.2    Wang, L.3    Woodland, P.C.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.