메뉴 건너뛰기




Volumn 13, Issue 6, 2005, Pages 1173-1185

Automatic transcription of conversational telephone speech

Author keywords

Large vocabulary conversational speech recognition; Telephone speech recognition

Indexed keywords

ACOUSTIC NOISE; AUTOMATION; COMPUTER SIMULATION; DATA ACQUISITION; INTERPOLATION; SPEECH ANALYSIS; TELEPHONE SYSTEMS;

EID: 27744599401     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.852999     Document Type: Article
Times cited : (17)

References (37)
  • 2
    • 0003396042 scopus 로고    scopus 로고
    • An empirical study of smoothing techniques for language modeling
    • Computer Science Group, Harvard Univ., Cambridge, MA
    • S. F. Chen and J. Goodman, "An Empirical Study of Smoothing Techniques for Language Modeling," Computer Science Group, Harvard Univ., Cambridge, MA, Tech. Rep. TR-10-98, 1998.
    • (1998) Tech. Rep. , vol.TR-10-98
    • Chen, S.F.1    Goodman, J.2
  • 3
    • 4544253834 scopus 로고    scopus 로고
    • Posterior probability decoding, confidence estimation and system combination
    • College Park, MD
    • G. Evermann and P. C. Woodland, "Posterior probability decoding, confidence estimation and system combination," in Proc. Speech Transcription Workshop, College Park, MD, 2000.
    • (2000) Proc. Speech Transcription Workshop
    • Evermann, G.1    Woodland, P.C.2
  • 4
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER)
    • Santa Barbara, CA
    • J. G. Fiscus, "A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER)," in Proc. IEEE ASRU Workshop, Santa Barbara, CA, 1997, pp. 347-354.
    • (1997) Proc. IEEE ASRU Workshop , pp. 347-354
    • Fiscus, J.G.1
  • 5
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • M. J. F. Gales and P. C. Woodland, "Mean and variance adaptation within the MLLR framework," Comput. Speech Lang., vol. 10, pp. 249-264, 1996.
    • (1996) Comput. Speech Lang. , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 6
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, 1998.
    • (1998) Comput. Speech Lang. , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 7
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • _, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Processing, vol. 7, pp. 272-281, 1999.
    • (1999) IEEE Trans. Speech Audio Processing , vol.7 , pp. 272-281
  • 10
    • 85016587886 scopus 로고
    • SWITCHBOARD: Telephone speech corpus for research and development
    • J. J. Godfrey, E. C. Holliman, and J. McDaniel, "SWITCHBOARD: Telephone speech corpus for research and development," in Proc. ICASSP'92, 1992, pp. 517-520.
    • (1992) Proc. ICASSP'92 , pp. 517-520
    • Godfrey, J.J.1    Holliman, E.C.2    McDaniel, J.3
  • 11
    • 0025952278 scopus 로고
    • An inequality for rational functions with applications to some statistical estimation problems
    • P. S. Gopalakrishnan, D. Kanevsky, A. Nadas, and D. Nahamoo, "An inequality for rational functions with applications to some statistical estimation problems," IEEE Trans. Inform. Theory, vol. 37, pp. 107-113, 1991.
    • (1991) IEEE Trans. Inform. Theory , vol.37 , pp. 107-113
    • Gopalakrishnan, P.S.1    Kanevsky, D.2    Nadas, A.3    Nahamoo, D.4
  • 12
    • 85153381142 scopus 로고    scopus 로고
    • CU-HTK acoustic modeling experiments
    • Linthicum Heights, MD
    • T. Hain and P. C. Woodland, "CU-HTK acoustic modeling experiments," in Proc. NIST Hub5 Workshop, Linthicum Heights, MD, 1998.
    • (1998) Proc. NIST Hub5 Workshop
    • Hain, T.1    Woodland, P.C.2
  • 13
    • 0034847002 scopus 로고    scopus 로고
    • The 1998 HTK system for transcription of conversational telephone speech
    • T. Hain, P. C. Woodland, T. R. Niesler, and E. W. D. Whittaker, "The 1998 HTK system for transcription of conversational telephone speech," in Proc. ICASSP'99, 1998, pp. 57-60.
    • (1998) Proc. ICASSP'99 , pp. 57-60
    • Hain, T.1    Woodland, P.C.2    Niesler, T.R.3    Whittaker, E.W.D.4
  • 15
    • 85153334377 scopus 로고    scopus 로고
    • New features in the CU-HTK system for transcription of conversational telephone speech
    • Salt Lake City, UT
    • _, "New features in the CU-HTK system for transcription of conversational telephone speech," in Proc. ICASSP'01, Salt Lake City, UT, 1999.
    • (1999) Proc. ICASSP'01
  • 16
    • 85153373801 scopus 로고    scopus 로고
    • Implicit modeling of pronunciation variation in automatic speech recognition
    • to be published
    • T. Hain, "Implicit modeling of pronunciation variation in automatic speech recognition," Speech Commun., 2003, to be published.
    • (2003) Speech Commun.
    • Hain, T.1
  • 17
    • 85123963268 scopus 로고
    • Improved clustering techniques for class-based statistical language modeling
    • Berlin, Germany
    • R. Kneser and H. Ney, "Improved clustering techniques for class-based statistical language modeling," in Proc. Eurospeech'93, Berlin, Germany, 1993, pp. 973-976.
    • (1993) Proc. Eurospeech'93 , pp. 973-976
    • Kneser, R.1    Ney, H.2
  • 20
    • 0029747183 scopus 로고    scopus 로고
    • Speaker normalization using efficient frequency warping procedures
    • L. Lee and R. C. Rose, "Speaker normalization using efficient frequency warping procedures," in Proc. ICASSP'96, 1996, pp. 353-356.
    • (1996) Proc. ICASSP'96 , pp. 353-356
    • Lee, L.1    Rose, R.C.2
  • 21
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density HMMs
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density HMMs," Comput. Speech Lang., vol. 9, pp. 171-186, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-186
    • Leggetter, C.J.1    Woodland, P.C.2
  • 22
    • 0001135471 scopus 로고
    • Flexible speaker adaptation using maximum likelihood linear regression
    • Madrid, Spain
    • _, "Flexible speaker adaptation using maximum likelihood linear regression," in Proc. Eurospeech'95, Madrid, Spain, 1995, pp. 1155-1158.
    • (1995) Proc. Eurospeech'95 , pp. 1155-1158
  • 23
    • 85135271674 scopus 로고    scopus 로고
    • Finding consensus among words: Lattice-based word error minimization
    • Budapest, Hungary
    • L. Mangu, E. Brill, and A. Stolcke, "Finding consensus among words: lattice-based word error minimization," in Proc. Eurospeech'99, Budapest, Hungary, 1999, pp. 495-498.
    • (1999) Proc. Eurospeech'99 , pp. 495-498
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 24
    • 85135152717 scopus 로고
    • Algorithms for bigram and trigram clustering
    • Madrid, Spain
    • S. Martin, J. Liermann, and H. Ney, "Algorithms for bigram and trigram clustering," in Proc. Eurospeech'95, Madrid, Spain, 1995, pp. 1253-1256.
    • (1995) Proc. Eurospeech'95 , pp. 1253-1256
    • Martin, S.1    Liermann, J.2    Ney, H.3
  • 25
    • 84902047630 scopus 로고    scopus 로고
    • Single-pass adapted training with all-pass transforms
    • Budapest, Hungary
    • J. McDonough and W. Byrne, "Single-pass adapted training with all-pass transforms," in Proc. Eurospeech'99, Budapest, Hungary, 1999, pp. 2737-2740.
    • (1999) Proc. Eurospeech'99 , pp. 2737-2740
    • McDonough, J.1    Byrne, W.2
  • 27
    • 0031628780 scopus 로고    scopus 로고
    • Comparison of part-of-speech and automatically derived category-based language models for speech recognition
    • Seattle, WA
    • T. R. Niesler, E. W. D. Whittaker, and P. C. Woodland, "Comparison of part-of-speech and automatically derived category-based language models for speech recognition," in Proc. ICASSP'98, Seattle, WA, 1998, pp. 177-180.
    • (1998) Proc. ICASSP'98 , pp. 177-180
    • Niesler, T.R.1    Whittaker, E.W.D.2    Woodland, P.C.3
  • 29
    • 0026372945 scopus 로고
    • An Improved MMIE training algorithm for speaker independent, small vocabulary, continuous speech recognition
    • Toronto, ON, Canada
    • Y. Normandin, "An Improved MMIE training algorithm for speaker independent, small vocabulary, continuous speech recognition," in Proc. ICASSP'91, Toronto, ON, Canada, 1991, pp. 537-540.
    • (1991) Proc. ICASSP'91 , pp. 537-540
    • Normandin, Y.1
  • 31
    • 0036296863 scopus 로고    scopus 로고
    • Minimum phone error and I-smoothing for improved discriminative training
    • Orlando, FL
    • D. Povey and P. C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP'02, Orlando, FL, 2002.
    • (2002) Proc. ICASSP'02
    • Povey, D.1    Woodland, P.C.2
  • 33
    • 0030643667 scopus 로고    scopus 로고
    • Broadcast news transcription using HTK
    • Munich, Germany
    • P. C. Woodland, M. J. F. Gales, D. Pye, and S. J. Young, "Broadcast news transcription using HTK," in Proc. ICASSP'97, Munich, Germany, 1997, pp. 719-722.
    • (1997) Proc. ICASSP'97 , pp. 719-722
    • Woodland, P.C.1    Gales, M.J.F.2    Pye, D.3    Young, S.J.4
  • 34
    • 0002867698 scopus 로고    scopus 로고
    • Large scale discriminative training for speech recognition
    • Paris, France
    • P. C. Woodland and D. Povey, "Large scale discriminative training for speech recognition," in Proc. ISCA ITRW ASR2000, Paris, France, 2000, pp. 7-16.
    • (2000) Proc. ISCA ITRW ASR2000 , pp. 7-16
    • Woodland, P.C.1    Povey, D.2
  • 35
    • 0036567794 scopus 로고    scopus 로고
    • The development of the HTK broadcast news transcription system: An overview
    • P. C. Woodland, "The development of the HTK broadcast news transcription system: an overview," Speech Commun., vol. 37, pp. 47-67, 2002.
    • (2002) Speech Commun. , vol.37 , pp. 47-67
    • Woodland, P.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.