메뉴 건너뛰기




Volumn 12, Issue 4, 2004, Pages 420-435

Automatic recognition of spontaneous speech for access to multilingual oral history archives

Author keywords

Automatic speech recognition (ASR); Information retrieval; Multilingual ASR; Oral history; Spoken document retrieval; Spontaneous speech

Indexed keywords

BROADCASTING; INFORMATION RETRIEVAL; INFORMATION RETRIEVAL SYSTEMS; INFORMATION TECHNOLOGY; SPEECH SYNTHESIS; TELEPHONE;

EID: 3042820894     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2004.828702     Document Type: Conference Paper
Times cited : (115)

References (43)
  • 11
    • 0141480043 scopus 로고    scopus 로고
    • Toward automatic transcription of large spoken archives - English ASR for the MALACH project
    • Hong Kong
    • B. Ramabhadran, J. Huang, and M. Picheny, "Toward automatic transcription of large spoken archives - English ASR for the MALACH project," in Proc. ICASSP, Hong Kong, 2003.
    • (2003) Proc. ICASSP
    • Ramabhadran, B.1    Huang, J.2    Picheny, M.3
  • 12
    • 85009288286 scopus 로고    scopus 로고
    • Large vocabulary conversational speech recognition with the Extended Maximum Likelihood Linear Transformation (EMLLT) model
    • Denver, CO
    • J. Huang, V. Goel, R. Gopinath, B. Kingsbury, P. Olsen, and K. Visweswariah, "Large vocabulary conversational speech recognition with the Extended Maximum Likelihood Linear Transformation (EMLLT) model," in Proc. ICSLP, Denver, CO, 2002, pp. 2597-2600.
    • (2002) Proc. ICSLP , pp. 2597-2600
    • Huang, J.1    Goel, V.2    Gopinath, R.3    Kingsbury, B.4    Olsen, P.5    Visweswariah, K.6
  • 13
    • 85079084846 scopus 로고
    • Robust methods for using context dependent features and models in a continuous speech recognizer
    • Geneva, Switzerland
    • L. R. Bahl, P. de Souza, P. S. Gopalakrishnan, D. Nahamoo, and M. Picheny, "Robust methods for using context dependent features and models in a continuous speech recognizer," in Proc. ICASSP, Geneva, Switzerland, 1994.
    • (1994) Proc. ICASSP
    • Bahl, L.R.1    De Souza, P.2    Gopalakrishnan, P.S.3    Nahamoo, D.4    Picheny, M.5
  • 14
    • 0030362995 scopus 로고    scopus 로고
    • A compact model for speaker-adaptive training
    • Philadelphia, PA
    • T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training," in Proc. ICSLP, Philadelphia, PA, 1996, pp. 1137-1140.
    • (1996) Proc. ICSLP , pp. 1137-1140
    • Anastasakos, T.1    McDonough, J.2    Schwartz, R.3    Makhoul, J.4
  • 15
    • 0003454539 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • CUED-F-INFENG-TR291
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Tech. Rep., CUED/F-INFENG/TR291, 1997.
    • (1997) Tech. Rep.
    • Gales, M.J.F.1
  • 18
    • 0033329799 scopus 로고    scopus 로고
    • An empirical study of smoothing techniques for language modeling
    • S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Comput. Speech Lang., vol. 13, no. 4, pp. 359-393, 1999.
    • (1999) Comput. Speech Lang. , vol.13 , Issue.4 , pp. 359-393
    • Chen, S.F.1    Goodman, J.2
  • 20
    • 0031209168 scopus 로고    scopus 로고
    • Using out-of-domain data to improve in-domain language models
    • Aug.
    • R. Iyer, M. Ostendorf, and H. Gish, "Using out-of-domain data to improve in-domain language models," IEEE Signal Processing Lett., vol. 4, pp. 221-223, Aug. 1997.
    • (1997) IEEE Signal Processing Lett. , vol.4 , pp. 221-223
    • Iyer, R.1    Ostendorf, M.2    Gish, H.3
  • 22
    • 0003843502 scopus 로고
    • Syllable-based generalizations in english phonology
    • Bloomington, IN
    • D. Kahn, "Syllable-based generalizations in english phonology," in Indiana Univ. Linguistics Club, Bloomington, IN, 1976.
    • (1976) Indiana Univ. Linguistics Club
    • Kahn, D.1
  • 25
    • 84891308106 scopus 로고    scopus 로고
    • SRILM - An extensible language modeling toolkit
    • Denver, CO
    • A. Stolcke, "SRILM - An extensible language modeling toolkit," in Proc. Int. Conf. Spoken Language Processing, Denver, CO, 2002, pp. 901-904.
    • (2002) Proc. Int. Conf. Spoken Language Processing , pp. 901-904
    • Stolcke, A.1
  • 26
    • 85009165976 scopus 로고    scopus 로고
    • Impact of audio segmentation and segment clustering on automated transcription accuracy of large spoken archives
    • Geneva, Switzerland
    • B. Ramabhadran, J. Huang, U. Chaudhari, G. Iyengar, and H. J. Nock, "Impact of audio segmentation and segment clustering on automated transcription accuracy of large spoken archives," in Proc. EUROSPEECH 2003, Geneva, Switzerland, 2003.
    • (2003) Proc. EUROSPEECH 2003
    • Ramabhadran, B.1    Huang, J.2    Chaudhari, U.3    Iyengar, G.4    Nock, H.J.5
  • 27
    • 3042855890 scopus 로고    scopus 로고
    • Arc minimization in finite state decoding graphs with cross-word acoustic context
    • Geneva, Switzerland
    • G. Zweig, G. Saon, and F. Yvon, "Arc minimization in finite state decoding graphs with cross-word acoustic context," in Proc. EUROSPEECH 2003, Geneva, Switzerland, 2003.
    • (2003) Proc. EUROSPEECH 2003
    • Zweig, G.1    Saon, G.2    Yvon, F.3
  • 28
    • 85009192356 scopus 로고    scopus 로고
    • An architecture for rapid decoding of large vocabulary conversational speech
    • Geneva, Switzerland
    • G. Saon, G. Zweig, B. Kingsbury, L. Mangu, and U. Chaudhari, "An architecture for rapid decoding of large vocabulary conversational speech," in Proc. EUROSPEECH 2003, Geneva, Switzerland, 2003.
    • (2003) Proc. EUROSPEECH 2003
    • Saon, G.1    Zweig, G.2    Kingsbury, B.3    Mangu, L.4    Chaudhari, U.5
  • 29
    • 44849116447 scopus 로고    scopus 로고
    • Unsupervised and supervised clustering for topic tracking
    • Gaithersburg, MD, [Online]
    • M. Franz, J. S. McCarley, T. Ward, and W.-J. Zhu, "Unsupervised and supervised clustering for topic tracking," in Topic Detection and Tracking 2000 Workshop, Gaithersburg, MD, [Online] Available: http://www.nist.gov/speech/ tests/tdt2000/papers.htm 2000.
    • (2000) Topic Detection and Tracking 2000 Workshop
    • Franz, M.1    McCarley, J.S.2    Ward, T.3    Zhu, W.-J.4
  • 31
  • 34
    • 0002652285 scopus 로고    scopus 로고
    • A maximum entropy approach to natural language processing
    • A. L. Berger, V. D. Pietra, and S. D. Pietra, "A maximum entropy approach to natural language processing," Computat. Ling., vol. 22, no. 1, pp. 39-71, 1996.
    • (1996) Computat. Ling. , vol.22 , Issue.1 , pp. 39-71
    • Berger, A.L.1    Pietra, V.D.2    Pietra, S.D.3
  • 40
    • 3042853618 scopus 로고    scopus 로고
    • Searching large collections of recorded speech: A preliminary study
    • Medford, NJ: Information Today, to be published
    • J. Kim, D. Oard, and D. Soergel, "Searching large collections of recorded speech: A preliminary study," in Proceedings of the ASIST Annual Meeting. Medford, NJ: Information Today, 2003, pp. 330-339, to be published.
    • (2003) Proceedings of the ASIST Annual Meeting , pp. 330-339
    • Kim, J.1    Oard, D.2    Soergel, D.3
  • 41
    • 0013233910 scopus 로고    scopus 로고
    • An empirical study of the optimal presentation of multimedia summaries of broadcast news
    • I. Mani and M. Maybury, Eds.
    • A. Merlino and M. Maybury, "An empirical study of the optimal presentation of multimedia summaries of broadcast news," in Automated Text Summarization, I. Mani and M. Maybury, Eds., 1999.
    • (1999) Automated Text Summarization
    • Merlino, A.1    Maybury, M.2
  • 43
    • 85009170963 scopus 로고    scopus 로고
    • Automated transcription and topic segmentation of large spoken archives
    • Geneva, Switzerland
    • M. Franz, B. Ramabhadran, T. Ward, and M. Picheny, "Automated transcription and topic segmentation of large spoken archives," in Proc. EUROSPEECH 2003, Geneva, Switzerland, 2003, pp. 953-956.
    • (2003) Proc. EUROSPEECH 2003 , pp. 953-956
    • Franz, M.1    Ramabhadran, B.2    Ward, T.3    Picheny, M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.