SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 12, Issue 4, 2004, Pages 420-435

Automatic recognition of spontaneous speech for access to multilingual oral history archives

(12) Byrne, William a Doermann, David b Franz, Martin c Gustman, Samuel d Hajič, Jan e Oard, Douglas b Picheny, Michael c Psutka, Josef f Ramabhadran, Bhuvana c Soergel, Dagobert b Ward, Todd c Zhu, Wei Jing c

a Johns Hopkins University (United States)

b UNIVERSITY OF MARYLAND (United States)

c IBM T J WATSON RESEARCH CENTER (United States)

d Survivors Shoah Vis Hist Found (United States)

e CHARLES UNIVERSITY (Czech Republic)

f UNIVERSITY OF WEST BOHEMIA (Czech Republic)

Author keywords

Automatic speech recognition (ASR); Information retrieval; Multilingual ASR; Oral history; Spoken document retrieval; Spontaneous speech

Indexed keywords

BROADCASTING; INFORMATION RETRIEVAL; INFORMATION RETRIEVAL SYSTEMS; INFORMATION TECHNOLOGY; SPEECH SYNTHESIS; TELEPHONE;

AUTOMATIC SPEECH RECOGNITION (ASR); MULTILINGUAL ASR; ORAL HISTORY; SPOKEN DOCUMENT RETRIEVAL; SPONTAEOUS SPEECH;

SPEECH RECOGNITION;

EID: 3042820894 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2004.828702 Document Type: Conference Paper

Times cited : (115)

References (43)

1
- 3042816531
- [On-line]
- DELOS/NSF. (2003) E.-U. W. on Spoken-Word Audio Collections. [On-line] Available: http://www.dcs.shef.ac.uk/spandh/projects/swag/
- (2003) E.-U. W. on Spoken-word Audio Collections

2
- 0013182765
- Boston, MA: Kluwer
- J. Allan, Ed., Topic Detection and Tracking: Event-Based Information Organization. Boston, MA: Kluwer, 2002.
- (2002) Topic Detection and Tracking: Event-based Information Organization
- Allan, J.¹

3
- 85016587886
- SWITCHBOARD: Telephone speech corpus for research and development
- J. Godfrey, E. Holliman, and J. McDaniel, "SWITCHBOARD: Telephone speech corpus for research and development," in. Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, 1992, pp. 517-520.
- (1992) Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing , pp. 517-520
- Godfrey, J.¹ Holliman, E.² McDaniel, J.³

4
- 0001450951
- The TREC spoken document retrieval track: A success story
- [Online], Nov.
- J. S. Garofolo, C. G. P. Auzanne, and E. M. Voorhees, "The TREC spoken document retrieval track: a success story," in Proc. 8th Text Retrieval Conf. (TREC-8), [Online] Available: http://trec.nist.gov, Nov. 1999.
- (1999) Proc. 8th Text Retrieval Conf. (TREC-8)
- Garofolo, J.S.¹ Auzanne, C.G.P.² Voorhees, E.M.³

5
- 0038452150
- [Online]
- Survivors of the Shoah Visual History Foundation. [Online] Available: http://www.vhf.org
- Survivors of the Shoah Visual History Foundation

6
- 85009102300
- Document expansion for speech retrieval
- Aug.
- A. Singhal and F. Pereira, "Document expansion for speech retrieval," in Proc. 22nd Int. Conf. Research and Development in Information Retrieval, Aug. 1999, pp. 34-41.
- (1999) Proc. 22nd Int. Conf. Research and Development in Information Retrieval , pp. 34-41
- Singhal, A.¹ Pereira, F.²

7
- 85009119463
- Statistical methods for topic segmentation
- Beijing, China
- S. Dharanipragada, M. Franz, J. S. McCarley, K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu, "Statistical methods for topic segmentation," in Proc. 6th Int. Conf. Spoken Language Processing, Beijing, China, 2000, pp. 516-519.
- (2000) Proc. 6th Int. Conf. Spoken Language Processing , pp. 516-519
- Dharanipragada, S.¹ Franz, M.² McCarley, J.S.³ Papineni, K.⁴ Roukos, S.⁵ Ward, T.⁶ Zhu, W.-J.⁷

8
- 0033650326
- Influence of speech recognition errors on topic detection
- Athens, Greece
- J. S. McCarley and M. Franz, "Influence of speech recognition errors on topic detection," in Proc. 23rd ACM SIGIR Conf. Information Retrieval, Athens, Greece, 2000, pp. 342-344.
- (2000) Proc. 23rd ACM SIGIR Conf. Information Retrieval , pp. 342-344
- McCarley, J.S.¹ Franz, M.²

9
- 0035148809
- Transcriber: Development and use of a tool for assisting speech corpora production
- Jan.
- C. Barras, E. Geoffrois, Z. Wu, and M. Liberman, "Transcriber: development and use of a tool for assisting speech corpora production," Speech Communication - Special Issue on Speech Annotation and Corpus Tools, vol. 33, no. 1-2, pp. 5-22, Jan. 2000.
- (2000) Speech Communication - Special Issue on Speech Annotation and Corpus Tools , vol.33 , Issue.1-2 , pp. 5-22
- Barras, C.¹ Geoffrois, E.² Wu, Z.³ Liberman, M.⁴

10
- 9444227276
- Automatic transcription of Czech language oral history in the MALACH project: Resources and initial experiments
- Berlin/Heidelberg, Germany
- J. Psutka, P. Ircing, J. Psutka, V. Radova, W. Byrne, J. Hajič, S. Gustman, and B. Ramabhadran, "Automatic transcription of Czech language oral history in the MALACH project: Resources and initial experiments," in Proc. Text, Speech, and Dialog Workshop, Berlin/Heidelberg, Germany, 2002.
- (2002) Proc. Text, Speech, and Dialog Workshop
- Psutka, J.¹ Ircing, P.² Psutka, J.³ Radova, V.⁴ Byrne, W.⁵ Hajič, J.⁶ Gustman, S.⁷ Ramabhadran, B.⁸

11
- 0141480043
- Toward automatic transcription of large spoken archives - English ASR for the MALACH project
- Hong Kong
- B. Ramabhadran, J. Huang, and M. Picheny, "Toward automatic transcription of large spoken archives - English ASR for the MALACH project," in Proc. ICASSP, Hong Kong, 2003.
- (2003) Proc. ICASSP
- Ramabhadran, B.¹ Huang, J.² Picheny, M.³

12
- 85009288286
- Large vocabulary conversational speech recognition with the Extended Maximum Likelihood Linear Transformation (EMLLT) model
- Denver, CO
- J. Huang, V. Goel, R. Gopinath, B. Kingsbury, P. Olsen, and K. Visweswariah, "Large vocabulary conversational speech recognition with the Extended Maximum Likelihood Linear Transformation (EMLLT) model," in Proc. ICSLP, Denver, CO, 2002, pp. 2597-2600.
- (2002) Proc. ICSLP , pp. 2597-2600
- Huang, J.¹ Goel, V.² Gopinath, R.³ Kingsbury, B.⁴ Olsen, P.⁵ Visweswariah, K.⁶

13
- 85079084846
- Robust methods for using context dependent features and models in a continuous speech recognizer
- Geneva, Switzerland
- L. R. Bahl, P. de Souza, P. S. Gopalakrishnan, D. Nahamoo, and M. Picheny, "Robust methods for using context dependent features and models in a continuous speech recognizer," in Proc. ICASSP, Geneva, Switzerland, 1994.
- (1994) Proc. ICASSP
- Bahl, L.R.¹ De Souza, P.² Gopalakrishnan, P.S.³ Nahamoo, D.⁴ Picheny, M.⁵

14
- 0030362995
- A compact model for speaker-adaptive training
- Philadelphia, PA
- T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training," in Proc. ICSLP, Philadelphia, PA, 1996, pp. 1137-1140.
- (1996) Proc. ICSLP , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

15
- 0003454539
- Maximum likelihood linear transformations for HMM-based speech recognition
- CUED-F-INFENG-TR291
- M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Tech. Rep., CUED/F-INFENG/TR291, 1997.
- (1997) Tech. Rep.
- Gales, M.J.F.¹

16
- 0003822743
- [Online]
- S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book: [Online] Available: http://htk.eng.cam.ac.uk/, 1997.
- (1997) The HTK Book
- Young, S.¹ Evermann, G.² Kershaw, D.³ Moore, G.⁴ Odell, J.⁵ Ollason, D.⁶ Valtchev, V.⁷ Woodland, P.⁸

17
- 11844267507
- On large vocabulary continuous speech recognition of highly inflectional language - Czech
- Aalborg, Denmark
- P. Ircing, P. Krbec, J. Hajič, S. Khudanpur, F. Jelinek, J. Psutka, and W. Byrne, "On large vocabulary continuous speech recognition of highly inflectional language - Czech," in Proc. 7th European Conf. Speech Communication and Technology (EUROSPEECH), Aalborg, Denmark. 2001.
- (2001) Proc. 7th European Conf. Speech Communication and Technology (EUROSPEECH)
- Ircing, P.¹ Krbec, P.² Hajič, J.³ Khudanpur, S.⁴ Jelinek, F.⁵ Psutka, J.⁶ Byrne, W.⁷

18
- 0033329799
- An empirical study of smoothing techniques for language modeling
- S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Comput. Speech Lang., vol. 13, no. 4, pp. 359-393, 1999.
- (1999) Comput. Speech Lang. , vol.13 , Issue.4 , pp. 359-393
- Chen, S.F.¹ Goodman, J.²

19
- 85009168011
- Large vocabulary ASR for spontaneous Czech in the MALACH project
- Geneva, Switzerland
- J. Psutka, P. Ircing, J. V. Psutka, V. Radová, W. J. Byrne, J. Hajicx̌, J. Mírovský, and S. Gustman, "Large vocabulary ASR for spontaneous Czech in the MALACH project," in Proc. EUROSPEECH 2003, Geneva, Switzerland, 2003.
- (2003) Proc. EUROSPEECH 2003
- Psutka, J.¹ Ircing, P.² Psutka, J.V.³ Radová, V.⁴ Byrne, W.J.⁵ Hajicx̌, J.⁶ Mírovský, J.⁷ Gustman, S.⁸

20
- 0031209168
- Using out-of-domain data to improve in-domain language models
- Aug.
- R. Iyer, M. Ostendorf, and H. Gish, "Using out-of-domain data to improve in-domain language models," IEEE Signal Processing Lett., vol. 4, pp. 221-223, Aug. 1997.
- (1997) IEEE Signal Processing Lett. , vol.4 , pp. 221-223
- Iyer, R.¹ Ostendorf, M.² Gish, H.³

21
- 3042814146
- Syllabification software
- Gaithersburg, MD, [Online]
- W. M. Fisher, "Syllabification software," in The Spoken Natural Language Processing Group, National Institute of Standards and Technology, Gaithersburg, MD, 1976, [Online] Available: http://www.itl.nist.gov/div894/894. 01/slp.htm.
- (1976) The Spoken Natural Language Processing Group, National Institute of Standards and Technology
- Fisher, W.M.¹

22
- 0003843502
- Syllable-based generalizations in english phonology
- Bloomington, IN
- D. Kahn, "Syllable-based generalizations in english phonology," in Indiana Univ. Linguistics Club, Bloomington, IN, 1976.
- (1976) Indiana Univ. Linguistics Club
- Kahn, D.¹

23
- 34547171560
- Improvements in English ASR for the MALACH project using syllable-centric models
- Virgin Islands
- A. Sethy, B. Ramabhadran, and S. Narayanan, "Improvements in English ASR for the MALACH project using syllable-centric models," in Proc. Automatic Speech Recognition and Understanding Workshop (ASRU 03), Virgin Islands, 2003.
- (2003) Proc. Automatic Speech Recognition and Understanding Workshop (ASRU 03)
- Sethy, A.¹ Ramabhadran, B.² Narayanan, S.³

24
- 0002824030
- Weighted finite-state transducers in speech recognition
- Paris, France, Sept.
- M. Mohri, M. Riley, and F. C. Pereira, "Weighted finite-state transducers in speech recognition," in Proc. Int. Workshop on Automatic Speech Recognition: Challenges for the Next Millenium, Paris, France, Sept. 2000, pp. 97-106.
- (2000) Proc. Int. Workshop on Automatic Speech Recognition: Challenges for the Next Millenium , pp. 97-106
- Mohri, M.¹ Riley, M.² Pereira, F.C.³

25
- 84891308106
- SRILM - An extensible language modeling toolkit
- Denver, CO
- A. Stolcke, "SRILM - An extensible language modeling toolkit," in Proc. Int. Conf. Spoken Language Processing, Denver, CO, 2002, pp. 901-904.
- (2002) Proc. Int. Conf. Spoken Language Processing , pp. 901-904
- Stolcke, A.¹

26
- 85009165976
- Impact of audio segmentation and segment clustering on automated transcription accuracy of large spoken archives
- Geneva, Switzerland
- B. Ramabhadran, J. Huang, U. Chaudhari, G. Iyengar, and H. J. Nock, "Impact of audio segmentation and segment clustering on automated transcription accuracy of large spoken archives," in Proc. EUROSPEECH 2003, Geneva, Switzerland, 2003.
- (2003) Proc. EUROSPEECH 2003
- Ramabhadran, B.¹ Huang, J.² Chaudhari, U.³ Iyengar, G.⁴ Nock, H.J.⁵

27
- 3042855890
- Arc minimization in finite state decoding graphs with cross-word acoustic context
- Geneva, Switzerland
- G. Zweig, G. Saon, and F. Yvon, "Arc minimization in finite state decoding graphs with cross-word acoustic context," in Proc. EUROSPEECH 2003, Geneva, Switzerland, 2003.
- (2003) Proc. EUROSPEECH 2003
- Zweig, G.¹ Saon, G.² Yvon, F.³

28
- 85009192356
- An architecture for rapid decoding of large vocabulary conversational speech
- Geneva, Switzerland
- G. Saon, G. Zweig, B. Kingsbury, L. Mangu, and U. Chaudhari, "An architecture for rapid decoding of large vocabulary conversational speech," in Proc. EUROSPEECH 2003, Geneva, Switzerland, 2003.
- (2003) Proc. EUROSPEECH 2003
- Saon, G.¹ Zweig, G.² Kingsbury, B.³ Mangu, L.⁴ Chaudhari, U.⁵

29
- 44849116447
- Unsupervised and supervised clustering for topic tracking
- Gaithersburg, MD, [Online]
- M. Franz, J. S. McCarley, T. Ward, and W.-J. Zhu, "Unsupervised and supervised clustering for topic tracking," in Topic Detection and Tracking 2000 Workshop, Gaithersburg, MD, [Online] Available: http://www.nist.gov/speech/ tests/tdt2000/papers.htm 2000.
- (2000) Topic Detection and Tracking 2000 Workshop
- Franz, M.¹ McCarley, J.S.² Ward, T.³ Zhu, W.-J.⁴

30
- 3042775942
- Segmentation and detection at IBM: Hybrid statistical models and two-tiered clustering
- Norwell, MA: Kluwer
- S. Dharanipragada, M. Franz, J. S. McCarley, T. Ward, and W.-J. Zhu, "Segmentation and detection at IBM: Hybrid statistical models and two-tiered clustering," in Topic Detection and Tracking: Event-Based Information Organization. Norwell, MA: Kluwer, 2002.
- (2002) Topic Detection and Tracking: Event-based Information Organization
- Dharanipragada, S.¹ Franz, M.² McCarley, J.S.³ Ward, T.⁴ Zhu, W.-J.⁵

31
- 84938213121
- Nymble: A highperformance learning name-finder
- San Francisco, CA
- S. M. D. Bikel and R. Schwartz, "Nymble: a highperformance learning name-finder," in Proc. Applied Natural Language Processing, San Francisco, CA, 1997, pp. 194-201.
- (1997) Proc. Applied Natural Language Processing , pp. 194-201
- Bikel, S.M.D.¹ Schwartz, R.²

32
- 1642280433
- IBM's statistical question answering system - TREC-10
- Gaithersburg, MD
- A. Ittycheriah, M. Franz, and S. Roukos, "IBM's statistical question answering system - TREC-10," in Proc. 10th Text Retrieval Conf. (TREC-10), Gaithersburg, MD, 2001, pp. 258-264.
- (2001) Proc. 10th Text Retrieval Conf. (TREC-10) , pp. 258-264
- Ittycheriah, A.¹ Franz, M.² Roukos, S.³

33
- 0012614532
- Performance measures for information extraction
- Hemdon, VA
- J. Makhoul, F. Kubala, R. Schwartz, and R. Weischedel, "Performance measures for information extraction," in 1998 DARPA Broadcast News Workshop, Hemdon, VA, 1998, pp. 249-252.
- (1998) 1998 DARPA Broadcast News Workshop , pp. 249-252
- Makhoul, J.¹ Kubala, F.² Schwartz, R.³ Weischedel, R.⁴

34
- 0002652285
- A maximum entropy approach to natural language processing
- A. L. Berger, V. D. Pietra, and S. D. Pietra, "A maximum entropy approach to natural language processing," Computat. Ling., vol. 22, no. 1, pp. 39-71, 1996.
- (1996) Computat. Ling. , vol.22 , Issue.1 , pp. 39-71
- Berger, A.L.¹ Pietra, V.D.² Pietra, S.D.³

35
- 0002779049
- Topic detection and tracking evaluation overview
- J. G. Fiscus and G. R. Doddington, "Topic detection and tracking evaluation overview," in Topic Detection and Tracking: Event-Based Information Organization, 2002.
- (2002) Topic Detection and Tracking: Event-based Information Organization
- Fiscus, J.G.¹ Doddington, G.R.²

36
- 85024373635
- A re-examination of text categorization methods
- Berkeley, CA
- Y. Yang and X. Liu, "A re-examination of text categorization methods," in Proc. 22nd ACM SIGIR Conf. Information Retrieval, Berkeley, CA, 1999, pp. 42-49.
- (1999) Proc. 22nd ACM SIGIR Conf. Information Retrieval , pp. 42-49
- Yang, Y.¹ Liu, X.²

37
- 0001319911
- Okapi at TREC-3
- Gaithersburg, MD
- S. Robertson, S. Walker, S. Jones, M. Hancock-Beaulieu, and M. Gatford, "Okapi at TREC-3," in Proc. 3rd Text Retrieval Conf. (TREC-3), Gaithersburg, MD, 1995, pp. 109-126.
- (1995) Proc. 3rd Text Retrieval Conf. (TREC-3) , pp. 109-126
- Robertson, S.¹ Walker, S.² Jones, S.³ Hancock-Beaulieu, M.⁴ Gatford, M.⁵

38
- 25944436524
- TREC 2001 results
- Gaithersburg, MD
- E. M. Voorhees and D. K. Harman, "TREC 2001 results," in Proc. 10th Text Retrieval Conf. (TREC-10), Gaithersburg, MD, 2001, p. A-14.
- (2001) Proc. 10th Text Retrieval Conf. (TREC-10)
- Voorhees, E.M.¹ Harman, D.K.²

39
- 3042814145
- College of Inform, Studies, Univ. of Mary-land, College Park, [Online]
- D. Soergel, D. Oard, S. Gustman, L. Fraser, J. Kim, J. Meyer, E. Proffen, and T. Sartori, "The many uses of digitized oral history collections: Implications for design," College of Inform, Studies, Univ. of Mary-land, College Park, [Online] Available: http://www.clsp.jhu.edu/research/malach/pubs, 2002.
- (2002) The Many Uses of Digitized Oral History Collections: Implications for Design
- Soergel, D.¹ Oard, D.² Gustman, S.³ Fraser, L.⁴ Kim, J.⁵ Meyer, J.⁶ Proffen, E.⁷ Sartori, T.⁸

40
- 3042853618
- Searching large collections of recorded speech: A preliminary study
- Medford, NJ: Information Today, to be published
- J. Kim, D. Oard, and D. Soergel, "Searching large collections of recorded speech: A preliminary study," in Proceedings of the ASIST Annual Meeting. Medford, NJ: Information Today, 2003, pp. 330-339, to be published.
- (2003) Proceedings of the ASIST Annual Meeting , pp. 330-339
- Kim, J.¹ Oard, D.² Soergel, D.³

41
- 0013233910
- An empirical study of the optimal presentation of multimedia summaries of broadcast news
- I. Mani and M. Maybury, Eds.
- A. Merlino and M. Maybury, "An empirical study of the optimal presentation of multimedia summaries of broadcast news," in Automated Text Summarization, I. Mani and M. Maybury, Eds., 1999.
- (1999) Automated Text Summarization
- Merlino, A.¹ Maybury, M.²

42
- 84886671683
- Searching recorded speech based on the temporal extent of topic labels
- Palo Alto, CA, Mar.
- D. W. Oard and A. Leuski, "Searching recorded speech based on the temporal extent of topic labels," in AAAI Spring Symp. Intelligent Multimedia Knowledge Management, Palo Alto, CA, Mar. 2003.
- (2003) AAAI Spring Symp. Intelligent Multimedia Knowledge Management
- Oard, D.W.¹ Leuski, A.²

43
- 85009170963
- Automated transcription and topic segmentation of large spoken archives
- Geneva, Switzerland
- M. Franz, B. Ramabhadran, T. Ward, and M. Picheny, "Automated transcription and topic segmentation of large spoken archives," in Proc. EUROSPEECH 2003, Geneva, Switzerland, 2003, pp. 953-956.
- (2003) Proc. EUROSPEECH 2003 , pp. 953-956
- Franz, M.¹ Ramabhadran, B.² Ward, T.³ Picheny, M.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.