SCOPUS 정보 검색 플랫폼

Foundations and Trends in Information Retrieval

Volumn 5, Issue 4-5, 2011, Pages 235-422

Spoken content retrieval: A survey of techniques and technologies

(2) Larson, Martha a Jones, Gareth J F b

a DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

b DUBLIN CITY UNIVERSITY (Ireland)

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; COMPONENT TECHNOLOGIES; CONTENT RETRIEVAL; DIGITAL AUDIO; INDEXING AND RETRIEVAL; RESEARCH AND DEVELOPMENT; SPEECH PROCESSING TECHNOLOGIES; SPEECH TECHNOLOGY; USER INTERACTION;

RESEARCH; SURVEYS;

TECHNOLOGY;

EID: 84865249159 PISSN: 15540669 EISSN: 15540677 Source Type: Journal
DOI: 10.1561/1500000020 Document Type: Article

Times cited : (72)

References (317)

1
- 84865275963
- Overview of the IR for spoken documents task in NTCIR-9 Workshop
- T. Akiba, H. Nishizaki, K. Aikawa, T. Kawahara, and T. Matsui, "Overview of the IR for spoken documents task in NTCIR-9 Workshop," in Proceedings of the NII Test Collection for IR Systems Workshop, pp. 223-235, 2011.
- (2011) Proceedings of the NII Test Collection for IR Systems Workshop , pp. 223-235
- Akiba, T.¹ Nishizaki, H.² Aikawa, K.³ Kawahara, T.⁴ Matsui, T.⁵

2
- 70349217247
- An audio indexing system for election video material
- C. Alberti, M. Bacchiani, A. Bezman, C. Chelba, A. Drofa, H. Liao, P. Moreno, T. Power, A. Sahuguet, M. Shugrina, and O. Siohan, "An audio indexing system for election video material," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4873-4876, 2009.
- (2009) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , pp. 4873-4876
- Alberti, C.¹ Bacchiani, M.² Bezman, A.³ Chelba, C.⁴ Drofa, A.⁵ Liao, H.⁶ Moreno, P.⁷ Power, T.⁸ Sahuguet, A.⁹ Shugrina, M.¹⁰ Siohan, O.¹¹

3
- 0013252919
- Perspectives on information retrieval and speech
- (A. R. Coden, E. W. Brown, and S. Srinivasan, eds.), Springer Berlin/Heidelberg
- J. Allan, "Perspectives on information retrieval and speech," in Information Retrieval Techniques for Speech Applications, (A. R. Coden, E. W. Brown, and S. Srinivasan, eds.), pp. 323-326, Springer Berlin/Heidelberg, 2002.
- (2002) Information Retrieval Techniques for Speech Applications , pp. 323-326
- Allan, J.¹

4
- 84865277132
- Topic detection and tracking: Event-based information organization
- Springer
- J. Allan, "Topic detection and tracking: Event-based information organization," in The Kluwer International Series on Information Retrieval, vol. 12, Springer, 2002.
- (2002) The Kluwer International Series on Information Retrieval , vol.12
- Allan, J.¹

5
- 0037300570
- Robust techniques for organizing and retrieving spoken documents
- J. Allan, "Robust techniques for organizing and retrieving spoken documents," EURASIP Journal on Advances in Signal Processing, vol. 2003, no. 1, pp. 103-114, 2003.
- (2003) EURASIP Journal on Advances in Signal Processing , vol.2003 , Issue.1 , pp. 103-114
- Allan, J.¹

6
- 34548361194
- Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system
- X. Anguera, C. Wooters, B. Peskin, and M. Aguilo, "Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system," in Proceedings of the NIST Machine Learning for Multimodal Interaction, Meeting Recognition Workshop, pp. 26-38, 2005.
- (2005) Proceedings of the NIST Machine Learning for Multimodal Interaction, Meeting Recognition Workshop , pp. 26-38
- Anguera, X.¹ Wooters, C.² Peskin, B.³ Aguilo, M.⁴

7
- 0142004229
- Bedford/St. Martin's
- J. Archibald and W. O'Grady, Contemporary Linguistics. Bedford/St. Martin's, 2001.
- (2001) Contemporary Linguistics
- Archibald, J.¹ O'grady, W.²

8
- 85008055179
- Turkish broadcast news transcription and retrieval
- E. Arisoy, D. Can, S. Parlak, H. Sak, and M. Saraclar, "Turkish broadcast news transcription and retrieval," IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 5, pp. 874-883, 2009.
- (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , Issue.5 , pp. 874-883
- Arisoy, E.¹ Can, D.² Parlak, S.³ Sak, H.⁴ Saraclar, M.⁵

9
- 85003876001
- SpeechSkimmer: Interactively skimming recorded speech
- Atlanta
- B. Arons, "SpeechSkimmer: Interactively skimming recorded speech," in Proceedings of the ACM User Interface Software and Technology Conference, Atlanta, 1993.
- (1993) Proceedings of the ACM User Interface Software and Technology Conference
- Arons, B.¹

10
- 0002494419
- SpeechSkimmer: A system for interactively skimming recorded speech
- B. Arons, "SpeechSkimmer: A system for interactively skimming recorded speech," Transactions on Computer Human Interaction, vol. 4, no. 1, pp. 3-38, 1997.
- (1997) Transactions on Computer Human Interaction , vol.4 , Issue.1 , pp. 3-38
- Arons, B.¹

11
- 0039737345
- The future of speech and audio in the interface: A CHI '94 workshop
- B. Arons and E. Mynatt, "The future of speech and audio in the interface: A CHI '94 workshop," SIGCHI Bulletin, vol. 26, no. 4, pp. 44-48, 1994.
- (1994) SIGCHI Bulletin , vol.26 , Issue.4 , pp. 44-48
- Arons, B.¹ Mynatt, E.²

12
- 0036460898
- An overview of decoding techniques for large vocabulary continuous speech recognition
- X. Aubert, "An overview of decoding techniques for large vocabulary continuous speech recognition," Computer Speech & Language, vol. 16, no. 1, pp. 89-114, 2002.
- (2002) Computer Speech & Language , vol.16 , Issue.1 , pp. 89-114
- Aubert, X.¹

13
- 4544292295
- Automatic language model adaptation for spoken document retrieval
- C. Auzanne, J. S. Garofolo, J. G. Fiscus, and W. M. Fisher, "Automatic language model adaptation for spoken document retrieval," in Proceedings of the RIAO Conference on Content-Based Multimedia Information Access, pp. 132-141, 2000.
- (2000) Proceedings of the RIAO Conference on Content-Based Multimedia Information Access , pp. 132-141
- Auzanne, C.¹ Garofolo, J.S.² Fiscus, J.G.³ Fisher, W.M.⁴

14
- 0005540823
- Addison-Wesley Longman Publishing Co., Inc
- R. A. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval: The Concepts and Technology Behind Search. Addison-Wesley Longman Publishing Co., Inc., 2010.
- (2010) Modern Information Retrieval: The Concepts and Technology behind Search
- Baeza-Yates, R.A.¹ Ribeiro-Neto, B.²

15
- 0030376675
- Very-large-vocabulary Mandarin voice message file retrieval using speech queries
- B.-R. Bai, L.-F. Chien, and L.-S. Lee, "Very-large-vocabulary Mandarin voice message file retrieval using speech queries," in Proceedings of the International Conference on Spoken Language Processing, pp. 1950-1953, 1996.
- (1996) Proceedings of the International Conference on Spoken Language Processing , pp. 1950-1953
- Bai, B.-R.¹ Chien, L.-F.² Lee, L.-S.³

16
- 0016663359
- The DRAGON system - An overview
- J. Baker, "The DRAGON system - an overview," IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 23, no. 1, pp. 24-29, 1975.
- (1975) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.23 , Issue.1 , pp. 24-29
- Baker, J.¹

17
- 44949112191
- A TextTiling based approach to topic boundary detection in meetings
- S. Banerjee and A. Rudnicky, "A TextTiling based approach to topic boundary detection in meetings," in Proceedings of Interspeech, 2006.
- (2006) Proceedings of Interspeech
- Banerjee, S.¹ Rudnicky, A.²

18
- 53149126088
- Recovering capitalization and punctuation marks for automatic speech recognition: Case study for portuguese broadcast news
- F. Batista, D. Caseiro, N. Mamede, and I. Trancoso, "Recovering capitalization and punctuation marks for automatic speech recognition: Case study for portuguese broadcast news," Speech Communication, vol. 50, no. 10, pp. 847-862, 2008.
- (2008) Speech Communication , vol.50 , Issue.10 , pp. 847-862
- Batista, F.¹ Caseiro, D.² Mamede, N.³ Trancoso, I.⁴

19
- 0029304819
- Combining the evidence of multiple query representations for information retrieval
- N. J. Belkin, P. Kantor, E. A. Fox, and J. A. Shaw, "Combining the evidence of multiple query representations for information retrieval," Information Processing & Management, vol. 31, no. 3, pp. 431-448, 1995.
- (1995) Information Processing & Management , vol.31 , Issue.3 , pp. 431-448
- Belkin, N.J.¹ Kantor, P.² Fox, E.A.³ Shaw, J.A.⁴

20
- 33947709432
- Automatic speech recognition and intrinsic speech variation
- M. Benzeguiba, R. D. Mori, O. Deroo, S. Dupont, T. Erbes, D. Jouvet, L. Fissore, P. Laface, A. Mertins, C. Ris, R. Rose, V. Tyagi, and C. Wellekens, "Automatic speech recognition and intrinsic speech variation," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. V/1021-V/1024, 2006.
- (2006) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Benzeguiba, M.¹ Mori, R.D.² Deroo, O.³ Dupont, S.⁴ Erbes, T.⁵ Jouvet, D.⁶ Fissore, L.⁷ Laface, P.⁸ Mertins, A.⁹ Ris, C.¹⁰ Rose, R.¹¹ Tyagi, V.¹² Wellekens, C.¹³

21
- 77956629726
- Podcast search: User goals and retrieval technologies
- J. Besser, M. Larson, and K. Hofmann, "Podcast search: User goals and retrieval technologies," Online Information Review, vol. 34, p. 3, 2010.
- (2010) Online Information Review , vol.34 , pp. 3
- Besser, J.¹ Larson, M.² Hofmann, K.³

22
- 0030142722
- Towards increasing speech recognition error rates
- DOI 10.1016/0167-6393(96)00003-9, PII S0167639396000039
- H. Bourlard, H. Hermansky, and N. Morgan, "Towards increasing speech recognition error rates," Speech Communication, vol. 18, pp. 205-231, May 1996. (Pubitemid 126362800)
- (1996) Speech Communication , vol.18 , Issue.3 , pp. 205-231
- Bourlard, H.¹ Hermansky, H.² Morgan, N.³

23
- 84862119541
- Recognition and understanding of meetings overview of the European AMI and AMIDA projects
- H. Bourlard and S. Renals, "Recognition and understanding of meetings overview of the European AMI and AMIDA projects," IDIAP-RR 27 Technical Report, 2008.
- (2008) IDIAP-RR 27 Technical Report
- Bourlard, H.¹ Renals, S.²

24
- 0038589165
- The anatomy of a large-scale hypertextual web search engine
- S. Brin and L. Page, "The anatomy of a large-scale hypertextual web search engine," Computer Networks and ISDN Systems, vol. 30, no. 1-7, pp. 107-117, 1998.
- (1998) Computer Networks and ISDN Systems , vol.30 , Issue.1-7 , pp. 107-117
- Brin, S.¹ Page, L.²

25
- 0035215786
- Toward speech as a knowledge resource
- E. W. Brown, S. Srinivasan, A. Coden, D. Ponceleon, J. W. Cooper, and A. Amir, "Toward speech as a knowledge resource," IBM Systems Journal, vol. 40, no. 4, pp. 985-1001, 2001. (Pubitemid 33149292)
- (2001) IBM Systems Journal , vol.40 , Issue.4 , pp. 985-1001
- Brown, E.W.¹ Srinivasan, S.² Coden, A.³ Ponceleon, D.⁴ Cooper, J.W.⁵ Amir, A.⁶

26
- 0029451866
- Automatic content-based retrieval of broadcast news
- M. G. Brown, J. T. Foote, G. J. F. Jones, K. S. Jones, and S. J. Young, "Automatic content-based retrieval of broadcast news," in Proceedings of the Annual ACM International Conference on Multimedia, pp. 35-43, 1995.
- (1995) Proceedings of the Annual ACM International Conference on Multimedia , pp. 35-43
- Brown, M.G.¹ Foote, J.T.² Jones, G.J.F.³ Jones, K.S.⁴ Young, S.J.⁵

27
- 0030394830
- Openvocabulary speech indexing for voice and video mail retrieval
- M. G. Brown, J. T. Foote, G. J. F. Jones, K. S. Jones, and S. J. Young, "Openvocabulary speech indexing for voice and video mail retrieval," in Proceedings of the ACM International Conference on Multimedia, pp. 307-316, 1996.
- (1996) Proceedings of the ACM International Conference on Multimedia , pp. 307-316
- Brown, M.G.¹ Foote, J.T.² Jones, G.J.F.³ Jones, K.S.⁴ Young, S.J.⁵

28
- 0010258486
- Video mail retrieval using voice: An overview of the Cambridge/Olivetti retrieval system
- M. G. Brown, J. T. Foote, G. J. F. Jones, K. Spärck Jones, and S. J. Young, "Video mail retrieval using voice: An overview of the Cambridge/Olivetti retrieval system," in Proceedings of the ACM Multimedia Workshop on Multimedia Database Management Systems, pp. 47-55, 1994.
- (1994) Proceedings of the ACM Multimedia Workshop on Multimedia Database Management Systems , pp. 47-55
- Brown, M.G.¹ Foote, J.T.² Jones, G.J.F.³ Spärck Jones, K.⁴ Young, S.J.⁵

29
- 0029451866
- Automatic content-based retrieval of broadcast news
- M. G. Brown, J. T. Foote, G. J. F. Jones, K. Spärck Jones, and S. J. Young, "Automatic content-based retrieval of broadcast news," in Proceedings of the Third ACM International Conference on Multimedia, pp. 35-43, 1995.
- (1995) Proceedings of the Third ACM International Conference on Multimedia , pp. 35-43
- Brown, M.G.¹ Foote, J.T.² Jones, G.J.F.³ Spärck Jones, K.⁴ Young, S.J.⁵

30
- 0002039278
- Automatic query expansion using SMART: TREC 3
- C. Buckley, G. Salton, J. Allan, and A. Singha, "Automatic query expansion using SMART: TREC 3," in Proceedings of the Third Text Retrieval Conference, pp. 69-80, 1995.
- (1995) Proceedings of the Third Text Retrieval Conference , pp. 69-80
- Buckley, C.¹ Salton, G.² Allan, J.³ Singha, A.⁴

31
- 0040283968
- Spontaneous speech effects in large vocabulary speech recognition applications
- J. Butzberger, H. Murveit, E. Shriberg, and P. Price, "Spontaneous speech effects in large vocabulary speech recognition applications," in Proceedings of the Workshop on Speech and Natural Language, pp. 339-343, 1992.
- (1992) Proceedings of the Workshop on Speech and Natural Language , pp. 339-343
- Butzberger, J.¹ Murveit, H.² Shriberg, E.³ Price, P.⁴

32
- 78449292738
- MIT Press
- S. Büuttcher, C. L. A. Clarke, and G. V. Cormack, Information Retrieval: Implementing and Evaluating Search Engines. MIT Press, 2010.
- (2010) Information Retrieval: Implementing and Evaluating Search Engines
- Büuttcher, S.¹ Clarke, C.L.A.² Cormack, G.V.³

33
- 3042820894
- Automatic recognition of spontaneous speech for access to multilingual oral history archives
- W. Byrne, D. Doermann, M. Franz, S. Gustman, J. Hajic, D. Oard, M. Picheny, J. Psutka, B. Ramabhadran, D. Soergel, T. Ward, and W.-J. Zhu, "Automatic recognition of spontaneous speech for access to multilingual oral history archives," IEEE Transactions on Speech and Audio Processing, Special Issue on Spontaneous Speech Processing, vol. 12, no. 4, pp. 420-435, 2004.
- (2004) IEEE Transactions on Speech and Audio Processing, Special Issue on Spontaneous Speech Processing , vol.12 , Issue.4 , pp. 420-435
- Byrne, W.¹ Doermann, D.² Franz, M.³ Gustman, S.⁴ Hajic, J.⁵ Oard, D.⁶ Picheny, M.⁷ Psutka, J.⁸ Ramabhadran, B.⁹ Soergel, D.¹⁰ Ward, T.¹¹ Zhu, W.-J.¹²

34
- 33745530242
- The AMI Meeting Corpus: A pre-announcement
- DOI 10.1007/11677482-3, Machine Learning for Multimodal Interaction - Second International Workshop, MLMI 2005, Revised Selected Papers LNCS
- J. Carletta, S. Ashby, S. Bourban, M. Flynn, M. Guillemot, T. Hain, J. Kadlec, K. Vasilis, W. Kraaij, M. Kronenthal, G. Lathoud, M. Lincoln, A. Lisowska, I. McCowan, W. Post, D. Reidsma, and P. Wellner, "The AMI meeting corpus: A pre-announcement," in Machine Learning for Multimodal Interaction, Chapter 3, pp. 28-39, Springer, 2006. (Pubitemid 43979723)
- (2006) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.3869 , pp. 28-39
- Carletta, J.¹ Ashby, S.² Bourban, S.³ Flynn, M.⁴ Guillemot, M.⁵ Hain, T.⁶ Kadlec, J.⁷ Karaiskos, V.⁸ Kraaij, W.⁹ Kronenthal, M.¹⁰ Lathoud, G.¹¹ Lincoln, M.¹² Lisowska, A.¹³ McCowan, L.¹⁴ Post, W.¹⁵ Reidsma, D.¹⁶ Wellner, P.¹⁷

35
- 51849151694
- Multimodal indexing of digital audio-visual documents: A Case study for cultural heritage data
- London, U.K
- J. Carmichael, M. Larson, J. Marlow, E. Newman, P. Clough, O. Oomen, and S. Sav, "Multimodal indexing of digital audio-visual documents: A Case study for cultural heritage data," in Proceedings of the International Workshop on Content-Based Multimedia Indexing, pp. 93-100, London, U.K., 2008.
- (2008) Proceedings of the International Workshop on Content-Based Multimedia Indexing , pp. 93-100
- Carmichael, J.¹ Larson, M.² Marlow, J.³ Newman, E.⁴ Clough, P.⁵ Oomen, O.⁶ Sav, S.⁷

36
- 35448958268
- Wiley- Blackwell
- J. K. Chambers, P.Trudgill, and N. Schilling-Estes, eds., The Handbook of Language Variation and Change, Blackwell Handbooks in Linguistics. Wiley- Blackwell, 2004.
- (2004) The Handbook of Language Variation and Change, Blackwell Handbooks in Linguistics
- Chambers, J.K.¹ Trudgill, P.² Schilling-Estes, N.³

37
- 44849083548
- Position specific posterior lattices for indexing speech
- Morristown, NJ, USA
- C. Chelba and A. Acero, "Position specific posterior lattices for indexing speech," in Proceedings of the Annual Meeting on Association for Computational Linguistics, pp. 443-450, Morristown, NJ, USA, 2005.
- (2005) Proceedings of the Annual Meeting on Association for Computational Linguistics , pp. 443-450
- Chelba, C.¹ Acero, A.²

38
- 85032751967
- Retrieval and browsing of spoken content
- DOI 10.1109/MSP.2008.917992
- C. Chelba, T. J. Hazen, and M. Saraclar, "Retrieval and browsing of spoken content," IEEE Signal Processing Magazine, vol. 25, no. 3, pp. 39-49, 2008. (Pubitemid 351695639)
- (2008) IEEE Signal Processing Magazine , vol.25 , Issue.3 , pp. 39-49
- Chelba, C.¹ Hazen, T.J.² Saraclar, M.³

39
- 33847607574
- Soft indexing of speech content for search in spoken documents
- DOI 10.1016/j.csl.2006.09.001, PII S0885230806000313
- C. Chelba, J. Silva, and A. Acero, "Soft indexing of speech content for search in spoken documents," Computer Speech and Language, vol. 21, no. 3, pp. 458-478, 2007. (Pubitemid 46367509)
- (2007) Computer Speech and Language , vol.21 , Issue.3 , pp. 458-478
- Chelba, C.¹ Silva, J.² Acero, A.³

40
- 27744494029
- Exploring the use of latent topical information for statistical Chinese spoken document retrieval
- DOI 10.1016/j.patrec.2005.06.010, PII S0167865505001704
- B. Chen, "Exploring the use of latent topical information for statistical Chinese spoken document retrieval," Pattern Recognition Letters, vol. 27, no. 1, pp. 9-18, 2006. (Pubitemid 41625538)
- (2006) Pattern Recognition Letters , vol.27 , Issue.1 , pp. 9-18
- Chen, B.¹

41
- 0036649836
- Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese
- DOI 10.1109/TSA.2002.802541, PII 1011092002802541
- B. Chen, H.-M. Wang, and L.-S. Lee, "Discriminating capabilities of syllablebased features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese," IEEE Transactions on Speech and Audio Processing, vol. 10, no. 5, pp. 303-314, 2002. (Pubitemid 34950068)
- (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.5 , pp. 303-314
- Chen, B.¹ Wang, H.-M.² Lee, L.-S.³

42
- 85017351590
- The use of emphasis to automatically summarize a spoken discourse
- F. R. Chen and M. Withgott, "The use of emphasis to automatically summarize a spoken discourse," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/229-I/232, 1992.
- (1992) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Chen, F.R.¹ Withgott, M.²

43
- 0033329799
- An empirical study of smoothing techniques for language modeling
- S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Computer Speech and Language, vol. 13, no. 4, pp. 359-393, 1999.
- (1999) Computer Speech and Language , vol.13 , Issue.4 , pp. 359-393
- Chen, S.F.¹ Goodman, J.²

44
- 0002595416
- Speaker, environment and channel change detection and clustering via the bayesian information criterion
- S. S. Chen and P. S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the bayesian information criterion," in Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, 1998.
- (1998) Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop
- Chen, S.S.¹ Gopalakrishnan, P.S.²

45
- 67149133555
- A probabilistic generative framework for extractive broadcast news speech summarization
- Y.-T. Chen, B. Chen, and H.-M. Wang, "A probabilistic generative framework for extractive broadcast news speech summarization," IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 1, pp. 95-106, 2009.
- (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , Issue.1 , pp. 95-106
- Chen, Y.-T.¹ Chen, B.² Wang, H.-M.³

46
- 84865209295
- Improving the front-end of Kunststofzuiger
- University of Amsterdam
- T. Cheong, R. Kok, J. Schuurman, and B. Stukart, "Improving the front-end of Kunststofzuiger," Final Report Project Information Retrieval, University of Amsterdam, 2008.
- (2008) Final Report Project Information Retrieval
- Cheong, T.¹ Kok, R.² Schuurman, J.³ Stukart, B.⁴

47
- 79851497439
- Statistical lattice-based spoken document retrieval
- T. K. Chia, K. C. Sim, H. Li, and H. T. Ng, "Statistical lattice-based spoken document retrieval," ACM Transactions on Information Systems, vol. 28, no. 1, pp. 1-30, 2010.
- (2010) ACM Transactions on Information Systems , vol.28 , Issue.1 , pp. 1-30
- Chia, T.K.¹ Sim, K.C.² Li, H.³ Ng, H.T.⁴

48
- 85115050772
- Advances in domain independent linear text segmentation
- F. Y. Y. Choi, "Advances in domain independent linear text segmentation," in Proceedings of the North American Chapter of the Association for Computational Linguistics Conference, pp. 26-33, 2000.
- (2000) Proceedings of the North American Chapter of the Association for Computational Linguistics Conference , pp. 26-33
- Choi, F.Y.Y.¹

49
- 36849085977
- Merging storyboard strategies and automatic retrieval for improving interactive video search
- DOI 10.1145/1282280.1282351, Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007
- M. G. Christel and R. Yan, "Merging storyboard strategies and automatic retrieval for improving interactive video search," in Proceedings of the ACM International Conference on Image and Video Retrieval, pp. 486-493, 2007. (Pubitemid 350229663)
- (2007) Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007 , pp. 486-493
- Christel, M.G.¹ Yan, R.²

50
- 22944486029
- Speech and language processing: Can we use the past to predict the future?
- (P. Sojka, I. Kopecek, and K. Pala, eds.), Springer Berlin/Heidelberg
- K. W. Church, "Speech and language processing: Can we use the past to predict the future?," in Text, Speech and Dialogue, vol. 3206 of Lecture Notes in Computer Science, (P. Sojka, I. Kopecek, and K. Pala, eds.), pp. 3-13, Springer Berlin/Heidelberg, 2004.
- (2004) Text, Speech and Dialogue 3206 of Lecture Notes in Computer Science , pp. 3-13
- Church, K.W.¹

51
- 0004147298
- Wiley-Blackwell
- J. Clark, C. Yallop, and J. Fletcher, An Introduction to Phonetics and Phonology (Blackwell Textbooks in Linguistics). Wiley-Blackwell, 2007.
- (2007) An Introduction to Phonetics and Phonology (Blackwell Textbooks in Linguistics)
- Clark, J.¹ Yallop, C.² Fletcher, J.³

52
- 0034977719
- Speech transcript analysis for automatic search
- A. R. Coden and E. W. Brown, "Speech transcript analysis for automatic search," in Proceedings of the Annual Hawaii International Conference on System Sciences, 2001, 2001.
- (2001) Proceedings of the Annual Hawaii International Conference on System Sciences , vol.2001
- Coden, A.R.¹ Brown, E.W.²

53
- 84865221228
- ACM SIGIR 2001 workshop "Information Retrieval Techniques for Speech Applications
- A. R. Coden, E. W. Brown, and S. Srinivasan, "ACM SIGIR 2001 workshop "Information Retrieval Techniques for Speech Applications", " SIGIR Forum, vol. 36, no. 1, pp. 10-13, 2002.
- (2002) SIGIR Forum , vol.36 , Issue.1 , pp. 10-13
- Coden, A.R.¹ Brown, E.W.² Srinivasan, S.³

54
- 0029230678
- The challenge of spoken language systems: Research directions for the nineties
- R. Cole, L. Hirschman, L. Atlas, M. Beckman, A. Biermann, M. Bush, M. Clements, L. Cohen, O. Garcia, B. Hanson, H. Hermansky, S. Levinson, K. McKeown, N. Morgan, D. G. Novick, M. Ostendorf, S. Oviatt, P. Price, H. Silverman, J. Spiitz, A. Waibel, C. Weinstein, S. Zahorian, and V. Zue, "The challenge of spoken language systems: Research directions for the nineties," IEEE Transactions on Speech and Audio Processing, vol. 3, no. 1, pp. 1-21, 1995.
- (1995) IEEE Transactions on Speech and Audio Processing , vol.3 , Issue.1 , pp. 1-21
- Cole, R.¹ Hirschman, L.² Atlas, L.³ Beckman, M.⁴ Biermann, A.⁵ Bush, M.⁶ Clements, M.⁷ Cohen, L.⁸ Garcia, O.⁹ Hanson, B.¹⁰ Hermansky, H.¹¹ Levinson, S.¹² McKeown, K.¹³ Morgan, N.¹⁴ Novick, D.G.¹⁵ Ostendorf, M.¹⁶ Oviatt, S.¹⁷ Price, P.¹⁸ Silverman, H.¹⁹ Spiitz, J.²⁰ more..

55
- 84865235825
- Sibyl, a factoid question answering system for spoken documents
- P. R. Comas, J. Turmo, and L. Marquez, "Sibyl, a factoid question answering system for spoken documents," ACM Transactions on Information Systems, vol. 30, no. 3, 2012.
- (2012) ACM Transactions on Information Systems , vol.30 , Issue.3
- Comas, P.R.¹ Turmo, J.² Marquez, L.³

56
- 33646687756
- Written versus spoken queries: A qualitative and quantitative comparative analysis
- DOI 10.1002/asi.20350
- F. Crestani and H. Du, "Written versus spoken queries: A qualitative and quantitative comparative analysis," Journal of the American Society for Information Science and Technology, vol. 57, no. 7, pp. 881-890, 2006. (Pubitemid 43734456)
- (2006) Journal of the American Society for Information Science and Technology , vol.57 , Issue.7 , pp. 881-890
- Crestani, F.¹ Du, H.²

57
- 62549107194
- Addison Wesley, 1st Edition, February
- B. Croft, D. Metzler, and T. Strohman, Search Engines: Information Retrieval in Practice. Addison Wesley, 1st Edition, February 2009.
- (2009) Search Engines: Information Retrieval in Practice
- Croft, B.¹ Metzler, D.² Strohman, T.³

58
- 84865263072
- Speech in noisy environments (SPINE) adds new dimension to speech recognition R&D
- T. H. Crystal, A. Schmidt-Nielsen, and E. Marsh, Speech in noisy environments (SPINE) adds new dimension to speech recognition R&D in Proceedings of the International Conference on Human Language Technology Research, pp. 212-216, 2002.
- (2002) Proceedings of the International Conference on Human Language Technology Research , pp. 212-216
- Crystal, T.H.¹ Schmidt-Nielsen, A.² Marsh, E.³

59
- 0038715064
- Distributed meetings: A meeting capture and broadcasting system
- R. Cutler, Y. Rui, A. Gupta, J. J. Cadiz, I. Tashev, L.-W. He, A. Colburn, Z. Zhang, Z. Liu, and S. Silverberg, "Distributed meetings: A meeting capture and broadcasting system," in Proceedings of the ACM International Conference on Multimedia, pp. 503-512, 2002.
- (2002) Proceedings of the ACM International Conference on Multimedia , pp. 503-512
- Cutler, R.¹ Rui, Y.² Gupta, A.³ Cadiz, J.J.⁴ Tashev, I.⁵ He, L.-W.⁶ Colburn, A.⁷ Zhang, Z.⁸ Liu, Z.⁹ Silverberg, S.¹⁰

60
- 64149119901
- A novel feature combination approach for spoken document classification with support vector machines
- P. Dai, U. Iurgel, and G. Rigoll, "A novel feature combination approach for spoken document classification with support vector machines," in Proceedings of the ACM Special Interest Group on Information Retrieval (SIGIR) Multimedia Information Retrieval Workshop, 2003.
- (2003) Proceedings of the ACM Special Interest Group on Information Retrieval (SIGIR) Multimedia Information Retrieval Workshop
- Dai, P.¹ Iurgel, U.² Rigoll, G.³

61
- 84979800444
- Access to recorded interviews: A research agenda
- F. M. G. de Jong, D. W. Oard, W. F. L. Heeren, and R. J. F. Ordelman, "Access to recorded interviews: A research agenda," ACM Journal on Computing and Cultural Heritage, vol. 1, no. 1, pp. 3:1-3:27, 2008.
- (2008) ACM Journal on Computing and Cultural Heritage , vol.1 , Issue.1 , pp. 31-327
- De Jong, F.M.G.¹ Oard, D.W.² Heeren, W.F.L.³ Ordelman, R.J.F.⁴

62
- 84885771294
- Automated speech and audio analysis for semantic access to multimedia
- Chapter 18, (Y. Avrithis, Y. Kompatsiaris, S. Staab, and N. O'Connor, eds.), Springer Berlin/Heidelberg: Berlin, Heidelberg
- F. M. G. de Jong, R. J. F. Ordelman, and M. A. H. Huijbregts, "Automated speech and audio analysis for semantic access to multimedia," in Semantic Multimedia, vol. 4306 of Lecture Notes in Computer Science, Chapter 18, (Y. Avrithis, Y. Kompatsiaris, S. Staab, and N. O'Connor, eds.), pp. 226-240, Springer Berlin/Heidelberg: Berlin, Heidelberg, 2006.
- (2006) Semantic Multimedia 4306 of Lecture Notes in Computer Science , pp. 226-240
- De Jong, F.M.G.¹ Ordelman, R.J.F.² Huijbregts, M.A.H.³

63
- 33847768236
- Multimedia search without visual analysis: The value of linguistic and contextual information
- DOI 10.1109/TCSVT.2007.890834
- F. M. G. de Jong, T. Westerveld, and A. P. de Vries, "Multimedia search without visual analysis: The value of linguistic and contextual information," IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 3, pp. 365-371, 2007. (Pubitemid 46393334)
- (2007) IEEE Transactions on Circuits and Systems for Video Technology , vol.17 , Issue.3 , pp. 365-371
- De Jong, F.M.G.¹ Westerveld, T.² De Vries, A.P.³

64
- 0345120094
- Improving information retrieval with latent semantic indexing
- (C. L. Borgman and E. Y. H. Pai, eds.)
- S. Deerwester, "Improving information retrieval with latent semantic indexing," in Proceedings of the 51st ASIS Annual Meeting, vol. 25, (C. L. Borgman and E. Y. H. Pai, eds.), 1988.
- (1988) Proceedings of the 51st ASIS Annual Meeting , vol.25
- Deerwester, S.¹

65
- 84944038525
- Extracting keyphrases from spoken audio documents
- London, UK Springer
- A. Désilets, B. de Bruijn, and J. Martin, "Extracting keyphrases from spoken audio documents," in Information Retrieval Techniques for Speech Applications, pp. 36-50, London, UK, Springer, 2002.
- (2002) Information Retrieval Techniques for Speech Applications , pp. 36-50
- Désilets, A.¹ De Bruijn, B.² Martin, J.³

66
- 36348937811
- Topic segmentation algorithms for text summarization and passage retrieval: An exhaustive evaluation
- AAAI-07/IAAI-07 Proceedings: 22nd AAAI Conference on Artificial Intelligence and the 19th Innovative Applications of Artificial Intelligence Conference
- G. Dias, E. Alves, and J. G. P. Lopes, "Topic segmentation algorithms for text summarization and passage retrieval: An exhaustive evaluation," in Proceedings of the National Conference on Artificial Intelligence - Volume 2, pp. 1334-1339, 2007. (Pubitemid 350149752)
- (2007) Proceedings of the National Conference on Artificial Intelligence , vol.2 , pp. 1334-1339
- Dias, G.¹ Alves, E.² Lopes, J.G.P.³

67
- 0003835196
- University Press
- R. M. W. Dixon, The Rise and Fall of Languages. Cambridge University Press, 1998.
- (1998) The Rise and Fall of Languages. Cambridge
- Dixon, R.M.W.¹

68
- 0028996886
- Understanding and improving speech recognition performance through the use of diagnostic tools
- E. Eide, H. Gish, P. Jeanrenaud, and A. Mielke, "Understanding and improving speech recognition performance through the use of diagnostic tools," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/221-I/224, 1995.
- (1995) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Eide, E.¹ Gish, H.² Jeanrenaud, P.³ Mielke, A.⁴

69
- 78649328053
- Survey on speech emotion recognition: Features, classification schemes, and databases
- M. El Ayadi, M. S. Kamel, and F. Karray, "Survey on speech emotion recognition: Features, classification schemes, and databases," Pattern Recognition, vol. 44, no. 3, pp. 572-587, 2011.
- (2011) Pattern Recognition , vol.44 , Issue.3 , pp. 572-587
- El Ayadi, M.¹ Kamel, M.S.² Karray, F.³

70
- 33745578668
- Information retrieval from spoken documents
- Computational Linguistics and Intelligent Text Processing - 7th International Conference, CICLing 2006, Proceedings LNCS
- M. Fapso, P. Smrz, P. Schwarz, I. Szoke, J. Schwarz, , M. Cernocky, M. Karafiat, and L. Burget, "Information retrieval from spoken documents," in Proceedings of the International Conference on Intelligent Text Processing and Computational Linguistics, pp. 410-416, 2006. (Pubitemid 43979966)
- (2006) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.3878 , pp. 410-416
- Fapso, M.¹ Smrz, P.² Schwarz, P.³ Szoke, I.⁴ Schwarz, M.⁵ Cernocky, J.⁶ Karafiat, M.⁷ Bruget, L.⁸

71
- 0034274033
- A system for the retrieval of Italian broadcast news
- M. Federico, "A system for the retrieval of Italian broadcast news," Speech Communication, vol. 32, no. 1-2, pp. 37-47, 2000.
- (2000) Speech Communication , vol.32 , Issue.1-2 , pp. 37-47
- Federico, M.¹

72
- 80155144871
- Phoneme-level indexing for fast and vocabularyindependent voice/voice retrieval
- A. Ferrieux and S. Peillon, "Phoneme-level indexing for fast and vocabularyindependent voice/voice retrieval," in Proceedings of the ESCA Workshop: Accessing Information in Spoken Audio, 1999.
- (1999) Proceedings of the ESCA Workshop: Accessing Information in Spoken Audio
- Ferrieux, A.¹ Peillon, S.²

73
- 0030638031
- A post-processing system to yield reduced word error rates: Recogniser output voting error reduction (rover)
- J. Fiscus, "A post-processing system to yield reduced word error rates: Recogniser output voting error reduction (rover)," in Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 347-352, 1997.
- (1997) Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 347-352
- Fiscus, J.¹

74
- 47749152568
- The rich transcription 2007 meeting recognition evaluation
- (R. Stiefelhagen, R. Bowers, and J. G. Fiscus, eds.) , Berlin/Heidelberg: Springer- Verlag
- J. G. Fiscus, J. Ajot, and J. S. Garofolo, "The rich transcription 2007 meeting recognition evaluation," in Multimodal Technologies for Perception of Humans, (R. Stiefelhagen, R. Bowers, and J. G. Fiscus, eds.), pp. 373-389, Berlin/Heidelberg: Springer-Verlag, 2008.
- (2008) Multimodal Technologies for Perception of Humans , pp. 373-389
- Fiscus, J.G.¹ Ajot, J.² Garofolo, J.S.³

75
- 79951634009
- Results of the 2006 spoken term detection evaluation
- Amsterdam, Netherlands
- J. G. Fiscus, J. Ajot, J. S. Garofolo, and G. Doddington, "Results of the 2006 spoken term detection evaluation," in Proceedings of the ACM Special Interest Group on Information Retrieval (SIGIR), Searching Spontaneous Conversational Speech Workshop, pp. 45-50, Amsterdam, Netherlands, 2007.
- (2007) Proceedings of the ACM Special Interest Group on Information Retrieval (SIGIR), Searching Spontaneous Conversational Speech Workshop , pp. 45-50
- Fiscus, J.G.¹ Ajot, J.² Garofolo, J.S.³ Doddington, G.⁴

76
- 0032646977
- An overview of audio information retrieval
- J. T. Foote, "An overview of audio information retrieval," Multimedia Systems, vol. 7, no. 1, pp. 2-10, 1999.
- (1999) Multimedia Systems , vol.7 , Issue.1 , pp. 2-10
- Foote, J.T.¹

77
- 85135191939
- Talkerindependent keyword spotting for information retrieval
- J. T. Foote, G. J. F. Jones, K. Sparck Jones, and S. J. Young, "Talkerindependent keyword spotting for information retrieval," in Proceedings of Eurospeech, pp. 2145-2148, 1995.
- (1995) Proceedings of Eurospeech , pp. 2145-2148
- Foote, J.T.¹ Jones, G.J.F.² Sparck Jones, K.³ Young, S.J.⁴

78
- 84865222669
- Using term clouds to represent segment-level semantic content of podcasts
- M. Fuller, M. Tsagkias, E. Newman, J. Besser, M. Larson, G. J. F. Jones, and M. de Rijke, "Using term clouds to represent segment-level semantic content of podcasts," in Proceedings of the ACM Special Interest Group on Information Retrieval (SIGIR), Searching Spontaneous Conversational Speech Workshop, 2008.
- (2008) Proceedings of the ACM Special Interest Group on Information Retrieval (SIGIR), Searching Spontaneous Conversational Speech Workshop
- Fuller, M.¹ Tsagkias, M.² Newman, E.³ Besser, J.⁴ Larson, M.⁵ Jones, G.J.F.⁶ De Rijke, M.⁷

79
- 77949405459
- Transcription and distillation of spontaneous speech
- Chapter 32, (J. Benesty, M. M. Sondhi, and Y. A. Huang, eds.) , Berlin/Heidelberg: Springer Berlin/Heidelberg
- S. Furui and T. Kawahara, "Transcription and distillation of spontaneous speech," in Springer Handbook of Speech Processing, Chapter 32, (J. Benesty, M. M. Sondhi, and Y. A. Huang, eds.), pp. 627-652, Berlin/Heidelberg: Springer Berlin/Heidelberg, 2008.
- (2008) Springer Handbook of Speech Processing , pp. 627-652
- Furui, S.¹ Kawahara, T.²

80
- 3042826816
- Speech-to-text and speech-tospeech summarization of spontaneous speech
- S. Furui, T. Kikuchi, Y. Shinnaka, and C. Hori, "Speech-to-text and speech-tospeech summarization of spontaneous speech," IEEE Transactions on Speech and Audio Processing, vol. 12, no. 4, pp. 401-408, 2004.
- (2004) IEEE Transactions on Speech and Audio Processing , vol.12 , Issue.4 , pp. 401-408
- Furui, S.¹ Kikuchi, T.² Shinnaka, Y.³ Hori, C.⁴

81
- 70349281382
- Now Publishers Inc February
- M. Gales and S. J. Young, The Application of Hidden Markov Models in Speech Recognition. now Publishers Inc., February 2008.
- (2008) The Application of Hidden Markov Models in Speech Recognition
- Gales, M.¹ Young, S.J.²

82
- 0001935505
- The TREC spoken document retrieval track: A success story
- (J.-J. Mariani and D. Harman, eds.)
- J. S. Garofolo, C. G. P. Auzanne, and E. M. Voorhees, "The TREC spoken document retrieval track: A success story," in Proceedings of the RIAO Conference on Content-Based Multimedia Information Access, (J.-J. Mariani and D. Harman, eds.), pp. 1-20, 2000.
- (2000) Proceedings of the RIAO Conference on Content-Based Multimedia Information Access , pp. 1-20
- Garofolo, J.S.¹ Auzanne, C.G.P.² Voorhees, E.M.³

83
- 0002943779
- Spoken document retrieval: 1998 evaluation and investigation of new metrics
- J. S. Garofolo, E. M. Voorhees, C. G. P. Auzanne, and V. M. Stanford, "Spoken document retrieval: 1998 evaluation and investigation of new metrics," in Proceedings of the ESCA Workshop: Accessing Information in Spoken Audio, pp. 1-7, 1999.
- (1999) Proceedings of the ESCA Workshop: Accessing Information in Spoken Audio , pp. 1-7
- Garofolo, J.S.¹ Voorhees, E.M.² Auzanne, C.G.P.³ Stanford, V.M.⁴

84
- 0003128543
- Transcribing broadcast news for audio and video indexing
- J.-L. Gauvain, L. Lamel, and G. Adda, "Transcribing broadcast news for audio and video indexing," Communications of the ACM, vol. 13, no. 2, pp. 64-70, 2000.
- (2000) Communications of the ACM , vol.13 , Issue.2 , pp. 64-70
- Gauvain, J.-L.¹ Lamel, L.² Adda, G.³

85
- 0027311604
- Application of large vocabulary continuous speech recognition to topic and speaker identification using telephone speech
- L. Gillick, J. Baker, J. Bridle, M. Hunt, Y. Ito, S. Lowe, J. Orloff, B. Peskin, R. Roth, and F. Scattone, "Application of large vocabulary continuous speech recognition to topic and speaker identification using telephone speech," in Proceedings of the IEEE International Conference on Acoustics Speech, and Signal Processing, pp. II/471-II474, 1993.
- (1993) Proceedings of the IEEE International Conference on Acoustics Speech, and Signal Processing
- Gillick, L.¹ Baker, J.² Bridle, J.³ Hunt, M.⁴ Ito, Y.⁵ Lowe, S.⁶ Orloff, J.⁷ Peskin, B.⁸ Roth, R.⁹ Scattone, F.¹⁰

86
- 43849107616
- Recent progress in the MIT spoken lecture processing project
- J. Glass, T. J. Hazen, S. Cyphers, I. Malioutov, D. Huynh, and R. Barzila, "Recent progress in the MIT spoken lecture processing project," in Proceedings of Interspeech, pp. 2556-2556, 2007.
- (2007) Proceedings of Interspeech , pp. 2556-2556
- Glass, J.¹ Hazen, T.J.² Cyphers, S.³ Malioutov, I.⁴ Huynh, D.⁵ Barzila, R.⁶

87
- 0026989462
- A system for retrieving speech documents
- ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval
- U. Glavitsch and P. Schäuble, "A system for retrieving speech documents," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 168-176, 1992.
- (1992) Proceedings of the International , pp. 168-176
- Glavitsch, U.¹ Schäuble, P.²

88
- 84976842565
- Metadata for integrating speech documents in a text retrieval system
- December
- U. Glavitsch, P. Schäuble, and M. Wechsler, "Metadata for integrating speech documents in a text retrieval system," SIGMOD Record, vol. 23, no. 4, pp. 57-63, December 1994.
- (1994) SIGMOD Record , vol.23 , Issue.4 , pp. 57-63
- Glavitsch, U.¹ Schäuble, P.² Wechsler, M.³

89
- 84889275410
- John Wiley & Sons
- A. Goker, J. Davies, and M. Graham, Information Retrieval: Searching in the 21st Century. John Wiley & Sons, 2007.
- (2007) Information Retrieval: Searching in the 21st Century
- Goker, A.¹ Davies, J.² Graham, M.³

90
- 84891583348
- Wiley & Sons, Inc
- B. Gold and N. Morgan, Speech and Audio Signal Processing: Processing and Perception of Speech and Music. John Wiley & Sons, Inc., 1999.
- (1999) Speech and Audio Signal Processing: Processing and Perception of Speech and Music. John
- Gold, B.¹ Morgan, N.²

91
- 33846950613
- Accessing the spoken word
- J. Goldman, S. Renals, S. G. Bird, F. M. G. de Jong, M. Federico, C. Fleischhauer, M. Kornbluh, L. Lamel, D. W. Oard, C. Stewart, and R. Wright, "Accessing the spoken word," International Journal on Digital Libraries, vol. 5, no. 4, pp. 287-298, 2005.
- (2005) International Journal on Digital Libraries , vol.5 , Issue.4 , pp. 287-298
- Goldman, J.¹ Renals, S.² Bird, S.G.³ De Jong, F.M.G.⁴ Federico, M.⁵ Fleischhauer, C.⁶ Kornbluh, M.⁷ Lamel, L.⁸ Oard, D.W.⁹ Stewart, C.¹⁰ Wright, R.¹¹

92
- 84865716939
- PodCastle: Recent advances of a spoken document retrieval service improved by anonymous user contributions
- M. Goto and J. Ogata, "PodCastle: Recent advances of a spoken document retrieval service improved by anonymous user contributions," in Proceedings of Interspeech, pp. 3073-3076, 2011.
- (2011) Proceedings of Interspeech , pp. 3073-3076
- Goto, M.¹ Ogata, J.²

93
- 67149104848
- PodCastle: A Web 2.0 approach to speech recognition research
- M. Goto, J. Ogata, and K. Eto, "PodCastle: A Web 2.0 approach to speech recognition research," in Proceedings of Interspeech, pp. 2397-2400, 2007.
- (2007) Proceedings of Interspeech , pp. 2397-2400
- Goto, M.¹ Ogata, J.² Eto, K.³

94
- 0036989457
- Supporting access to large digital oral history archives
- S. Gustman, D. Soergel, D. Oard, W. Byrne, M. Picheny, B. Ramabhadran, and D. Greenberg, "Supporting access to large digital oral history archives," in Proceedings of the ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 18-27, 2002.
- (2002) Proceedings of the ACM/ IEEE-CS Joint Conference on Digital Libraries , pp. 18-27
- Gustman, S.¹ Soergel, D.² Oard, D.³ Byrne, W.⁴ Picheny, M.⁵ Ramabhadran, B.⁶ Greenberg, D.⁷

95
- 0002751623
- Segment generation and clustering in the HTK Broadcast News Transcription System
- T. Hain, S. E. Johnson, A. Tuerk, P. C. Woodland, and S. J. Young, "Segment generation and clustering in the HTK Broadcast News Transcription System," in Proceedings of the Broadcast News Transcription and Understanding Workshop, pp. 133-137, 1998.
- (1998) Proceedings of the Broadcast News Transcription and Understanding Workshop , pp. 133-137
- Hain, T.¹ Johnson, S.E.² Tuerk, A.³ Woodland, P.C.⁴ Young, S.J.⁵

96
- 27744599401
- Automatic transcription of conversational telephone speech
- DOI 10.1109/TSA.2005.852999
- T. Hain, P. C. Woodland, G. Evermann, M. J. F. Gales, X. Liu, G. L. Moore, D. Povey, and L. Wang, "Automatic transcription of conversational telephone speech," IEEE Transactions on Speech and Audio Processing, vol. 13, no. 6, pp. 1173-1185, 2005. (Pubitemid 41605020)
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.6 , pp. 1173-1185
- Hain, T.¹ Woodland, P.C.² Evermann, G.³ Gales, M.J.F.⁴ Liu, X.⁵ Moore, G.L.⁶ Povey, D.⁷ Wang, L.⁸

97
- 33746550354
- Beyond ASR 1-best: Using word confusion networks in spoken language understanding
- DOI 10.1016/j.csl.2005.07.005, PII S0885230805000495
- D. Hakkani-Tür, F. Bechet, G. Riccardi, and G. Tür, "Beyond ASR 1-best: Using word confusion networks in spoken language understanding," Computer Speech and Language, vol. 20, no. 4, pp. 495-514, 2006. (Pubitemid 44142006)
- (2006) Computer Speech and Language , vol.20 , Issue.4 , pp. 495-514
- Hakkani-Tur, D.¹ Bechet, F.² Riccardi, G.³ Tur, G.⁴

98
- 0141479125
- A general algorithm for word graph matrix decomposition
- D. Hakkani-Tür and G. Riccardi, "A general algorithm for word graph matrix decomposition," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/596-I/599, 2003.
- (2003) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Hakkani-Tür, D.¹ Riccardi, G.²

99
- 13144279345
- Affective video content representation and modeling
- DOI 10.1109/TMM.2004.840618
- A. Hanjalic and L.-Q. Xu, "Affective video content representation and modeling," IEEE Transactions on Multimedia, vol. 7, no. 1, pp. 143-154, 2005. (Pubitemid 40178377)
- (2005) IEEE Transactions on Multimedia , vol.7 , Issue.1 , pp. 143-154
- Hanjalic, A.¹ Xu, L.-Q.²

100
- 85008020310
- SpeechFind: Advances in spoken document retrieval for a national gallery of the spoken word
- J. H. L. Hansen, R. Huang, B. Zhou, M. Seadle, J. R. Deller, A. R. Gurijala, M. Kurimo, and P. Angkititrakul, "SpeechFind: Advances in spoken document retrieval for a national gallery of the spoken word," IEEE Transactions on Speech and Audio Processing, vol. 13, no. 5, pp. 712-730, 2005.
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.5 , pp. 712-730
- Hansen, J.H.L.¹ Huang, R.² Zhou, B.³ Seadle, M.⁴ Deller, J.R.⁵ Gurijala, A.R.⁶ Kurimo, M.⁷ Angkititrakul, P.⁸

101
- 36448995740
- Selection and ranking of text from highly imperfect transcripts for retrieval of video content
- DOI 10.1145/1277741.1277911, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
- A. Haubold, "Selection and ranking of text from highly imperfect transcripts for retrieval of video content," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 791-792, 2007. (Pubitemid 350165072)
- (2007) Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07 , pp. 791-792
- Haubold, A.¹

102
- 0029481553
- Speech recognition in the Informedia Digital Video Library: Uses and limitations
- A. G. Hauptmann, "Speech recognition in the Informedia Digital Video Library: Uses and limitations," in Proceedings of the International Conference on Tools with Artificial Intelligence, p. 288, 1995.
- (1995) Proceedings of the International Conference on Tools with Artificial Intelligence , pp. 288
- Hauptmann, A.G.¹

103
- 13444273036
- Successful approaches in the TREC video retrieval evaluations
- ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
- A. G. Hauptmann and M. G. Christel, "Successful approaches in the TREC video retrieval evaluations," in Proceedings of the Annual ACM International Conference on Multimedia, pp. 668-675, 2004. (Pubitemid 40211844)
- (2004) ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia , pp. 668-675
- Hauptmann, A.G.¹ Christel, M.G.²

104
- 0030648359
- Indexing and search of multimodal information
- A. G. Hauptmann and H. Wactlar, "Indexing and search of multimodal information," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/195-I/198, 1997.
- (1997) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Hauptmann, A.G.¹ Wactlar, H.²

105
- 0001374417
- Informedia: News-on-demand multimedia information acquisition and retrieval
- (M. T. Maybury, ed.) , The MIT Press
- A. G. Hauptmann and M. J. Witbrock, "Informedia: News-on-demand multimedia information acquisition and retrieval," in Intelligent Multimedia Information Retrieval, (M. T. Maybury, ed.), pp. 215-239, The MIT Press, 1997.
- (1997) Intelligent Multimedia Information Retrieval , pp. 215-239
- Hauptmann, A.G.¹ Witbrock, M.J.²

106
- 85149131035
- Multi-paragraph segmentation of expository text
- M. A. Hearst, "Multi-paragraph segmentation of expository text," in Proceedings of the Annual Meeting on Association for Computational Linguistics, pp. 9-16, 1994.
- (1994) Proceedings of the Annual Meeting on Association for Computational Linguistics , pp. 9-16
- Hearst, M.A.¹

107
- 84922848449
- Cambridge University Press
- M. A. Hearst, Search User Interfaces. Cambridge University Press, 2009.
- (2009) Search User Interfaces
- Hearst, M.A.¹

108
- 84865227528
- Disclosing spoken culture: User interfaces for access to spoken word archives
- W. F. L. Heeren and F. M. G. de Jong, "Disclosing spoken culture: User interfaces for access to spoken word archives," in Proceedings of the British HCI Group Annual Conference on Human Computer Interaction, pp. 23-32, 2008.
- (2008) Proceedings of the British HCI Group Annual Conference on Human Computer Interaction , pp. 23-32
- Heeren, W.F.L.¹ De Jong, F.M.G.²

109
- 36448946478
- Radio Oranje: Searching the queen's speech(es)
- DOI 10.1145/1277741.1277971, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
- W. F. L. Heeren, L. van der Werff, R. J. F. Ordelman, A. van Hessen, and F. M. G. de Jong, "Radio Oranje: Searching the Queen's speech(es)," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, p. 903, 2007. (Pubitemid 350165130)
- (2007) Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07 , pp. 903
- Heeren, W.¹ Van Der Werff, L.² Ordelman, R.³ Van Hessen, A.⁴ De Jong, F.⁵

110
- 85135371918
- New words: Implications for continuous speech recognition
- I. L. Hetherington and V. W. Zue, "New words: Implications for continuous speech recognition," in Proceedings of Eurospeech, pp. 2121-2124, 1993.
- (1993) Proceedings of Eurospeech , pp. 2121-2124
- Hetherington, I.L.¹ Zue, V.W.²

111
- 0003754573
- Using language models for information retrieval PhD thesis
- D. Hiemstra, "Using language models for information retrieval," PhD thesis, University of Twente, 2001.
- (2001) University of Twente
- Hiemstra, D.¹

112
- 84865260137
- Studying search and archiving in a real audio database
- J. Hirschberg and S. Whittaker, "Studying search and archiving in a real audio database," in Working Notes of the AAAI Spring Symposium on Intelligent Integration and Use of Text, Image, Video and Audio Corpora, pp. 70-76, 1997.
- (1997) Working Notes of the AAAI Spring Symposium on Intelligent Integration and Use of Text, Image, Video and Audio Corpora , pp. 70-76
- Hirschberg, J.¹ Whittaker, S.²

113
- 0039141124
- Finding information in audio: A new paradigm for audio browsing/retrieval
- J. Hirschberg, S. Whittaker, D. Hindle, F. Pereira, and A. Singhal, "Finding information in audio: A new paradigm for audio browsing/retrieval," in Proceedings of the ESCA Workshop: Accessing Information in Spoken Audio, pp. 117-122, 1999.
- (1999) Proceedings of the ESCA Workshop: Accessing Information in Spoken Audio , pp. 117-122
- Hirschberg, J.¹ Whittaker, S.² Hindle, D.³ Pereira, F.⁴ Singhal, A.⁵

114
- 34547541175
- Open-vocabulary spoken utterance retrieval using confusion networks
- T. Hori, I. L. Hetherington, T. J. Hazen, and J. R. Glass, "Open-vocabulary spoken utterance retrieval using confusion networks," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. IV/73-IV/76, 2007.
- (2007) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Hori, T.¹ Hetherington, I.L.² Hazen, T.J.³ Glass, J.R.⁴

115
- 33947691278
- Improved spoken document retrieval with dynamic key term lexicon and Probabilistic Latent Semantic Analysis (PLSA)
- Y.-C. Hsieh, Y.-T. Huang, C.-C. Wang, and L.-S. Lee, "Improved spoken document retrieval with dynamic key term lexicon and Probabilistic Latent Semantic Analysis (PLSA)," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/961-I/964, 2006.
- (2006) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Hsieh, Y.-C.¹ Huang, Y.-T.² Wang, C.-C.³ Lee, L.-S.⁴

116
- 48749103162
- Automatic topic segmentation and labeling in multiparty dialogue
- P.-Y. Hsueh and J. D. Moore, "Automatic topic segmentation and labeling in multiparty dialogue," in IEEE Spoken Language Technology Workshop, pp. 98-101, 2006.
- (2006) IEEE Spoken Language Technology Workshop , pp. 98-101
- Hsueh, P.-Y.¹ Moore, J.D.²

117
- 0004056285
- Prentice Hall
- X. Huang, A. Acero, and H.-W. Hon, Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Prentice Hall, 2001.
- (2001) Spoken Language Processing: A Guide to Theory, Algorithm and System Development
- Huang, X.¹ Acero, A.² Hon, H.-W.³

118
- 70450194678
- The majority wins: A method for combining speaker diarization systems
- M. A. H. Huijbregts, D. A. Leeuwen, and F. M. G. Jong, "The majority wins: A method for combining speaker diarization systems," in Proceedings of Interspeech, pp. 924-927, 2009.
- (2009) Proceedings of Interspeech , pp. 924-927
- Huijbregts, M.A.H.¹ Leeuwen, D.A.² Jong, F.M.G.³

119
- 48349117971
- Recording, summarizing, and accessing meeting videos: An overview of the AMI project
- A. Jaimes, H. Bourlard, S. Renals, and J. Carletta, "Recording, summarizing, and accessing meeting videos: An overview of the AMI project," in Proceeings of the IEEE International Conference of Image Analysis and Processing Workshops, pp. 59-64, 2007.
- (2007) Proceeings of the IEEE International Conference of Image Analysis and Processing Workshops , pp. 59-64
- Jaimes, A.¹ Bourlard, H.² Renals, S.³ Carletta, J.⁴

120
- 0029747181
- A system for unrestricted topic retrieval from radio news broadcasts
- D. A. James, "A system for unrestricted topic retrieval from radio news broadcasts," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/279-I/282, 1996.
- (1996) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- James, D.A.¹

121
- 0004671920
- PhD Thesis University of Cambridge June
- D. A. James, "The application of classical information retrieval techniques to spoken documents," PhD Thesis, University of Cambridge, June 1995.
- (1995) The Application of Classical Information Retrieval Techniques to Spoken Documents
- James, D.A.¹

122
- 85012973695
- A fast lattice-based approach to vocabulary independent wordspotting
- D. A. James and S. J. Young, "A fast lattice-based approach to vocabulary independent wordspotting," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/377-I/380, 1994.
- (1994) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- James, D.A.¹ Young, S.J.²

123
- 78650973206
- Joke-o-Mat HD: Browsing sitcoms with human derived transcripts
- A. Janin, L. Gottlieb, and G. Friedland, "Joke-o-Mat HD: Browsing sitcoms with human derived transcripts," in Proceedings of the ACM International Conference on Multimedia, pp. 1591-1594, 2010.
- (2010) Proceedings of the ACM International Conference on Multimedia , pp. 1591-1594
- Janin, A.¹ Gottlieb, L.² Friedland, G.³

124
- 0003786003
- The MIT Press
- F. Jelinek, Statistical Methods for Speech Recognition (Language, Speech, and Communication). The MIT Press, 1998.
- (1998) Statistical Methods for Speech Recognition (Language, Speech, and Communication)
- Jelinek, F.¹

125
- 15844411850
- Confidence measures for speech recognition: A survey
- DOI 10.1016/j.specom.2004.12.004, PII S0167639305000051
- H. Jiang, "Confidence measures for speech recognition: A survey," Speech Communication, vol. 45, no. 4, pp. 455-470, 2005. (Pubitemid 40423290)
- (2005) Speech Communication , vol.45 , Issue.4 , pp. 455-470
- Jiang, H.¹

126
- 44949247571
- Automatic title generation for spoken broadcast news
- R. Jin and A. G. Hauptmann, "Automatic title generation for spoken broadcast news," in Proceedings of the International Conference on Human Language Technology Research, pp. 1-3, 2001.
- (2001) Proceedings of the International Conference on Human Language Technology Research , pp. 1-3
- Jin, R.¹ Hauptmann, A.G.²

127
- 0002623652
- Spoken document retrieval for TREC-9 at Cambridge University
- (E. Voorhees and D. Harman, eds.)
- S. E. Johnson, P. Jourlin, K. S. Jones, and P. Woodland, "Spoken document retrieval for TREC-9 at Cambridge University," in Proceedings of the Text REtrieval Conference, (E. Voorhees and D. Harman, eds.), pp. 117-126, 2000.
- (2000) Proceedings of the Text REtrieval Conference , pp. 117-126
- Johnson, S.E.¹ Jourlin, P.² Jones, K.S.³ Woodland, P.⁴

128
- 84865274068
- Exploring the incorporation of acoustic information into term weights for spoken document retrieval
- G. J. F. Jones, "Exploring the incorporation of acoustic information into term weights for spoken document retrieval," in Proceedings of the BCS Information Retrieval Specialist Group Colloquium on Information Retrieval Research, pp. 118-131, 2000.
- (2000) Proceedings of the BCS Information Retrieval Specialist Group Colloquium on Information Retrieval Research , pp. 118-131
- Jones, G.J.F.¹

129
- 84865209310
- Multimedia information extraction
- IEEE Computer Society Press
- G. J. F. Jones and C. H. Chan, "Multimedia information extraction," Chapter Affect-Based Indexing for Multimedia Data. IEEE Computer Society Press, 2012.
- (2012) Chapter Affect-Based Indexing for Multimedia Data
- Jones, G.J.F.¹ Chan, C.H.²

130
- 84865267414
- Automated alignment and annotation of audio-visual presentations
- (M. Agosti and C. Thanos, eds.), Springer Berlin/Heidelberg
- G. J. F. Jones and R. Edens, "Automated alignment and annotation of audio-visual presentations," in Research and Advanced Technology for Digital Libraries, vol. 2458 of Lecture Notes in Computer Science, (M. Agosti and C. Thanos, eds.), pp. 187-196, Springer Berlin/Heidelberg, 2002.
- (2002) Research and Advanced Technology for Digital Libraries2458 of Lecture Notes in Computer Science , pp. 187-196
- Jones, G.J.F.¹ Edens, R.²

131
- 0028996903
- Video mail retrieval: The effect of word spotting accuracy on precision
- G. J. F. Jones, J. T. Foote, K. Sparck Jones, and S. J. Young, "Video mail retrieval: The effect of word spotting accuracy on precision," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/309-I/312, 1995.
- (1995) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Jones, G.J.F.¹ Foote, J.T.² Sparck Jones, K.³ Young, S.J.⁴

132
- 0030379111
- Retrieving spoken documents by combining multiple index sources
- G. J. F. Jones, J. T. Foote, K. Spärck Jones, and S. J. Young, "Retrieving spoken documents by combining multiple index sources," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 30-38, 1996.
- (1996) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 30-38
- Jones, G.J.F.¹ Foote, J.T.² Spärck Jones, K.³ Young, S.J.⁴

133
- 84865235834
- A critical review of state-of-The-art technologies for cross-language speech retrieval
- Menlo Park, California
- G. J. F. Jones and D. A. James, "A critical review of state-of-the-art technologies for cross-language speech retrieval," in Cross-Language Text and Speech Retrieval Papers from the 1997 AAAI Spring Symposium, Technical Report SS-97-05, Menlo Park, California, 1997.
- Cross-Language Text and Speech Retrieval Papers from the 1997 AAAI Spring Symposium, Technical Report SS-97-05 , vol.1997
- Jones, G.J.F.¹ James, D.A.²

134
- 84865264734
- Exeter at CLEF 2003: Crosslanguage spoken document retrieval experiments
- (C. Peters, J. Gonzalo, M. Braschler, and M. Kluck, eds.), Springer Berlin/Heidelberg
- G. J. F. Jones and A. M. Lam-Adesina, "Exeter at CLEF 2003: Crosslanguage spoken document retrieval experiments," in Comparative Evaluation of Multilingual Information Access Systems, vol. 3237 of Lecture Notes in Computer Science, (C. Peters, J. Gonzalo, M. Braschler, and M. Kluck, eds.), pp. 553-558, Springer Berlin/Heidelberg, 2004.
- (2004) Comparative Evaluation of Multilingual Information Access Systems 3237 of Lecture Notes in Computer Science , pp. 553-558
- Jones, G.J.F.¹ Lam-Adesina, A.M.²

135
- 84865263079
- Examining the contributions of automatic speech transcriptions and metadata sources for searching spontaneous conversational speech
- G. J. F. Jones, K. Zhang, E. Newman, and A. M. Lam-Adesina, "Examining the contributions of automatic speech transcriptions and metadata sources for searching spontaneous conversational speech," in Proceedings of the ACM Special Interest Group on Information Retrieval (SIGIR) Searching Spontaneous Conversational Speech Workshop, 2007.
- (2007) Proceedings of the ACM Special Interest Group on Information Retrieval (SIGIR) Searching Spontaneous Conversational Speech Workshop
- Jones, G.J.F.¹ Zhang, K.² Newman, E.³ Lam-Adesina, A.M.⁴

136
- 0002672937
- Improving retrieval on imperfect speech transcriptions (poster abstract)
- P. Jourlin, S. E. Johnson, K. Spärck Jones, and P. C. Woodland, "Improving retrieval on imperfect speech transcriptions (poster abstract)," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 283-284, 1999.
- (1999) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 283-284
- Jourlin, P.¹ Johnson, S.E.² Spärck Jones, K.³ Woodland, P.C.⁴

137
- 0034275589
- Spoken document representations for probabilistic retrieval
- P. Jourlin, S. E. Johnson, K. Spärk Jones, and P. C. Woodland, "Spoken document representations for probabilistic retrieval," Speech Communication, vol. 32, pp. 21-36, 2000.
- (2000) Speech Communication , vol.32 , pp. 21-36
- Jourlin, P.¹ Johnson, S.E.² Spärk Jones, K.³ Woodland, P.C.⁴

138
- 72449201883
- Automatic speech recognition - A brief history of the technology
- Second Edition, Elsevier
- B. H. Juang and L. R. Rabiner, "Automatic speech recognition - a brief history of the technology," in Elsevier Encyclopedia of Language and Linguistics, Second Edition, Elsevier, 2005.
- (2005) Elsevier Encyclopedia of Language and Linguistics
- Juang, B.H.¹ Rabiner, L.R.²

139
- 0003847769
- Prentice Hall
- D. Jurafsky and J. H. Martin, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition. Prentice Hall, 2008.
- (2008) Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition
- Jurafsky, D.¹ Martin, J.H.²

140
- 73849099105
- Social summarization: Does social feedback improve access to speech data?
- V. Kalnikaité and S. Whittaker, "Social summarization: Does social feedback improve access to speech data?," in Proceedings of the ACM Conference on Computer Supported Cooperative Work, pp. 9-12, 2008.
- (2008) Proceedings of the ACM Conference on Computer Supported Cooperative Work , pp. 9-12
- Kalnikaité, V.¹ Whittaker, S.²

141
- 70450284540
- A critical assessment of spoken utterance retrieval through approximate lattice representations
- S. Kazemian, F. Rudzicz, G. Penn, and C. Munteanu, "A critical assessment of spoken utterance retrieval through approximate lattice representations," in Proceeding of the ACM International Conference on Multimedia Information Retrieval, pp. 83-88, 2008.
- (2008) Proceeding of the ACM International Conference on Multimedia Information Retrieval , pp. 83-88
- Kazemian, S.¹ Rudzicz, F.² Penn, G.³ Munteanu, C.⁴

142
- 85135146711
- Estimating confidence using word lattices
- T. Kemp and T. Schaaf, "Estimating confidence using word lattices," in Proceedings of Eurospeech, pp. 827-830, 1997.
- (1997) Proceedings of Eurospeech , pp. 827-830
- Kemp, T.¹ Schaaf, T.²

143
- 78650906733
- The ambient spotlight: Queryless desktop search from meeting speech
- J. Kilgour, J. Carletta, and S. Renals, "The ambient spotlight: Queryless desktop search from meeting speech," in Proceedings of the ACM Multimedia Searching Spontaneous Conversational Speech Workshop, pp. 49-52, 2010.
- (2010) Proceedings of the ACM Multimedia Searching Spontaneous Conversational Speech Workshop , pp. 49-52
- Kilgour, J.¹ Carletta, J.² Renals, S.³

144
- 84898268902
- Information Science Reference
- W. Kim and J. Hansen, Speechfind: Advances in Rich Content Based Spoken Document Retrieval. pp. 173-187. Information Science Reference, 2009.
- (2009) Speechfind: Advances in Rich Content Based Spoken Document Retrieval , pp. 173-187
- Kim, W.¹ Hansen, J.²

145
- 0029202108
- Speaker segmentation for browsing recorded audio
- D. G. Kimber, L. D. Wilcox, F. R. Chen, and T. P. Moran, "Speaker segmentation for browsing recorded audio," in Conference Companion on Human Factors in Computing Systems, pp. 212-213, 1995.
- (1995) Conference Companion on Human Factors in Computing Systems , pp. 212-213
- Kimber, D.G.¹ Wilcox, L.D.² Chen, F.R.³ Moran, T.P.⁴

146
- 70350125882
- An overview of text-independent speaker recognition: From features to supervectors
- T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: From features to supervectors," Speech Communication, vol. 52, no. 1, pp. 12-40, 2010.
- (2010) Speech Communication , vol.52 , Issue.1 , pp. 12-40
- Kinnunen, T.¹ Li, H.²

147
- 0035426911
- Multilingual phone models for vocabulary-independent speech recognition tasks
- DOI 10.1016/S0167-6393(00)00093-5, PII S0167639300000935
- J. Köhler, "Multilingual phone models for vocabulary- independent speech recognition tasks," Speech Communication, vol. 35, no. 1-2, pp. 21-30, 2001. (Pubitemid 32599644)
- (2001) Speech Communication , vol.35 , Issue.1-2 , pp. 21-30
- Kohler, J.¹

148
- 85032751882
- Content-based access to spoken audio
- DOI 10.1109/MSP.2005.1511824
- K. Koumpis and S. Renals, "Content-based access to spoken audio," IEEE Signal Processing Magazine, vol. 22, no. 5, pp. 61-69, 2005. (Pubitemid 41488521)
- (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 61-69
- Koumpis, K.¹ Renals, S.²

149
- 33947673332
- Automatic summarization of voicemail messages using lexical and prosodic features
- K. Koumpis and S. Renals, "Automatic summarization of voicemail messages using lexical and prosodic features," ACM Transactions on Speech and Language Processing, vol. 2, no. 1, pp. 1-24, 2005.
- (2005) ACM Transactions on Speech and Language Processing , vol.2 , Issue.1 , pp. 1-24
- Koumpis, K.¹ Renals, S.²

150
- 84900191436
- Rough'n'Ready: A meeting recorder and browser
- F. Kubala, S. Colbath, D. Liu, and J. Makhoul, "Rough'n'Ready: A meeting recorder and browser," ACM Computing Surveyes, vol. 1, no. 2, 1999.
- (1999) ACM Computing Surveyes , vol.1 , Issue.2
- Kubala, F.¹ Colbath, S.² Liu, D.³ Makhoul, J.⁴

151
- 0344139642
- Speech-based retrieval using semantic co-occurrence filtering
- J. Kupiec, D. Kimber, and V. Balasubramanian, "Speech-based retrieval using semantic co-occurrence filtering," in Proceedings of the International Conference on Human Language Technology Research, pp. 350-354, 1994.
- (1994) Proceedings of the International Conference on Human Language Technology Research , pp. 350-354
- Kupiec, J.¹ Kimber, D.² Balasubramanian, V.³

152
- 0036722768
- Thematic indexing of spoken documents by using self-organizing maps
- DOI 10.1016/S0167-6393(01)00042-5, PII S0167639301000425
- M. Kurimo, "Thematic indexing of spoken documents by using
- (2002) Speech Communication , vol.38 , Issue.1-2 , pp. 29-45
- Kurimo, M.¹

153
- 85009133608
- An evaluation of a spoken document retrieval baseline system in finnish
- M. Kurimo and V. Turunen, "An evaluation of a spoken document retrieval baseline system in finnish," in Proceedings of Interspeech, pp. 1585-1588, 2004.
- (2004) Proceedings of Interspeech , pp. 1585-1588
- Kurimo, M.¹ Turunen, V.²

154
- 33750372012
- Using string comparison in context for improved relevance feedback in different text media
- String Processing and Information Retrieval - 13th International Conference, SPIRE 2006, Proceedings LNCS
- A. M. Lam-Adesina and G. J. F. Jones, "Using string comparison in context for improved relevance feedback in different text media," in Proceedings of the String Processing on Information Retrieval Conference, pp. 229-241, 2006. (Pubitemid 44619011)
- (2006) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.4209 , pp. 229-241
- Lam-Adesina, A.M.¹ Jones, G.J.F.²

155
- 33745217037
- Using syllable-based indexing features and language models to improve German spoken document retrieval
- M. Larson and S. Eickeler, "Using syllable-based indexing features and language models to improve German spoken document retrieval," in Proceedings of Interspeech, pp. 1217-1220, 2003.
- (2003) Proceedings of Interspeech , pp. 1217-1220
- Larson, M.¹ Eickeler, S.²

156
- 84877728825
- Overview of MediaEval 2011 rich speech retrieval task and genre tagging task
- M. Larson, M. Eskevich, R. J. F. Ordelman, C. Kofler, S. Schmiedeke, and G. J. F. Jones, "Overview of MediaEval 2011 rich speech retrieval task and genre tagging task," in Working Notes Proceedings of the MediaEval Workshop, CEUR-WS.org, 2011.
- (2011) Working Notes Proceedings of the MediaEval Workshop, CEUR-WS.org
- Larson, M.¹ Eskevich, M.² Ordelman, R.J.F.³ Kofler, C.⁴ Schmiedeke, S.⁵ Jones, G.J.F.⁶

157
- 84865222680
- Structured audio player: Supporting radio archive workflows with automatically generated structure metadata
- M. Larson and J. Köhler, "Structured audio player: Supporting radio archive workflows with automatically generated structure metadata," in Proceedings of the RIAO Conference on Large-scale Semantic Access to Content (Text, Image, Video and Sound), 2007.
- (2007) Proceedings of the RIAO Conference on Large-scale Semantic Access to Content (Text, Image, Video and Sound)
- Larson, M.¹ Köhler, J.²

158
- 70549113366
- Overview of VideoCLEF 2008: Automatic generation of topic-based feeds for dual language audiovisual content
- (C. Peters, T. Deselaers, N. Ferro, J. Gonzalo, A. Penas, G. J. F. Jones, M. Kurimo, T. Mandl, and V. Petras, eds.), Springer Berlin/Heidelberg
- M. Larson, E. Newman, and G. J. F. Jones, "Overview of VideoCLEF 2008: Automatic generation of topic-based feeds for dual language audiovisual content," in Proceedings of the Cross-language Evaluation Forum Conference on Evaluating Systems for Multilingual and Multimodal Information Access, (C. Peters, T. Deselaers, N. Ferro, J. Gonzalo, A. Penas, G. J. F. Jones, M. Kurimo, T. Mandl, and V. Petras, eds.), pp. 906-917, Springer Berlin/Heidelberg, 2009.
- (2009) Proceedings of the Cross-language Evaluation Forum Conference on Evaluating Systems for Multilingual and Multimodal Information Access , pp. 906-917
- Larson, M.¹ Newman, E.² Jones, G.J.F.³

159
- 78049336944
- Overview of VideoCLEF 2009: New perspectives on speech-based multimedia content enrichment
- (C. Peters, B. Caputo, J. Gonzalo, G. J. F. Jones, J. Kalpathy-Cramer, H. Muller, and T. Tsikrika, eds.), Springer Berlin/Heidelberg
- M. Larson, E. Newman, and G. J. F. Jones, "Overview of VideoCLEF 2009: New perspectives on speech-based multimedia content enrichment," in Multilingual Information Access Evaluation II. Multimedia Experiments, vol. 6242 of Lecture Notes in Computer Science, (C. Peters, B. Caputo, J. Gonzalo, G. J. F. Jones, J. Kalpathy-Cramer, H. Müller, and T. Tsikrika, eds.), pp. 354-368, Springer Berlin/Heidelberg, 2010.
- (2010) Multilingual Information Access Evaluation II. Multimedia Experiments 6242 of Lecture Notes in Computer Science , pp. 354-368
- Larson, M.¹ Newman, E.² Jones, G.J.F.³

160
- 84865263098
- The community and the crowd: Developing large-scale data collections for multimedia benchmarking
- IEEE Computer Society Digital Library. IEEE Computer Society 15 May
- M. Larson, M. Soleymani, M. Eskevich, P. Serdyukov, R. Ordelman, and G. J. F. Jones, "The community and the crowd: Developing large-scale data collections for multimedia benchmarking," IEEE Multimedia, IEEE Computer Society Digital Library. IEEE Computer Society, 15 May 2012.
- (2012) IEEE Multimedia
- Larson, M.¹ Soleymani, M.² Eskevich, M.³ Serdyukov, P.⁴ Ordelman, R.⁵ Jones, G.J.F.⁶

161
- 85043080519
- Automatic tagging and geotagging in video collections and communities
- M. Larson, M. Soleymani, P. Serdyukov, S. Rudinac, C. Wartena, V. a. Murdock, G. Friedland, R. J. F. Ordelman, and G. J. F. Jones, "Automatic tagging and geotagging in video collections and communities," in Proceedings of the 1st ACM International Conference on Multimedia Retrieval, pp. 1-51, 2011.
- (2011) Proceedings of the 1st ACM International Conference on Multimedia Retrieval , pp. 1-51
- Larson, M.¹ Soleymani, M.² Serdyukov, P.³ Rudinac, S.⁴ Wartena, C.⁵ Murdock, V.A.⁶ Friedland, G.⁷ Ordelman, R.J.F.⁸ Jones, G.J.F.⁹

162
- 67650695369
- Investigating the global semantic impact of speech recognition error on spoken content collections
- (M. Boughanem, C. Berrut, J. Mothe, and C. Soule-Dupuy, eds.), Springer Berlin/Heidelberg
- M. Larson, M. Tsagkias, J. He, and M. de Rijke, "Investigating the global semantic impact of speech recognition error on spoken content collections," in Advances in Information Retrieval. Proceedings of the European Conference on IR Research, vol. 5478 of Lecture Notes in Computer Science, (M. Boughanem, C. Berrut, J. Mothe, and C. Soule-Dupuy, eds.), pp. 755-760, Springer Berlin/Heidelberg, 2009.
- (2009) Advances in Information Retrieval. Proceedings of the European Conference on IR Research 5478 of Lecture Notes in Computer Science , pp. 755-760
- Larson, M.¹ Tsagkias, M.² He, J.³ De Rijke, M.⁴

163
- 80054370614
- Cambridge University Press
- J. Laver, Principles of Phonetics (Cambridge Textbooks in Linguistics). Cambridge University Press, 1994.
- (1994) Principles of Phonetics (Cambridge Textbooks in Linguistics)
- Laver, J.¹

164
- 0034785304
- Relevance based language models
- ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval
- V. Lavrenko and W. B. Croft, "Relevance based language models," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 120-127, 2001.
- (2001) Proceedings of the International , pp. 120-127
- Lavrenko, V.¹ Croft, W.B.²

165
- 84904361490
- A Korean spoken document retrieval system for lecture search
- D. Lee and G. G. Lee, "A Korean spoken document retrieval system for lecture search," in Proceedings of the ACM Special Interest Group on Information Retrieval (SIGIR) Searching Spontaneous Conversational Speech Workshop, 2008.
- (2008) Proceedings of the ACM Special Interest Group on Information Retrieval (SIGIR) Searching Spontaneous Conversational Speech Workshop
- Lee, D.¹ Lee, G.G.²

166
- 85032751176
- Spoken document understanding and organization
- DOI 10.1109/MSP.2005.1511823
- L.-S. Lee and B. Chen, "Spoken document understanding and organization," IEEE Signal Processing Magazine, vol. 22, no. 5, pp. 42-60, 2005. (Pubitemid 41488520)
- (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 42-60
- Lee, L.-S.¹ Chen, B.²

167
- 33646815291
- Combining multiple subword representations for open-vocabulary spoken document retrieval
- S.-W. Lee, K. Tanaka, and Y. Itoh, "Combining multiple subword representations for open-vocabulary spoken document retrieval," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 505-508, 2005.
- (2005) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 505-508
- Lee, S.-W.¹ Tanaka, K.² Itoh, Y.³

168
- 33750329509
- One-sided measures for evaluating ranked retrieval effectiveness with spontaneous conversational speech
- B. Liu and D. W. Oard, "One-sided measures for evaluating ranked retrieval effectiveness with spontaneous conversational speech," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 673- 674, 2006.
- (2006) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 673-674
- Liu, B.¹ Oard, D.W.²

169
- 34047266607
- Enriching speech recognition with automatic detection of sentence boundaries and disfluencies
- Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, and M. Harper, "Enriching speech recognition with automatic detection of sentence boundaries and disfluencies," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 5, pp. 1526-1540, 2006.
- (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , Issue.5 , pp. 1526-1540
- Liu, Y.¹ Shriberg, E.² Stolcke, A.³ Hillard, D.⁴ Ostendorf, M.⁵ Harper, M.⁶

170
- 3042818033
- Cross-language spoken document retrieval using HMM-based retrieval model with multi-scale fusion
- W.-K. Lo, H. Meng, and P. C. Ching, "Cross-language spoken document retrieval using HMM-based retrieval model with multi-scale fusion," ACM Transactions on Asian Language Information Processing, vol. 2, no. 1, pp. 1-26, 2003.
- (2003) ACM Transactions on Asian Language Information Processing , vol.2 , Issue.1 , pp. 1-26
- Lo, W.-K.¹ Meng, H.² Ching, P.C.³

171
- 0038376815
- IFINDER: An MPEG-7-based retrieval system for distributed multimedia content
- J. Löffler, K. Biatov, C. Eckes, and J. Köhler, "IFINDER: An MPEG-7-based retrieval system for distributed multimedia content," in Proceedings of the ACM International Conference on Multimedia, pp. 431-435, 2002.
- (2002) Proceedings of the ACM International Conference on Multimedia , pp. 431-435
- Löffler, J.¹ Biatov, K.² Eckes, C.³ Köhler, J.⁴

172
- 78649263967
- Word and sub-word indexing approaches for reducing the effects of OOV queries on spoken audio
- B. Logan, P. Moreno, and O. Deshmukh, "Word and sub-word indexing approaches for reducing the effects of OOV queries on spoken audio," in Proceedings of the International Conference on Human Language Technology Research, pp. 31-35, 2002.
- (2002) Proceedings of the International Conference on Human Language Technology Research , pp. 31-35
- Logan, B.¹ Moreno, P.² Deshmukh, O.³

173
- 85009285063
- Confusion-based query expansion for OOV words in spoken document retrieval
- B. Logan and J. M. V. Thong, "Confusion-based query expansion for OOV words in spoken document retrieval," in Proceedings of Interspeech, pp. 1997-2000, 2002.
- (2002) Proceedings of Interspeech , pp. 1997-2000
- Logan, B.¹ Thong, J.M.V.²

174
- 26844534218
- Approaches to reduce the effects of OOV queries on indexed spoken audio
- DOI 10.1109/TMM.2005.854429
- B. Logan, J. M. Van Thong, and P. J. Moreno, "Approaches to reduce the effects of OOV queries on indexed spoken audio," IEEE Transactions on Multimedia, vol. 7, no. 5, pp. 899-906, 2005. (Pubitemid 41452518)
- (2005) IEEE Transactions on Multimedia , vol.7 , Issue.5 , pp. 899-906
- Logan, B.¹ Van Thong, J.M.² Moreno, P.J.³

175
- 77950799692
- Minimum cut model for spoken lecture segmentation
- I. Malioutov and R. Barzilay, "Minimum cut model for spoken lecture segmentation," in Proceedings of the International Conference on Computational Linguistics and the Annual Meeting of the Association for Computational Linguistics, pp. 25-32, 2006.
- (2006) Proceedings of the International Conference on Computational Linguistics and the Annual Meeting of the Association for Computational Linguistics , pp. 25-32
- Malioutov, I.¹ Barzilay, R.²

176
- 33750331971
- Spoken document retrieval from callcenter conversations
- J. Mamou, D. Carmel, and R. Hoory, "Spoken document retrieval from callcenter conversations," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 51-58, 2006.
- (2006) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 51-58
- Mamou, J.¹ Carmel, D.² Hoory, R.³

177
- 0034296009
- Finding consensus among words: Latticebased word error minimisation
- L. Mangu, E. Brill, and A. Stolcke, "Finding consensus among words: Latticebased word error minimisation," Computer Speech and Language, vol. 14, no. 4, pp. 373-400, 2000.
- (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 373-400
- Mangu, L.¹ Brill, E.² Stolcke, A.³

178
- 34548080780
- Cambridge University Press
- C. D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval. Cambridge University Press, 2008.
- (2008) Introduction to Information Retrieval
- Manning, C.D.¹ Raghavan, P.² Schütze, H.³

179
- 80053080845
- Automatic detection of well recognized words in automatic speech transcription
- J. Mauclair, Y. Esteve, S. Petitrenaud, and P. Deléglise, "Automatic detection of well recognized words in automatic speech transcription," in Proceedings of the International Conference on Language Resources and Evaluation, 2006.
- (2006) Proceedings of the International Conference on Language Resources and Evaluation
- Mauclair, J.¹ Esteve, Y.² Petitrenaud, S.³ Deléglise, P.⁴

180
- 0003515632
- The MIT Press
- M. T. Maybury, ed., Intelligent Multimedia Information Retrieval. The MIT Press, 1997.
- (1997) Intelligent Multimedia Information Retrieval
- Maybury, M.T.¹

181
- 84964500666
- Approaches to topic identification on the switchboard corpus
- J. McDonough, K. Ng, P. Jeanrenaud, H. Gish, and J. R. Rohlicek, "Approaches to topic identification on the switchboard corpus," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/385-I/388, 1994.
- (1994) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- McDonough, J.¹ Ng, K.² Jeanrenaud, P.³ Gish, H.⁴ Rohlicek, J.R.⁵

182
- 70349206325
- Improved lattice-based spoken document retrieval by directly learning from the evaluation measures
- C.-H. Meng, H.-Y. Lee, and L.-S. Lee, "Improved lattice-based spoken document retrieval by directly learning from the evaluation measures," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4893-4896, 2009.
- (2009) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , pp. 4893-4896
- Meng, C.-H.¹ Lee, H.-Y.² Lee, L.-S.³

183
- 12144286470
- Mandarin-English Information (MEI): Investigating translingual speech retrieval
- H. Meng, B. Chen, S. Khudanpur, G. Levow, W. Lo, D. W. Oard, P. Schone, K. Tang, H. Wang, and J. Wang, "Mandarin-English Information (MEI): Investigating translingual speech retrieval," Computer Speech and Language, vol. 18, no. 2, pp. 163-179, 2004.
- (2004) Computer Speech and Language , vol.18 , Issue.2 , pp. 163-179
- Meng, H.¹ Chen, B.² Khudanpur, S.³ Levow, G.⁴ Lo, W.⁵ Oard, D.W.⁶ Schone, P.⁷ Tang, K.⁸ Wang, H.⁹ Wang, J.¹⁰

184
- 70349212534
- Efficient subword lattice retrieval for German spoken term detection
- T. Mertens and D. Schneider, "Efficient subword lattice retrieval for German spoken term detection," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4885-4888, 2009.
- (2009) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , pp. 4885-4888
- Mertens, T.¹ Schneider, D.²

185
- 70450194220
- Merging search spaces for spoken term detection
- T. Mertens, D. Schneider, and J. Köhler, "Merging search spaces for spoken term detection," in Proceedings of Interspeech, pp. 2127-2130, 2009.
- (2009) Proceedings of Interspeech , pp. 2127-2130
- Mertens, T.¹ Schneider, D.² Köhler, J.³

186
- 84885662673
- A Markov random field model for term dependencies
- D. Metzler and W. B. Croft, "A Markov random field model for term dependencies," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 472-479, 2005.
- (2005) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 472-479
- Metzler, D.¹ Croft, W.B.²

187
- 24644434943
- Boosting Web retrieval through query operations
- Advances in Information Retrieval - 27th European Conference on IR Research, ECIR 2005
- G. Mishne and M. de Rijke, "Boosting web retrieval through query operations," in Advances in Information Retrieval, pp. 502-516, Springer, 2005. (Pubitemid 41272928)
- (2005) Lecture Notes in Computer Science , vol.3408 , pp. 502-516
- Mishne, G.¹ De Rijke, M.²

188
- 67649562145
- A similar content retrieval method for podcast episodes
- J. Mizuno, J. Ogata, and M. Goto, "A similar content retrieval method for podcast episodes," in IEEE Spoken Language Technology Workshop, pp. 297- 300, 2009.
- (2009) IEEE Spoken Language Technology Workshop , pp. 297-300
- Mizuno, J.¹ Ogata, J.² Goto, M.³

189
- 34547513206
- Castsearch - Context based spoken document retrieval
- L. L. Molgaard, K. W. Jorgensen, and L. K. Hansen, "Castsearch - context based spoken document retrieval," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. IV/93-IV/96, 2007.
- (2007) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Molgaard, L.L.¹ Jorgensen, K.W.² Hansen, L.K.³

190
- 33750546860
- Infolink: Analysis of Dutch broadcast news and cross-media browsing
- DOI 10.1109/ICME.2005.1521738, 1521738, IEEE International Conference on Multimedia and Expo, ICME 2005
- J. Morang, R. J. F. Ordelman, F. M. G. de Jong, and A. J. van Hessen, "Infolink: Analysis of dutch broadcast news and cross-media browsing," in IEEE International Conference on Multimedia and Expo, pp. 1582-1585, 2005. (Pubitemid 44669182)
- (2005) IEEE International Conference on Multimedia and Expo, ICME 2005 , vol.2005 , pp. 1582-1585
- Morang, J.¹ Ordelman, R.² De Jong, F.³ Van Hessen, A.⁴

191
- 33745218075
- Comparison of different phone-based spoken document retrieval methods with text and spoken queries
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- N. Moreau, S. Jin, and T. Sikora, "Comparison of different phone-based spoken document retrieval methods with text and spoken queries," in Proceedings of Interspeech, pp. 641-644, 2005. (Pubitemid 43908144)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 641-644
- Moreau, N.¹ Jin, S.² Sikora, T.³

192
- 33745186799
- Phonetic confusion based document expansion for spoken document retrieval
- N. Moreau, H.-G. Kim, and T. Sikora, "Phonetic confusion based document expansion for spoken document retrieval," in Proceedings of Interspeech, pp. 1593-1596, 2004.
- (2004) Proceedings of Interspeech , pp. 1593-1596
- Moreau, N.¹ Kim, H.-G.² Sikora, T.³

193
- 0036534571
- From multimedia retrieval to knowledge management
- P. J. Moreno, J. M. Van Thong, B. Logan, and G. J. F. Jones, "From multimedia retrieval to knowledge management," Computer, vol. 35, no. 4, pp. 58-66, 2002. (Pubitemid 34291867)
- (2002) Computer , vol.35 , Issue.4 , pp. 58-66
- Moreno, P.J.¹ Van Thong, J.-M.² Logan, B.³ Jones, G.J.F.⁴

194
- 33745856298
- The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives
- CHI 2006: Conference on Human Factors in Computing Systems, Conference Proceedings SIGCHI
- C. Munteanu, R. Baecker, G. Penn, E. Toms, and D. James, "The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives," in Proceedings of the Special Interest Group on Computer-Human Interaction (SIGCHI) Conference on Human Factors in Computing Systems, pp. 493-502, 2006. (Pubitemid 44032136)
- (2006) Conference on Human Factors in Computing Systems - Proceedings , vol.1 , pp. 493-502
- Munteanu, C.¹ Baecker, R.² Penn, G.³ Toms, E.⁴ James, D.⁵

195
- 0034274806
- Experiments in spoken document retrieval using phoneme n-grams
- C. Ng, R. Wilkinson, and J. Zobel, "Experiments in spoken document retrieval using phoneme n-grams," Speech Communication, vol. 32, no. 1-2, pp. 61-77, 2000.
- (2000) Speech Communication , vol.32 , Issue.1-2 , pp. 61-77
- Ng, C.¹ Wilkinson, R.² Zobel, J.³

196
- 0002470735
- Subword unit representations for spoken document retrieval
- K. Ng and V. W. Zue, "Subword unit representations for spoken document retrieval," in Proceedings of Eurospeech, pp. 1607-1610, 1997.
- (1997) Proceedings of Eurospeech , pp. 1607-1610
- Ng, K.¹ Zue, V.W.²

197
- 0034300710
- Subword-based approaches for spoken document retrieval
- K. Ng and V. W. Zue, "Subword-based approaches for spoken document retrieval," Speech Communication, vol. 32, no. 3, pp. 157-186, 2000.
- (2000) Speech Communication , vol.32 , Issue.3 , pp. 157-186
- Ng, K.¹ Zue, V.W.²

198
- 34250014992
- Language-dependent state clustering for multilingual acoustic modelling
- DOI 10.1016/j.specom.2007.04.001, PII S0167639307000611
- T. Niesler, "Language-dependent state clustering for multilingual acoustic modelling," Speech Communication, vol. 49, no. 6, pp. 453-463, 2007. (Pubitemid 46891623)
- (2007) Speech Communication , vol.49 , Issue.6 , pp. 453-463
- Niesler, T.¹

199
- 44849099471
- NIST
- NIST, The Spoken Term Detection (STD) 2006 Evaluation Plan, 2006.
- (2006) The Spoken Term Detection (STD) 2006 Evaluation Plan

200
- 33750225607
- A system for information retrieval from large records of Czech spoken data
- Text, Speech and Dialogue - 9th International Conference, TSD 2006, Proceedings LNCS
- J. Nouza, J. Zdansky, P. Cerva, and J. Kolorenc, "A system for information retrieval from large records of Czech spoken data," in Text, Speech and Dialogue, vol. 4188 of Lecture Notes in Computer Science, (P. Sojka, I. Kopecek, and K. Pala, eds.), pp. 485-492, Springer Berlin/Heidelberg, 2006. (Pubitemid 44609052)
- (2006) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.4188 , pp. 485-492
- Nouza, J.¹ Zd'ansky, J.² Cerva, P.³ Kolorenc, J.⁴

201
- 85121253341
- The application of dynamic programming techniques to non-word based topic spotting
- P. Nowell and R. K. Moore, "The application of dynamic programming techniques to non-word based topic spotting," in Proceedings of Eurospeech, pp. 1355-1358, 1995.
- (1995) Proceedings of Eurospeech , pp. 1355-1358
- Nowell, P.¹ Moore, R.K.²

202
- 0012753821
- Speech-based information retrieval for digital libraries
- D. W. Oard, "Speech-based information retrieval for digital libraries," Technical Report CS-TR-3778, University of Maryland, 1997.
- (1997) Technical Report CS-TR-3778, University of Maryland
- Oard, D.W.¹

203
- 33745093235
- User interface design for speech-based retrieval
- D. W. Oard, "User interface design for speech-based retrieval," Bulletin of the American Society for Information Science and Technology, vol. 26, no. 5, pp. 20-22, 2000.
- (2000) Bulletin of the American Society for Information Science and Technology , vol.26 , Issue.5 , pp. 20-22
- Oard, D.W.¹

204
- 8644291021
- Building an information retrieval test collection for spontaneous conversational speech
- D. W. Oard, D. Soergel, D. Doermann, X. Huang, C. G. Murray, J. Wang, B. Ramabhadran, M. Franz, S. Gustman, J. Mayfield, L. Kharevych, and S. Strassel, "Building an information retrieval test collection for spontaneous conversational speech," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 41-48, 2004.
- (2004) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 41-48
- Oard, D.W.¹ Soergel, D.² Doermann, D.³ Huang, X.⁴ Murray, C.G.⁵ Wang, J.⁶ Ramabhadran, B.⁷ Franz, M.⁸ Gustman, S.⁹ Mayfield, J.¹⁰ Kharevych, L.¹¹ Strassel, S.¹²

205
- 38049169483
- Overview of the CLEF-2006 cross-language speech retrieval track
- (C. Peters, P. Clough, F. Gey, J. Karlgren, B. Magnini, D. Oard, M. de Rijke, and M. Stempfhuber, eds.), Springer Berlin/Heidelberg
- D. W. Oard, J. Wang, G. J. F. Jones, R. White, P. Pecina, D. Soergel, X. Huang, and I. Shafran, "Overview of the CLEF-2006 cross-language speech retrieval track," in Evaluation of Multilingual and Multi-modal Information Retrieval, vol. 4730 of Lecture Notes in Computer Science, (C. Peters, P. Clough, F. Gey, J. Karlgren, B. Magnini, D. Oard, M. de Rijke, and M. Stempfhuber, eds.), pp. 744-758, Springer Berlin/Heidelberg, 2007.
- (2007) Evaluation of Multilingual and Multi-modal Information Retrieval 4730 of Lecture Notes in Computer Science , pp. 744-758
- Oard, D.W.¹ Wang, J.² Jones, G.J.F.³ White, R.⁴ Pecina, P.⁵ Soergel, D.⁶ Huang, X.⁷ Shafran, I.⁸

206
- 34547311929
- Fischlar-TRECVid-2004: Combined text- and imagebased searching of video archives
- N. A. O'Connor, H. Lee, A. F. Smeaton, G. J. F. Jones, E. Cooke, H. Le Borgne, and C. Gurrin, "Fischlar-TRECVid-2004: Combined text- and imagebased searching of video archives," in Proceedings of the IEEE International Symposium on Circuits and Systems, 2006.
- (2006) Proceedings of the IEEE International Symposium on Circuits and Systems
- O'connor, N.A.¹ Lee, H.² Smeaton, A.F.³ Jones, G.J.F.⁴ Cooke, E.⁵ Le Borgne, H.⁶ Gurrin, C.⁷

207
- 67149138696
- Automatic transcription for aWeb 2.0 service to search podcasts
- J. Ogata, M. Goto, and K. Eto, "Automatic transcription for aWeb 2.0 service to search podcasts," in Proceedings of Interspeech, pp. 2617-2620, 2007.
- (2007) Proceedings of Interspeech , pp. 2617-2620
- Ogata, J.¹ Goto, M.² Eto, K.³

208
- 84867193207
- Vocabulary independent discriminative term frequency estimation
- J. S. Olsson, "Vocabulary independent discriminative term frequency estimation," in Proceedings of Interspeech, pp. 2187-2190, 2008.
- (2008) Proceedings of Interspeech , pp. 2187-2190
- Olsson, J.S.¹

209
- 72449159219
- Combining LVCSR and vocabularyindependent ranked utterance retrieval for robust speech search
- J. S. Olsson and D. W. Oard, "Combining LVCSR and vocabularyindependent ranked utterance retrieval for robust speech search," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 91-98, 2009.
- (2009) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 91-98
- Olsson, J.S.¹ Oard, D.W.²

210
- 84858392859
- Phrase-based query degradation modeling for vocabulary-independent ranked utterance retrieval
- J. S. Olsson and D. W. Oard, "Phrase-based query degradation modeling for vocabulary-independent ranked utterance retrieval," in Proceedings of Human Language Technologies Conferemce of the North American Chapter of the Association for Computational Linguistics, pp. 182-190, 2009.
- (2009) Proceedings of Human Language Technologies Conferemce of the North American Chapter of the Association for Computational Linguistics , pp. 182-190
- Olsson, J.S.¹ Oard, D.W.²

211
- 76649088991
- Towards affordable disclosure of spoken heritage archives
- R. J. F. Ordelman, W. F. L. Heeren, M. A. H. Huijbregts, F. M. G. de Jong, and D. Hiemstra, "Towards affordable disclosure of spoken heritage archives," Journal of Digital Information, Special Issue on Information Access to Cultural Heritage, vol. 10, no. 6, 2009.
- (2009) Journal of Digital Information, Special Issue on Information Access to Cultural Heritage , vol.10 , Issue.6
- Ordelman, R.J.F.¹ Heeren, W.F.L.² Huijbregts, M.A.H.³ De Jong, F.M.G.⁴ Hiemstra, D.⁵

212
- 84942565777
- Speech recognition issues for Dutch spoken document retrieval
- R. J. F. Ordelman, A. J. van Hessen, and F. M. G. de Jong, "Speech recognition issues for Dutch spoken document retrieval," in Proceedings of the International Conference on Text, Speech and Dialogue, pp. 258-265, 2001.
- (2001) Proceedings of the International Conference on Text, Speech and Dialogue , pp. 258-265
- Ordelman, R.J.F.¹ Van Hessen, A.J.² De Jong, F.M.G.³

213
- 79959723349
- SVM classification using sequences of phonemes and syllables
- (T. Elomaa, H. Mannila, and H. Toivonen, eds.), Springer Berlin/Heidelberg
- G. Paas, E. Leopold, M. Larson, J. Kindermann, and S. Eickeler, "SVM classification using sequences of phonemes and syllables," in Principles of Data Mining and Knowledge Discovery, vol. 2431 of Lecture Notes in Computer Science, (T. Elomaa, H. Mannila, and H. Toivonen, eds.), pp. 373-384, Springer Berlin/Heidelberg, 2002.
- (2002) Principles of Data Mining and Knowledge Discovery 2431 of Lecture Notes in Computer Science , pp. 373-384
- Paas, G.¹ Leopold, E.² Larson, M.³ Kindermann, J.⁴ Eickeler, S.⁵

214
- 0012739214
- Measurements in support of research accomplishments
- D. S. Pallett, J. S. Garofolo, and J. G. Fiscus, "Measurements in support of research accomplishments," Communications of the ACM, vol. 43, no. 2, pp. 75-79, 2000.
- (2000) Communications of the ACM , vol.43 , Issue.2 , pp. 75-79
- Pallett, D.S.¹ Garofolo, J.S.² Fiscus, J.G.³

215
- 77955759248
- Performance analysis for lattice-based speech indexing approaches using words and subword units
- Y.-C. Pan and L.-S. Lee, "Performance analysis for lattice-based speech indexing approaches using words and subword units," IEEE Transactions on Speech and Audio Processing, vol. 18, no. 6, pp. 1562-1574, 2010.
- (2010) IEEE Transactions on Speech and Audio Processing , vol.18 , Issue.6 , pp. 1562-1574
- Pan, Y.-C.¹ Lee, L.-S.²

216
- 0030657236
- Cross-language speech retrieval: Establishing a baseline performance
- S. Paraic, M. Wechsler, and P. Schäuble, "Cross-language speech retrieval: Establishing a baseline performance," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 99-108, 1997. (Pubitemid 127720310)
- (1997) SIGIR Forum (ACM Special Interest Group on Information Retrieval) , vol.31 , Issue.1 SPEC. ISS. , pp. 99-108
- Sheridan, P.¹ Wechsler, M.² Schauble, P.³

217
- 70349788975
- Overview of the CLEF 2007 cross-language speech retrieval track
- (C. Peters, V. Jijkoun, T. Mandl, H. Müller, D. W. Oard, A. Penas, V. Petras, and D. Santos, eds.), Springer Berlin/Heidelberg
- P. Pecina, P. Hoffmannova, G. J. F. Jones, Y. Zhang, and D. W. Oard, "Overview of the CLEF 2007 cross-language speech retrieval track," in Advances in Multilingual and Multimodal Information Retrieval, vol. 5152 of Lecture Notes in Computer Science, (C. Peters, V. Jijkoun, T. Mandl, H. Müller, D. W. Oard, A. Penas, V. Petras, and D. Santos, eds.), pp. 674-686, Springer Berlin/Heidelberg, 2008.
- (2008) Advances in Multilingual and Multimodal Information Retrieval 5152 of Lecture Notes in Computer Science , pp. 674-686
- Pecina, P.¹ Hoffmannova, P.² Jones, G.J.F.³ Zhang, Y.⁴ Oard, D.W.⁵

218
- 0032268440
- A language modeling approach to information retrieval
- J. M. Ponte and W. B. Croft, "A language modeling approach to information retrieval," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 275-281, 1998.
- (1998) Proceedings of the Internationa ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 275-281
- Ponte, J.M.¹ Croft, W.B.²

219
- 70349199063
- Non-speech audio event detection
- J. Portelo, M. Bugalho, I. Trancoso, J. Neto, A. Abad, and A. Serralheiro, "Non-speech audio event detection," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 1973-1976, 2009.
- (2009) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , pp. 1973-1976
- Portelo, J.¹ Bugalho, M.² Trancoso, I.³ Neto, J.⁴ Abad, A.⁵ Serralheiro, A.⁶

220
- 0347338002
- Robust recognition of children's speech
- A. Potamianos and S. Narayanan, "Robust recognition of children's speech," IEEE Transactions on Speech and Audio Processing, vol. 11, no. 6, pp. 603-616, 2003.
- (2003) IEEE Transactions on Speech and Audio Processing , vol.11 , Issue.6 , pp. 603-616
- Potamianos, A.¹ Narayanan, S.²

221
- 0004244302
- Prentice Hall
- L. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition. Prentice Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.-H.²

222
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286, 1989.
- (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.R.¹

223
- 0016939166
- Speech recognition by machine: A review
- D. R. Reddy, "Speech recognition by machine: A review," Proceedings of the IEEE, vol. 64, no. 4, pp. 501-531, 1976. (Pubitemid 8019231)
- (1976) Proceedings of the IEEE , vol.64 , Issue.4 , pp. 501-531
- Reddy, D.R.¹

224
- 84962787580
- The ALERT system: Advanced broadcast speech recognition technology for selective dissemination of multimedia Information
- G. Rigoll, "The ALERT system: Advanced broadcast speech recognition technology for selective dissemination of multimedia Information," in Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 301-306, 2001.
- (2001) Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 301-306
- Rigoll, G.¹

225
- 0001737422
- On term selection for query expansion
- S. E. Robertson, "On term selection for query expansion," Journal of Documentation, vol. 46, no. 4, pp. 359-364, 1990.
- (1990) Journal of Documentation , vol.46 , Issue.4 , pp. 359-364
- Robertson, S.E.¹

226
- 0016958419
- Relevance weighting of search terms
- S. E. Robertson and K. Spärk Jones, "Relevance weighting of search terms," Journal of the American Society of Information Science, vol. 27, no. 3, pp. 129-146, 1976.
- (1976) Journal of the American Society of Information Science , vol.27 , Issue.3 , pp. 129-146
- Robertson, S.E.¹ Spärk Jones, K.²

227
- 84966534942
- Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval
- S. E. Robertson and S. Walker, "Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 232-241, 1994.
- (1994) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 232-241
- Robertson, S.E.¹ Walker, S.²

228
- 0001319911
- Okapi at TREC-3
- S. E. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, and M. Gatford, "Okapi at TREC-3," in Proceedings of the Text REtrieval Conference, pp. 109-126, 1996.
- (1996) Proceedings of the Text REtrieval Conference , pp. 109-126
- Robertson, S.E.¹ Walker, S.² Jones, S.³ Hancock-Beaulieu, M.M.⁴ Gatford, M.⁵

229
- 18744388867
- Simple BM25 extension to multiple weighted fields
- CIKM 2004: Proceedings of the Thirteenth ACM Conference on Information and Knowledge Management
- S. E. Robertson, H. Zaragoza, and M. J. Taylor, "Simple BM25 extension to multiple weighted fields," in Proceedings of the International Conference on Information and Knowledge Management, pp. 42-49, 2004. (Pubitemid 40673422)
- (2004) International Conference on Information and Knowledge Management, Proceedings , pp. 42-49
- Robertson, S.¹ Zaragoza, H.² Taylor, M.³

230
- 0039627177
- Techniques for information retrieval from speech messages
- R. C. Rose, "Techniques for information retrieval from speech messages," Lincoln Laboratory Journal, vol. 4, no. 1, pp. 45-60, 1991.
- (1991) Lincoln Laboratory Journal , vol.4 , Issue.1 , pp. 45-60
- Rose, R.C.¹

231
- 0026388699
- Techniques for information retrieval from voice messages
- R. C. Rose, E. I. Chang, and R. P. Lippmann, "Techniques for information retrieval from voice messages," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/317-I/320, 1991.
- (1991) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Rose, R.C.¹ Chang, E.I.² Lippmann, R.P.³

232
- 0025592394
- A hidden Markov model based keyword recognition system
- R. C. Rose and D. B. Paul, "A hidden Markov model based keyword recognition system," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/129-I/132, 1990.
- (1990) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Rose, R.C.¹ Paul, D.B.²

233
- 44849099019
- The LIMSI QAst systems: Comparison between human and automatic rules generation for questionanswering on speech transcriptions
- S. Rosset, O. Galibert, G. Adda, and E. Bilinski, "The LIMSI QAst systems: Comparison between human and automatic rules generation for questionanswering on speech transcriptions," in Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 647-652, 2007.
- (2007) Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 647-652
- Rosset, S.¹ Galibert, O.² Adda, G.³ Bilinski, E.⁴

234
- 0034440695
- Automatically extracting highlights for TV baseball programs
- Y. Rui, A. Gupta, and A. Acero, "Automatically extracting highlights for TV baseball programs," in Proceedings of the ACM International Conference on Multimedia, pp. 105-115, 2000.
- (2000) Proceedings of the ACM International Conference on Multimedia , pp. 105-115
- Rui, Y.¹ Gupta, A.² Acero, A.³

235
- 45549117987
- Term-weighting approaches in automatic text retrieval
- G. Salton and C. Buckley, "Term-weighting approaches in automatic text retrieval," Information Processing and Management, vol. 24, no. 5, pp. 513-523, 1988.
- (1988) Information Processing and Management , vol.24 , Issue.5 , pp. 513-523
- Salton, G.¹ Buckley, C.²

236
- 84945186654
- Mixing and merging for spoken document retrieval
- M. Sanderson and F. Crestani, "Mixing and merging for spoken document retrieval," in Proceedings of the European Conference on Research and Advanced Technology for Digital Libraries, pp. 397-407, 1998.
- (1998) Proceedings of the European Conference on Research and Advanced Technology for Digital Libraries , pp. 397-407
- Sanderson, M.¹ Crestani, F.²

237
- 37149040193
- Search of spoken documents retrieves well recognized transcripts
- Advances in Information Retrieval - 29th European Conference on IR Research, ECIR 2007, Proceedings LNCS
- M. Sanderson and X.-M. Shou, "Search of spoken documents retrieves well recognized transcripts," in Advances in Information Retrieval. Proceedings of the European Conference on IR Research, (G. Amati, C. Carpineto, and G. Romano, eds.), pp. 505-516, Springer Berlin/Heidelberg, 2007. (Pubitemid 350259576)
- (2007) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.4425 , pp. 505-516
- Sanderson, M.¹ Shou, X.M.²

238
- 85050187568
- Lattice-based search for spoken utterance retrieval
- M. Saraclar and R. W. Sproat, "Lattice-based search for spoken utterance retrieval," in Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, pp. 129-136, 2004.
- (2004) Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics , pp. 129-136
- Saraclar, M.¹ Sproat, R.W.²

239
- 0030715426
- Confidence measures for spontaneous speech recognition
- T. Schaaf and T. Kemp, "Confidence measures for spontaneous speech recognition," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. II/875-II/878, 1997.
- (1997) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Schaaf, T.¹ Kemp, T.²

240
- 2642521115
- Assessing the retrieval effectiveness of a speech retrieval system by simulating recognition errors
- P. Schäuble and U. Glavitsch, "Assessing the retrieval effectiveness of a speech retrieval system by simulating recognition errors," in Proceedings of the Workshop on Human Language Technology, pp. 347-349, 1994.
- (1994) Proceedings of the Workshop on Human Language Technology , pp. 347-349
- Schäuble, P.¹ Glavitsch, U.²

241
- 0039789208
- First experiences with a system for content based retrieval of information from speech recordings
- P. Schäuble and M. Wechsler, "First experiences with a system for content based retrieval of information from speech recordings," in Proceedings of the IJCAI Workshop on Intelligent Multimedia Information Retrieval, pp. 59-69, 1995.
- (1995) Proceedings of the IJCAI Workshop on Intelligent Multimedia Information Retrieval , pp. 59-69
- Schäuble, P.¹ Wechsler, M.²

242
- 0019683470
- The intelligent ear: A graphical interface to digital audio
- C. Schmandt, "The intelligent ear: A graphical interface to digital audio," in Proceedings of the Internationl Conference on Cybernetics and Society, pp. 393-397, 1981. (Pubitemid 12483507)
- (1981) Proceedings - International Conference on Cybernetics and Society , pp. 393-397
- Schmandt Christopher¹

243
- 84865209315
- PhD thesis University of Bonn
- D. Schneider, "Holistic vocabulary independent spoken term detection," PhD thesis, University of Bonn, 2011.
- (2011) Holistic Vocabulary Independent Spoken Term Detection
- Schneider, D.¹

244
- 4544316885
- Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture
- B. Schuller, G. Rigoll, and M. Lang, "Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/577-I/580, 2004.
- (2004) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Schuller, B.¹ Rigoll, G.² Lang, M.³

245
- 84855302239
- Experiments in spoken document retrieval at CMU
- A. Siegler, M. A. amd Berger, M. Witbrock, and A. Hauptmann, "Experiments in spoken document retrieval at CMU," in Proceedings of the Text Retrieval Conference, pp. 319-326, 1998.
- (1998) Proceedings of the Text Retrieval Conference , pp. 319-326
- Siegler, A.¹ Berger Amd, A.M.² Witbrock, M.³ Hauptmann, A.⁴

246
- 0032649342
- Improving the suitability of imperfect transcriptions for information retrieval from spoken documents
- M. Siegler and M. Witbrock, "Improving the suitability of imperfect transcriptions for information retrieval from spoken documents," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/505-I/508, 1999.
- (1999) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Siegler, M.¹ Witbrock, M.²

247
- 33745185674
- PhD thesis Carnegie Mellon University
- M. A. Siegler, "Integration of continuous speech recognition and information retrieval for mutually optimal performance," PhD thesis, Carnegie Mellon University, 1999.
- (1999) Integration of Continuous Speech Recognition and Information Retrieval for Mutually Optimal Performance
- Siegler, M.A.¹

248
- 48749097764
- Integration of metadata in spoken document search using position specific posterior latices
- J. Silva, C. Chelba, and A. Acero, "Integration of metadata in spoken document search using position specific posterior latices," in Proceedings of the IEEE Spoken Language Technology Workshop, pp. 46-49, 2006.
- (2006) Proceedings of the IEEE Spoken Language Technology Workshop , pp. 46-49
- Silva, J.¹ Chelba, C.² Acero, A.³

249
- 0030402534
- Pivoted document length normalization
- A. Singhal, C. Buckley, and M. Mitra, "Pivoted document length normalization," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 21-29, 1996.
- (1996) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 21-29
- Singhal, A.¹ Buckley, C.² Mitra, M.³

250
- 0002878614
- AT&T at TREC- 7
- A. Singhal, J. Choi, D. Hindle, D. D. Lewis, and F. Pereira, AT&T at TREC- 7, in Proceedings of the Text REtrieval Conference, pp. 239-252, 1999.
- (1999) Proceedings of the Text REtrieval Conference , pp. 239-252
- Singhal, A.¹ Choi, J.² Hindle, D.³ Lewis, D.D.⁴ Pereira, F.⁵

251
- 85009102300
- Document expansion for speech retrieval
- A. Singhal and F. Pereira, "Document expansion for speech retrieval," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 34-41, 1999.
- (1999) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 34-41
- Singhal, A.¹ Pereira, F.²

252
- 33745213806
- Fast vocabulary-independent audio search using path-based graph indexing
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- O. Siohan and M. Bacchiani, "Fast vocabulary-independent audio search using path-based graph indexing," in Proceedings of Interspeech, pp. 53-56, 2005. (Pubitemid 43907999)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 53-56
- Siohan, O.¹ Bacchiani, M.²

253
- 0031641167
- A graphical interface for speech-based retrieval
- L. Slaughter, D. W. Oard, V. L. Warnick, J. L. Harding, and G. J. Wilkerson, "A graphical interface for speech-based retrieval," in Proceedings of the ACM Conference on Digital Libraries, pp. 305-306, 1998.
- (1998) Proceedings of the ACM Conference on Digital Libraries , pp. 305-306
- Slaughter, L.¹ Oard, D.W.² Warnick, V.L.³ Harding, J.L.⁴ Wilkerson, G.J.⁵

254
- 84865231398
- Taiscealai: Information Retrieval from an Archive of Spoken Radio News
- Research and Advanced Technology for Digital Libraries
- A. F. Smeaton, M. Morony, G. Quinn, and R. Scaife, "Taiscé ala?: Information retrieval from an archive of spoken radio news," in Research and Advanced Technology for Digital Libraries, vol. 1513 of Lecture Notes in Computer Science, (C. Nikolaou and C. Stephanidis, eds.), pp. 429-442, Springer Berlin/Heidelberg, 1998. (Pubitemid 128145539)
- (1998) Lecture Notes in Computer Science , Issue.1513 , pp. 429-442
- Sineaton, A.F.¹ Morony, M.² Quinn, G.³ Scaife, R.⁴

255
- 34547401486
- Evaluation campaigns and TRECVid
- DOI 10.1145/1178677.1178722, Proceedings of the 8th ACM Multimedia International Workshop on Multimedia Information Retrieval, MIR 2006
- A. F. Smeaton, P. Over, and W. Kraaij, "Evaluation campaigns and TRECVid," in Proceedings of the ACM International Workshop on Multimedia Information Retrieval, pp. 321-330, 2006. (Pubitemid 47168230)
- (2006) Proceedings of the ACM International Multimedia Conference and Exhibition , pp. 321-330
- Smeaton, A.F.¹ Over, P.² Kraaij, W.³

256
- 0030192735
- Experiments in spoken document retrieval
- DOI 10.1016/0306-4573(95)00077-1
- K. Spärck Jones, G. J. F. Jones, J. T. Foote, and S. J. Young, "Experiments in spoken document retrieval," Information Processing and Management, vol. 32, no. 4, pp. 399-417, 1996. (Pubitemid 126371700)
- (1996) Information Processing and Management , vol.32 , Issue.4 , pp. 399-417
- Jones, K.S.¹ Jones, G.J.F.² Foote, J.T.³ Young, S.J.⁴

257
- 0033658324
- Phonetic confusion matrix based spoken document retrieval
- S. Srinivasan and D. Petkovic, "Phonetic confusion matrix based spoken document retrieval," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 81-87, 2000.
- (2000) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 81-87
- Srinivasan, S.¹ Petkovic, D.²

258
- 0038206636
- ASR satisficing: The effects of ASR accuracy on speech retrieval
- L. A. Stark, S. Whittaker, and J. Hirschberg, "ASR satisficing: The effects of ASR accuracy on speech retrieval," in Proceedings of Interspeech, pp. 1069- 1072, 2000.
- (2000) Proceedings of Interspeech , pp. 1069-1072
- Stark, L.A.¹ Whittaker, S.² Hirschberg, J.³

259
- 0000023031
- Dialogue act modeling for automatic tagging and recognition of conversational speech
- A. Stolcke, K. Ries, N. Coccaro, E. Shriberg, R. Bates, D. Jurafsky, P. Taylor, R. Martin, C. V. Ess-Dykema, and M. Meteer, "Dialogue act modeling for automatic tagging and recognition of conversational speech," Computational Linguistics, vol. 26, no. 3, pp. 339-373, 2000.
- (2000) Computational Linguistics , vol.26 , Issue.3 , pp. 339-373
- Stolcke, A.¹ Ries, K.² Coccaro, N.³ Shriberg, E.⁴ Bates, R.⁵ Jurafsky, D.⁶ Taylor, P.⁷ Martin, R.⁸ Ess-Dykema, C.V.⁹ Meteer, M.¹⁰

260
- 0037955911
- Combining words and speech prosody for automatic topic segmentation
- A. Stolcke, E. Shriberg, D. Hakkani-Tür, G. Tür, Z. Rivlin, and K. Sönmez, "Combining words and speech prosody for automatic topic segmentation," in Proceedings of DARPA Broadcast News Transcription and Understanding Workshop, pp. 61-64, 1999.
- (1999) Proceedings of DARPA Broadcast News Transcription and Understanding Workshop , pp. 61-64
- Stolcke, A.¹ Shriberg, E.² Hakkani-Tür, D.³ Tür, G.⁴ Rivlin, Z.⁵ Sönmez, K.⁶

261
- 0033335618
- Modeling pronunciation variation for ASR: A survey of the literature
- DOI 10.1016/S0167-6393(99)00038-2
- H. Strik and C. Cucchiarini, "Modeling pronunciation variation for ASR: A survey of the literature," Speech Communication, vol. 29, no. 2-4, pp. 225-246, 1999. (Pubitemid 30514833)
- (1999) Speech Communication , vol.29 , Issue.2 , pp. 225-246
- Strik, H.¹ Cucchiarini, C.²

262
- 84865263089
- Comparison of methods for language-dependent and language-independent Query-by- Example spoken term detection
- J. Tejedor, M. Fapso, I. Szoke, J. Cernocky, and F. Grezl, "Comparison of methods for language-dependent and language-independent Query-by- Example spoken term detection," ACM Transactions on Information Systems, vol. 30, no. 3, 2012.
- (2012) ACM Transactions on Information Systems , vol.30 , Issue.3
- Tejedor, J.¹ Fapso, M.² Szoke, I.³ Cernocky, J.⁴ Grezl, F.⁵

263
- 54249088981
- A comparison of grapheme and phoneme-based units for Spanish spoken term detection
- J. Tejedor, D. Wang, J. Frankel, S. King, and J. Colas, "A comparison of grapheme and phoneme-based units for Spanish spoken term detection," Speech Communication, vol. 50, no. 11-12, pp. 980-991, 2008.
- (2008) Speech Communication , vol.50 , Issue.11-12 , pp. 980-991
- Tejedor, J.¹ Wang, D.² Frankel, J.³ King, S.⁴ Colas, J.⁵

264
- 54249103198
- Rapid yet accurate speech indexing using dynamic match lattice spotting
- K. Thambiratnam and S. Sridharan, "Rapid yet accurate speech indexing using dynamic match lattice spotting," IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 1, pp. 346-357, 2007.
- (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.1 , pp. 346-357
- Thambiratnam, K.¹ Sridharan, S.²

265
- 84865263087
- A study of users' perception of relevance of spoken documents
- International Computer Science Institute
- T. Tombros and F. Crestani, "A study of users' perception of relevance of spoken documents," Technical Report TR-99-013, International Computer Science Institute, 1999.
- (1999) Technical Report TR , pp. 99-013
- Tombros, T.¹ Crestani, F.²

266
- 34047261805
- An overview of automatic speaker diarization systems
- DOI 10.1109/TASL.2006.878256
- S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 5, pp. 1557-1565, 2006. (Pubitemid 46547580)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.5 , pp. 1557-1565
- Tranter, S.E.¹ Reynolds, D.A.²

267
- 0345164806
- Automatic genre identification for content-based video categorization
- B. T. Truong, S. Venkatesh, and C. Dorai, "Automatic genre identification for content-based video categorization," in Proceedings of the International Conference on Pattern Recognition, vol. 4, pp. 230-233, 2000.
- (2000) Proceedings of the International Conference on Pattern Recognition , vol.4 , pp. 230-233
- Truong, B.T.¹ Venkatesh, S.² Dorai, C.³

268
- 57349143307
- Term clouds as surrogates for user generated speech
- M. Tsagkias, M. Larson, and M. de Rijke, "Term clouds as surrogates for user generated speech," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 773-774, 2008.
- (2008) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 773-774
- Tsagkias, M.¹ Larson, M.² De Rijke, M.³

269
- 57849126781
- Time-compressing speech: ASR transcripts are an effective way to support gist extraction
- Chapter 21, (A. Popescu-Belis and R. Stiefelhagen, eds.), Springer Berlin/Heidelberg
- S. Tucker, N. Kyprianou, and S. Whittaker, "Time-compressing speech: ASR transcripts are an effective way to support gist extraction," in Machine Learning for Multimodal Interaction, vol. 5237 of Lecture Notes in Computer Science Chapter 21, (A. Popescu-Belis and R. Stiefelhagen, eds.), pp. 226-235, Springer Berlin/Heidelberg, 2008.
- (2008) Machine Learning for Multimodal Interaction 5237 of Lecture Notes in Computer Science , pp. 226-235
- Tucker, S.¹ Kyprianou, N.² Whittaker, S.³

270
- 57849129912
- Temporal compression of speech: An evaluation
- S. Tucker and S. Whittaker, "Temporal compression of speech: An evaluation," IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, no. 4, 2008.
- (2008) IEEE Transactions on Audio, Speech, and Language Processing , vol.16 , Issue.4
- Tucker, S.¹ Whittaker, S.²

271
- 36448997316
- Indexing confusion networks for morph-based spoken document retrieval
- DOI 10.1145/1277741.1277849, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
- V. T. Turunen and M. Kurimo, "Indexing confusion networks for morph-based spoken document retrieval," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 631-638, 2007. (Pubitemid 350165013)
- (2007) Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07 , pp. 631-638
- Turunen, V.T.¹ Kurimo, M.²

272
- 85119989274
- Data-oriented methods for graphemeto- phoneme conversion
- A. van den Bosch and W. Daelemans, "Data-oriented methods for graphemeto- phoneme conversion," in Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics, pp. 45-53, 1993.
- (1993) Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics , pp. 45-53
- Bosch Den A.Van¹ Daelemans, W.²

273
- 0004217877
- Butterworths
- C. J. van Rijsbergen, Information Retrieval. Butterworths, 1979.
- (1979) Information Retrieval
- Van Rijsbergen, C.J.¹

274
- 34648854438
- Speakers role recognition in multiparty audio recordings using social network analysis and duration distribution modeling
- A. Vinciarelli, "Speakers role recognition in multiparty audio recordings using social network analysis and duration distribution modeling," IEEE Transactions on Multimedia, vol. 9, no. 6, pp. 1215-1226, 2007.
- (2007) IEEE Transactions on Multimedia , vol.9 , Issue.6 , pp. 1215-1226
- Vinciarelli, A.¹

275
- 0002415750
- Retrieval from spoken documents using content and speaker information
- M. Viswanathan, H. S. M. Beigi, S. Dharanipragada, and A. Tritschler, "Retrieval from spoken documents using content and speaker information," in Proceedings of the International Conference on Document Analysis and Recognition, pp. 567-572, 1999.
- (1999) Proceedings of the International Conference on Document Analysis and Recognition , pp. 567-572
- Viswanathan, M.¹ Beigi, H.S.M.² Dharanipragada, S.³ Tritschler, A.⁴

276
- 0001739133
- Fusion via a linear combination of scores
- C. C. Vogt and G. W. Cottrell, "Fusion via a linear combination of scores," Information Retrieval, vol. 1, no. 3, pp. 151-173, 1999.
- (1999) Information Retrieval , vol.1 , Issue.3 , pp. 151-173
- Vogt, C.C.¹ Cottrell, G.W.²

277
- 8844267001
- Digital Libraries and Electronic Publishing, The MIT Press
- E. M. Voorhees and D. K. Harman, TREC: Experiment and Evaluation in Information Retrieval. Digital Libraries and Electronic Publishing, The MIT Press, 2005.
- (2005) TREC: Experiment and Evaluation in Information Retrieval
- Voorhees, E.M.¹ Harman, D.K.²

278
- 0002403499
- Complementary video and audio analysis for broadcast news archives
- H. D. Wactlar, A. G. Hauptmann, M. G. Christel, R. A. Houghton, and A. M. Olligschlaeger, "Complementary video and audio analysis for broadcast news archives," Communications of the ACM, vol. 43, no. 2, pp. 42-47, 2000.
- (2000) Communications of the ACM , vol.43 , Issue.2 , pp. 42-47
- Wactlar, H.D.¹ Hauptmann, A.G.² Christel, M.G.³ Houghton, R.A.⁴ Olligschlaeger, A.M.⁵

279
- 0042033109
- Morgan Kaufmann
- A. Waibel and K.-F. Lee, eds., Readings in Speech Recognition. Morgan Kaufmann, 1990.
- (1990) Readings in Speech Recognition
- Waibel, A.¹ Lee, K.-F.²

280
- 78650990390
- PhD thesis University of Edinburgh
- D. Wang, "Out-of-vocabulary spoken term detection," PhD thesis, University of Edinburgh, 2009.
- (2009) Out-of-vocabulary Spoken Term Detection
- Wang, D.¹

281
- 84865274432
- Direct posterior confidence estimation for out-of-vocabulary spoken term detection
- D. Wang, S. King, J. Frankel, R. Vipperla, N. Evans, and R. Troncy, "Direct posterior confidence estimation for out-of-vocabulary spoken term detection," ACM Transactions on Information System, vol. 30, no. 3, 2012.
- (2012) ACM Transactions on Information System , vol.30 , Issue.3
- Wang, D.¹ King, S.² Frankel, J.³ Vipperla, R.⁴ Evans, N.⁵ Troncy, R.⁶

282
- 0034275766
- Experiments in syllable-based retrieval of broadcast news speech in Mandarin Chinese
- H.-M. Wang, "Experiments in syllable-based retrieval of broadcast news speech in Mandarin Chinese," Speech Commununication, vol. 32, no. 1-2, pp. 49-60, 2000.
- (2000) Speech Commununication , vol.32 , Issue.1-2 , pp. 49-60
- Wang, H.-M.¹

283
- 0039174218
- Mandarin spoken document retrieval based on syllable lattice matching
- H.-M. Wang, "Mandarin spoken document retrieval based on syllable lattice matching," Pattern Recognition Letters, vol. 21, no. 6-7, pp. 615-624, 2000.
- (2000) Pattern Recognition Letters , vol.21 , Issue.6-7 , pp. 615-624
- Wang, H.-M.¹

284
- 84946714447
- Is word error rate a good indicator for spoken language understanding accuracy
- Y.-Y. Wang, A. Acero, and C. Chelba, "Is word error rate a good indicator for spoken language understanding accuracy," in Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 577-582, 2003.
- (2003) Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 577-582
- Wang, Y.-Y.¹ Acero, A.² Chelba, C.³

285
- 85032751364
- An introduction to voice search
- Y.-Y.Wang, D. Yu, Y.-C. Ju, and A. Acero, "An introduction to voice search," IEEE Signal Processing Magazine, vol. 25, no. 3, pp. 28-38, 2008.
- (2008) IEEE Signal Processing Magazine , vol.25 , Issue.3 , pp. 28-38
- Wang, Y.-Y.¹ Yu, D.² Ju, Y.-C.³ Acero, A.⁴

286
- 84865274434
- Topic spotting using subword units
- V. Warnke, S. Harbeck, E. Noth, and H. Niemann, "Topic spotting using subword units," in 9. Aachener Kolloqium "Signaltheorie" Bild- und Sprachsignale, pp. 287-291, 1997.
- (1997) Aachener Kolloqium "Signaltheorie" Bild- und Sprachsignale , pp. 287-291
- Warnke, V.¹ Harbeck, S.² Noth, E.³ Niemann, H.⁴

287
- 84975883723
- Multilingual topic detection and tracking: Successful research enabled by corpora and evaluation
- C. L. Wayne, "Multilingual topic detection and tracking: Successful research enabled by corpora and evaluation," in Proceedings of the International Conference on Language Resources and Evaluation, 2000.
- (2000) Proceedings of the International Conference on Language Resources and Evaluation
- Wayne, C.L.¹

288
- 0032282577
- New techniques for openvocabulary spoken document retrieval
- M. Wechsler, E. Munteanu, and P. Schäuble, "New techniques for openvocabulary spoken document retrieval," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 20-27, 1998.
- (1998) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 20-27
- Wechsler, M.¹ Munteanu, E.² Schauble, P.³

289
- 1842797022
- New approaches to spoken document retrieval
- M. Wechsler, E. Munteanu, and P. Schäuble, "New approaches to spoken document retrieval," Information Retrieval, vol. 3, no. 3, pp. 173-188, 2000.
- (2000) Information Retrieval , vol.3 , Issue.3 , pp. 173-188
- Wechsler, M.¹ Munteanu, E.² Schäuble, P.³

290
- 0028996917
- LVCSR log-likelihood ratio scoring for keyword spotting
- M. Weintraub, "LVCSR log-likelihood ratio scoring for keyword spotting," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/297-I/300, 1995.
- (1995) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Weintraub, M.¹

291
- 0043086491
- Effect of speaking style on LVCSR performance
- M. Weintraub, K. Taussig, K. Hunicke-Smith, and A. Snodgrass, "Effect of speaking style on LVCSR performance," in Proceedings of the International Conference on Spoken Language Processing, pp. 16-19, 1996.
- (1996) Proceedings of the International Conference on Spoken Language Processing , pp. 16-19
- Weintraub, M.¹ Taussig, K.² Hunicke-Smith, K.³ Snodgrass, A.⁴

292
- 24144440949
- Browsing recorded meetings with ferret
- Machine Learning for Multimodal Interaction - First International Workshop, MLMI 2004
- P. Wellner, M. Flynn, and M. Guillemot, "Browsing recorded meetings with ferret," in Machine Learning for Multimodal Interaction, vol. 3361 of Lecture Notes in Computer Science, (S. Bengio and H. Bourlard, eds.), pp. 12-21, Springer Berlin/Heidelberg, 2005. (Pubitemid 41228874)
- (2005) Lecture Notes in Computer Science , vol.3361 , pp. 12-21
- Wellner, P.¹ Flynn, M.² Guillemot, M.³

293
- 84869113595
- A meeting browser evaluation test
- P. Wellner, M. Flynn, A. Tucker, and A. Whittaker, "A meeting browser evaluation test," in Computer-Human Interaction Extended Abstracts on Human Factors in Computing Systems, 2005.
- (2005) Computer-Human Interaction Extended Abstracts on Human Factors in Computing Systems
- Wellner, P.¹ Flynn, M.² Tucker, A.³ Whittaker, A.⁴

294
- 0035278951
- Confidence measures for large vocabulary continuous speech recognition
- DOI 10.1109/89.906002, PII S1063667601013281
- F. Wessel, R. Schluter, K. Macherey, and H. Ney, "Confidence measures for large vocabulary continuous speech recognition," IEEE Transactions on Speech and Audio Processing, vol. 9, no. 3, pp. 288-298, 2001. (Pubitemid 32286598)
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.3 , pp. 288-298
- Wessel, F.¹ Schluter, R.² Macherey, K.³ Ney, H.⁴

295
- 33749645070
- Overview of the CLEF-2005 cross-language speech retrieval track
- Accessing Multilingual Information Repositories - 6th Workshop of the Cross-Language Evalution Forum, CLEF 2005 LNCS
- R. W. White, D. W. Oard, G. J. F. Jones, D. Soergel, and X. Huang, "Overview of the CLEF-2005 cross-language speech retrieval track," in Accessing Multilingual Information Repositories, vol. 4022 of Lecture Notes in Computer Science, (C. Peters, F. Gey, J. Gonzalo, H. Müller, G. J. F. Jones, M. Kluck, B. Magnini, and M. de Rijke, eds.), pp. 744-759, Springer Berlin/Heidelberg, 2006. (Pubitemid 44545951)
- (2006) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.4022 , pp. 744-759
- White, R.W.¹ Oard, D.W.² Jones, G.J.F.³ Soergel, D.⁴ Huang, X.⁵

296
- 84962786558
- Vocabulary independent speech recognition using particles
- E. W. D. Whittaker, J. M. Van Thong, and P. J. Moreno, "Vocabulary independent speech recognition using particles," in Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 315-318, 2001.
- (2001) Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 315-318
- Whittaker, E.W.D.¹ Van Thong, J.M.² Moreno, P.J.³

297
- 0037480836
- Scanmail: A voicemail interface that makes speech browsable readable and searchable
- S. Whittaker, J. Hirschberg, B. Amento, L. Stark, M. Bacchiani, L. Isenhour, P. Stead, G. Zamchick, and A. Rosenberg, "Scanmail: A voicemail interface that makes speech browsable readable and searchable," in Proceedings of the Special Interest Group on Computer-Human Interaction (SIGCHI) Conference on Human Factors in Computing Systems, pp. 275-282, 2002.
- (2002) Proceedings of the Special Interest Group on Computer-Human Interaction (SIGCHI) Conference on Human Factors in Computing Systems , pp. 275-282
- Whittaker, S.¹ Hirschberg, J.² Amento, B.³ Stark, L.⁴ Bacchiani, M.⁵ Isenhour, L.⁶ Stead, P.⁷ Zamchick, G.⁸ Rosenberg, A.⁹

298
- 85002793295
- SCAN: Designing and evaluating user interfaces to support retrieval from speech archives
- S. Whittaker, J. Hirschberg, J. Choi, D. Hindle, F. Pereira, and A. Singhal, "SCAN: Designing and evaluating user interfaces to support retrieval from speech archives," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 26-33, 1999.
- (1999) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 26-33
- Whittaker, S.¹ Hirschberg, J.² Choi, J.³ Hindle, D.⁴ Pereira, F.⁵ Singhal, A.⁶

299
- 38749097384
- Design and evaluation of systems to support interaction capture and retrieval
- DOI 10.1007/s00779-007-0146-3, Special Issue: User-centred design and evaluation of ubiquitous groupware
- S. Whittaker, S. Tucker, K. Swampillai, and R. Laban, "Design and evaluation of systems to support interaction capture and retrieval," Personal Ubiquitous Computing, vol. 12, no. 3, pp. 197-221, 2008. (Pubitemid 351176344)
- (2008) Personal and Ubiquitous Computing , vol.12 , Issue.3 , pp. 197-221
- Whittaker, S.¹ Tucker, S.² Swampillai, K.³ Laban, R.⁴

300
- 79952385877
- Segmentation of speech using speaker identification
- L. Wilcox, F. Chen, and V. Balasubramanian, "Segmentation of speech using speaker identification," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. I/161-I/164, 1994.
- (1994) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Wilcox, L.¹ Chen, F.² Balasubramanian, V.³

301
- 84947288844
- HMM-based wordspotting for voice editing and indexing
- L. D. Wilcox and M. A. Bush., "HMM-based wordspotting for voice editing and indexing," in Proceedings of Eurospeech, pp. 25-28, 1991.
- (1991) Proceedings of Eurospeech , pp. 25-28
- Wilcox, L.D.¹ Bush, M.A.²

302
- 85037089806
- Confidence measures for HMM-based speech recognition
- D. Willett, A. Worm, C. Neukirchen, and G. Rigoll, "Confidence measures for HMM-based speech recognition," in Proceedings of the International Conference on Spoken Language Processing, pp. 3241-3244, 1998.
- (1998) Proceedings of the International Conference on Spoken Language Processing , pp. 3241-3244
- Willett, D.¹ Worm, A.² Neukirchen, C.³ Rigoll, G.⁴

303
- 0003280323
- Speech recognition and information retrieval: Experiments in retrieving spoken documents
- M. J. Witbrock and A. G. Hauptmann, "Speech recognition and information retrieval: Experiments in retrieving spoken documents," in Proceedings of the DARPA Speech Recognition Workshop, 1997.
- (1997) Proceedings of the DARPA Speech Recognition Workshop
- Witbrock, M.J.¹ Hauptmann, A.G.²

304
- 0030676041
- Using words and phonetic strings for efficient information retrieval from imperfectly transcribed spoken documents
- M. J. Witbrock and A. G. Hauptmann, "Using words and phonetic strings for efficient information retrieval from imperfectly transcribed spoken documents," in Proceedings of the ACM International Conference on Digital Libraries, pp. 30-35, 1997.
- (1997) Proceedings of the ACM International Conference on Digital Libraries , pp. 30-35
- Witbrock, M.J.¹ Hauptmann, A.G.²

305
- 0003756969
- Morgan Kaufmann
- I. H. Witten, A. Moffat, and T. C. Bell, Managing Gigabytes: Compressing and Indexing Documents and Images. Morgan Kaufmann, 1999.
- (1999) Managing Gigabytes: Compressing and Indexing Documents and Images
- Witten, I.H.¹ Moffat, A.² Bell, T.C.³

306
- 0033652331
- Effects of out of vocabulary words in spoken document retrieval
- P. C. Woodland, S. E. Johnson, P. Jourlin, and K. Spärck Jones, "Effects of out of vocabulary words in spoken document retrieval," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 372- 374, 2000.
- (2000) Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval , pp. 372-374
- Woodland, P.C.¹ Johnson, S.E.² Jourlin, P.³ Spärck Jones, K.⁴

307
- 85009168880
- Spotting hot spots in meetings: Human judgments and prosodic cues
- B. Wrede and E. Shriberg, "Spotting "Hot Spots" in meetings: Human judgments and prosodic cues," in Proceeindgs of Eurospeech, pp. 2805-2808, 2003.
- (2003) Proceeindgs of Eurospeech , pp. 2805-2808
- Wrede, B.¹ Shriberg, E.²

308
- 58049207761
- Speech-annotated photo retrieval using syllable-transformed patterns
- C.-H. Wu, C.-L. Huang, W.-C. Lee, and Y.-S. Lai, "Speech-annotated photo retrieval using syllable-transformed patterns," IEEE Signal Processing Letters, vol. 16, no. 1, pp. 6-9, 2009.
- (2009) IEEE Signal Processing Letters , vol.16 , Issue.1 , pp. 6-9
- Wu, C.-H.¹ Huang, C.-L.² Lee, W.-C.³ Lai, Y.-S.⁴

309
- 44849088089
- A fast-match approach for robust faster than real-time speaker diarization
- H. Yan, O. Vinyals, G. Friedland, C. Muller, N. Mirghafori, and C. Wooters, "A fast-match approach for robust faster than real-time speaker diarization," in Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 693-698, 2007.
- (2007) Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 693-698
- Yan, H.¹ Vinyals, O.² Friedland, G.³ Muller, C.⁴ Mirghafori, N.⁵ Wooters, C.⁶

310
- 2342453148
- VideoQA: Question answering on news video
- H. Yang, L. Chaisorn, Y. Zhao, S. Y. Neo, and T. S. Chua, "VideoQA: question answering on news video," in Proceedings of the ACM International Conference on Multimedia, pp. 632-641, 2003.
- (2003) Proceedings of the ACM International Conference on Multimedia , pp. 632-641
- Yang, H.¹ Chaisorn, L.² Zhao, Y.³ Neo, S.Y.⁴ Chua, T.S.⁵

311
- 85079240850
- Detecting misrecognitions and out-of-vocabulary words
- S. R. Young, "Detecting misrecognitions and out-of-vocabulary words," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. II/21-II/24, 1994.
- (1994) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
- Young, S.R.¹

312
- 85008054253
- Vocabulary-independent indexing of spontaneous speech
- P. Yu, K. Chen, C. Ma, and F. Seide, "Vocabulary-independent indexing of spontaneous speech," IEEE Transactions on Speech and Audio Processing, vol. 13, no. 5, pp. 635-643, 2005.
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.5 , pp. 635-643
- Yu, P.¹ Chen, K.² Ma, C.³ Seide, F.⁴

313
- 0003801149
- Kluwer Academic Publishers
- T. Zhang and C. C. Kuo, Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing. Kluwer Academic Publishers, 2001.
- (2001) Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing
- Zhang, T.¹ Kuo, C.C.²

314
- 0033279432
- Heuristic approach for generic audio data segmentation and annotation
- T. Zhang and C.-C. J. Kuo, "Heuristic approach for generic audio data segmentation and annotation," in Proceedings of the ACM International Conference on Multimedia (Part 1), pp. 67-76, 1999. (Pubitemid 32262460)
- (1999) Proceedings of the ACM International Multimedia Conference & Exhibition , pp. 67-76
- Zhang Tong¹ Kuo C.-C.Jay²

315
- 84863337904
- Towards spoken-document retrieval for the internet: Lattice indexing for large-scale web-search architectures
- Z.-Y. Zhou, P. Yu, C. Chelba, and F. Seide, "Towards spoken-document retrieval for the internet: Lattice indexing for large-scale web-search architectures," in Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, pp. 415-422, 2006.
- (2006) Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics , pp. 415-422
- Zhou, Z.-Y.¹ Yu, P.² Chelba, C.³ Seide, F.⁴

316
- 70350349017
- Speaker diarization: From broadcast news to lectures
- Chapter 35, (S. Renals, S. Bengio, and J. G. Fiscus, eds.), Springer Berlin/Heidelberg
- X. Zhu, C. Barras, L. Lamel, and J.-L. Gauvain, "Speaker diarization: From broadcast news to lectures," in Machine Learning for Multimodal Interaction, vol. 4299 of Lecture Notes in Computer Science, Chapter 35, (S. Renals, S. Bengio, and J. G. Fiscus, eds.), pp. 396-406, Springer Berlin/Heidelberg, 2006.
- (2006) Machine Learning for Multimodal Interaction 4299 of Lecture Notes in Computer Science , pp. 396-406
- Zhu, X.¹ Barras, C.² Lamel, L.³ Gauvain, J.-L.⁴

317
- 77952349683
- Introduction to the special section on Rich Transcription
- G. Zweig, J. Makhoul, and A. Stolke, "Introduction to the special section on Rich Transcription," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 5, pp. 1490-1491, 2006.
- (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , Issue.5 , pp. 1490-1491
- Zweig, G.¹ Makhoul, J.² Stolke, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.