메뉴 건너뛰기




Volumn 5, Issue 4-5, 2011, Pages 235-422

Spoken content retrieval: A survey of techniques and technologies

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; COMPONENT TECHNOLOGIES; CONTENT RETRIEVAL; DIGITAL AUDIO; INDEXING AND RETRIEVAL; RESEARCH AND DEVELOPMENT; SPEECH PROCESSING TECHNOLOGIES; SPEECH TECHNOLOGY; USER INTERACTION;

EID: 84865249159     PISSN: 15540669     EISSN: 15540677     Source Type: Journal    
DOI: 10.1561/1500000020     Document Type: Article
Times cited : (72)

References (317)
  • 3
    • 0013252919 scopus 로고    scopus 로고
    • Perspectives on information retrieval and speech
    • (A. R. Coden, E. W. Brown, and S. Srinivasan, eds.), Springer Berlin/Heidelberg
    • J. Allan, "Perspectives on information retrieval and speech," in Information Retrieval Techniques for Speech Applications, (A. R. Coden, E. W. Brown, and S. Srinivasan, eds.), pp. 323-326, Springer Berlin/Heidelberg, 2002.
    • (2002) Information Retrieval Techniques for Speech Applications , pp. 323-326
    • Allan, J.1
  • 4
    • 84865277132 scopus 로고    scopus 로고
    • Topic detection and tracking: Event-based information organization
    • Springer
    • J. Allan, "Topic detection and tracking: Event-based information organization," in The Kluwer International Series on Information Retrieval, vol. 12, Springer, 2002.
    • (2002) The Kluwer International Series on Information Retrieval , vol.12
    • Allan, J.1
  • 5
    • 0037300570 scopus 로고    scopus 로고
    • Robust techniques for organizing and retrieving spoken documents
    • J. Allan, "Robust techniques for organizing and retrieving spoken documents," EURASIP Journal on Advances in Signal Processing, vol. 2003, no. 1, pp. 103-114, 2003.
    • (2003) EURASIP Journal on Advances in Signal Processing , vol.2003 , Issue.1 , pp. 103-114
    • Allan, J.1
  • 10
    • 0002494419 scopus 로고    scopus 로고
    • SpeechSkimmer: A system for interactively skimming recorded speech
    • B. Arons, "SpeechSkimmer: A system for interactively skimming recorded speech," Transactions on Computer Human Interaction, vol. 4, no. 1, pp. 3-38, 1997.
    • (1997) Transactions on Computer Human Interaction , vol.4 , Issue.1 , pp. 3-38
    • Arons, B.1
  • 11
    • 0039737345 scopus 로고
    • The future of speech and audio in the interface: A CHI '94 workshop
    • B. Arons and E. Mynatt, "The future of speech and audio in the interface: A CHI '94 workshop," SIGCHI Bulletin, vol. 26, no. 4, pp. 44-48, 1994.
    • (1994) SIGCHI Bulletin , vol.26 , Issue.4 , pp. 44-48
    • Arons, B.1    Mynatt, E.2
  • 12
    • 0036460898 scopus 로고    scopus 로고
    • An overview of decoding techniques for large vocabulary continuous speech recognition
    • X. Aubert, "An overview of decoding techniques for large vocabulary continuous speech recognition," Computer Speech & Language, vol. 16, no. 1, pp. 89-114, 2002.
    • (2002) Computer Speech & Language , vol.16 , Issue.1 , pp. 89-114
    • Aubert, X.1
  • 17
    • 44949112191 scopus 로고    scopus 로고
    • A TextTiling based approach to topic boundary detection in meetings
    • S. Banerjee and A. Rudnicky, "A TextTiling based approach to topic boundary detection in meetings," in Proceedings of Interspeech, 2006.
    • (2006) Proceedings of Interspeech
    • Banerjee, S.1    Rudnicky, A.2
  • 18
    • 53149126088 scopus 로고    scopus 로고
    • Recovering capitalization and punctuation marks for automatic speech recognition: Case study for portuguese broadcast news
    • F. Batista, D. Caseiro, N. Mamede, and I. Trancoso, "Recovering capitalization and punctuation marks for automatic speech recognition: Case study for portuguese broadcast news," Speech Communication, vol. 50, no. 10, pp. 847-862, 2008.
    • (2008) Speech Communication , vol.50 , Issue.10 , pp. 847-862
    • Batista, F.1    Caseiro, D.2    Mamede, N.3    Trancoso, I.4
  • 19
    • 0029304819 scopus 로고
    • Combining the evidence of multiple query representations for information retrieval
    • N. J. Belkin, P. Kantor, E. A. Fox, and J. A. Shaw, "Combining the evidence of multiple query representations for information retrieval," Information Processing & Management, vol. 31, no. 3, pp. 431-448, 1995.
    • (1995) Information Processing & Management , vol.31 , Issue.3 , pp. 431-448
    • Belkin, N.J.1    Kantor, P.2    Fox, E.A.3    Shaw, J.A.4
  • 21
    • 77956629726 scopus 로고    scopus 로고
    • Podcast search: User goals and retrieval technologies
    • J. Besser, M. Larson, and K. Hofmann, "Podcast search: User goals and retrieval technologies," Online Information Review, vol. 34, p. 3, 2010.
    • (2010) Online Information Review , vol.34 , pp. 3
    • Besser, J.1    Larson, M.2    Hofmann, K.3
  • 22
    • 0030142722 scopus 로고    scopus 로고
    • Towards increasing speech recognition error rates
    • DOI 10.1016/0167-6393(96)00003-9, PII S0167639396000039
    • H. Bourlard, H. Hermansky, and N. Morgan, "Towards increasing speech recognition error rates," Speech Communication, vol. 18, pp. 205-231, May 1996. (Pubitemid 126362800)
    • (1996) Speech Communication , vol.18 , Issue.3 , pp. 205-231
    • Bourlard, H.1    Hermansky, H.2    Morgan, N.3
  • 23
    • 84862119541 scopus 로고    scopus 로고
    • Recognition and understanding of meetings overview of the European AMI and AMIDA projects
    • H. Bourlard and S. Renals, "Recognition and understanding of meetings overview of the European AMI and AMIDA projects," IDIAP-RR 27 Technical Report, 2008.
    • (2008) IDIAP-RR 27 Technical Report
    • Bourlard, H.1    Renals, S.2
  • 24
    • 0038589165 scopus 로고    scopus 로고
    • The anatomy of a large-scale hypertextual web search engine
    • S. Brin and L. Page, "The anatomy of a large-scale hypertextual web search engine," Computer Networks and ISDN Systems, vol. 30, no. 1-7, pp. 107-117, 1998.
    • (1998) Computer Networks and ISDN Systems , vol.30 , Issue.1-7 , pp. 107-117
    • Brin, S.1    Page, L.2
  • 38
    • 85032751967 scopus 로고    scopus 로고
    • Retrieval and browsing of spoken content
    • DOI 10.1109/MSP.2008.917992
    • C. Chelba, T. J. Hazen, and M. Saraclar, "Retrieval and browsing of spoken content," IEEE Signal Processing Magazine, vol. 25, no. 3, pp. 39-49, 2008. (Pubitemid 351695639)
    • (2008) IEEE Signal Processing Magazine , vol.25 , Issue.3 , pp. 39-49
    • Chelba, C.1    Hazen, T.J.2    Saraclar, M.3
  • 39
    • 33847607574 scopus 로고    scopus 로고
    • Soft indexing of speech content for search in spoken documents
    • DOI 10.1016/j.csl.2006.09.001, PII S0885230806000313
    • C. Chelba, J. Silva, and A. Acero, "Soft indexing of speech content for search in spoken documents," Computer Speech and Language, vol. 21, no. 3, pp. 458-478, 2007. (Pubitemid 46367509)
    • (2007) Computer Speech and Language , vol.21 , Issue.3 , pp. 458-478
    • Chelba, C.1    Silva, J.2    Acero, A.3
  • 40
    • 27744494029 scopus 로고    scopus 로고
    • Exploring the use of latent topical information for statistical Chinese spoken document retrieval
    • DOI 10.1016/j.patrec.2005.06.010, PII S0167865505001704
    • B. Chen, "Exploring the use of latent topical information for statistical Chinese spoken document retrieval," Pattern Recognition Letters, vol. 27, no. 1, pp. 9-18, 2006. (Pubitemid 41625538)
    • (2006) Pattern Recognition Letters , vol.27 , Issue.1 , pp. 9-18
    • Chen, B.1
  • 41
    • 0036649836 scopus 로고    scopus 로고
    • Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese
    • DOI 10.1109/TSA.2002.802541, PII 1011092002802541
    • B. Chen, H.-M. Wang, and L.-S. Lee, "Discriminating capabilities of syllablebased features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese," IEEE Transactions on Speech and Audio Processing, vol. 10, no. 5, pp. 303-314, 2002. (Pubitemid 34950068)
    • (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.5 , pp. 303-314
    • Chen, B.1    Wang, H.-M.2    Lee, L.-S.3
  • 43
    • 0033329799 scopus 로고    scopus 로고
    • An empirical study of smoothing techniques for language modeling
    • S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Computer Speech and Language, vol. 13, no. 4, pp. 359-393, 1999.
    • (1999) Computer Speech and Language , vol.13 , Issue.4 , pp. 359-393
    • Chen, S.F.1    Goodman, J.2
  • 45
    • 67149133555 scopus 로고    scopus 로고
    • A probabilistic generative framework for extractive broadcast news speech summarization
    • Y.-T. Chen, B. Chen, and H.-M. Wang, "A probabilistic generative framework for extractive broadcast news speech summarization," IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 1, pp. 95-106, 2009.
    • (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , Issue.1 , pp. 95-106
    • Chen, Y.-T.1    Chen, B.2    Wang, H.-M.3
  • 49
    • 36849085977 scopus 로고    scopus 로고
    • Merging storyboard strategies and automatic retrieval for improving interactive video search
    • DOI 10.1145/1282280.1282351, Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007
    • M. G. Christel and R. Yan, "Merging storyboard strategies and automatic retrieval for improving interactive video search," in Proceedings of the ACM International Conference on Image and Video Retrieval, pp. 486-493, 2007. (Pubitemid 350229663)
    • (2007) Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007 , pp. 486-493
    • Christel, M.G.1    Yan, R.2
  • 50
    • 22944486029 scopus 로고    scopus 로고
    • Speech and language processing: Can we use the past to predict the future?
    • (P. Sojka, I. Kopecek, and K. Pala, eds.), Springer Berlin/Heidelberg
    • K. W. Church, "Speech and language processing: Can we use the past to predict the future?," in Text, Speech and Dialogue, vol. 3206 of Lecture Notes in Computer Science, (P. Sojka, I. Kopecek, and K. Pala, eds.), pp. 3-13, Springer Berlin/Heidelberg, 2004.
    • (2004) Text, Speech and Dialogue 3206 of Lecture Notes in Computer Science , pp. 3-13
    • Church, K.W.1
  • 53
    • 84865221228 scopus 로고    scopus 로고
    • ACM SIGIR 2001 workshop "Information Retrieval Techniques for Speech Applications
    • A. R. Coden, E. W. Brown, and S. Srinivasan, "ACM SIGIR 2001 workshop "Information Retrieval Techniques for Speech Applications", " SIGIR Forum, vol. 36, no. 1, pp. 10-13, 2002.
    • (2002) SIGIR Forum , vol.36 , Issue.1 , pp. 10-13
    • Coden, A.R.1    Brown, E.W.2    Srinivasan, S.3
  • 56
    • 33646687756 scopus 로고    scopus 로고
    • Written versus spoken queries: A qualitative and quantitative comparative analysis
    • DOI 10.1002/asi.20350
    • F. Crestani and H. Du, "Written versus spoken queries: A qualitative and quantitative comparative analysis," Journal of the American Society for Information Science and Technology, vol. 57, no. 7, pp. 881-890, 2006. (Pubitemid 43734456)
    • (2006) Journal of the American Society for Information Science and Technology , vol.57 , Issue.7 , pp. 881-890
    • Crestani, F.1    Du, H.2
  • 62
    • 84885771294 scopus 로고    scopus 로고
    • Automated speech and audio analysis for semantic access to multimedia
    • Chapter 18, (Y. Avrithis, Y. Kompatsiaris, S. Staab, and N. O'Connor, eds.), Springer Berlin/Heidelberg: Berlin, Heidelberg
    • F. M. G. de Jong, R. J. F. Ordelman, and M. A. H. Huijbregts, "Automated speech and audio analysis for semantic access to multimedia," in Semantic Multimedia, vol. 4306 of Lecture Notes in Computer Science, Chapter 18, (Y. Avrithis, Y. Kompatsiaris, S. Staab, and N. O'Connor, eds.), pp. 226-240, Springer Berlin/Heidelberg: Berlin, Heidelberg, 2006.
    • (2006) Semantic Multimedia 4306 of Lecture Notes in Computer Science , pp. 226-240
    • De Jong, F.M.G.1    Ordelman, R.J.F.2    Huijbregts, M.A.H.3
  • 63
    • 33847768236 scopus 로고    scopus 로고
    • Multimedia search without visual analysis: The value of linguistic and contextual information
    • DOI 10.1109/TCSVT.2007.890834
    • F. M. G. de Jong, T. Westerveld, and A. P. de Vries, "Multimedia search without visual analysis: The value of linguistic and contextual information," IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 3, pp. 365-371, 2007. (Pubitemid 46393334)
    • (2007) IEEE Transactions on Circuits and Systems for Video Technology , vol.17 , Issue.3 , pp. 365-371
    • De Jong, F.M.G.1    Westerveld, T.2    De Vries, A.P.3
  • 64
    • 0345120094 scopus 로고
    • Improving information retrieval with latent semantic indexing
    • (C. L. Borgman and E. Y. H. Pai, eds.)
    • S. Deerwester, "Improving information retrieval with latent semantic indexing," in Proceedings of the 51st ASIS Annual Meeting, vol. 25, (C. L. Borgman and E. Y. H. Pai, eds.), 1988.
    • (1988) Proceedings of the 51st ASIS Annual Meeting , vol.25
    • Deerwester, S.1
  • 66
    • 36348937811 scopus 로고    scopus 로고
    • Topic segmentation algorithms for text summarization and passage retrieval: An exhaustive evaluation
    • AAAI-07/IAAI-07 Proceedings: 22nd AAAI Conference on Artificial Intelligence and the 19th Innovative Applications of Artificial Intelligence Conference
    • G. Dias, E. Alves, and J. G. P. Lopes, "Topic segmentation algorithms for text summarization and passage retrieval: An exhaustive evaluation," in Proceedings of the National Conference on Artificial Intelligence - Volume 2, pp. 1334-1339, 2007. (Pubitemid 350149752)
    • (2007) Proceedings of the National Conference on Artificial Intelligence , vol.2 , pp. 1334-1339
    • Dias, G.1    Alves, E.2    Lopes, J.G.P.3
  • 69
    • 78649328053 scopus 로고    scopus 로고
    • Survey on speech emotion recognition: Features, classification schemes, and databases
    • M. El Ayadi, M. S. Kamel, and F. Karray, "Survey on speech emotion recognition: Features, classification schemes, and databases," Pattern Recognition, vol. 44, no. 3, pp. 572-587, 2011.
    • (2011) Pattern Recognition , vol.44 , Issue.3 , pp. 572-587
    • El Ayadi, M.1    Kamel, M.S.2    Karray, F.3
  • 71
    • 0034274033 scopus 로고    scopus 로고
    • A system for the retrieval of Italian broadcast news
    • M. Federico, "A system for the retrieval of Italian broadcast news," Speech Communication, vol. 32, no. 1-2, pp. 37-47, 2000.
    • (2000) Speech Communication , vol.32 , Issue.1-2 , pp. 37-47
    • Federico, M.1
  • 73
  • 74
    • 47749152568 scopus 로고    scopus 로고
    • The rich transcription 2007 meeting recognition evaluation
    • (R. Stiefelhagen, R. Bowers, and J. G. Fiscus, eds.) , Berlin/Heidelberg: Springer- Verlag
    • J. G. Fiscus, J. Ajot, and J. S. Garofolo, "The rich transcription 2007 meeting recognition evaluation," in Multimodal Technologies for Perception of Humans, (R. Stiefelhagen, R. Bowers, and J. G. Fiscus, eds.), pp. 373-389, Berlin/Heidelberg: Springer-Verlag, 2008.
    • (2008) Multimodal Technologies for Perception of Humans , pp. 373-389
    • Fiscus, J.G.1    Ajot, J.2    Garofolo, J.S.3
  • 76
    • 0032646977 scopus 로고    scopus 로고
    • An overview of audio information retrieval
    • J. T. Foote, "An overview of audio information retrieval," Multimedia Systems, vol. 7, no. 1, pp. 2-10, 1999.
    • (1999) Multimedia Systems , vol.7 , Issue.1 , pp. 2-10
    • Foote, J.T.1
  • 79
    • 77949405459 scopus 로고    scopus 로고
    • Transcription and distillation of spontaneous speech
    • Chapter 32, (J. Benesty, M. M. Sondhi, and Y. A. Huang, eds.) , Berlin/Heidelberg: Springer Berlin/Heidelberg
    • S. Furui and T. Kawahara, "Transcription and distillation of spontaneous speech," in Springer Handbook of Speech Processing, Chapter 32, (J. Benesty, M. M. Sondhi, and Y. A. Huang, eds.), pp. 627-652, Berlin/Heidelberg: Springer Berlin/Heidelberg, 2008.
    • (2008) Springer Handbook of Speech Processing , pp. 627-652
    • Furui, S.1    Kawahara, T.2
  • 84
    • 0003128543 scopus 로고    scopus 로고
    • Transcribing broadcast news for audio and video indexing
    • J.-L. Gauvain, L. Lamel, and G. Adda, "Transcribing broadcast news for audio and video indexing," Communications of the ACM, vol. 13, no. 2, pp. 64-70, 2000.
    • (2000) Communications of the ACM , vol.13 , Issue.2 , pp. 64-70
    • Gauvain, J.-L.1    Lamel, L.2    Adda, G.3
  • 87
    • 0026989462 scopus 로고
    • A system for retrieving speech documents
    • ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval
    • U. Glavitsch and P. Schäuble, "A system for retrieving speech documents," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 168-176, 1992.
    • (1992) Proceedings of the International , pp. 168-176
    • Glavitsch, U.1    Schäuble, P.2
  • 88
    • 84976842565 scopus 로고
    • Metadata for integrating speech documents in a text retrieval system
    • December
    • U. Glavitsch, P. Schäuble, and M. Wechsler, "Metadata for integrating speech documents in a text retrieval system," SIGMOD Record, vol. 23, no. 4, pp. 57-63, December 1994.
    • (1994) SIGMOD Record , vol.23 , Issue.4 , pp. 57-63
    • Glavitsch, U.1    Schäuble, P.2    Wechsler, M.3
  • 92
    • 84865716939 scopus 로고    scopus 로고
    • PodCastle: Recent advances of a spoken document retrieval service improved by anonymous user contributions
    • M. Goto and J. Ogata, "PodCastle: Recent advances of a spoken document retrieval service improved by anonymous user contributions," in Proceedings of Interspeech, pp. 3073-3076, 2011.
    • (2011) Proceedings of Interspeech , pp. 3073-3076
    • Goto, M.1    Ogata, J.2
  • 93
    • 67149104848 scopus 로고    scopus 로고
    • PodCastle: A Web 2.0 approach to speech recognition research
    • M. Goto, J. Ogata, and K. Eto, "PodCastle: A Web 2.0 approach to speech recognition research," in Proceedings of Interspeech, pp. 2397-2400, 2007.
    • (2007) Proceedings of Interspeech , pp. 2397-2400
    • Goto, M.1    Ogata, J.2    Eto, K.3
  • 97
    • 33746550354 scopus 로고    scopus 로고
    • Beyond ASR 1-best: Using word confusion networks in spoken language understanding
    • DOI 10.1016/j.csl.2005.07.005, PII S0885230805000495
    • D. Hakkani-Tür, F. Bechet, G. Riccardi, and G. Tür, "Beyond ASR 1-best: Using word confusion networks in spoken language understanding," Computer Speech and Language, vol. 20, no. 4, pp. 495-514, 2006. (Pubitemid 44142006)
    • (2006) Computer Speech and Language , vol.20 , Issue.4 , pp. 495-514
    • Hakkani-Tur, D.1    Bechet, F.2    Riccardi, G.3    Tur, G.4
  • 99
    • 13144279345 scopus 로고    scopus 로고
    • Affective video content representation and modeling
    • DOI 10.1109/TMM.2004.840618
    • A. Hanjalic and L.-Q. Xu, "Affective video content representation and modeling," IEEE Transactions on Multimedia, vol. 7, no. 1, pp. 143-154, 2005. (Pubitemid 40178377)
    • (2005) IEEE Transactions on Multimedia , vol.7 , Issue.1 , pp. 143-154
    • Hanjalic, A.1    Xu, L.-Q.2
  • 101
    • 36448995740 scopus 로고    scopus 로고
    • Selection and ranking of text from highly imperfect transcripts for retrieval of video content
    • DOI 10.1145/1277741.1277911, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
    • A. Haubold, "Selection and ranking of text from highly imperfect transcripts for retrieval of video content," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 791-792, 2007. (Pubitemid 350165072)
    • (2007) Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07 , pp. 791-792
    • Haubold, A.1
  • 105
    • 0001374417 scopus 로고    scopus 로고
    • Informedia: News-on-demand multimedia information acquisition and retrieval
    • (M. T. Maybury, ed.) , The MIT Press
    • A. G. Hauptmann and M. J. Witbrock, "Informedia: News-on-demand multimedia information acquisition and retrieval," in Intelligent Multimedia Information Retrieval, (M. T. Maybury, ed.), pp. 215-239, The MIT Press, 1997.
    • (1997) Intelligent Multimedia Information Retrieval , pp. 215-239
    • Hauptmann, A.G.1    Witbrock, M.J.2
  • 110
    • 85135371918 scopus 로고
    • New words: Implications for continuous speech recognition
    • I. L. Hetherington and V. W. Zue, "New words: Implications for continuous speech recognition," in Proceedings of Eurospeech, pp. 2121-2124, 1993.
    • (1993) Proceedings of Eurospeech , pp. 2121-2124
    • Hetherington, I.L.1    Zue, V.W.2
  • 111
    • 0003754573 scopus 로고    scopus 로고
    • Using language models for information retrieval PhD thesis
    • D. Hiemstra, "Using language models for information retrieval," PhD thesis, University of Twente, 2001.
    • (2001) University of Twente
    • Hiemstra, D.1
  • 116
    • 48749103162 scopus 로고    scopus 로고
    • Automatic topic segmentation and labeling in multiparty dialogue
    • P.-Y. Hsueh and J. D. Moore, "Automatic topic segmentation and labeling in multiparty dialogue," in IEEE Spoken Language Technology Workshop, pp. 98-101, 2006.
    • (2006) IEEE Spoken Language Technology Workshop , pp. 98-101
    • Hsueh, P.-Y.1    Moore, J.D.2
  • 118
    • 70450194678 scopus 로고    scopus 로고
    • The majority wins: A method for combining speaker diarization systems
    • M. A. H. Huijbregts, D. A. Leeuwen, and F. M. G. Jong, "The majority wins: A method for combining speaker diarization systems," in Proceedings of Interspeech, pp. 924-927, 2009.
    • (2009) Proceedings of Interspeech , pp. 924-927
    • Huijbregts, M.A.H.1    Leeuwen, D.A.2    Jong, F.M.G.3
  • 125
    • 15844411850 scopus 로고    scopus 로고
    • Confidence measures for speech recognition: A survey
    • DOI 10.1016/j.specom.2004.12.004, PII S0167639305000051
    • H. Jiang, "Confidence measures for speech recognition: A survey," Speech Communication, vol. 45, no. 4, pp. 455-470, 2005. (Pubitemid 40423290)
    • (2005) Speech Communication , vol.45 , Issue.4 , pp. 455-470
    • Jiang, H.1
  • 138
  • 142
    • 85135146711 scopus 로고    scopus 로고
    • Estimating confidence using word lattices
    • T. Kemp and T. Schaaf, "Estimating confidence using word lattices," in Proceedings of Eurospeech, pp. 827-830, 1997.
    • (1997) Proceedings of Eurospeech , pp. 827-830
    • Kemp, T.1    Schaaf, T.2
  • 146
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: From features to supervectors," Speech Communication, vol. 52, no. 1, pp. 12-40, 2010.
    • (2010) Speech Communication , vol.52 , Issue.1 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 147
    • 0035426911 scopus 로고    scopus 로고
    • Multilingual phone models for vocabulary-independent speech recognition tasks
    • DOI 10.1016/S0167-6393(00)00093-5, PII S0167639300000935
    • J. Köhler, "Multilingual phone models for vocabulary- independent speech recognition tasks," Speech Communication, vol. 35, no. 1-2, pp. 21-30, 2001. (Pubitemid 32599644)
    • (2001) Speech Communication , vol.35 , Issue.1-2 , pp. 21-30
    • Kohler, J.1
  • 148
    • 85032751882 scopus 로고    scopus 로고
    • Content-based access to spoken audio
    • DOI 10.1109/MSP.2005.1511824
    • K. Koumpis and S. Renals, "Content-based access to spoken audio," IEEE Signal Processing Magazine, vol. 22, no. 5, pp. 61-69, 2005. (Pubitemid 41488521)
    • (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 61-69
    • Koumpis, K.1    Renals, S.2
  • 149
    • 33947673332 scopus 로고    scopus 로고
    • Automatic summarization of voicemail messages using lexical and prosodic features
    • K. Koumpis and S. Renals, "Automatic summarization of voicemail messages using lexical and prosodic features," ACM Transactions on Speech and Language Processing, vol. 2, no. 1, pp. 1-24, 2005.
    • (2005) ACM Transactions on Speech and Language Processing , vol.2 , Issue.1 , pp. 1-24
    • Koumpis, K.1    Renals, S.2
  • 152
    • 0036722768 scopus 로고    scopus 로고
    • Thematic indexing of spoken documents by using self-organizing maps
    • DOI 10.1016/S0167-6393(01)00042-5, PII S0167639301000425
    • M. Kurimo, "Thematic indexing of spoken documents by using
    • (2002) Speech Communication , vol.38 , Issue.1-2 , pp. 29-45
    • Kurimo, M.1
  • 153
    • 85009133608 scopus 로고    scopus 로고
    • An evaluation of a spoken document retrieval baseline system in finnish
    • M. Kurimo and V. Turunen, "An evaluation of a spoken document retrieval baseline system in finnish," in Proceedings of Interspeech, pp. 1585-1588, 2004.
    • (2004) Proceedings of Interspeech , pp. 1585-1588
    • Kurimo, M.1    Turunen, V.2
  • 155
    • 33745217037 scopus 로고    scopus 로고
    • Using syllable-based indexing features and language models to improve German spoken document retrieval
    • M. Larson and S. Eickeler, "Using syllable-based indexing features and language models to improve German spoken document retrieval," in Proceedings of Interspeech, pp. 1217-1220, 2003.
    • (2003) Proceedings of Interspeech , pp. 1217-1220
    • Larson, M.1    Eickeler, S.2
  • 158
    • 70549113366 scopus 로고    scopus 로고
    • Overview of VideoCLEF 2008: Automatic generation of topic-based feeds for dual language audiovisual content
    • (C. Peters, T. Deselaers, N. Ferro, J. Gonzalo, A. Penas, G. J. F. Jones, M. Kurimo, T. Mandl, and V. Petras, eds.), Springer Berlin/Heidelberg
    • M. Larson, E. Newman, and G. J. F. Jones, "Overview of VideoCLEF 2008: Automatic generation of topic-based feeds for dual language audiovisual content," in Proceedings of the Cross-language Evaluation Forum Conference on Evaluating Systems for Multilingual and Multimodal Information Access, (C. Peters, T. Deselaers, N. Ferro, J. Gonzalo, A. Penas, G. J. F. Jones, M. Kurimo, T. Mandl, and V. Petras, eds.), pp. 906-917, Springer Berlin/Heidelberg, 2009.
    • (2009) Proceedings of the Cross-language Evaluation Forum Conference on Evaluating Systems for Multilingual and Multimodal Information Access , pp. 906-917
    • Larson, M.1    Newman, E.2    Jones, G.J.F.3
  • 159
    • 78049336944 scopus 로고    scopus 로고
    • Overview of VideoCLEF 2009: New perspectives on speech-based multimedia content enrichment
    • (C. Peters, B. Caputo, J. Gonzalo, G. J. F. Jones, J. Kalpathy-Cramer, H. Muller, and T. Tsikrika, eds.), Springer Berlin/Heidelberg
    • M. Larson, E. Newman, and G. J. F. Jones, "Overview of VideoCLEF 2009: New perspectives on speech-based multimedia content enrichment," in Multilingual Information Access Evaluation II. Multimedia Experiments, vol. 6242 of Lecture Notes in Computer Science, (C. Peters, B. Caputo, J. Gonzalo, G. J. F. Jones, J. Kalpathy-Cramer, H. Müller, and T. Tsikrika, eds.), pp. 354-368, Springer Berlin/Heidelberg, 2010.
    • (2010) Multilingual Information Access Evaluation II. Multimedia Experiments 6242 of Lecture Notes in Computer Science , pp. 354-368
    • Larson, M.1    Newman, E.2    Jones, G.J.F.3
  • 160
    • 84865263098 scopus 로고    scopus 로고
    • The community and the crowd: Developing large-scale data collections for multimedia benchmarking
    • IEEE Computer Society Digital Library. IEEE Computer Society 15 May
    • M. Larson, M. Soleymani, M. Eskevich, P. Serdyukov, R. Ordelman, and G. J. F. Jones, "The community and the crowd: Developing large-scale data collections for multimedia benchmarking," IEEE Multimedia, IEEE Computer Society Digital Library. IEEE Computer Society, 15 May 2012.
    • (2012) IEEE Multimedia
    • Larson, M.1    Soleymani, M.2    Eskevich, M.3    Serdyukov, P.4    Ordelman, R.5    Jones, G.J.F.6
  • 164
    • 0034785304 scopus 로고    scopus 로고
    • Relevance based language models
    • ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval
    • V. Lavrenko and W. B. Croft, "Relevance based language models," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 120-127, 2001.
    • (2001) Proceedings of the International , pp. 120-127
    • Lavrenko, V.1    Croft, W.B.2
  • 166
    • 85032751176 scopus 로고    scopus 로고
    • Spoken document understanding and organization
    • DOI 10.1109/MSP.2005.1511823
    • L.-S. Lee and B. Chen, "Spoken document understanding and organization," IEEE Signal Processing Magazine, vol. 22, no. 5, pp. 42-60, 2005. (Pubitemid 41488520)
    • (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 42-60
    • Lee, L.-S.1    Chen, B.2
  • 170
    • 3042818033 scopus 로고    scopus 로고
    • Cross-language spoken document retrieval using HMM-based retrieval model with multi-scale fusion
    • W.-K. Lo, H. Meng, and P. C. Ching, "Cross-language spoken document retrieval using HMM-based retrieval model with multi-scale fusion," ACM Transactions on Asian Language Information Processing, vol. 2, no. 1, pp. 1-26, 2003.
    • (2003) ACM Transactions on Asian Language Information Processing , vol.2 , Issue.1 , pp. 1-26
    • Lo, W.-K.1    Meng, H.2    Ching, P.C.3
  • 173
    • 85009285063 scopus 로고    scopus 로고
    • Confusion-based query expansion for OOV words in spoken document retrieval
    • B. Logan and J. M. V. Thong, "Confusion-based query expansion for OOV words in spoken document retrieval," in Proceedings of Interspeech, pp. 1997-2000, 2002.
    • (2002) Proceedings of Interspeech , pp. 1997-2000
    • Logan, B.1    Thong, J.M.V.2
  • 174
    • 26844534218 scopus 로고    scopus 로고
    • Approaches to reduce the effects of OOV queries on indexed spoken audio
    • DOI 10.1109/TMM.2005.854429
    • B. Logan, J. M. Van Thong, and P. J. Moreno, "Approaches to reduce the effects of OOV queries on indexed spoken audio," IEEE Transactions on Multimedia, vol. 7, no. 5, pp. 899-906, 2005. (Pubitemid 41452518)
    • (2005) IEEE Transactions on Multimedia , vol.7 , Issue.5 , pp. 899-906
    • Logan, B.1    Van Thong, J.M.2    Moreno, P.J.3
  • 177
    • 0034296009 scopus 로고    scopus 로고
    • Finding consensus among words: Latticebased word error minimisation
    • L. Mangu, E. Brill, and A. Stolcke, "Finding consensus among words: Latticebased word error minimisation," Computer Speech and Language, vol. 14, no. 4, pp. 373-400, 2000.
    • (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 373-400
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 187
    • 24644434943 scopus 로고    scopus 로고
    • Boosting Web retrieval through query operations
    • Advances in Information Retrieval - 27th European Conference on IR Research, ECIR 2005
    • G. Mishne and M. de Rijke, "Boosting web retrieval through query operations," in Advances in Information Retrieval, pp. 502-516, Springer, 2005. (Pubitemid 41272928)
    • (2005) Lecture Notes in Computer Science , vol.3408 , pp. 502-516
    • Mishne, G.1    De Rijke, M.2
  • 190
    • 33750546860 scopus 로고    scopus 로고
    • Infolink: Analysis of Dutch broadcast news and cross-media browsing
    • DOI 10.1109/ICME.2005.1521738, 1521738, IEEE International Conference on Multimedia and Expo, ICME 2005
    • J. Morang, R. J. F. Ordelman, F. M. G. de Jong, and A. J. van Hessen, "Infolink: Analysis of dutch broadcast news and cross-media browsing," in IEEE International Conference on Multimedia and Expo, pp. 1582-1585, 2005. (Pubitemid 44669182)
    • (2005) IEEE International Conference on Multimedia and Expo, ICME 2005 , vol.2005 , pp. 1582-1585
    • Morang, J.1    Ordelman, R.2    De Jong, F.3    Van Hessen, A.4
  • 191
    • 33745218075 scopus 로고    scopus 로고
    • Comparison of different phone-based spoken document retrieval methods with text and spoken queries
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • N. Moreau, S. Jin, and T. Sikora, "Comparison of different phone-based spoken document retrieval methods with text and spoken queries," in Proceedings of Interspeech, pp. 641-644, 2005. (Pubitemid 43908144)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 641-644
    • Moreau, N.1    Jin, S.2    Sikora, T.3
  • 192
    • 33745186799 scopus 로고    scopus 로고
    • Phonetic confusion based document expansion for spoken document retrieval
    • N. Moreau, H.-G. Kim, and T. Sikora, "Phonetic confusion based document expansion for spoken document retrieval," in Proceedings of Interspeech, pp. 1593-1596, 2004.
    • (2004) Proceedings of Interspeech , pp. 1593-1596
    • Moreau, N.1    Kim, H.-G.2    Sikora, T.3
  • 193
    • 0036534571 scopus 로고    scopus 로고
    • From multimedia retrieval to knowledge management
    • P. J. Moreno, J. M. Van Thong, B. Logan, and G. J. F. Jones, "From multimedia retrieval to knowledge management," Computer, vol. 35, no. 4, pp. 58-66, 2002. (Pubitemid 34291867)
    • (2002) Computer , vol.35 , Issue.4 , pp. 58-66
    • Moreno, P.J.1    Van Thong, J.-M.2    Logan, B.3    Jones, G.J.F.4
  • 194
    • 33745856298 scopus 로고    scopus 로고
    • The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives
    • CHI 2006: Conference on Human Factors in Computing Systems, Conference Proceedings SIGCHI
    • C. Munteanu, R. Baecker, G. Penn, E. Toms, and D. James, "The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives," in Proceedings of the Special Interest Group on Computer-Human Interaction (SIGCHI) Conference on Human Factors in Computing Systems, pp. 493-502, 2006. (Pubitemid 44032136)
    • (2006) Conference on Human Factors in Computing Systems - Proceedings , vol.1 , pp. 493-502
    • Munteanu, C.1    Baecker, R.2    Penn, G.3    Toms, E.4    James, D.5
  • 195
    • 0034274806 scopus 로고    scopus 로고
    • Experiments in spoken document retrieval using phoneme n-grams
    • C. Ng, R. Wilkinson, and J. Zobel, "Experiments in spoken document retrieval using phoneme n-grams," Speech Communication, vol. 32, no. 1-2, pp. 61-77, 2000.
    • (2000) Speech Communication , vol.32 , Issue.1-2 , pp. 61-77
    • Ng, C.1    Wilkinson, R.2    Zobel, J.3
  • 196
    • 0002470735 scopus 로고    scopus 로고
    • Subword unit representations for spoken document retrieval
    • K. Ng and V. W. Zue, "Subword unit representations for spoken document retrieval," in Proceedings of Eurospeech, pp. 1607-1610, 1997.
    • (1997) Proceedings of Eurospeech , pp. 1607-1610
    • Ng, K.1    Zue, V.W.2
  • 197
    • 0034300710 scopus 로고    scopus 로고
    • Subword-based approaches for spoken document retrieval
    • K. Ng and V. W. Zue, "Subword-based approaches for spoken document retrieval," Speech Communication, vol. 32, no. 3, pp. 157-186, 2000.
    • (2000) Speech Communication , vol.32 , Issue.3 , pp. 157-186
    • Ng, K.1    Zue, V.W.2
  • 198
    • 34250014992 scopus 로고    scopus 로고
    • Language-dependent state clustering for multilingual acoustic modelling
    • DOI 10.1016/j.specom.2007.04.001, PII S0167639307000611
    • T. Niesler, "Language-dependent state clustering for multilingual acoustic modelling," Speech Communication, vol. 49, no. 6, pp. 453-463, 2007. (Pubitemid 46891623)
    • (2007) Speech Communication , vol.49 , Issue.6 , pp. 453-463
    • Niesler, T.1
  • 201
    • 85121253341 scopus 로고
    • The application of dynamic programming techniques to non-word based topic spotting
    • P. Nowell and R. K. Moore, "The application of dynamic programming techniques to non-word based topic spotting," in Proceedings of Eurospeech, pp. 1355-1358, 1995.
    • (1995) Proceedings of Eurospeech , pp. 1355-1358
    • Nowell, P.1    Moore, R.K.2
  • 207
    • 67149138696 scopus 로고    scopus 로고
    • Automatic transcription for aWeb 2.0 service to search podcasts
    • J. Ogata, M. Goto, and K. Eto, "Automatic transcription for aWeb 2.0 service to search podcasts," in Proceedings of Interspeech, pp. 2617-2620, 2007.
    • (2007) Proceedings of Interspeech , pp. 2617-2620
    • Ogata, J.1    Goto, M.2    Eto, K.3
  • 208
    • 84867193207 scopus 로고    scopus 로고
    • Vocabulary independent discriminative term frequency estimation
    • J. S. Olsson, "Vocabulary independent discriminative term frequency estimation," in Proceedings of Interspeech, pp. 2187-2190, 2008.
    • (2008) Proceedings of Interspeech , pp. 2187-2190
    • Olsson, J.S.1
  • 214
    • 0012739214 scopus 로고    scopus 로고
    • Measurements in support of research accomplishments
    • D. S. Pallett, J. S. Garofolo, and J. G. Fiscus, "Measurements in support of research accomplishments," Communications of the ACM, vol. 43, no. 2, pp. 75-79, 2000.
    • (2000) Communications of the ACM , vol.43 , Issue.2 , pp. 75-79
    • Pallett, D.S.1    Garofolo, J.S.2    Fiscus, J.G.3
  • 215
    • 77955759248 scopus 로고    scopus 로고
    • Performance analysis for lattice-based speech indexing approaches using words and subword units
    • Y.-C. Pan and L.-S. Lee, "Performance analysis for lattice-based speech indexing approaches using words and subword units," IEEE Transactions on Speech and Audio Processing, vol. 18, no. 6, pp. 1562-1574, 2010.
    • (2010) IEEE Transactions on Speech and Audio Processing , vol.18 , Issue.6 , pp. 1562-1574
    • Pan, Y.-C.1    Lee, L.-S.2
  • 216
    • 0030657236 scopus 로고    scopus 로고
    • Cross-language speech retrieval: Establishing a baseline performance
    • S. Paraic, M. Wechsler, and P. Schäuble, "Cross-language speech retrieval: Establishing a baseline performance," in Proceedings of the International ACM Special Interest Group on Information Retrieval (SIGIR) Conference on Research and Development in Information Retrieval, pp. 99-108, 1997. (Pubitemid 127720310)
    • (1997) SIGIR Forum (ACM Special Interest Group on Information Retrieval) , vol.31 , Issue.1 SPEC. ISS. , pp. 99-108
    • Sheridan, P.1    Wechsler, M.2    Schauble, P.3
  • 222
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286, 1989.
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 223
    • 0016939166 scopus 로고
    • Speech recognition by machine: A review
    • D. R. Reddy, "Speech recognition by machine: A review," Proceedings of the IEEE, vol. 64, no. 4, pp. 501-531, 1976. (Pubitemid 8019231)
    • (1976) Proceedings of the IEEE , vol.64 , Issue.4 , pp. 501-531
    • Reddy, D.R.1
  • 224
    • 84962787580 scopus 로고    scopus 로고
    • The ALERT system: Advanced broadcast speech recognition technology for selective dissemination of multimedia Information
    • G. Rigoll, "The ALERT system: Advanced broadcast speech recognition technology for selective dissemination of multimedia Information," in Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 301-306, 2001.
    • (2001) Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 301-306
    • Rigoll, G.1
  • 225
    • 0001737422 scopus 로고
    • On term selection for query expansion
    • S. E. Robertson, "On term selection for query expansion," Journal of Documentation, vol. 46, no. 4, pp. 359-364, 1990.
    • (1990) Journal of Documentation , vol.46 , Issue.4 , pp. 359-364
    • Robertson, S.E.1
  • 230
    • 0039627177 scopus 로고
    • Techniques for information retrieval from speech messages
    • R. C. Rose, "Techniques for information retrieval from speech messages," Lincoln Laboratory Journal, vol. 4, no. 1, pp. 45-60, 1991.
    • (1991) Lincoln Laboratory Journal , vol.4 , Issue.1 , pp. 45-60
    • Rose, R.C.1
  • 235
    • 45549117987 scopus 로고
    • Term-weighting approaches in automatic text retrieval
    • G. Salton and C. Buckley, "Term-weighting approaches in automatic text retrieval," Information Processing and Management, vol. 24, no. 5, pp. 513-523, 1988.
    • (1988) Information Processing and Management , vol.24 , Issue.5 , pp. 513-523
    • Salton, G.1    Buckley, C.2
  • 240
    • 2642521115 scopus 로고
    • Assessing the retrieval effectiveness of a speech retrieval system by simulating recognition errors
    • P. Schäuble and U. Glavitsch, "Assessing the retrieval effectiveness of a speech retrieval system by simulating recognition errors," in Proceedings of the Workshop on Human Language Technology, pp. 347-349, 1994.
    • (1994) Proceedings of the Workshop on Human Language Technology , pp. 347-349
    • Schäuble, P.1    Glavitsch, U.2
  • 252
    • 33745213806 scopus 로고    scopus 로고
    • Fast vocabulary-independent audio search using path-based graph indexing
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • O. Siohan and M. Bacchiani, "Fast vocabulary-independent audio search using path-based graph indexing," in Proceedings of Interspeech, pp. 53-56, 2005. (Pubitemid 43907999)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 53-56
    • Siohan, O.1    Bacchiani, M.2
  • 254
    • 84865231398 scopus 로고    scopus 로고
    • Taiscealai: Information Retrieval from an Archive of Spoken Radio News
    • Research and Advanced Technology for Digital Libraries
    • A. F. Smeaton, M. Morony, G. Quinn, and R. Scaife, "Taiscé ala?: Information retrieval from an archive of spoken radio news," in Research and Advanced Technology for Digital Libraries, vol. 1513 of Lecture Notes in Computer Science, (C. Nikolaou and C. Stephanidis, eds.), pp. 429-442, Springer Berlin/Heidelberg, 1998. (Pubitemid 128145539)
    • (1998) Lecture Notes in Computer Science , Issue.1513 , pp. 429-442
    • Sineaton, A.F.1    Morony, M.2    Quinn, G.3    Scaife, R.4
  • 256
  • 258
    • 0038206636 scopus 로고    scopus 로고
    • ASR satisficing: The effects of ASR accuracy on speech retrieval
    • L. A. Stark, S. Whittaker, and J. Hirschberg, "ASR satisficing: The effects of ASR accuracy on speech retrieval," in Proceedings of Interspeech, pp. 1069- 1072, 2000.
    • (2000) Proceedings of Interspeech , pp. 1069-1072
    • Stark, L.A.1    Whittaker, S.2    Hirschberg, J.3
  • 261
    • 0033335618 scopus 로고    scopus 로고
    • Modeling pronunciation variation for ASR: A survey of the literature
    • DOI 10.1016/S0167-6393(99)00038-2
    • H. Strik and C. Cucchiarini, "Modeling pronunciation variation for ASR: A survey of the literature," Speech Communication, vol. 29, no. 2-4, pp. 225-246, 1999. (Pubitemid 30514833)
    • (1999) Speech Communication , vol.29 , Issue.2 , pp. 225-246
    • Strik, H.1    Cucchiarini, C.2
  • 262
    • 84865263089 scopus 로고    scopus 로고
    • Comparison of methods for language-dependent and language-independent Query-by- Example spoken term detection
    • J. Tejedor, M. Fapso, I. Szoke, J. Cernocky, and F. Grezl, "Comparison of methods for language-dependent and language-independent Query-by- Example spoken term detection," ACM Transactions on Information Systems, vol. 30, no. 3, 2012.
    • (2012) ACM Transactions on Information Systems , vol.30 , Issue.3
    • Tejedor, J.1    Fapso, M.2    Szoke, I.3    Cernocky, J.4    Grezl, F.5
  • 263
    • 54249088981 scopus 로고    scopus 로고
    • A comparison of grapheme and phoneme-based units for Spanish spoken term detection
    • J. Tejedor, D. Wang, J. Frankel, S. King, and J. Colas, "A comparison of grapheme and phoneme-based units for Spanish spoken term detection," Speech Communication, vol. 50, no. 11-12, pp. 980-991, 2008.
    • (2008) Speech Communication , vol.50 , Issue.11-12 , pp. 980-991
    • Tejedor, J.1    Wang, D.2    Frankel, J.3    King, S.4    Colas, J.5
  • 265
    • 84865263087 scopus 로고    scopus 로고
    • A study of users' perception of relevance of spoken documents
    • International Computer Science Institute
    • T. Tombros and F. Crestani, "A study of users' perception of relevance of spoken documents," Technical Report TR-99-013, International Computer Science Institute, 1999.
    • (1999) Technical Report TR , pp. 99-013
    • Tombros, T.1    Crestani, F.2
  • 266
    • 34047261805 scopus 로고    scopus 로고
    • An overview of automatic speaker diarization systems
    • DOI 10.1109/TASL.2006.878256
    • S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 5, pp. 1557-1565, 2006. (Pubitemid 46547580)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.5 , pp. 1557-1565
    • Tranter, S.E.1    Reynolds, D.A.2
  • 269
    • 57849126781 scopus 로고    scopus 로고
    • Time-compressing speech: ASR transcripts are an effective way to support gist extraction
    • Chapter 21, (A. Popescu-Belis and R. Stiefelhagen, eds.), Springer Berlin/Heidelberg
    • S. Tucker, N. Kyprianou, and S. Whittaker, "Time-compressing speech: ASR transcripts are an effective way to support gist extraction," in Machine Learning for Multimodal Interaction, vol. 5237 of Lecture Notes in Computer Science Chapter 21, (A. Popescu-Belis and R. Stiefelhagen, eds.), pp. 226-235, Springer Berlin/Heidelberg, 2008.
    • (2008) Machine Learning for Multimodal Interaction 5237 of Lecture Notes in Computer Science , pp. 226-235
    • Tucker, S.1    Kyprianou, N.2    Whittaker, S.3
  • 274
    • 34648854438 scopus 로고    scopus 로고
    • Speakers role recognition in multiparty audio recordings using social network analysis and duration distribution modeling
    • A. Vinciarelli, "Speakers role recognition in multiparty audio recordings using social network analysis and duration distribution modeling," IEEE Transactions on Multimedia, vol. 9, no. 6, pp. 1215-1226, 2007.
    • (2007) IEEE Transactions on Multimedia , vol.9 , Issue.6 , pp. 1215-1226
    • Vinciarelli, A.1
  • 276
    • 0001739133 scopus 로고    scopus 로고
    • Fusion via a linear combination of scores
    • C. C. Vogt and G. W. Cottrell, "Fusion via a linear combination of scores," Information Retrieval, vol. 1, no. 3, pp. 151-173, 1999.
    • (1999) Information Retrieval , vol.1 , Issue.3 , pp. 151-173
    • Vogt, C.C.1    Cottrell, G.W.2
  • 282
    • 0034275766 scopus 로고    scopus 로고
    • Experiments in syllable-based retrieval of broadcast news speech in Mandarin Chinese
    • H.-M. Wang, "Experiments in syllable-based retrieval of broadcast news speech in Mandarin Chinese," Speech Commununication, vol. 32, no. 1-2, pp. 49-60, 2000.
    • (2000) Speech Commununication , vol.32 , Issue.1-2 , pp. 49-60
    • Wang, H.-M.1
  • 283
    • 0039174218 scopus 로고    scopus 로고
    • Mandarin spoken document retrieval based on syllable lattice matching
    • H.-M. Wang, "Mandarin spoken document retrieval based on syllable lattice matching," Pattern Recognition Letters, vol. 21, no. 6-7, pp. 615-624, 2000.
    • (2000) Pattern Recognition Letters , vol.21 , Issue.6-7 , pp. 615-624
    • Wang, H.-M.1
  • 289
    • 1842797022 scopus 로고    scopus 로고
    • New approaches to spoken document retrieval
    • M. Wechsler, E. Munteanu, and P. Schäuble, "New approaches to spoken document retrieval," Information Retrieval, vol. 3, no. 3, pp. 173-188, 2000.
    • (2000) Information Retrieval , vol.3 , Issue.3 , pp. 173-188
    • Wechsler, M.1    Munteanu, E.2    Schäuble, P.3
  • 292
    • 24144440949 scopus 로고    scopus 로고
    • Browsing recorded meetings with ferret
    • Machine Learning for Multimodal Interaction - First International Workshop, MLMI 2004
    • P. Wellner, M. Flynn, and M. Guillemot, "Browsing recorded meetings with ferret," in Machine Learning for Multimodal Interaction, vol. 3361 of Lecture Notes in Computer Science, (S. Bengio and H. Bourlard, eds.), pp. 12-21, Springer Berlin/Heidelberg, 2005. (Pubitemid 41228874)
    • (2005) Lecture Notes in Computer Science , vol.3361 , pp. 12-21
    • Wellner, P.1    Flynn, M.2    Guillemot, M.3
  • 294
    • 0035278951 scopus 로고    scopus 로고
    • Confidence measures for large vocabulary continuous speech recognition
    • DOI 10.1109/89.906002, PII S1063667601013281
    • F. Wessel, R. Schluter, K. Macherey, and H. Ney, "Confidence measures for large vocabulary continuous speech recognition," IEEE Transactions on Speech and Audio Processing, vol. 9, no. 3, pp. 288-298, 2001. (Pubitemid 32286598)
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.3 , pp. 288-298
    • Wessel, F.1    Schluter, R.2    Macherey, K.3    Ney, H.4
  • 299
    • 38749097384 scopus 로고    scopus 로고
    • Design and evaluation of systems to support interaction capture and retrieval
    • DOI 10.1007/s00779-007-0146-3, Special Issue: User-centred design and evaluation of ubiquitous groupware
    • S. Whittaker, S. Tucker, K. Swampillai, and R. Laban, "Design and evaluation of systems to support interaction capture and retrieval," Personal Ubiquitous Computing, vol. 12, no. 3, pp. 197-221, 2008. (Pubitemid 351176344)
    • (2008) Personal and Ubiquitous Computing , vol.12 , Issue.3 , pp. 197-221
    • Whittaker, S.1    Tucker, S.2    Swampillai, K.3    Laban, R.4
  • 301
    • 84947288844 scopus 로고
    • HMM-based wordspotting for voice editing and indexing
    • L. D. Wilcox and M. A. Bush., "HMM-based wordspotting for voice editing and indexing," in Proceedings of Eurospeech, pp. 25-28, 1991.
    • (1991) Proceedings of Eurospeech , pp. 25-28
    • Wilcox, L.D.1    Bush, M.A.2
  • 304
  • 307
    • 85009168880 scopus 로고    scopus 로고
    • Spotting hot spots in meetings: Human judgments and prosodic cues
    • B. Wrede and E. Shriberg, "Spotting "Hot Spots" in meetings: Human judgments and prosodic cues," in Proceeindgs of Eurospeech, pp. 2805-2808, 2003.
    • (2003) Proceeindgs of Eurospeech , pp. 2805-2808
    • Wrede, B.1    Shriberg, E.2
  • 308
    • 58049207761 scopus 로고    scopus 로고
    • Speech-annotated photo retrieval using syllable-transformed patterns
    • C.-H. Wu, C.-L. Huang, W.-C. Lee, and Y.-S. Lai, "Speech-annotated photo retrieval using syllable-transformed patterns," IEEE Signal Processing Letters, vol. 16, no. 1, pp. 6-9, 2009.
    • (2009) IEEE Signal Processing Letters , vol.16 , Issue.1 , pp. 6-9
    • Wu, C.-H.1    Huang, C.-L.2    Lee, W.-C.3    Lai, Y.-S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.