메뉴 건너뛰기




Volumn 28, Issue 5, 2014, Pages 1066-1082

Language independent search in MediaEval's Spoken Web Search task

Author keywords

Evaluation; Low resource speech technology; Spoken term detection; Spoken web

Indexed keywords

INFORMATION RETRIEVAL;

EID: 84902547894     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2013.12.004     Document Type: Article
Times cited : (32)

References (69)
  • 1
    • 84887114105 scopus 로고    scopus 로고
    • The L2F spoken web search system for MediaEval 2012
    • http://ceur-ws.org/Vol-927/
    • A. Abad, and R.F. Astudillo The L2F spoken web search system for MediaEval 2012 Proc. MediaEval 2012 2012 http://www.multimediaeval.org/ mediaeval2012/; http://ceur-ws.org/Vol-927/
    • (2012) Proc. MediaEval 2012
    • Abad, A.1    Astudillo, R.F.2
  • 6
    • 84887115089 scopus 로고    scopus 로고
    • Telefonica system for the spoken web search task at MediaEval 2011
    • http://ceur-ws.org/Vol-807/
    • X. Anguera Telefonica system for the spoken web search task at MediaEval 2011 Proc. MediaEval 2011 2012 http://www.multimediaeval.org/mediaeval2011/; http://ceur-ws.org/Vol-807/
    • (2012) Proc. MediaEval 2011
    • Anguera, X.1
  • 7
    • 84887115089 scopus 로고    scopus 로고
    • Telefonica research system for the spoken web search task at MediaEval 2012
    • http://ceur-ws.org/Vol-927/.
    • X. Anguera Telefonica research system for the spoken web search task at MediaEval 2012 Proc. MediaEval 2012 2012 http://www.multimediaeval.org/ mediaeval2012/; http://ceur-ws.org/Vol-927/.
    • (2012) Proc. MediaEval 2012
    • Anguera, X.1
  • 8
    • 70450219527 scopus 로고    scopus 로고
    • ASR corpus design for resource-scarce languages
    • ISCA Brighton, UK
    • E. Barnard, M.H. Davel, and C. van Heerden ASR corpus design for resource-scarce languages Proc. INTERSPEECH 2009 ISCA Brighton, UK 2847 2850
    • (2009) Proc. INTERSPEECH , pp. 2847-2850
    • Barnard, E.1    Davel, M.H.2    Van Heerden, C.3
  • 11
    • 84877672373 scopus 로고    scopus 로고
    • ARF@MediaEval 2012: A Romanian ASR-based approach to spoken term detection
    • Buzo, A., Cucu, H., Safta, M., Ionescu, B., Burileanu, C., 2012. ARF@MediaEval 2012: a Romanian ASR-based approach to spoken term detection. In: Proc. MediaEval 2012, http://www.multimediaeval.org/mediaeval2012/; http://ceur-ws.org/Vol-927/.
    • (2012) Proc. MediaEval 2012
    • Buzo, A.1    Cucu, H.2    Safta, M.3    Ionescu, B.4    Burileanu, C.5
  • 12
    • 85032751967 scopus 로고    scopus 로고
    • Retrieval and browsing of spoken content
    • DOI 10.1109/MSP.2008.917992
    • C. Chelba, T.J. Hazen, and M. Saraçlar Retrieval and browsing of spoken content IEEE Sign. Process. Mag. 25 3 2008 39 49 (Pubitemid 351695639)
    • (2008) IEEE Signal Processing Magazine , vol.25 , Issue.3 , pp. 39-49
    • Chelba, C.1    Hazen, T.J.2    Saraclar, M.3
  • 13
    • 84865706565 scopus 로고    scopus 로고
    • Woefzela - An open-source platform for ASR data collection in the developing world
    • ISCA Florence, Italy
    • N.J. de Vries, J. Badenhorst, M.H. Davel, E. Barnard, and A. de Waal Woefzela - an open-source platform for ASR data collection in the developing world Proc. INTERSPEECH 2011 ISCA Florence, Italy 3177 3180
    • (2011) Proc. INTERSPEECH , pp. 3177-3180
    • De Vries, N.J.1    Badenhorst, J.2    Davel, M.H.3    Barnard, E.4    De Waal, A.5
  • 16
    • 79951634009 scopus 로고    scopus 로고
    • Results of the 2006 spoken term detection evaluation
    • Amsterdam, Netherlands
    • J. Fiscus, J. Ajot, J. Garofolo, and G. Doddington Results of the 2006 spoken term detection evaluation Proc. SSCS Amsterdam, Netherlands 2007
    • (2007) Proc. SSCS
    • Fiscus, J.1    Ajot, J.2    Garofolo, J.3    Doddington, G.4
  • 17
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
    • IEEE Santa Barbara, CA, USA
    • J. Fiscus A post-processing system to yield reduced word error rates: recognizer output voting error reduction (ROVER) Proc. Automatic Speech Recognition and Understanding Workshop 1997 IEEE Santa Barbara, CA, USA 347 354
    • (1997) Proc. Automatic Speech Recognition and Understanding Workshop , pp. 347-354
    • Fiscus, J.1
  • 19
    • 77949351968 scopus 로고    scopus 로고
    • Query-by-example spoken term detection using phonetic posteriorgram templates
    • IEEE Merano, Italy
    • T.J. Hazen, W. Shen, and C. White Query-by-example spoken term detection using phonetic posteriorgram templates Proc. ASRU 2009 IEEE Merano, Italy
    • (2009) Proc. ASRU
    • Hazen, T.J.1    Shen, W.2    White, C.3
  • 21
    • 84890543656 scopus 로고    scopus 로고
    • IARPA-BAA-11-02,. Last accessed: March 1, 2014
    • Intelligence Advanced Research Projects Activity, 2011. IARPA-BAA-11-02, http://www.iarpa.gov/Programs/ia/Babel/babel.html. Last accessed: March 1, 2014.
    • (2011) Intelligence Advanced Research Projects Activity
  • 22
    • 84902545376 scopus 로고    scopus 로고
    • Last accessed: March 1, 2014
    • Internet Usage World-Wide by Country, 2010. http://www.infoplease.com/ ipa/A0933606.html. Last accessed: March 1, 2014.
    • (2010) Internet Usage World-Wide by Country
  • 23
    • 84878566254 scopus 로고    scopus 로고
    • Indexing raw acoustic features for scalable zero resource search
    • ISCA Portland, OR, USA
    • A. Jansen, and B.V. Durme Indexing raw acoustic features for scalable zero resource search Proc. INTERSPEECH 2012 ISCA Portland, OR, USA
    • (2012) Proc. INTERSPEECH
    • Jansen, A.1    Durme, B.V.2
  • 24
    • 84887082392 scopus 로고    scopus 로고
    • The JHU-HLTCOE spoken web search system for MediaEval 2012
    • http://ceur-ws.org/Vol-927/
    • A. Jansen, B. van Durme, and P. Clark The JHU-HLTCOE spoken web search system for MediaEval 2012 Proc. MediaEval 2012 2012 http://www.multimediaeval. org/mediaeval2012/; http://ceur-ws.org/Vol-927/
    • (2012) Proc. MediaEval 2012
    • Jansen, A.1    Van Durme, B.2    Clark, P.3
  • 26
    • 84887060955 scopus 로고    scopus 로고
    • The TUM cumulative DTW approach for the MediaEval 2012 spoken web search task
    • http://ceur-ws.org/Vol-927/
    • C. Joder, F. Weninger, M. Wöllmer, and B. Schuller The TUM cumulative DTW approach for the MediaEval 2012 spoken web search task Proc. MediaEval 2012 2012 http://www.multimediaeval.org/mediaeval2012/; http://ceur-ws.org/Vol-927/
    • (2012) Proc. MediaEval 2012
    • Joder, C.1    Weninger, F.2    Wöllmer, M.3    Schuller, B.4
  • 30
    • 84906248285 scopus 로고    scopus 로고
    • Formalizing expert knowledge for developing accurate speech recognizers
    • ISCA Lyon, France
    • A. Kumar, F. Metze, W. Wang, and M. Kam Formalizing expert knowledge for developing accurate speech recognizers Proc. INTERSPEECH 2013 ISCA Lyon, France
    • (2013) Proc. INTERSPEECH
    • Kumar, A.1    Metze, F.2    Wang, W.3    Kam, M.4
  • 32
    • 0034296009 scopus 로고    scopus 로고
    • Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
    • L. Mangu, E. Brill, and A. Stolcke Finding consensus in speech recognition: word error minimization and other applications of confusion networks Comput. Speech Language 14 4 2000 373 400
    • (2000) Comput. Speech Language , vol.14 , Issue.4 , pp. 373-400
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 33
    • 84890443127 scopus 로고    scopus 로고
    • Speed improvements to information retrieval-based dynamic time warping using hierarchical k-means clustering
    • IEEE Vancouver, Canada
    • G. Mantena, and X. Anguera Speed improvements to information retrieval-based dynamic time warping using hierarchical k-means clustering Proc. ICASSP 2013 IEEE Vancouver, Canada
    • (2013) Proc. ICASSP
    • Mantena, G.1    Anguera, X.2
  • 34
    • 84902547799 scopus 로고    scopus 로고
    • SWS task: Articulatory phonetic units and sliding DTW
    • http://ceur-ws.org/Vol-807/
    • G.V. Mantena, B. Babu, and K. Prahallad SWS task: articulatory phonetic units and sliding DTW Proc. MediaEval 2011 2011 http://www.multimediaeval.org/ mediaeval2011/; http://ceur-ws.org/Vol-807/
    • (2011) Proc. MediaEval 2011
    • Mantena, G.V.1    Babu, B.2    Prahallad, K.3
  • 35
    • 84902542384 scopus 로고    scopus 로고
    • MediaEval Benchmark
    • MediaEval Benchmark, MediaEval 2011 Workshop, http://www.multimediaeval. org/mediaeval2011/; http://ceur-ws.org/Vol-807/.
    • MediaEval 2011 Workshop
  • 36
    • 84902542377 scopus 로고    scopus 로고
    • MediaEval Benchmark
    • MediaEval Benchmark, MediaEval 2012 Workshop, http://www.multimediaeval. org/mediaeval2012/; http://ceur-ws.org/Vol-927/.
    • MediaEval 2012 Workshop
  • 37
    • 84902542368 scopus 로고    scopus 로고
    • MediaEval Benchmark
    • MediaEval Benchmark, MediaEval 2013 Workshop, http://www.multimediaeval. org/mediaeval2013/; http://ceur-ws.org/Vol-1043/.
    • MediaEval 2013 Workshop
  • 38
    • 84902542369 scopus 로고    scopus 로고
    • MediaEval Benchmark
    • MediaEval Benchmark, 2014. http://www.multimediaeval.org/.
    • (2014)
  • 43
    • 84902542370 scopus 로고    scopus 로고
    • IRISA MediaEval 2011 spoken web search system
    • http://ceur-ws.org/Vol-807/
    • A. Muscariello, and G. Gravier IRISA MediaEval 2011 spoken web search system Proc. MediaEval 2011 2012 http://www.multimediaeval.org/mediaeval2011/; http://ceur-ws.org/Vol-807/
    • (2012) Proc. MediaEval 2011
    • Muscariello, A.1    Gravier, G.2
  • 44
    • 84865803305 scopus 로고    scopus 로고
    • A zero-resource system for audio-only spoken term detection using a combination of pattern matching techniques
    • ISCA Florence, Italy
    • A. Muscariello, G. Gravier, and F. Bimbot A zero-resource system for audio-only spoken term detection using a combination of pattern matching techniques Proc. INTERSPEECH 2011 ISCA Florence, Italy
    • (2011) Proc. INTERSPEECH
    • Muscariello, A.1    Gravier, G.2    Bimbot, F.3
  • 45
    • 84861498500 scopus 로고    scopus 로고
    • Unsupervised motif acquisition in speech via seeded discovery and template matching combination
    • A. Muscariello, G. Gravier, and F. Bimbot Unsupervised motif acquisition in speech via seeded discovery and template matching combination IEEE Trans. Audio Speech Language 20 7 2012 2031 2044
    • (2012) IEEE Trans. Audio Speech Language , vol.20 , Issue.7 , pp. 2031-2044
    • Muscariello, A.1    Gravier, G.2    Bimbot, F.3
  • 46
    • 84878537462 scopus 로고    scopus 로고
    • Exploiting discriminative point process models for spoken term detection
    • ISCA Portland, OR, USA
    • A. Norouzian, A. Jansen, R. Rose, and S. Thomas Exploiting discriminative point process models for spoken term detection Proc. INTERSPEECH 2012 ISCA Portland, OR, USA
    • (2012) Proc. INTERSPEECH
    • Norouzian, A.1    Jansen, A.2    Rose, R.3    Thomas, S.4
  • 48
    • 77949394249 scopus 로고    scopus 로고
    • Last accessed: March 1, 2014
    • Phoneme recognizer based on long temporal context, 2009. http://speech.fit.vutbr.cz/software/phoneme-recognizer-based-long-temporal- context. Last accessed: March 1, 2014.
    • (2009) Phoneme Recognizer Based on Long Temporal Context
  • 49
    • 84887107753 scopus 로고    scopus 로고
    • Spoken web search
    • http://ceur-ws.org/Vol-807/
    • N. Rajput, and F. Metze Spoken web search Proc. MediaEval 2011 2011 http://www.multimediaeval.org/mediaeval2011/; http://ceur-ws.org/Vol-807/
    • (2011) Proc. MediaEval 2011
    • Rajput, N.1    Metze, F.2
  • 50
    • 84883293124 scopus 로고    scopus 로고
    • Job opportunities through entertainment: Virally spread speech-based services for low-literate users
    • ACM Paris, France
    • A.A. Raza, F.U. Haq, Z. Tariq, M. Pervaiz, S. Razaq, U. Saif, and R. Rosenfeld Job opportunities through entertainment: virally spread speech-based services for low-literate users Proc. CHI 2013 ACM Paris, France
    • (2013) Proc. CHI
    • Raza, A.A.1    Haq, F.U.2    Tariq, Z.3    Pervaiz, M.4    Razaq, S.5    Saif, U.6    Rosenfeld, R.7
  • 52
    • 77949394249 scopus 로고    scopus 로고
    • Faculty of Information Technology, Brno University of Technology (BUT) (Ph.D. thesis).
    • P. Schwarz Phoneme recognition based on long temporal context 2009 Faculty of Information Technology, Brno University of Technology (BUT) (Ph.D. thesis). http://www.fit.vutbr.cz/research/view-pub.php?id=9132
    • (2009) Phoneme Recognition Based on Long Temporal Context
    • Schwarz, P.1
  • 53
    • 70450160623 scopus 로고    scopus 로고
    • A comparison of query-by-example methods for spoken term detection
    • ISCA Brighton, UK
    • W. Shen, C. White, and T.J. Hazen A comparison of query-by-example methods for spoken term detection Proc. INTERSPEECH 2009 ISCA Brighton, UK
    • (2009) Proc. INTERSPEECH
    • Shen, W.1    White, C.2    Hazen, T.J.3
  • 55
    • 33646041005 scopus 로고    scopus 로고
    • Phoneme based acoustics keyword spotting in informal continuous speech
    • I. Szöke, P. Schwarz, P. Matějka, L. Burget, M. Karafiát, and J. Černocký Phoneme based acoustics keyword spotting in informal continuous speech LNAI 3658 2005 302 309 http://www.fit.vutbr.cz/research/view-pub.php?id=7882.
    • (2005) LNAI , vol.3658 , pp. 302-309
    • Szöke, I.1    Schwarz, P.2    Matějka, P.3    Burget, L.4    Karafiát, M.5    Černocký, J.6
  • 56
    • 84906215350 scopus 로고    scopus 로고
    • BUT-HCTLab approaches for spoken web search
    • http://ceur-ws.org/Vol-807/
    • I. Szöke, J. Tejedor, M. Fapšo, and J. Colás BUT-HCTLab approaches for spoken web search Proc. MediaEval 2011 2011 http://www.multimediaeval.org/mediaeval2011/; http://ceur-ws.org/Vol-807/
    • (2011) Proc. MediaEval 2011
    • Szöke, I.1    Tejedor, J.2    Fapšo, M.3    Colás, J.4
  • 57
    • 84887087822 scopus 로고    scopus 로고
    • BUT 2012 approaches for spoken web search - MediaEval 2012
    • http://ceur-ws.org/Vol-927/
    • I. Szöke, M. Fapšo, and K. Veselý BUT 2012 approaches for spoken web search - MediaEval 2012 Proc. MediaEval 2012 2012 http://www.multimediaeval.org/mediaeval2012/; http://ceur-ws.org/Vol-927/
    • (2012) Proc. MediaEval 2012
    • Szöke, I.1    Fapšo, M.2    Veselý, K.3
  • 58
    • 84867300870 scopus 로고    scopus 로고
    • Faculty of Information Technology BUT (Ph.D. thesis).
    • I. Szöke Hybrid word-subword spoken term detection 2010 Faculty of Information Technology BUT (Ph.D. thesis). http://www.fit.vutbr.cz/research/ view-pub.php?id=9375
    • (2010) Hybrid Word-subword Spoken Term Detection
    • Szöke, I.1
  • 59
  • 60
    • 84877676391 scopus 로고    scopus 로고
    • TUKE MediaEval 2012: Spoken web search using DTW and unsupervised SVM
    • http://ceur-ws.org/Vol-927/
    • J. Vavrek, M. Pleva, and J. Juhár TUKE MediaEval 2012: spoken web search using DTW and unsupervised SVM Proc. MediaEval 2012 2012 http://www.multimediaeval.org/mediaeval2012/; http://ceur-ws.org/Vol-927/
    • (2012) Proc. MediaEval 2012
    • Vavrek, J.1    Pleva, M.2    Juhár, J.3
  • 61
    • 56149122156 scopus 로고    scopus 로고
    • A phonetic search approach to the 2006 NIST spoken term detection evaluation
    • ISCA Antwerpen, Belgium
    • R.G. Wallace, R.J. Vogt, and S. Sridharan A phonetic search approach to the 2006 NIST spoken term detection evaluation Proc. INTERSPEECH 2007 ISCA Antwerpen, Belgium
    • (2007) Proc. INTERSPEECH
    • Wallace, R.G.1    Vogt, R.J.2    Sridharan, S.3
  • 62
    • 84887104587 scopus 로고    scopus 로고
    • CUHK system for the spoken web search task at MediaEval 2012
    • http://ceur-ws.org/Vol-927/
    • H. Wang, and T. Lee CUHK system for the spoken web search task at MediaEval 2012 Proc. MediaEval 2012 2012 http://www.multimediaeval.org/ mediaeval2012/; http://ceur-ws.org/Vol-927/
    • (2012) Proc. MediaEval 2012
    • Wang, H.1    Lee, T.2
  • 63
    • 79951661301 scopus 로고    scopus 로고
    • Stochastic pronunciation modelling for out-of-vocabulary spoken term detection
    • D. Wang, S. King, and J. Frankel Stochastic pronunciation modelling for out-of-vocabulary spoken term detection IEEE Trans. Audio, Speech, Language Process. 9 4 2011 http://homepages.inf.ed.ac.uk/v1dwang2/public/tools/index.html
    • (2011) IEEE Trans. Audio, Speech, Language Process. , vol.9 , Issue.4
    • Wang, D.1    King, S.2    Frankel, J.3
  • 64
    • 84890452240 scopus 로고    scopus 로고
    • Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection
    • IEEE Vancouver, Canada
    • H. Wang, T. Lee, C.-C. Leung, B. Ma, and H. Li Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection Proc. ICASSP 2013 IEEE Vancouver, Canada
    • (2013) Proc. ICASSP
    • Wang, H.1    Lee, T.2    Leung, C.-C.3    Ma, B.4    Li, H.5
  • 67
    • 77949473673 scopus 로고    scopus 로고
    • Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams
    • Merano, Italy
    • Y. Zhang, and J. Glass Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams Proc. ASRU, IEEE Merano, Italy 2009
    • (2009) Proc. ASRU, IEEE
    • Zhang, Y.1    Glass, J.2
  • 69
    • 85050710669 scopus 로고    scopus 로고
    • Orthographic measures of language distances between the official south African languages
    • P.N. Zulu, G. Botha, and E. Barnard Orthographic measures of language distances between the official south African languages Literator 29 1 2008 1 20
    • (2008) Literator , vol.29 , Issue.1 , pp. 1-20
    • Zulu, P.N.1    Botha, G.2    Barnard, E.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.