메뉴 건너뛰기




Volumn 20, Issue 7, 2012, Pages 2095-2110

Integrating recognition and retrieval with relevance feedback for spoken term detection

Author keywords

Relevance feedback; spoken term detection

Indexed keywords

ACOUSTIC MODEL; AUTOMATIC SPEECH RECOGNITION; CONTENT RETRIEVAL; PSEUDO RELEVANCE FEEDBACK; RELEVANCE FEEDBACK; RETRIEVAL TECHNIQUES; TEXT INFORMATION RETRIEVALS; TEXT RETRIEVAL; TEXT SYMBOLS; USER RELEVANCE FEEDBACKS;

EID: 84862297277     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2196514     Document Type: Article
Times cited : (10)

References (74)
  • 1
    • 85032751176 scopus 로고    scopus 로고
    • Spoken document understanding and or ganization
    • Sep.
    • L.-S. Lee and B.-L. Chen, "Spoken document understanding and or ganization," IEEE Signal Process. Mag., vol. 22, no. 5, pp. 42-60, Sep. 2005.
    • (2005) IEEE Signal Process. Mag. , vol.22 , Issue.5 , pp. 42-60
    • Lee, L.-S.1    Chen, B.-L.2
  • 2
    • 85032751967 scopus 로고    scopus 로고
    • Retrieval and browsing of spoken content
    • DOI 10.1109/MSP.2008.917992
    • C. Chelba, T. Hazen, and M. Saraclar, "Retrieval and browsing of spoken content," IEEE Signal Process. Mag., vol. 25, no. 3, pp. 39-49, May 2008. (Pubitemid 351695639)
    • (2008) IEEE Signal Processing Magazine , vol.25 , Issue.3 , pp. 39-49
    • Chelba, C.1    Hazen, T.J.2    Saraclar, M.3
  • 3
    • 44849122115 scopus 로고    scopus 로고
    • Lattice-based search for spoken utterance
    • M. Saraclar and R. Sproat, "Lattice-based search for spoken utterance," in Proc. HLT, 2004.
    • (2004) Proc. HLT
    • Saraclar, M.1    Sproat, R.2
  • 4
    • 33847607574 scopus 로고    scopus 로고
    • Soft indexing of speech content for search in spoken documents
    • DOI 10.1016/j.csl.2006.09.001, PII S0885230806000313
    • C. Chelba, J. Silva, and A. Acero, "Soft indexing of speech content for search in spoken documents," Comput. Speech Lang., vol. 21, pp. 458-478, 2007. (Pubitemid 46367509)
    • (2007) Computer Speech and Language , vol.21 , Issue.3 , pp. 458-478
    • Chelba, C.1    Silva, J.2    Acero, A.3
  • 5
    • 44849083548 scopus 로고    scopus 로고
    • Position specific posterior lattices for indexing speech
    • C. Chelba and A. Acero, "Position specific posterior lattices for indexing speech," in Proc. ACL, 2005.
    • (2005) Proc. ACL
    • Chelba, C.1    Acero, A.2
  • 6
    • 77955766670 scopus 로고    scopus 로고
    • Analytical comparison between position specific posterior lattices and confusion networks based on words and subword units for spoken document indexing
    • H.-L. C. Y.-C. Pan and L.-S. Lee, "Analytical comparison between position specific posterior lattices and confusion networks based on words and subword units for spoken document indexing," in Proc. ASRU, 2007.
    • (2007) Proc. ASRU
    • Pan, H.-L.C.Y.-C.1    Lee, L.-S.2
  • 7
    • 34547541175 scopus 로고    scopus 로고
    • Open vocabulary spoken utterance retrieval using confusion networks
    • T. Hori, I. Hetherington, T. Hazen, and J. Glass, "Open vocabulary spoken utterance retrieval using confusion networks," in Proc. ICASSP, 2007, pp. 73-76.
    • (2007) Proc. ICASSP , pp. 73-76
    • Hori, T.1    Hetherington, I.2    Hazen, T.3    Glass, J.4
  • 9
    • 51449117494 scopus 로고    scopus 로고
    • Subword-based position specific posterior lattices (S-PSPL) for indexing speech information
    • Y.-C. Pan, H.-L. Chang, and L.-S. Lee, "Subword-based position specific posterior lattices (S-PSPL) for indexing speech information," in Proc. Interspeech, 2007.
    • (2007) Proc. Interspeech
    • Pan, Y.-C.1    Chang, H.-L.2    Lee, L.-S.3
  • 10
    • 26844534218 scopus 로고    scopus 로고
    • Approaches to reduce the effects of OOV queries on indexed spoken audio
    • DOI 10.1109/TMM.2005.854429
    • B. Logan, J.-M. Van Thong, and P. Moreno, "Approaches to reduce the effects of OOV queries on indexed spoken audio," IEEE Trans. Multimedia, vol. 7, no. 5, pp. 899-906, Oct. 2005. (Pubitemid 41452518)
    • (2005) IEEE Transactions on Multimedia , vol.7 , Issue.5 , pp. 899-906
    • Logan, B.1    Van Thong, J.M.2    Moreno, P.J.3
  • 11
    • 56149122156 scopus 로고    scopus 로고
    • A phonetic search approach to the 2006 NIST spoken term detection evaluation
    • R. Wallace, R. Vogt, and S. Sridharan, "A phonetic search approach to the 2006 NIST spoken term detection evaluation," in Proc. Interspeech, 2007.
    • (2007) Proc. Interspeech
    • Wallace, R.1    Vogt, R.2    Sridharan, S.3
  • 12
    • 84867205123 scopus 로고    scopus 로고
    • Reducing the effect of OOV query words by using morph-based spoken document retrieval
    • V. T. Turunen, "Reducing the effect of OOV query words by using morph-based spoken document retrieval," in Proc. Interspeech, 2008.
    • (2008) Proc. Interspeech
    • Turunen, V.T.1
  • 13
    • 51449115711 scopus 로고    scopus 로고
    • A comparison of phone and grapheme-based spoken term detection
    • D. Wang, J. Frankel, J. Tejedor, and S. King, "A comparison of phone and grapheme-based spoken term detection," in Proc. ICASSP, 2008, pp. 4969-4972.
    • (2008) Proc. ICASSP , pp. 4969-4972
    • Wang, D.1    Frankel, J.2    Tejedor, J.3    King, S.4
  • 14
    • 56149088648 scopus 로고    scopus 로고
    • An integration method of retrieval results using plural subword models for vocabulary-free spoken document retrieval
    • Y. Itoh, K. Iwata, K. Kojima, M. Ishigame, K. Tanaka, and S. w. Lee, "An integration method of retrieval results using plural subword models for vocabulary-free spoken document retrieval," in Proc. Interspeech, 2007.
    • (2007) Proc. Interspeech
    • Itoh, Y.1    Iwata, K.2    Kojima, K.3    Ishigame, M.4    Tanaka, K.5    W. Lee, S.6
  • 15
    • 33947644326 scopus 로고    scopus 로고
    • Keyword spotting of arbitrary words using minimal speech resources
    • A. Garcia and H. Gish, "Keyword spotting of arbitrary words using minimal speech resources," in Proc. ICASSP, 2006, pp. 949-952.
    • (2006) Proc. ICASSP , pp. 949-952
    • Garcia, A.1    Gish, H.2
  • 16
    • 33646815291 scopus 로고    scopus 로고
    • Combining multiple subword representations for open-vocabulary spoken document retrieval
    • S. w. Lee, K. Tanaka, and Y. Itoh, "Combining multiple subword representations for open-vocabulary spoken document retrieval," in Proc. ICASSP, 2005, pp. 505-508.
    • (2005) Proc. ICASSP , pp. 505-508
    • Lee, S.W.1    Tanaka, K.2    Itoh, Y.3
  • 17
    • 51449122583 scopus 로고    scopus 로고
    • Fusing multiple systems into a compact lattice index for Chinese spoken term detection
    • S. Meng, P. Yu, J. Liu, and F. Seide, "Fusing multiple systems into a compact lattice index for Chinese spoken term detection," in ICASSP, 2008, pp. 4345-4348.
    • (2008) ICASSP , pp. 4345-4348
    • Meng, S.1    Yu, P.2    Liu, J.3    Seide, F.4
  • 18
    • 84862293273 scopus 로고    scopus 로고
    • Type-II dialogue systems for information access from unstructured knowledge sources
    • Y. c. Pan, H. l. Chang, and L. s. Lee, "Type-II dialogue systems for information access from unstructured knowledge sources," in Proc. ASRU, 2007.
    • (2007) Proc. ASRU
    • Pan, Y.C.1    Chang, H.L.2    Lee, L.S.3
  • 19
    • 70349227602 scopus 로고    scopus 로고
    • Learning on demand-Course lecture distillation by information extraction
    • S.-Y. Kong, M.-R. Wu, C.-K. Lin, Y.-S. Fu, and L.-S. Lee, "Learning on demand-Course lecture distillation by information extraction," in Proc. ICASSP, 2009, pp. 4709-4712.
    • (2009) Proc. ICASSP , pp. 4709-4712
    • Kong, S.-Y.1    Wu, M.-R.2    Lin, C.-K.3    Fu, Y.-S.4    Lee, L.-S.5
  • 21
    • 67149104848 scopus 로고    scopus 로고
    • Podcastle: A web 2.0 approach to speech recognition research
    • M. Goto, J. Ogata, and K. Eto, "Podcastle: A web 2.0 approach to speech recognition research," in Proc. Interspeech, 2007.
    • (2007) Proc. Interspeech
    • Goto, M.1    Ogata, J.2    Eto, K.3
  • 23
    • 78049372541 scopus 로고    scopus 로고
    • Optimising figure of merit for phonetic spoken term detection
    • R. Wallace, R. Vogt, B. Baker, and S. Sridharan, "Optimising figure of merit for phonetic spoken term detection," in Proc. ICASSP, 2010, pp. 5298-5301.
    • (2010) Proc. ICASSP , pp. 5298-5301
    • Wallace, R.1    Vogt, R.2    Baker, B.3    Sridharan, S.4
  • 24
    • 0033658324 scopus 로고    scopus 로고
    • Phonetic confusion matrix based spoken document retrieval
    • S. Srinivasan and D. Petkovic, "Phonetic confusion matrix based spoken document retrieval," in Proc. SIGIR, 2000.
    • (2000) Proc. SIGIR
    • Srinivasan, S.1    Petkovic, D.2
  • 25
    • 33646767826 scopus 로고    scopus 로고
    • A new ASR evaluation measure and minimum Bayes-risk decoding for open-domain speech understanding
    • H. Nanjo and T. Kawahara, "A new ASR evaluation measure and minimum Bayes-risk decoding for open-domain speech understanding," in Proc. ICASSP, 2005, pp. 1053-1056.
    • (2005) Proc. ICASSP , pp. 1053-1056
    • Nanjo, H.1    Kawahara, T.2
  • 26
    • 51449115420 scopus 로고    scopus 로고
    • Minimum Bayes-risk decoding with presumed word significance for speech based information retrieval
    • T. Shichiri, H. Nanjo, and T. Yoshimi, "Minimum Bayes-risk decoding with presumed word significance for speech based information retrieval," in Proc. ICASSP, 2008, pp. 1557-1560.
    • (2008) Proc. ICASSP , pp. 1557-1560
    • Shichiri, T.1    Nanjo, H.2    Yoshimi, T.3
  • 27
    • 84862280224 scopus 로고    scopus 로고
    • Automatic speech recognition based on weighted minimum classification error (W-MCE) training method
    • Q. Fu and B.-H. Juang, "Automatic speech recognition based on weighted minimum classification error (W-MCE) training method," in Proc. ASRU, 2007.
    • (2007) Proc. ASRU
    • Fu, Q.1    Juang, B.-H.2
  • 28
    • 0031139839 scopus 로고    scopus 로고
    • Minimum classification error rate methods for speech recognition
    • PII S1063667697035937
    • B.-H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 3, pp. 257-265, May 1997. (Pubitemid 127745998)
    • (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , Issue.3 , pp. 257-265
    • Juang, B.-H.1    Chou, W.2    Lee, C.-H.3
  • 29
    • 84867197153 scopus 로고    scopus 로고
    • Towards vocabulary-independent speech indexing for large-scale repositories
    • J. Shao, R.-P. Yu, Q. Zhao, Y. Yan, and F. Seide, "Towards vocabulary-independent speech indexing for large-scale repositories," in Proc. Interspeech, 2008.
    • (2008) Proc. Interspeech
    • Shao, J.1    Yu, R.-P.2    Zhao, Q.3    Yan, Y.4    Seide, F.5
  • 31
    • 0346907250 scopus 로고    scopus 로고
    • A survey on the use of relevance feedback for information access systems
    • I. Ruthven and M. Lalmas, "A survey on the use of relevance feedback for information access systems," Knowl. Eng. Rev., pp. 95-145, 2003.
    • (2003) Knowl. Eng. Rev. , pp. 95-145
    • Ruthven, I.1    Lalmas, M.2
  • 34
    • 0034441543 scopus 로고    scopus 로고
    • Incorporate support vector machines to content-based image retrieval with relevance feedback
    • P. Hong, Q. Tian, and T. Huang, "Incorporate support vector machines to content-based image retrieval with relevance feedback," in Proc. Int. Conf. Image Process., 2000.
    • (2000) Proc. Int. Conf. Image Process.
    • Hong, P.1    Tian, Q.2    Huang, T.3
  • 39
    • 2342601508 scopus 로고    scopus 로고
    • Multimedia search with pseudo-relevance feedback
    • R. Yan, A. Hauptmann, and R. Jin, "Multimedia search with pseudo-relevance feedback," in Proc. CIVR, 2003.
    • (2003) Proc. CIVR
    • Yan, R.1    Hauptmann, A.2    Jin, R.3
  • 41
    • 79959847574 scopus 로고    scopus 로고
    • Improved spoken term detection by discriminative training of acoustic models based on user relevance feedback
    • H.-Y. Lee, C.-P. Chen, C.-F. Yeh, and L.-S. Lee, "Improved spoken term detection by discriminative training of acoustic models based on user relevance feedback," in Proc. Interspeech, 2010.
    • (2010) Proc. Interspeech
    • Lee, H.-Y.1    Chen, C.-P.2    Yeh, C.-F.3    Lee, L.-S.4
  • 42
    • 78049408076 scopus 로고    scopus 로고
    • Integrating recognition and retrieval with user feedback: A new framework for spoken term detection
    • H.-Y. Lee and L.-S. Lee, "Integrating recognition and retrieval with user feedback: A new framework for spoken term detection," in Proc. ICASSP, 2010, pp. 5290-5293.
    • (2010) Proc. ICASSP , pp. 5290-5293
    • Lee, H.-Y.1    Lee, L.-S.2
  • 43
    • 79959850446 scopus 로고    scopus 로고
    • Improved spoken term detection by feature space pseudo-relevance feedback
    • C.-P. Chen, H.-Y. Lee, C.-F. Yeh, and L.-S. Lee, "Improved spoken term detection by feature space pseudo-relevance feedback," in Proc. Interspeech, 2010.
    • (2010) Proc. Interspeech
    • Chen, C.-P.1    Lee, H.-Y.2    Yeh, C.-F.3    Lee, L.-S.4
  • 45
    • 79951783781 scopus 로고    scopus 로고
    • A framework integrating different relevance feedback scenarios and approaches for spoken term detection
    • H.-Y. Lee, C.-P. Chen, C.-F. Yeh, and L.-S. Lee, "A framework integrating different relevance feedback scenarios and approaches for spoken term detection," in Proc. SLT, 2010.
    • (2010) Proc. SLT
    • Lee, H.-Y.1    Chen, C.-P.2    Yeh, C.-F.3    Lee, L.-S.4
  • 47
    • 84862293276 scopus 로고    scopus 로고
    • Optimizing search engines using clickthrough data
    • J. Thorsten, "Optimizing search engines using clickthrough data," in Proc. KDD, 2002.
    • (2002) Proc. KDD
    • Thorsten, J.1
  • 49
    • 84885632533 scopus 로고    scopus 로고
    • Context sensitive information retrieval using implicit feedback
    • X. Shen, B. Tan, and C. Zhia, "Context sensitive information retrieval using implicit feedback," in Proc. SIGIR, 2005.
    • (2005) Proc. SIGIR
    • Shen, X.1    Tan, B.2    Zhia, C.3
  • 50
    • 0037708475 scopus 로고    scopus 로고
    • Relevance feedback in image retrieval: A comprehensive review
    • X. S. Zhou and T. S. Huang, "Relevance feedback in image retrieval: A comprehensive review," Multimedia Syst., vol. 8, pp. 536-544, 2003.
    • (2003) Multimedia Syst. , vol.8 , pp. 536-544
    • Zhou, X.S.1    Huang, T.S.2
  • 51
    • 77949351968 scopus 로고    scopus 로고
    • Query-by-example spoken term detection using phonetic posteriorgram templetes
    • T. J. Hazen, W. Shen, and C. White, "Query-by-example spoken term detection using phonetic posteriorgram templetes," in Proc. ASRU, 2009.
    • (2009) Proc. ASRU
    • Hazen, T.J.1    Shen, W.2    White, C.3
  • 52
    • 84880475213 scopus 로고    scopus 로고
    • Improving pseudo-relevance feedback in web information retrieval using web page segmentation
    • S. Yu, D. Cai, J.-R. Wen, and W.-Y. Ma, "Improving pseudo-relevance feedback in web information retrieval using web page segmentation," in Proc. 12th Int. Conf. World Wide Web, 2003.
    • (2003) Proc. 12th Int. Conf. World Wide Web
    • Yu, S.1    Cai, D.2    Wen, J.-R.3    Ma, W.-Y.4
  • 59
    • 74549206716 scopus 로고    scopus 로고
    • A comparative study of methods for estimating query language models with pseudo feedback
    • Y. Lv and C. Zhai, "A comparative study of methods for estimating query language models with pseudo feedback," in Proc. 18th ACM Conf. Inf. Knowl. Manage., 2009.
    • (2009) Proc. 18th ACM Conf. Inf. Knowl. Manage.
    • Lv, Y.1    Zhai, C.2
  • 67
    • 51449084490 scopus 로고    scopus 로고
    • Spoken term detection for Turkish broadcast news
    • S. Parlak and M. Saraclar, "Spoken term detection for Turkish broadcast news," in Proc. ICASSP, 2008, pp. 5244-5247.
    • (2008) Proc. ICASSP , pp. 5244-5247
    • Parlak, S.1    Saraclar, M.2
  • 68
    • 78049362755 scopus 로고    scopus 로고
    • Towards multi-speaker unsupervised speech pattern discovery
    • Y. Zhang and J. Glass, "Towards multi-speaker unsupervised speech pattern discovery," in Proc. ICASSP, 2010, pp. 4366-4369.
    • (2010) Proc. ICASSP , pp. 4366-4369
    • Zhang, Y.1    Glass, J.2
  • 69
    • 77949473673 scopus 로고    scopus 로고
    • Unsupervised spoken keyword spotting via segmental Dtw on Gaussian posteriorgrams
    • Y. Zhang and J. Glass, "Unsupervised spoken keyword spotting via segmental Dtw on Gaussian posteriorgrams," in Proc. ASRU, 2009.
    • (2009) Proc. ASRU
    • Zhang, Y.1    Glass, J.2
  • 70
    • 70450190034 scopus 로고    scopus 로고
    • Podcastle: Collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription
    • J. Ogata and M. Goto, "Podcastle: Collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription," in Proc. Interspeech, 2009.
    • (2009) Proc. Interspeech
    • Ogata, J.1    Goto, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.