메뉴 건너뛰기




Volumn 21, Issue 7, 2013, Pages 1330-1342

Model-based unsupervised spoken term detection with spoken queries

Author keywords

Acoustic segment model; dynamic time warping; unsupervised spoken term detection; zero resource

Indexed keywords

ACOUSTIC SEGMENT MODELS; DYNAMIC TIME WARPING; PRECISION IMPROVEMENT; PSEUDO-RELEVANCE FEEDBACKS; SELF-ORGANIZING MODEL; SPOKEN TERM DETECTION (STD); SPOKEN TERM DETECTIONS; ZERO-RESOURCE;

EID: 84875677338     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2013.2248714     Document Type: Article
Times cited : (27)

References (48)
  • 1
    • 84875679044 scopus 로고    scopus 로고
    • The Spoken Term Detection (STD) NIST
    • The Spoken Term Detection (STD) 2006 Evaluation Plan10th ed. NIST [Online]. Available: http://www.nist.gov/speech/tests/std
    • 2006 Evaluation Plan10th Ed
  • 4
    • 56149122156 scopus 로고    scopus 로고
    • A phonetic search approach to the 2006 NIST spoken term detection evaluation
    • R. Wallace, R. Vogt, and S. Sridharan, "A phonetic search approach to the 2006 NIST spoken term detection evaluation," in Proc. INTERSPEECH, 2007.
    • (2007) Proc. INTERSPEECH
    • Wallace, R.1    Vogt, R.2    Sridharan, S.3
  • 5
    • 84865709671 scopus 로고    scopus 로고
    • Open-vocabulary spoken-document retrieval based on query expansion using related web documents
    • M. Terao, T. Koshinaka, S. Ando, R. Isotani, and A. Okumura, "Open-vocabulary spoken-document retrieval based on query expansion using related web documents," in Proc. INTERSPEECH, 2008.
    • (2008) Proc. INTERSPEECH
    • Terao, M.1    Koshinaka, T.2    Ando, S.3    Isotani, R.4    Okumura, A.5
  • 6
    • 70450160623 scopus 로고    scopus 로고
    • A comparison of query-by-example methods for spoken term detection
    • W. Shen, C. M. White, and T. J. Hazen, "A comparison of query-by-example methods for spoken term detection," in Proc. INTERSPEECH, 2009.
    • (2009) Proc. INTERSPEECH
    • Shen, W.1    White, C.M.2    Hazen, T.J.3
  • 8
    • 77955759248 scopus 로고    scopus 로고
    • Performance analysis for lattice-based speech indexing approaches using word and subword units
    • Aug
    • Y.-C. Pan and L.-S. Lee, "Performance analysis for lattice-based speech indexing approaches using word and subword units," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1562-1574, Aug. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.6 , pp. 1562-1574
    • Pan, Y.-C.1    Lee, L.-S.2
  • 9
    • 85050187568 scopus 로고    scopus 로고
    • Lattice-based search for spoken utterance retrieval
    • M. Saraclar and R. Sproat, "Lattice-based search for spoken utterance retrieval," in Proc. HLT-NAACL, 2004.
    • (2004) Proc. HLT-NAACL
    • Saraclar, M.1    Sproat, R.2
  • 12
    • 33947644326 scopus 로고    scopus 로고
    • Keyword spotting of arbitrary words using minimal speech resources
    • A. Garcia and H. Gish, "Keyword spotting of arbitrary words using minimal speech resources," in Proc. ICASSP, 2006, pp. 949-952.
    • (2006) Proc. ICASSP , pp. 949-952
    • Garcia, A.1    Gish, H.2
  • 13
    • 70349210894 scopus 로고    scopus 로고
    • Unsupervised acoustic and language model training with small amounts of labelled data
    • S. Novotney, R. Schwartz, and J. Ma, "Unsupervised acoustic and language model training with small amounts of labelled data," in Proc. ICASSP, 2009, pp. 4297-4300.
    • (2009) Proc. ICASSP , pp. 4297-4300
    • Novotney, S.1    Schwartz, R.2    Ma, J.3
  • 18
    • 0023800699 scopus 로고
    • A segment model based approach to speech recognition
    • C.-H. Lee, F. K. Soong, and B.-H. Juang, "A segment model based approach to speech recognition," in Proc. ICASSP, 1988, pp. 501-504.
    • (1988) Proc. ICASSP , pp. 501-504
    • Lee, C.-H.1    Soong, F.K.2    Juang, B.-H.3
  • 19
    • 84867809023 scopus 로고    scopus 로고
    • A nonparametric Bayesian approach to acoustic model discovery
    • C.-Y. Lee and J. R. Glass, "A nonparametric Bayesian approach to acoustic model discovery," in Proc. ACL, 2012, pp. 40-49.
    • (2012) Proc. ACL , pp. 40-49
    • Lee, C.-Y.1    Glass, J.R.2
  • 20
    • 64849085294 scopus 로고    scopus 로고
    • Unsupervised pattern discovery in speech
    • Jan
    • A. S. Park and J. R. Glass, "Unsupervised pattern discovery in speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 1, pp. 186-197, Jan. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.1 , pp. 186-197
    • Park, A.S.1    Glass, J.R.2
  • 21
    • 77949473673 scopus 로고    scopus 로고
    • Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams
    • Y. Zhang and J. R. Glass, "Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams," in Proc. IEEE Autom. Speech Recogn. Understand. Workshop, 2009, pp. 398-503.
    • (2009) Proc. IEEE Autom. Speech Recogn. Understand. Workshop , pp. 398-503
    • Zhang, Y.1    Glass, J.R.2
  • 22
    • 80051626575 scopus 로고    scopus 로고
    • Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection
    • M. Huijbregts, M. McLaren, and D. van Leeuwen, "Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection," in Proc. ICASSP, 2011, pp. 4436-4439.
    • (2011) Proc. ICASSP , pp. 4436-4439
    • Huijbregts, M.1    McLaren, M.2    Van Leeuwen, D.3
  • 23
    • 84867600320 scopus 로고    scopus 로고
    • An acoustic segment modeling approach to query-by-example spoken term detection
    • H. Wang, C.-C. Leung, T. Lee, B. Ma, and H. Li, "An acoustic segment modeling approach to query-by-example spoken term detection," in Proc. ICASSP, 2012, pp. 5157-5160.
    • (2012) Proc. ICASSP , pp. 5157-5160
    • Wang, H.1    Leung, C.-C.2    Lee, T.3    Ma, B.4    Li, H.5
  • 26
    • 84865803305 scopus 로고    scopus 로고
    • Zero-resource audio-only spoken term detection based on a combination of template matching techniques
    • A. Muscariello, G. Gravier, and F. Bimbot, "Zero-resource audio-only spoken term detection based on a combination of template matching techniques," in Proc. INTERSPEECH, 2011.
    • (2011) Proc. INTERSPEECH
    • Muscariello, A.1    Gravier, G.2    Bimbot, F.3
  • 27
    • 79959823416 scopus 로고    scopus 로고
    • Unsupervised spoken-term detection with spoken queries using segment-based dynamic time warping
    • C.-A. Chan and L.-S. Lee, "Unsupervised spoken-term detection with spoken queries using segment-based dynamic time warping," in Proc. INTERSPEECH, 2010.
    • (2010) Proc. INTERSPEECH
    • Chan, C.-A.1    Lee, L.-S.2
  • 28
    • 80051622244 scopus 로고    scopus 로고
    • Integrating frame-based and segment-based dynamic time warping for unsupervised spoken term detection with spoken queries
    • Prague, Czech Republic, May
    • C.-A. Chan and L.-S. Lee, "Integrating frame-based and segment-based dynamic time warping for unsupervised spoken term detection with spoken queries," in Proc. ICASSP, Prague, Czech Republic, May 2011, pp. 5652-5655.
    • (2011) Proc. ICASSP , pp. 5652-5655
    • Chan, C.-A.1    Lee, L.-S.2
  • 30
    • 84865770619 scopus 로고    scopus 로고
    • A piecewise aggregate approximation lower-bound estimate for posteriorgram-based dynamic time warping
    • Y. Zhang and J. R. Glass, "A piecewise aggregate approximation lower-bound estimate for posteriorgram-based dynamic time warping," in Proc. INTERSPEECH, 2011.
    • (2011) Proc. INTERSPEECH
    • Zhang, Y.1    Glass, J.R.2
  • 32
    • 2342504481 scopus 로고    scopus 로고
    • Negative pseudo-relevance feedback in content-based video retrieval
    • R. Yan, A. G. Hauptmann, and R. Jin, "Negative pseudo-relevance feedback in content-based video retrieval," in Proc. ACM-Multimedia, 2003.
    • (2003) Proc. ACM-Multimedia
    • Yan, R.1    Hauptmann, A.G.2    Jin, R.3
  • 33
    • 84875672130 scopus 로고    scopus 로고
    • Improving pseudo-relevance feedback in web information retrieval using web page segmentation
    • S. Yu, D. Cai, J.-R. Wen, and W.-Y. Ma, "Improving pseudo-relevance feedback in web information retrieval using web page segmentation," in Proc. Int. World Wide Web Conf., 2008.
    • (2008) Proc. Int. World Wide Web Conf.
    • Yu, S.1    Cai, D.2    Wen, J.-R.3    Ma, W.-Y.4
  • 34
    • 84873444148 scopus 로고    scopus 로고
    • A study on music genre classification based on universal acoustic models
    • J. Reed and C.-H. Lee, "A study on music genre classification based on universal acoustic models," in Proc. ISMIR, 2006, pp. 89-94.
    • (2006) Proc. ISMIR , pp. 89-94
    • Reed, J.1    Lee, C.-H.2
  • 35
    • 34547502608 scopus 로고    scopus 로고
    • A vector space modeling approach to spoken language identification
    • Jan
    • H. Li, B. Ma, and C.-H. Lee, "A vector space modeling approach to spoken language identification," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 271-284, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 271-284
    • Li, H.1    Ma, B.2    Lee, C.-H.3
  • 36
    • 78049411640 scopus 로고    scopus 로고
    • An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition
    • Y. Tsao, H. Sun, H. Li, and C.-H. Lee, "An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition," in Proc. ICASSP, 2010, pp. 4422-4425.
    • (2010) Proc. ICASSP , pp. 4422-4425
    • Tsao, Y.1    Sun, H.2    Li, H.3    Lee, C.-H.4
  • 37
    • 70450158585 scopus 로고    scopus 로고
    • Unsupervised training of an HMM-based speech recognizer for topic classification
    • H. Gish, M.-H. Siu, A. Chan, and W. Belfield, "Unsupervised training of an HMM-based speech recognizer for topic classification," in Proc. INTERSPEECH, 2009, pp. 1935-1938.
    • (2009) Proc. INTERSPEECH , pp. 1935-1938
    • Gish, H.1    Siu, M.-H.2    Chan, A.3    Belfield, W.4
  • 38
    • 84865747527 scopus 로고    scopus 로고
    • Unsupervised audio patterns discovery using HMM-based self-organized units
    • M.-H. Siu, H. Gish, S. Lowe, and A. Chan, "Unsupervised audio patterns discovery using HMM-based self-organized units," in Proc. INTERSPEECH, 2011, pp. 2333-2336.
    • (2011) Proc. INTERSPEECH , pp. 2333-2336
    • Siu, M.-H.1    Gish, H.2    Lowe, S.3    Chan, A.4
  • 39
    • 84865757470 scopus 로고    scopus 로고
    • Unsupervised hidden Markov modeling of spoken queries for spoken term detection without speech recognition
    • C.-A. Chan and L.-S. Lee, "Unsupervised hidden Markov modeling of spoken queries for spoken term detection without speech recognition," in Proc. INTERSPEECH, 2011.
    • (2011) Proc. INTERSPEECH
    • Chan, C.-A.1    Lee, L.-S.2
  • 40
    • 51449096712 scopus 로고    scopus 로고
    • Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons
    • Y. Qiao, N. Shimomura, and N. Minematsu, "Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons," in Proc. ICASSP, 2008, pp. 3989-3992.
    • (2008) Proc. ICASSP , pp. 3989-3992
    • Qiao, Y.1    Shimomura, N.2    Minematsu, N.3
  • 42
    • 0030245363 scopus 로고    scopus 로고
    • From HMM's to segment models: A unified view of stochastic modeling for speech recognition
    • PII S1063667696067181
    • M. Ostendorf, V. Digalakis, and O. A. Kimball, "From HMMS to segment models: A unified view of stochastic modeling for speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 360-378, Sep. 1995. (Pubitemid 126753024)
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.5 , pp. 360-378
    • Ostendorf, M.1    Digalakis, V.V.2    Kimball, O.A.3
  • 43
    • 0035509488 scopus 로고    scopus 로고
    • Speech recognition and utterance verification based on a generalized confidence score
    • DOI 10.1109/89.966085, PII S1063667601096651
    • M.-W. Koo, C.-H. Lee, and B.-H. Juang, "Speech recognition and utterance verification based on a generalized confidence score," IEEE Trans. Speech Audio Process., vol. 9, no. 8, pp. 821-832, Nov. 2001. (Pubitemid 33137934)
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.8 , pp. 821-832
    • Koo, M.-W.1    Lee, C.-H.2    Juang, B.-H.3
  • 46
    • 84890454850 scopus 로고    scopus 로고
    • Overview of the TREC 2006
    • E. M. Voorhees, "Overview of the TREC 2006," in Proc. TREC, 2006.
    • (2006) Proc. TREC
    • Voorhees, E.M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.