SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 21, Issue 7, 2013, Pages 1330-1342

Model-based unsupervised spoken term detection with spoken queries

(2) Chan, Chun An a Lee, Lin Shan a

a NATIONAL TAIWAN UNIVERSITY (Taiwan)

Author keywords

Acoustic segment model; dynamic time warping; unsupervised spoken term detection; zero resource

Indexed keywords

ACOUSTIC SEGMENT MODELS; DYNAMIC TIME WARPING; PRECISION IMPROVEMENT; PSEUDO-RELEVANCE FEEDBACKS; SELF-ORGANIZING MODEL; SPOKEN TERM DETECTION (STD); SPOKEN TERM DETECTIONS; ZERO-RESOURCE;

VITERBI ALGORITHM;

SPEECH RECOGNITION;

EID: 84875677338 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2013.2248714 Document Type: Article

Times cited : (27)

References (48)

1
- 84875679044
- The Spoken Term Detection (STD) NIST
- The Spoken Term Detection (STD) 2006 Evaluation Plan10th ed. NIST [Online]. Available: http://www.nist.gov/speech/tests/std
- 2006 Evaluation Plan10th Ed

2
- 56149113962
- Rapid and accurate spoken term detection
- D. R. H. Miller, M. Kleber, C.-L Kao, O. Kimball, T. Colthurst, S. A. Lowe, R. M. Schwartz, and H. Gish, "Rapid and accurate spoken term detection," in Proc. INTERSPEECH, 2007.
- (2007) Proc. INTERSPEECH
- Miller, D.R.H.¹ Kleber, M.² Kao, C.-L.³ Kimball, O.⁴ Colthurst, T.⁵ Lowe, S.A.⁶ Schwartz, R.M.⁷ Gish, H.⁸

3
- 36448941168
- Vocabulary independent spoken term detection
- J. Mamou, B. Ramabhadran, and O. Siohan, "Vocabulary independent spoken term detection," in Proc. ACM-SIGIR, 2007.
- (2007) Proc. ACM-SIGIR
- Mamou, J.¹ Ramabhadran, B.² Siohan, O.³

4
- 56149122156
- A phonetic search approach to the 2006 NIST spoken term detection evaluation
- R. Wallace, R. Vogt, and S. Sridharan, "A phonetic search approach to the 2006 NIST spoken term detection evaluation," in Proc. INTERSPEECH, 2007.
- (2007) Proc. INTERSPEECH
- Wallace, R.¹ Vogt, R.² Sridharan, S.³

5
- 84865709671
- Open-vocabulary spoken-document retrieval based on query expansion using related web documents
- M. Terao, T. Koshinaka, S. Ando, R. Isotani, and A. Okumura, "Open-vocabulary spoken-document retrieval based on query expansion using related web documents," in Proc. INTERSPEECH, 2008.
- (2008) Proc. INTERSPEECH
- Terao, M.¹ Koshinaka, T.² Ando, S.³ Isotani, R.⁴ Okumura, A.⁵

6
- 70450160623
- A comparison of query-by-example methods for spoken term detection
- W. Shen, C. M. White, and T. J. Hazen, "A comparison of query-by-example methods for spoken term detection," in Proc. INTERSPEECH, 2009.
- (2009) Proc. INTERSPEECH
- Shen, W.¹ White, C.M.² Hazen, T.J.³

7
- 77949407432
- Query-by-example spoken term detection for OOV terms
- C. Parada, A. Sethy, and B. Ramabhadran, "Query-by-example spoken term detection for OOV terms," in Proc. IEEE Autom. Speech Recogn. Understand. Workshop, 2009.
- (2009) Proc. IEEE Autom. Speech Recogn. Understand. Workshop
- Parada, C.¹ Sethy, A.² Ramabhadran, B.³

8
- 77955759248
- Performance analysis for lattice-based speech indexing approaches using word and subword units
- Aug
- Y.-C. Pan and L.-S. Lee, "Performance analysis for lattice-based speech indexing approaches using word and subword units," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1562-1574, Aug. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.6 , pp. 1562-1574
- Pan, Y.-C.¹ Lee, L.-S.²

9
- 85050187568
- Lattice-based search for spoken utterance retrieval
- M. Saraclar and R. Sproat, "Lattice-based search for spoken utterance retrieval," in Proc. HLT-NAACL, 2004.
- (2004) Proc. HLT-NAACL
- Saraclar, M.¹ Sproat, R.²

10
- 70450180814
- Resources for speech research: Present and future infrastructure needs
- L. Boves, R. Carlson, E. W. Hinrichs, D. House, S. Krauwer, L. Lem-nitzer, M. Vainio, and P. Wittenburg, "Resources for speech research: Present and future infrastructure needs," in Proc. INTERSPEECH, 2009.
- (2009) Proc. INTERSPEECH
- Boves, L.¹ Carlson, R.² Hinrichs, E.W.³ House, D.⁴ Krauwer, S.⁵ Lem-Nitzer, L.⁶ Vainio, M.⁷ Wittenburg, P.⁸

11
- 77954612828
- WWTW: The world wide telecom web
- A. Kumar, N. Rajput, D. Chakraborty, S. K. Agarwal, and A. A. Nana-vati, "WWTW: The world wide telecom web," in Proc. 2007 Workshop Netw. Syst. for Develop. Regions, 2007.
- (2007) Proc. 2007 Workshop Netw. Syst. for Develop. Regions
- Kumar, A.¹ Rajput, N.² Chakraborty, D.³ Agarwal, S.K.⁴ Nana-Vati, A.A.⁵

12
- 33947644326
- Keyword spotting of arbitrary words using minimal speech resources
- A. Garcia and H. Gish, "Keyword spotting of arbitrary words using minimal speech resources," in Proc. ICASSP, 2006, pp. 949-952.
- (2006) Proc. ICASSP , pp. 949-952
- Garcia, A.¹ Gish, H.²

13
- 70349210894
- Unsupervised acoustic and language model training with small amounts of labelled data
- S. Novotney, R. Schwartz, and J. Ma, "Unsupervised acoustic and language model training with small amounts of labelled data," in Proc. ICASSP, 2009, pp. 4297-4300.
- (2009) Proc. ICASSP , pp. 4297-4300
- Novotney, S.¹ Schwartz, R.² Ma, J.³

14
- 84867316017
- The spoken web search task at MediaEval 2011
- F. Metze, N. Rajput, X. Anguera, M. Davel, G. Guillaume, C. v. Heerden, G. V. Mantena, A. Muscariello, K. Prahallad, I. Szöke, and J. Tejedor, "The spoken web search task at MediaEval 2011," in Proc. ICASSP, 2012, pp. 5165-5168.
- (2012) Proc. ICASSP , pp. 5165-5168
- Metze, F.¹ Rajput, N.² Anguera, X.³ Davel, M.⁴ Guillaume, G.⁵ Heerden, C.V.⁶ Mantena, G.V.⁷ Muscariello, A.⁸ Prahallad, K.⁹ Szöke, I.¹⁰ Tejedor, J.¹¹

15
- 0017930815
- Dynamic programming algorithm optimization for spoken word recognition
- H. Sakoe and S. Chiba, "Dynamic programming algorithm optimization for spoken word recognition," IEEE Trans. Acoust, Speech, Signal Process., vol. ASSP-26, no. 1, pp. 43-49, Feb. 1978. (Pubitemid 8601900)
- (1978) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-26 , Issue.1 , pp. 43-49
- Sakoe Hiroaki¹ Chiba Seibi²

16
- 0019280090
- Performance tradeoffs in dynamic time warping algorithms for isolated word recognition
- C. Myers, L. Rabiner, and A. Rosenberg, "Performance tradeoffs in dynamic time warping algorithms for isolated word recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 6, pp. 623-635, Dec. 1980. (Pubitemid 11475385)
- (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-28 , Issue.6 , pp. 623-635
- Myers Cory¹ Rabiner Lawrence, R.² Rosenberg Aaron, E.³

17
- 0022245547
- Keyword recognition using template concatenation
- A. Higgins and R. Wohlford, "Keyword recognition using template concatenation," in Proc. ICASSP, 1985, pp. 1233-1236. (Pubitemid 16511577)
- (1985) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , pp. 1233-1236
- Higgins Alan, L.¹ Wohlford Robert, E.²

18
- 0023800699
- A segment model based approach to speech recognition
- C.-H. Lee, F. K. Soong, and B.-H. Juang, "A segment model based approach to speech recognition," in Proc. ICASSP, 1988, pp. 501-504.
- (1988) Proc. ICASSP , pp. 501-504
- Lee, C.-H.¹ Soong, F.K.² Juang, B.-H.³

19
- 84867809023
- A nonparametric Bayesian approach to acoustic model discovery
- C.-Y. Lee and J. R. Glass, "A nonparametric Bayesian approach to acoustic model discovery," in Proc. ACL, 2012, pp. 40-49.
- (2012) Proc. ACL , pp. 40-49
- Lee, C.-Y.¹ Glass, J.R.²

20
- 64849085294
- Unsupervised pattern discovery in speech
- Jan
- A. S. Park and J. R. Glass, "Unsupervised pattern discovery in speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 1, pp. 186-197, Jan. 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.1 , pp. 186-197
- Park, A.S.¹ Glass, J.R.²

21
- 77949473673
- Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams
- Y. Zhang and J. R. Glass, "Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams," in Proc. IEEE Autom. Speech Recogn. Understand. Workshop, 2009, pp. 398-503.
- (2009) Proc. IEEE Autom. Speech Recogn. Understand. Workshop , pp. 398-503
- Zhang, Y.¹ Glass, J.R.²

22
- 80051626575
- Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection
- M. Huijbregts, M. McLaren, and D. van Leeuwen, "Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection," in Proc. ICASSP, 2011, pp. 4436-4439.
- (2011) Proc. ICASSP , pp. 4436-4439
- Huijbregts, M.¹ McLaren, M.² Van Leeuwen, D.³

23
- 84867600320
- An acoustic segment modeling approach to query-by-example spoken term detection
- H. Wang, C.-C. Leung, T. Lee, B. Ma, and H. Li, "An acoustic segment modeling approach to query-by-example spoken term detection," in Proc. ICASSP, 2012, pp. 5157-5160.
- (2012) Proc. ICASSP , pp. 5157-5160
- Wang, H.¹ Leung, C.-C.² Lee, T.³ Ma, B.⁴ Li, H.⁵

24
- 84865767134
- Rapid evaluation of speech representations for spoken term discovery
- M. A. Carlin, S. Thomas, A. Jansen, and H. Hermansky, "Rapid evaluation of speech representations for spoken term discovery," in Proc. INTERSPEECH, 2011.
- (2011) Proc. INTERSPEECH
- Carlin, M.A.¹ Thomas, S.² Jansen, A.³ Hermansky, H.⁴

25
- 77949351968
- Query-by-example spoken term detection using phonetic posteriorgram templates
- T. J. Hazen, W. Shen, and C. White, "Query-by-example spoken term detection using phonetic posteriorgram templates," in Proc. IEEE Autom. Speech Recognit Understand. Workshop, 2009, pp. 421-426.
- (2009) Proc. IEEE Autom. Speech Recognit Understand. Workshop , pp. 421-426
- Hazen, T.J.¹ Shen, W.² White, C.³

26
- 84865803305
- Zero-resource audio-only spoken term detection based on a combination of template matching techniques
- A. Muscariello, G. Gravier, and F. Bimbot, "Zero-resource audio-only spoken term detection based on a combination of template matching techniques," in Proc. INTERSPEECH, 2011.
- (2011) Proc. INTERSPEECH
- Muscariello, A.¹ Gravier, G.² Bimbot, F.³

27
- 79959823416
- Unsupervised spoken-term detection with spoken queries using segment-based dynamic time warping
- C.-A. Chan and L.-S. Lee, "Unsupervised spoken-term detection with spoken queries using segment-based dynamic time warping," in Proc. INTERSPEECH, 2010.
- (2010) Proc. INTERSPEECH
- Chan, C.-A.¹ Lee, L.-S.²

28
- 80051622244
- Integrating frame-based and segment-based dynamic time warping for unsupervised spoken term detection with spoken queries
- Prague, Czech Republic, May
- C.-A. Chan and L.-S. Lee, "Integrating frame-based and segment-based dynamic time warping for unsupervised spoken term detection with spoken queries," in Proc. ICASSP, Prague, Czech Republic, May 2011, pp. 5652-5655.
- (2011) Proc. ICASSP , pp. 5652-5655
- Chan, C.-A.¹ Lee, L.-S.²

29
- 0242497349
- Exact indexing of dynamic time warping
- E. Keogh, "Exact indexing of dynamic time warping," in Proc. 28th Int. Conf. Very Large Data Bases, 2002, pp. 406-417.
- (2002) Proc. 28th Int. Conf. Very Large Data Bases , pp. 406-417
- Keogh, E.¹

30
- 84865770619
- A piecewise aggregate approximation lower-bound estimate for posteriorgram-based dynamic time warping
- Y. Zhang and J. R. Glass, "A piecewise aggregate approximation lower-bound estimate for posteriorgram-based dynamic time warping," in Proc. INTERSPEECH, 2011.
- (2011) Proc. INTERSPEECH
- Zhang, Y.¹ Glass, J.R.²

31
- 2342601508
- Multimedia search with pseudo-relevance feedback
- R. Yan, E. Hauptmann, and R. Jin, "Multimedia search with pseudo-relevance feedback," in Proc. Int. Conf. Image and Video Retrieval, 2003.
- (2003) Proc. Int. Conf. Image and Video Retrieval
- Yan, R.¹ Hauptmann, E.² Jin, R.³

32
- 2342504481
- Negative pseudo-relevance feedback in content-based video retrieval
- R. Yan, A. G. Hauptmann, and R. Jin, "Negative pseudo-relevance feedback in content-based video retrieval," in Proc. ACM-Multimedia, 2003.
- (2003) Proc. ACM-Multimedia
- Yan, R.¹ Hauptmann, A.G.² Jin, R.³

33
- 84875672130
- Improving pseudo-relevance feedback in web information retrieval using web page segmentation
- S. Yu, D. Cai, J.-R. Wen, and W.-Y. Ma, "Improving pseudo-relevance feedback in web information retrieval using web page segmentation," in Proc. Int. World Wide Web Conf., 2008.
- (2008) Proc. Int. World Wide Web Conf.
- Yu, S.¹ Cai, D.² Wen, J.-R.³ Ma, W.-Y.⁴

34
- 84873444148
- A study on music genre classification based on universal acoustic models
- J. Reed and C.-H. Lee, "A study on music genre classification based on universal acoustic models," in Proc. ISMIR, 2006, pp. 89-94.
- (2006) Proc. ISMIR , pp. 89-94
- Reed, J.¹ Lee, C.-H.²

35
- 34547502608
- A vector space modeling approach to spoken language identification
- Jan
- H. Li, B. Ma, and C.-H. Lee, "A vector space modeling approach to spoken language identification," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 271-284, Jan. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 271-284
- Li, H.¹ Ma, B.² Lee, C.-H.³

36
- 78049411640
- An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition
- Y. Tsao, H. Sun, H. Li, and C.-H. Lee, "An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition," in Proc. ICASSP, 2010, pp. 4422-4425.
- (2010) Proc. ICASSP , pp. 4422-4425
- Tsao, Y.¹ Sun, H.² Li, H.³ Lee, C.-H.⁴

37
- 70450158585
- Unsupervised training of an HMM-based speech recognizer for topic classification
- H. Gish, M.-H. Siu, A. Chan, and W. Belfield, "Unsupervised training of an HMM-based speech recognizer for topic classification," in Proc. INTERSPEECH, 2009, pp. 1935-1938.
- (2009) Proc. INTERSPEECH , pp. 1935-1938
- Gish, H.¹ Siu, M.-H.² Chan, A.³ Belfield, W.⁴

38
- 84865747527
- Unsupervised audio patterns discovery using HMM-based self-organized units
- M.-H. Siu, H. Gish, S. Lowe, and A. Chan, "Unsupervised audio patterns discovery using HMM-based self-organized units," in Proc. INTERSPEECH, 2011, pp. 2333-2336.
- (2011) Proc. INTERSPEECH , pp. 2333-2336
- Siu, M.-H.¹ Gish, H.² Lowe, S.³ Chan, A.⁴

39
- 84865757470
- Unsupervised hidden Markov modeling of spoken queries for spoken term detection without speech recognition
- C.-A. Chan and L.-S. Lee, "Unsupervised hidden Markov modeling of spoken queries for spoken term detection without speech recognition," in Proc. INTERSPEECH, 2011.
- (2011) Proc. INTERSPEECH
- Chan, C.-A.¹ Lee, L.-S.²

40
- 51449096712
- Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons
- Y. Qiao, N. Shimomura, and N. Minematsu, "Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons," in Proc. ICASSP, 2008, pp. 3989-3992.
- (2008) Proc. ICASSP , pp. 3989-3992
- Qiao, Y.¹ Shimomura, N.² Minematsu, N.³

41
- 11244261680
- Apr.
- C. A. Bouman, "Cluster: An unsupervised algorithm for modeling Gaussian mixtures," Apr. 1997 [Online]. Available: http://www.ece. purdue.edu/~bouman
- (1997) Cluster: An Unsupervised Algorithm for Modeling Gaussian Mixtures
- Bouman, C.A.¹

42
- 0030245363
- From HMM's to segment models: A unified view of stochastic modeling for speech recognition
- PII S1063667696067181
- M. Ostendorf, V. Digalakis, and O. A. Kimball, "From HMMS to segment models: A unified view of stochastic modeling for speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 360-378, Sep. 1995. (Pubitemid 126753024)
- (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.5 , pp. 360-378
- Ostendorf, M.¹ Digalakis, V.V.² Kimball, O.A.³

43
- 0035509488
- Speech recognition and utterance verification based on a generalized confidence score
- DOI 10.1109/89.966085, PII S1063667601096651
- M.-W. Koo, C.-H. Lee, and B.-H. Juang, "Speech recognition and utterance verification based on a generalized confidence score," IEEE Trans. Speech Audio Process., vol. 9, no. 8, pp. 821-832, Nov. 2001. (Pubitemid 33137934)
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.8 , pp. 821-832
- Koo, M.-W.¹ Lee, C.-H.² Juang, B.-H.³

44
- 0002583871
- Speech database development: Design and analysis of the acoustic-phonetic corpus
- L. F. Lamela, R. H. Kassel, and S. Seneff, "Speech database development: Design and analysis of the acoustic-phonetic corpus," in Proc. DARPA Speech Recognition Workshop, 1986.
- (1986) Proc. DARPA Speech Recognition Workshop
- Lamela, L.F.¹ Kassel, R.H.² Seneff, S.³

45
- 0003877861
- Ph.D. dissertation, Dept. of Elect. Eng. and Comput. Sci., Mass. Inst. of Technol., Cambridge, MA, USA
- A. K. Halberstadt, "Heterogeneous acoustic measurements and multiple classifiers for speech recognition," Ph.D. dissertation, Dept. of Elect. Eng. and Comput. Sci., Mass. Inst. of Technol., Cambridge, MA, USA, 1999.
- (1999) Heterogeneous Acoustic Measurements and Multiple Classifiers for Speech Recognition
- Halberstadt, A.K.¹

46
- 84890454850
- Overview of the TREC 2006
- E. M. Voorhees, "Overview of the TREC 2006," in Proc. TREC, 2006.
- (2006) Proc. TREC
- Voorhees, E.M.¹

47
- 79951634009
- Results of the 2006 spoken term detection evaluation
- J. G. Fiscus, J. Ajot, J. S. Garofolo, and G. Doddingtion, "Results of the 2006 spoken term detection evaluation," in Proc. INTERSPEECH, 2007.
- (2007) Proc. INTERSPEECH
- Fiscus, J.G.¹ Ajot, J.² Garofolo, J.S.³ Doddingtion, G.⁴

48
- 84877656847
- The spoken web search task
- F. Metze, E. Barnard, M. Davel, C. van Heerden, X. Anguera, G. Gravier, and N. Rajput, "The spoken web search task," in Proc. MediaEval 2012 Workshop, 2012.
- (2012) Proc. MediaEval 2012 Workshop
- Metze, F.¹ Barnard, E.² Davel, M.³ Van Heerden, C.⁴ Anguera, X.⁵ Gravier, G.⁶ Rajput, N.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.