SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2012, Pages 5157-5160

An acoustic segment modeling approach to query-by-example spoken term detection

(5) Wang, Haipeng a Leung, Cheung Chi b Lee, Tan a Ma, Bin b Li, Haizhou b

a CHINESE UNIVERSITY OF HONG KONG (Hong Kong)

b INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

Author keywords

acoustic segment model; posteriorgram based template matching; query by example; Spoken term detection

Indexed keywords

DYNAMIC TIME WARPING; HETEROGENEOUS TESTS; ITERATIVE PROCEDURES; QUERY TERMS; QUERY-BY-EXAMPLE; SEGMENT MODELING; SEGMENT MODELS; SPEAKER NORMALIZATION; TOKENIZER;

ITERATIVE METHODS; TEMPLATE MATCHING;

SIGNAL PROCESSING;

EID: 84867600320 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2012.6289081 Document Type: Conference Paper

Times cited : (62)

References (15)

1
- 70450160623
- A comparison of query-by-example methods for spoken term detection
- W. Shen, C.M. White, and T.J. Hazen, "A comparison of query-by-example methods for spoken term detection," Proc. INTERSPEECH, pp. 2143-2146, 2009.
- (2009) Proc. INTERSPEECH , pp. 2143-2146
- Shen, W.¹ White, C.M.² Hazen, T.J.³

2
- 85032751967
- Retrieval and browsing of spoken content
- C. Chelba, T.J. Hazen, and M. Saraçlar, "Retrieval and browsing of spoken content," IEEE Signal Processing Magazine, vol. 25, no. 3, pp. 39-49, 2008.
- (2008) IEEE Signal Processing Magazine , vol.25 , Issue.3 , pp. 39-49
- Chelba, C.¹ Hazen, T.J.² Saraçlar, M.³

3
- 77949351968
- Query-by-example spoken term detection using phonetic posteriorgram templates
- T.J. Hazen, W. Shen, and C. White, "Query-by-example spoken term detection using phonetic posteriorgram templates," Proc. ASRU, pp. 421-426, 2009.
- (2009) Proc. ASRU , pp. 421-426
- Hazen, T.J.¹ Shen, W.² White, C.³

4
- 77949473673
- Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams
- Y. Zhang and J.R. Glass, "Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams," Proc. ASRU, pp. 398-403, 2009.
- (2009) Proc. ASRU , pp. 398-403
- Zhang, Y.¹ Glass, J.R.²

5
- 34547502608
- A vector space modeling approach to spoken language identification
- H. Li, B. Ma, and C.H. Lee, "A vector space modeling approach to spoken language identification," IEEE Trans. ASLP, vol. 15, no. 1, pp. 271-284, 2007.
- (2007) IEEE Trans. ASLP , vol.15 , Issue.1 , pp. 271-284
- Li, H.¹ Ma, B.² Lee, C.H.³

6
- 0030245363
- From HMM's to segment models: A unified view of stochastic modeling for speech recognition
- M. Ostendorf, V.V. Digalakis, and O.A. Kimball, "From HMM's to segment models: A unified view of stochastic modeling for speech recognition," IEEE Trans. SAP, vol. 4, no. 5, pp. 360-378, 1996.
- (1996) IEEE Trans. SAP , vol.4 , Issue.5 , pp. 360-378
- Ostendorf, M.¹ Digalakis, V.V.² Kimball, O.A.³

7
- 84865757470
- Unsupervised hidden markov modeling of spoken queries for spoken term detection without speech recognition
- C. Chan and L. Lee, "Unsupervised hidden markov modeling of spoken queries for spoken term detection without speech recognition," Proc. INTERSPEECH, pp. 2141-2144, 2011.
- (2011) Proc. INTERSPEECH , pp. 2141-2144
- Chan, C.¹ Lee, L.²

8
- 80051626575
- Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection
- M. Huijbregts, M. McLaren, and D.V. Leeuwen, "Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection," Proc. ICASSP, pp. 4436-4439, 2011.
- (2011) Proc. ICASSP , pp. 4436-4439
- Huijbregts, M.¹ McLaren, M.² Leeuwen, D.V.³

9
- 44949165663
- Using posterior-based features in template matching for speech recognition
- G. Aradilla, J. Vepa, and H. Bourlard, "Using posterior-based features in template matching for speech recognition," Proc. INTERSPEECH, pp. 1186-1189, 2006.
- (2006) Proc. INTERSPEECH , pp. 1186-1189
- Aradilla, G.¹ Vepa, J.² Bourlard, H.³

10
- 51449096712
- Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons
- Y. Qiao, N. Shimomura, and N. Minematsu, "Unsupervised optimal phoneme segmentation: objectives, algorithm and comparisons," Proc. ICASSP, pp. 3989-3992, 2008.
- (2008) Proc. ICASSP , pp. 3989-3992
- Qiao, Y.¹ Shimomura, N.² Minematsu, N.³

11
- 0023800699
- A segment model based approach to speech recognition
- C.H. Lee, F.K. Soong, and B.H. Juang, "A segment model based approach to speech recognition," Proc. ICASSP, pp. 501-541, 1988.
- (1988) Proc. ICASSP , pp. 501-541
- Lee, C.H.¹ Soong, F.K.² Juang, B.H.³

12
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- D.A. Reynolds and R.C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. SAP, vol. 3, no. 1, pp. 72-83, 1995.
- (1995) IEEE Trans. SAP , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

13
- 17444453660
- Language identification using Gaussian mixture model tokenization
- P.A. Torres-Carrasquillo, D.A. Reynolds, and JR Deller Jr, "Language identification using Gaussian mixture model tokenization," Proc. ICSLP, pp. 757-760, 2002.
- (2002) Proc. ICSLP , pp. 757-760
- Torres-Carrasquillo, P.A.¹ Reynolds, D.A.² Deller Jr., J.R.³

14
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M.J.F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol. 12, pp. 75-98, 1998.
- (1998) Computer Speech and Language , vol.12 , pp. 75-98
- Gales, M.J.F.¹

15
- 33947620115
- Hierarchical structures of neural networks for phoneme recognition
- P. Schwarz, P. Matejka, and J. Cernocky, "Hierarchical structures of neural networks for phoneme recognition," Proc. ICASSP, pp. 325-328, 2006.
- (2006) Proc. ICASSP , pp. 325-328
- Schwarz, P.¹ Matejka, P.² Cernocky, J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.