메뉴 건너뛰기




Volumn , Issue , 2012, Pages 5157-5160

An acoustic segment modeling approach to query-by-example spoken term detection

Author keywords

acoustic segment model; posteriorgram based template matching; query by example; Spoken term detection

Indexed keywords

DYNAMIC TIME WARPING; HETEROGENEOUS TESTS; ITERATIVE PROCEDURES; QUERY TERMS; QUERY-BY-EXAMPLE; SEGMENT MODELING; SEGMENT MODELS; SPEAKER NORMALIZATION; TOKENIZER;

EID: 84867600320     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2012.6289081     Document Type: Conference Paper
Times cited : (62)

References (15)
  • 1
    • 70450160623 scopus 로고    scopus 로고
    • A comparison of query-by-example methods for spoken term detection
    • W. Shen, C.M. White, and T.J. Hazen, "A comparison of query-by-example methods for spoken term detection," Proc. INTERSPEECH, pp. 2143-2146, 2009.
    • (2009) Proc. INTERSPEECH , pp. 2143-2146
    • Shen, W.1    White, C.M.2    Hazen, T.J.3
  • 3
    • 77949351968 scopus 로고    scopus 로고
    • Query-by-example spoken term detection using phonetic posteriorgram templates
    • T.J. Hazen, W. Shen, and C. White, "Query-by-example spoken term detection using phonetic posteriorgram templates," Proc. ASRU, pp. 421-426, 2009.
    • (2009) Proc. ASRU , pp. 421-426
    • Hazen, T.J.1    Shen, W.2    White, C.3
  • 4
    • 77949473673 scopus 로고    scopus 로고
    • Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams
    • Y. Zhang and J.R. Glass, "Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams," Proc. ASRU, pp. 398-403, 2009.
    • (2009) Proc. ASRU , pp. 398-403
    • Zhang, Y.1    Glass, J.R.2
  • 5
    • 34547502608 scopus 로고    scopus 로고
    • A vector space modeling approach to spoken language identification
    • H. Li, B. Ma, and C.H. Lee, "A vector space modeling approach to spoken language identification," IEEE Trans. ASLP, vol. 15, no. 1, pp. 271-284, 2007.
    • (2007) IEEE Trans. ASLP , vol.15 , Issue.1 , pp. 271-284
    • Li, H.1    Ma, B.2    Lee, C.H.3
  • 6
    • 0030245363 scopus 로고    scopus 로고
    • From HMM's to segment models: A unified view of stochastic modeling for speech recognition
    • M. Ostendorf, V.V. Digalakis, and O.A. Kimball, "From HMM's to segment models: A unified view of stochastic modeling for speech recognition," IEEE Trans. SAP, vol. 4, no. 5, pp. 360-378, 1996.
    • (1996) IEEE Trans. SAP , vol.4 , Issue.5 , pp. 360-378
    • Ostendorf, M.1    Digalakis, V.V.2    Kimball, O.A.3
  • 7
    • 84865757470 scopus 로고    scopus 로고
    • Unsupervised hidden markov modeling of spoken queries for spoken term detection without speech recognition
    • C. Chan and L. Lee, "Unsupervised hidden markov modeling of spoken queries for spoken term detection without speech recognition," Proc. INTERSPEECH, pp. 2141-2144, 2011.
    • (2011) Proc. INTERSPEECH , pp. 2141-2144
    • Chan, C.1    Lee, L.2
  • 8
    • 80051626575 scopus 로고    scopus 로고
    • Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection
    • M. Huijbregts, M. McLaren, and D.V. Leeuwen, "Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection," Proc. ICASSP, pp. 4436-4439, 2011.
    • (2011) Proc. ICASSP , pp. 4436-4439
    • Huijbregts, M.1    McLaren, M.2    Leeuwen, D.V.3
  • 9
    • 44949165663 scopus 로고    scopus 로고
    • Using posterior-based features in template matching for speech recognition
    • G. Aradilla, J. Vepa, and H. Bourlard, "Using posterior-based features in template matching for speech recognition," Proc. INTERSPEECH, pp. 1186-1189, 2006.
    • (2006) Proc. INTERSPEECH , pp. 1186-1189
    • Aradilla, G.1    Vepa, J.2    Bourlard, H.3
  • 10
    • 51449096712 scopus 로고    scopus 로고
    • Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons
    • Y. Qiao, N. Shimomura, and N. Minematsu, "Unsupervised optimal phoneme segmentation: objectives, algorithm and comparisons," Proc. ICASSP, pp. 3989-3992, 2008.
    • (2008) Proc. ICASSP , pp. 3989-3992
    • Qiao, Y.1    Shimomura, N.2    Minematsu, N.3
  • 11
    • 0023800699 scopus 로고
    • A segment model based approach to speech recognition
    • C.H. Lee, F.K. Soong, and B.H. Juang, "A segment model based approach to speech recognition," Proc. ICASSP, pp. 501-541, 1988.
    • (1988) Proc. ICASSP , pp. 501-541
    • Lee, C.H.1    Soong, F.K.2    Juang, B.H.3
  • 12
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • D.A. Reynolds and R.C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. SAP, vol. 3, no. 1, pp. 72-83, 1995.
    • (1995) IEEE Trans. SAP , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 14
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M.J.F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol. 12, pp. 75-98, 1998.
    • (1998) Computer Speech and Language , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 15
    • 33947620115 scopus 로고    scopus 로고
    • Hierarchical structures of neural networks for phoneme recognition
    • P. Schwarz, P. Matejka, and J. Cernocky, "Hierarchical structures of neural networks for phoneme recognition," Proc. ICASSP, pp. 325-328, 2006.
    • (2006) Proc. ICASSP , pp. 325-328
    • Schwarz, P.1    Matejka, P.2    Cernocky, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.