메뉴 건너뛰기




Volumn , Issue , 2014, Pages 4121-4125

Strategies for Vietnamese keyword search

Author keywords

audio indexing; deep neural networks (DNN); glottalization; large vocabulary continuous speech recognition (LVCSR); low resourced languages; spoken term detection

Indexed keywords

CONTINUOUS SPEECH RECOGNITION; INDEXING (OF INFORMATION); SEARCH ENGINES;

EID: 84905234286     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854377     Document Type: Conference Paper
Times cited : (48)

References (32)
  • 1
    • 59649109709 scopus 로고    scopus 로고
    • Discriminative keyword spotting
    • Joseph Keshet, David Grangier, and Samy Bengio, "Discriminative keyword spotting," Speech Communication, vol. 51, no. 4, pp. 317-329, 2009.
    • (2009) Speech Communication , vol.51 , Issue.4 , pp. 317-329
    • Keshet, J.1    Grangier, D.2    Bengio, S.3
  • 2
    • 84906219389 scopus 로고    scopus 로고
    • A hybrid HMM/DNN Approach to keyword spotting for short words
    • I-Fan Chen and Chin-Hui Lee, "A hybrid HMM/DNN Approach to keyword spotting for short words," in Proc. of Interspeech, 2013, pp. 1574-1578.
    • (2013) Proc. of Interspeech , pp. 1574-1578
    • Chen, I.1    Lee, C.2
  • 7
    • 77949417963 scopus 로고    scopus 로고
    • Vietnamese large vocabulary continuous speech recognition
    • Ngoc Thang Vu and Tanja Schultz, "Vietnamese large vocabulary continuous speech recognition," in Proc. of IEEE ASRU. IEEE, 2009, pp. 333-338.
    • (2009) Proc. of IEEE ASRU. IEEE , pp. 333-338
    • Thang Vu, N.1    Schultz, T.2
  • 8
    • 51449101963 scopus 로고    scopus 로고
    • Openvocabulary spoken term detection using graphone-based hybrid recognition systems
    • Murat Akbacak, Dimitra Vergyri, and Andreas Stolcke, "Openvocabulary spoken term detection using graphone-based hybrid recognition systems," in Proc. of IEEE ICASSP, 2008, pp. 5240-5243.
    • (2008) Proc. of IEEE ICASSP , pp. 5240-5243
    • Akbacak, M.1    Vergyri, D.2    Stolcke, A.3
  • 11
    • 41049105254 scopus 로고    scopus 로고
    • Joint-sequence models for grapheme-to-phoneme conversion
    • Maximilian Bisani and Hermann Ney, "Joint-sequence models for grapheme-to-phoneme conversion," Speech Communication, vol. 50, no. 5, pp. 434-451, 2008.
    • (2008) Speech Communication , vol.50 , Issue.5 , pp. 434-451
    • Bisani, M.1    Ney, H.2
  • 13
    • 78349290063 scopus 로고    scopus 로고
    • Comparative analysis of transliteration techniques based on statistical machine translation and joint-sequence model
    • Nam X Cao, Nhut M Pham, and Quan H Vu, "Comparative analysis of transliteration techniques based on statistical machine translation and joint-sequence model," in Proc. of Symposium on Information and Communication Technology. ACM, 2010, pp. 59-63.
    • (2010) Proc. of Symposium on Information and Communication Technology. ACM , pp. 59-63
    • Cao, N.X.1    Pham, N.M.2    Vu, Q.H.3
  • 20
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Hynek Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Am., vol. 87, pp. 1738, 1990.
    • (1990) J. Acoust. Soc. Am. , vol.87 , pp. 1738
    • Hermansky, H.1
  • 22
    • 70349209406 scopus 로고    scopus 로고
    • Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum
    • Kornel Laskowski and Qin Jin, "Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum," in Proc. of IEEE ICASSP, 2009, pp. 4541-4544.
    • (2009) Proc. of IEEE ICASSP , pp. 4541-4544
    • Laskowski, K.1    Jin, Q.2
  • 25
    • 84905247416 scopus 로고    scopus 로고
    • A phonetic study of Vietnamese tones: Acoustic and electroglottographic measurements
    • Vu Ngoc Tuan, Christophe d'Alessandro, and Sophie Rosset, "A phonetic study of Vietnamese tones: Acoustic and electroglottographic measurements," in Proc. of Interspeech, 2002.
    • (2002) Proc. of Interspeech
    • Ngoc Tuan, V.1    D'alessandro, C.2    Rosset, S.3
  • 26
    • 0031023993 scopus 로고    scopus 로고
    • Glottal characteristics of female speakers: Acoustic correlates
    • Helen M Hanson, "Glottal characteristics of female speakers: Acoustic correlates," J. Acoust. Soc. Am., vol. 101, pp. 466, 1997.
    • (1997) J. Acoust. Soc. Am. , vol.101 , pp. 466
    • Hanson, H.M.1
  • 29
    • 0141589488 scopus 로고    scopus 로고
    • SRILM-an extensible language modeling toolkit
    • Andreas Stolcke et al., "SRILM-an extensible language modeling toolkit," in Proc. of Interspeech, 2002.
    • (2002) Proc. of Interspeech
    • Stolcke, A.1
  • 32
    • 2442562479 scopus 로고    scopus 로고
    • Segmental minimum Bayes-risk decoding for automatic speech recognition
    • Vaibhava Goel, Shankar Kumar, andWilliam Byrne, "Segmental minimum Bayes-risk decoding for automatic speech recognition," Speech and Audio Processing, IEEE Transactions on, vol. 12, no. 3, pp. 234-249, 2004.
    • (2004) Speech and Audio Processing, IEEE Transactions on , vol.12 , Issue.3 , pp. 234-249
    • Goel, V.1    Kumar, S.2    Byrne, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.