SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2014, Pages 4121-4125

Strategies for Vietnamese keyword search

(8) Chen, Nancy F a Sivadas, Sunil a Lim, Boon Pang a Ngo, Hoang Gia b Xu, Haihua c Pham, Van Tung c Ma, Bin a Li, Haizhou a

a INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

b NATIONAL UNIVERSITY OF SINGAPORE (Singapore)

c NANYANG TECHNOLOGICAL UNIVERSITY (Singapore)

Author keywords

audio indexing; deep neural networks (DNN); glottalization; large vocabulary continuous speech recognition (LVCSR); low resourced languages; spoken term detection

Indexed keywords

CONTINUOUS SPEECH RECOGNITION; INDEXING (OF INFORMATION); SEARCH ENGINES;

AUDIO INDEXING; DEEP NEURAL NETWORKS; GLOTTALIZATION; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; SPOKEN TERM DETECTIONS;

SIGNAL PROCESSING;

EID: 84905234286 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2014.6854377 Document Type: Conference Paper

Times cited : (48)

References (32)

1
- 59649109709
- Discriminative keyword spotting
- Joseph Keshet, David Grangier, and Samy Bengio, "Discriminative keyword spotting," Speech Communication, vol. 51, no. 4, pp. 317-329, 2009.
- (2009) Speech Communication , vol.51 , Issue.4 , pp. 317-329
- Keshet, J.¹ Grangier, D.² Bengio, S.³

2
- 84906219389
- A hybrid HMM/DNN Approach to keyword spotting for short words
- I-Fan Chen and Chin-Hui Lee, "A hybrid HMM/DNN Approach to keyword spotting for short words," in Proc. of Interspeech, 2013, pp. 1574-1578.
- (2013) Proc. of Interspeech , pp. 1574-1578
- Chen, I.¹ Lee, C.²

3
- 79951634009
- Results of the 2006 spoken term detection evaluation
- Jonathan G Fiscus, Jerome Ajot, John S Garofolo, and George Doddingtion, "Results of the 2006 spoken term detection evaluation," in Proceedings of ACM SIGIR Workshop on Searching Spontaneous Conversational, 2007, pp. 51-55.
- (2007) Proceedings of ACM SIGIR Workshop on Searching Spontaneous Conversational , pp. 51-55
- Fiscus, J.G.¹ Ajot, J.² Garofolo, J.S.³ Doddingtion, G.⁴

4
- 84874276847
- The kaldi speech recognition toolkit
- Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, et al., "The kaldi speech recognition toolkit," in Proc. of IEEE ASRU, 2011.
- (2011) Proc. of IEEE ASRU
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ Motlicek, P.⁸ Qian, Y.⁹ Schwarz, P.¹⁰

5
- 84905247417
- Voice quality dependent speech recognition
- Tae-Jin Yoon, Xiaodan Zhuang, Jennifer Cole, and Mark Hasegawa-Johnson, "Voice quality dependent speech recognition," in International Symposium on Linguistic Patterns in Spontaneous Speech, 2008.
- (2008) International Symposium on Linguistic Patterns in Spontaneous Speech
- Yoon, T.¹ Zhuang, X.² Cole, J.³ Hasegawa-Johnson, M.⁴

6
- 84906214232
- A study on LVCSR and keyword search for tagalog
- Korbinian Riedhammer, Van Hai Do, and James Hieronymus, Eds., A Study on LVCSR and Keyword Search for Tagalog. Proc. of Interspeech, 2013.
- (2013) Proc. of Interspeech
- Riedhammer, K.¹ Van Hai, Do.² Hieronymus, J.³

7
- 77949417963
- Vietnamese large vocabulary continuous speech recognition
- Ngoc Thang Vu and Tanja Schultz, "Vietnamese large vocabulary continuous speech recognition," in Proc. of IEEE ASRU. IEEE, 2009, pp. 333-338.
- (2009) Proc. of IEEE ASRU. IEEE , pp. 333-338
- Thang Vu, N.¹ Schultz, T.²

8
- 51449101963
- Openvocabulary spoken term detection using graphone-based hybrid recognition systems
- Murat Akbacak, Dimitra Vergyri, and Andreas Stolcke, "Openvocabulary spoken term detection using graphone-based hybrid recognition systems," in Proc. of IEEE ICASSP, 2008, pp. 5240-5243.
- (2008) Proc. of IEEE ICASSP , pp. 5240-5243
- Akbacak, M.¹ Vergyri, D.² Stolcke, A.³

9
- 70349211775
- Effect of pronounciations on OOV queries in spoken term detection
- Dogan Can, Erica Cooper, Abhinav Sethy, Chris White, Bhuvana Ramabhadran, and Murat Saraclar, "Effect of pronounciations on OOV queries in spoken term detection," in Proc. of IEEE ICASSP, 2009, pp. 3957-3960.
- (2009) Proc. of IEEE ICASSP , pp. 3957-3960
- Can, D.¹ Cooper, E.² Sethy, A.³ White, C.⁴ Ramabhadran, B.⁵ Saraclar, M.⁶

10
- 85149114440
- A joint source-channel model for machine transliteration
- Li Haizhou, Zhang Min, and Su Jian, "A joint source-channel model for machine transliteration," in Proc. of the 42nd Annual Meeting on Association for Computational Linguistics, 2004.
- (2004) Proc. of the 42nd Annual Meeting on Association for Computational Linguistics
- Haizhou, L.¹ Min, Z.² Jian, S.³

11
- 41049105254
- Joint-sequence models for grapheme-to-phoneme conversion
- Maximilian Bisani and Hermann Ney, "Joint-sequence models for grapheme-to-phoneme conversion," Speech Communication, vol. 50, no. 5, pp. 434-451, 2008.
- (2008) Speech Communication , vol.50 , Issue.5 , pp. 434-451
- Bisani, M.¹ Ney, H.²

12
- 0011898389
- Translation
- Warren Weaver, "Translation," Machine translation of languages, vol. 14, pp. 15-23, 1955.
- (1955) Machine Translation of Languages , vol.14 , pp. 15-23
- Weaver, W.¹

13
- 78349290063
- Comparative analysis of transliteration techniques based on statistical machine translation and joint-sequence model
- Nam X Cao, Nhut M Pham, and Quan H Vu, "Comparative analysis of transliteration techniques based on statistical machine translation and joint-sequence model," in Proc. of Symposium on Information and Communication Technology. ACM, 2010, pp. 59-63.
- (2010) Proc. of Symposium on Information and Communication Technology. ACM , pp. 59-63
- Cao, N.X.¹ Pham, N.M.² Vu, Q.H.³

14
- 84890542302
- Exploiting diversity for spoken term detection
- Lidia Mangu, Hagen Soltau, Hong-Kwang Kuo, Brian Kingsbury, and George Saon, "Exploiting diversity for spoken term detection," in Proc. of IEEE ICASSP, 2013.
- (2013) Proc. of IEEE ICASSP
- Mangu, L.¹ Soltau, H.² Kuo, H.³ Kingsbury, B.⁴ Saon, G.⁵

15
- 84906281193
- On the calibration and fusion of heterogeneous spoken term detectionsystems
- Alberto Abad, Luis Javier Rodrguez-Fuentes, Mikel Penagarikano, Amparo Varona, and Germán Bordel, "On the calibration and fusion of heterogeneous spoken term detectionsystems," in Proc. of Interspeech, 2013.
- (2013) Proc. of Interspeech
- Abad, A.¹ Rodrguez-Fuentes, L.J.² Penagarikano, M.³ Varona, A.⁴ Bordel, G.⁵

16
- 70349253031
- NIST Special Publication SP
- E. A. Fox and J. A. Shaw, "Combination of multiple searches," NIST Special Publication SP, pp. 243-243, 1994.
- (1994) Combination of Multiple Searches , pp. 243-243
- Fox, E.A.¹ Shaw, J.A.²

17
- 84890537373
- A high performance cantonese keyword search system
- Brian Kingsbury, Jia Cui, Xiaodong Cui, Mark JF Gales, Kate Knill, Jonathan Mamou, Lidia Mangu, David Nolden, Michael Picheny, Bhuvana Ramabhadran, Ralf Schluter, Abhinav Sehty, and Phillip C.Woodland, "A High Performance Cantonese Keyword Search System," in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Kingsbury, B.¹ Cui, J.² Cui, X.³ Gales, M.J.⁴ Knill, K.⁵ Mamou, J.⁶ Mangu, L.⁷ Nolden, D.⁸ Picheny, M.⁹ Ramabhadran, B.¹⁰ Schluter, R.¹¹ Sehty, A.¹² Phillip C.Woodland¹³

18
- 78049409301
- Subspace gaussian mixture models for speech recognition
- Daniel Povey, Lukas Burget, Mohit Agarwal, Pinar Akyazi, Kai Feng, Arnab Ghoshal, Ondrej Glembek, Nagendra K Goel, Martin Karafíat, Ariya Rastrow, R. C. Rose, P Schearz, and S. Thomas, "Subspace gaussian mixture models for speech recognition," in IEEE International Conference on Acoustics Speech and Signal Processing,. IEEE, 2010, pp. 4330-4333.
- (2010) IEEE International Conference on Acoustics Speech and Signal Processing,. IEEE , pp. 4330-4333
- Povey, D.¹ Burget, L.² Agarwal, M.³ Akyazi, P.⁴ Feng, K.⁵ Ghoshal, A.⁶ Glembek, O.⁷ Goel, N.K.⁸ Karafíat, M.⁹ Rastrow, A.¹⁰ Rose, R.C.¹¹ Schearz, P.¹² Thomas, S.¹³

19
- 79958161623
- A robust real-time sound source localization system for olivia robot
- Nguyen Trung Hieu Shengkui Zhao, Eng Siong Chng and Haizhou Li, "A Robust Real-time Sound Source Localization System for Olivia Robot," in 2010 APSIPA Annual Summit and Conference, 2010.
- (2010) 2010 APSIPA Annual Summit and Conference
- Zhao, N.T.H.S.¹ Chng, S.E.² Li, H.³

20
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Hynek Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Am., vol. 87, pp. 1738, 1990.
- (1990) J. Acoust. Soc. Am. , vol.87 , pp. 1738
- Hermansky, H.¹

21
- 84905283451
- New methods in continuous mandarin speech recognition
- C Julian Chen, Ramesh A Gopinath, Michael D Monkowski, Michael A Picheny, and Katherine Shen, "New methods in continuous mandarin speech recognition.," in Proc. of Eurospeech, 1997.
- (1997) Proc. of Eurospeech
- Julian Chen, C.¹ Gopinath, R.A.² Monkowski, M.D.³ Picheny, M.A.⁴ Shen, K.⁵

22
- 70349209406
- Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum
- Kornel Laskowski and Qin Jin, "Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum," in Proc. of IEEE ICASSP, 2009, pp. 4541-4544.
- (2009) Proc. of IEEE ICASSP , pp. 4541-4544
- Laskowski, K.¹ Jin, Q.²

23
- 84893656667
- Models of tone for tonal and non-tonal languages
- Olomouc; Czech Republic
- Florian Metze, Zaid A. W. Sheikh, Alex Waibel, Jonas Gehring, Kevin Kilgour, Quoc Bao Nguyen, and Van Huy Nguyen, "Models of tone for tonal and non-tonal languages," in Proc. IEEE ASRU, Olomouc; Czech Republic, 2013.
- (2013) Proc. IEEE ASRU
- Metze, F.¹ Sheikh, W.Z.A.² Waibel, A.³ Gehring, J.⁴ Kilgour, K.⁵ Bao Nguyen, Q.⁶ Van Huy Nguyen⁷

24
- 33745220757
- Influence of F0 on Vietnamese syllable perception
- Do Dat Tran, Eric Castelli, Jean-Francóis Serignat, Van Loan Trinh, and Le Xuan Hung, "Influence of F0 on Vietnamese syllable perception," in Proc. of Interspeech, 2005, pp. 1697-1700.
- (2005) Proc. of Interspeech , pp. 1697-1700
- Dat Tran, D.¹ Castelli, E.² Serignat, J.³ Van Loan, Trinh.⁴ Xuan Hung, L.⁵

25
- 84905247416
- A phonetic study of Vietnamese tones: Acoustic and electroglottographic measurements
- Vu Ngoc Tuan, Christophe d'Alessandro, and Sophie Rosset, "A phonetic study of Vietnamese tones: Acoustic and electroglottographic measurements," in Proc. of Interspeech, 2002.
- (2002) Proc. of Interspeech
- Ngoc Tuan, V.¹ D'alessandro, C.² Rosset, S.³

26
- 0031023993
- Glottal characteristics of female speakers: Acoustic correlates
- Helen M Hanson, "Glottal characteristics of female speakers: Acoustic correlates," J. Acoust. Soc. Am., vol. 101, pp. 466, 1997.
- (1997) J. Acoust. Soc. Am. , vol.101 , pp. 466
- Hanson, H.M.¹

27
- 84905247412
- MA Thesis, Texas Technology College
- Thi-Quynh-Hoa Hoang, "A phonological contrastive study of Vietnamese and English," MA Thesis, Texas Technology College, 1965.
- (1965) A Phonological Contrastive Study of Vietnamese and English
- Hoang, T.¹

28
- 84906274730
- Sequencediscriminative training of deep neural networks
- L. Burget K. Vesely, A. Ghoshal and D. Povey, "Sequencediscriminative training of deep neural networks," in Proceedings of Interspeech, 2013.
- (2013) Proceedings of Interspeech
- Burget K Vesely, L.¹ Ghoshal, A.² Povey, D.³

29
- 0141589488
- SRILM-an extensible language modeling toolkit
- Andreas Stolcke et al., "SRILM-an extensible language modeling toolkit," in Proc. of Interspeech, 2002.
- (2002) Proc. of Interspeech
- Stolcke, A.¹

30
- 80052042597
- Lattice indexing for spoken term detection
- Dogan Can and Murat Saraclar, "Lattice indexing for spoken term detection," IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 8, pp. 2338-2347, 2011.
- (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , Issue.8 , pp. 2338-2347
- Can, D.¹ Saraclar, M.²

31
- 43849104109
- Rapid and accurate spoken term detection
- David RH Miller, Michael Kleber, Chia-Lin Kao, Owen Kimball, Thomas Colthurst, Stephen A Lowe, RichardMSchwartz, and Herbert Gish, "Rapid and accurate spoken term detection.," in Proc. of Interspeech, 2007, pp. 314-317.
- (2007) Proc. of Interspeech , pp. 314-317
- Miller, D.Rh.¹ Kleber, M.² Kao, C.³ Kimball, O.⁴ Colthurst, T.⁵ Lowe, S.A.⁶ Schwartz, R.M.⁷ Gish, H.⁸

32
- 2442562479
- Segmental minimum Bayes-risk decoding for automatic speech recognition
- Vaibhava Goel, Shankar Kumar, andWilliam Byrne, "Segmental minimum Bayes-risk decoding for automatic speech recognition," Speech and Audio Processing, IEEE Transactions on, vol. 12, no. 3, pp. 234-249, 2004.
- (2004) Speech and Audio Processing, IEEE Transactions on , vol.12 , Issue.3 , pp. 234-249
- Goel, V.¹ Kumar, S.² Byrne, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.