SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2015-August, Issue , 2015, Pages 5366-5370

Low-resource keyword search strategies for Tamil

(17) Chen, Nancy F a Ni, Chongjia a Chen, I Fan b Sivadas, Sunil a Pham, Van Tung c Xu, Haihua c Xiao, Xiong c Lau, Tze Siong c Leow, Su Jun c Lim, Boon Pang a Leung, Cheung Chi a Wang, Lei a Lee, Chin Hui b Goh, Alvina c Chng, Eng Siong c Ma, Bin a Li, Haizhou a

a INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

b GEORGIA INSTITUTE OF TECHNOLOGY (United States)

c NANYANG TECHNOLOGICAL UNIVERSITY (Singapore)

Author keywords

active learning; agglutinative languages; deep neural network (DNN); inflective languages; keyword spotting; morphology; semi supervised learning; Spoken term detection (STD); under resourced languages; unsupervised learning

Indexed keywords

AUDIO SIGNAL PROCESSING; DEEP NEURAL NETWORKS; MORPHOLOGY; SEARCH ENGINES; SPEECH COMMUNICATION; SPEECH RECOGNITION; SUPERVISED LEARNING; UNSUPERVISED LEARNING;

ACTIVE LEARNING; AGGLUTINATIVE LANGUAGE; KEYWORD SPOTTING; SEMI- SUPERVISED LEARNING; SPOKEN TERM DETECTION (STD); UNDER-RESOURCED LANGUAGES;

MODELING LANGUAGES;

EID: 84946036768 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2015.7178996 Document Type: Conference Paper

Times cited : (42)

References (31)

1
- 33646914432
- Speech and language technologies for audio indexing and retrieval
- John Makhoul, Francis Kubala, Timothy Leek, Daben Liu, Long Nguyen, Richard Schwartz, and Amit Srivastava, "Speech and language technologies for audio indexing and retrieval," Proceedings of the IEEE, vol. 88, no. 8, pp. 1338-1353, 2000.
- (2000) Proceedings of the IEEE , vol.88 , Issue.8 , pp. 1338-1353
- Makhoul, J.¹ Kubala, F.² Leek, T.³ Liu, D.⁴ Nguyen, L.⁵ Schwartz, R.⁶ Srivastava, A.⁷

2
- 0000763574
- Automatic recognition and understanding of spoken language-A first step toward natural human-machine communication
- Biing-Hwang Juang and Sadaoki Furui, "Automatic recognition and understanding of spoken language-A first step toward natural human-machine communication," Proceedings of the IEEE, vol. 88, no. 8, pp. 1142-1165, 2000.
- (2000) Proceedings of the IEEE , vol.88 , Issue.8 , pp. 1142-1165
- Juang, B.-H.¹ Furui, S.²

3
- 0025517070
- Automatic recognition of keywords in unconstrained speech using hidden markov models
- Jay G Wilpon, L Rabiner, Chin-Hui Lee, and ER Goldman, "Automatic recognition of keywords in unconstrained speech using hidden markov models," IEEE TASLP, vol. 38, no. 11, pp. 1870-1878, 1990.
- (1990) IEEE TASLP , vol.38 , Issue.11 , pp. 1870-1878
- Wilpon, J.G.¹ Rabiner, L.² Lee, C.-H.³ Goldman, E.R.⁴

4
- 0001596920
- Large-vocabulary continuous speech recognition: Advances and applications
- J Gauvain and Lori Lamel, "Large-vocabulary continuous speech recognition: advances and applications," Proceedings of the IEEE, vol. 88, no. 8, pp. 1181-1200, 2000.
- (2000) Proceedings of the IEEE , vol.88 , Issue.8 , pp. 1181-1200
- Gauvain, J.¹ Lamel, L.²

5
- 79951634009
- Results of the 2006 spoken term detection evaluation
- Jonathan G Fiscus, Jerome Ajot, John S Garofolo, and George Doddingtion, "Results of the 2006 spoken term detection evaluation," in Proceedings of ACM SIGIR Workshop on Searching Spontaneous Conversational, 2007, pp. 51-55.
- (2007) Proceedings of ACM SIGIR Workshop on Searching Spontaneous Conversational , pp. 51-55
- Fiscus, J.G.¹ Ajot, J.² Garofolo, J.S.³ Doddingtion, G.⁴

6
- 0036293684
- Active learning for automatic speech recognition
- Dilek Hakkani-Tur, Giuseppe Riccardi, and Allen Gorin, "Active learning for automatic speech recognition," in Proc. IEEE ICASSP, 2002, vol. 4, pp. IV-3904.
- (2002) Proc. IEEE ICASSP , vol.4 , pp. IV-3904
- Dilek, H.-T.¹ Riccardi, G.² Gorin, A.³

7
- 0036460908
- Lightly supervised and unsupervised acoustic model training
- Lori Lamel, Jean-Luc Gauvain, and Gilles Adda, "Lightly supervised and unsupervised acoustic model training," Computer Speech &Language, vol. 16, no. 1, pp. 115-129, 2002.
- (2002) Computer Speech &Language , vol.16 , Issue.1 , pp. 115-129
- Lamel, L.¹ Gauvain, J.-L.² Adda, G.³

8
- 77950063604
- Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion
- Dong Yu, Balakrishnan Varadarajan, Li Deng, and Alex Acero, "Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion," Computer Speech &Language, vol. 24, no. 3, pp. 433-444, 2010.
- (2010) Computer Speech &Language , vol.24 , Issue.3 , pp. 433-444
- Yu, D.¹ Varadarajan, B.² Deng, L.³ Acero, A.⁴

9
- 84867616363
- N-best entropy based data selection for acoustic modeling
- Nobuyasu Itoh, Tara N Sainath, Dan Ning Jiang, Jie Zhou, and Bhuvana Ramabhadran, "N-best entropy based data selection for acoustic modeling," in Proc. IEEE ICASSP, 2012, pp. 4133-4136.
- (2012) Proc. IEEE ICASSP , pp. 4133-4136
- Itoh, N.¹ Sainath, T.N.² Jiang, D.N.³ Zhou, J.⁴ Ramabhadran, B.⁵

10
- 44849132070
- Data selection for speech recognition
- YiWu, Rong Zhang, and Alexander Rudnicky, "Data selection for speech recognition," in Proc. IEEE ASRU, 2007, pp. 562-565.
- (2007) Proc. IEEE ASRU , pp. 562-565
- Wu, Y.¹ Zhang, R.² Rudnicky, A.³

11
- 70450220373
- How to select a good training-data subset for transcription: Submodular active selection for sequences
- Hui Lin and Jeff Bilmes, "How to select a good training-data subset for transcription: Submodular active selection for sequences," in INTERSPEECH, 2009.
- (2009) INTERSPEECH
- Lin, H.¹ Bilmes, J.²

12
- 84926184611
- Using document summarization techniques for speech data subset selection
- Kai Wei, Yuzong Liu, Katrin Kirchhoff, and Jeff Bilmes, "Using document summarization techniques for speech data subset selection.," in HLT-NAACL, 2013, pp. 721-726.
- (2013) HLT-NAACL , pp. 721-726
- Wei, K.¹ Liu, Y.² Kirchhoff, K.³ Bilmes, J.⁴

13
- 84910089557
- A keyword-boosted smbr criterion to enhance keyword search performance in deep neural network based acoustic modeling
- I-Fan Chen, Nancy F Chen, and Chin-Hui Lee, "A Keyword-Boosted sMBR Criterion to Enhance Keyword Search Performance in Deep Neural Network Based Acoustic Modeling," in INTERSPEECH, 2014.
- (2014) INTERSPEECH
- Chen, I.-F.¹ Chen, N.F.² Lee, C.-H.³

14
- 84878585396
- White listing and score normalization for keyword spotting of noisy speech
- Bing Zhang, Richard M Schwartz, Stavros Tsakalidis, Long Nguyen, and Spyros Matsoukas, "White listing and score normalization for keyword spotting of noisy speech.," in INTERSPEECH, 2012.
- (2012) INTERSPEECH
- Zhang, B.¹ Schwartz, R.M.² Tsakalidis, S.³ Nguyen, L.⁴ Matsoukas, S.⁵

15
- 84912082128
- A novel keyword+lvcsr-filler based grammar network representation for spoken keyword search
- I-Fan Chen, Chongjia Ni, Boon Pang Lim, Nancy F Chen, and Chin-Hui Lee, "A novel keyword+lvcsr-filler based grammar network representation for spoken keyword search," in ISCSLP, 2014.
- (2014) ISCSLP
- Chen, I.-F.¹ Ni, C.² Lim, B.P.³ Chen, N.F.⁴ Lee, C.-H.⁵

16
- 36448941168
- Vocabulary independent spoken term detection
- Jonathan Mamou, Bhuvana Ramabhadran, and Olivier Siohan, "Vocabulary independent spoken term detection," in Proc. ACM SIGIR conference on Research and development in information retrieval, 2007, pp. 615-622.
- (2007) Proc. ACM SIGIR Conference on Research and Development in Information Retrieval , pp. 615-622
- Mamou, J.¹ Ramabhadran, B.² Siohan, O.³

17
- 84865351796
- Morpholexical and discriminative language models for Turkish automatic speech recognition
- Hasim Sak, Murat Saraçlar, and Tunga Gungor, "Morpholexical and discriminative language models for turkish automatic speech recognition," IEEE TASLP, vol. 20, no. 8, pp. 2341-2351, 2012.
- (2012) IEEE TASLP , vol.20 , Issue.8 , pp. 2341-2351
- Sak, H.¹ Saraçlar, M.² Gungor, T.³

18
- 84905277328
- Subwordbased modeling for handling oov words inkeyword spotting
- Yanzhang He, Brian Hutchinson, Peter Baumann, Mari Ostendorf, Eric Fosler-Lussier, and Janet Pierrehumbert, "Subwordbased modeling for handling oov words inkeyword spotting," in Proc. IEEE ICASSP, 2014, pp. 7864-7868.
- (2014) Proc. IEEE ICASSP , pp. 7864-7868
- He, Y.¹ Hutchinson, B.² Baumann, P.³ Ostendorf, M.⁴ Eric, F.-L.⁵ Pierrehumbert, J.⁶

19
- 84906247137
- Experiments towards a better lvcsr system for Tamil
- Melvin Jose Johnson Premkumar, Ngoc Thang Vu, and Tanja Schultz, "Experiments towards a better lvcsr system for tamil," in INTERSPEECH, 2013.
- (2013) INTERSPEECH
- Premkumar, M.J.J.¹ Vu, N.T.² Schultz, T.³

20
- 84938721921
- A keyword-aware grammar framework for lvcsr-based spoken keyword search
- I-Fan Chen, Chongjia Ni, Boon Pang Lim, Nancy F Chen, and Chin-Hui Lee, "A Keyword-Aware Grammar Framework for LVCSR-Based Spoken Keyword Search," in Proc. IEEE ICASSP, 2015.
- (2015) Proc. IEEE ICASSP
- Chen, I.-F.¹ Ni, C.² Lim, B.P.³ Chen, N.F.⁴ Lee, C.-H.⁵

21
- 84946074880
- "Morfessor 2.0.0: last accessed, August
- "Morfessor 2.0.0: http://www.cis.hut.fi/projects/morpho/morfessor2.shtml," last accessed, August 2014.
- (2014)

22
- 84858953642
- The kaldi speech recognition toolkit
- Daniel Povey et al., "The kaldi speech recognition toolkit," in Proc. of IEEE ASRU, 2011.
- (2011) Proc. of IEEE ASRU
- Povey, D.¹

23
- 84893656667
- Models of tone for tonal and non-tonal languages
- Florian Metze, Zaid A. W. Sheikh, Alex Waibel, Jonas Gehring, Kevin Kilgour, Quoc Bao Nguyen, and Van Huy Nguyen, "Models of tone for tonal and non-tonal languages," in Proc. IEEE ASRU, Olomouc; Czech Republic, 2013.
- (2013) Proc. IEEE ASRU, Olomouc; Czech Republic
- Metze, F.¹ Sheikh, Z.A.W.² Waibel, A.³ Gehring, J.⁴ Kilgour, K.⁵ Nguyen, Q.B.⁶ Van Huy, N.⁷

24
- 84905234286
- Strategies for Vietnamese keyword search
- Nancy F Chen, Sunil Sivadas, Boon Pang Lim, Hoang Gia Ngo, Haihua Xu, Van Tung Pham, Bin Ma, and Haizhou Li, "Strategies for Vietnamese keyword search," in Proc. IEEE ICASSP, 2014, pp. 4121-4125.
- (2014) Proc. IEEE ICASSP , pp. 4121-4125
- Chen, N.F.¹ Sivadas, S.² Lim, B.P.³ Ngo, H.G.⁴ Xu, H.⁵ Van Tung, P.⁶ Ma, B.⁷ Li, H.⁸

25
- 0021645331
- Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
- Yariv Ephraim and David Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator," IEEE TASLP, vol. 32, no. 6, pp. 1109-1121, 1984.
- (1984) IEEE TASLP , vol.32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

26
- 78049409301
- Subspace Gaussian mixture models for speech recognition
- Daniel Povey, Lukas Burget, Mohit Agarwal, Pinar Akyazi, Kai Feng, Arnab Ghoshal, Ondrej Glembek, Nagendra K Goel, Martin Karafiát, Ariya Rastrow, R. C. Rose, P Schearz, and S. Thomas, "Subspace Gaussian mixture models for speech recognition," in Proc. IEEE ICASSP, 2010, pp. 4330-4333.
- (2010) Proc. IEEE ICASSP , pp. 4330-4333
- Povey, D.¹ Burget, L.² Agarwal, M.³ Akyazi, P.⁴ Feng, K.⁵ Ghoshal, A.⁶ Glembek, O.⁷ Goel, N.K.⁸ Karafiát, M.⁹ Rastrow, A.¹⁰ Rose, R.C.¹¹ Schearz, P.¹² Thomas, S.¹³

27
- 84878546297
- J. R. Novak, "Phoneticsaurus-A WFST-driven Phoneticizer. Available: https://code.google.com/p/phonetisaurus," 2012.
- (2012) Phoneticsaurus-A WFST-driven Phoneticizer
- Novak, J.R.¹

28
- 84893692703
- Score normalization and system combination for improved keyword spotting
- Damianos Karakos, Richard Schwartz, Stavros Tsakalidis, Le Zhang, Shivesh Ranjan, Tim Ng, Roger Hsiao, Guruprasad Saikumar, Ivan Bulyko, Long Nguyen, et al., "Score normalization and system combination for improved keyword spotting," in Proc. IEEE ASRU, 2013, pp. 210-215.
- (2013) Proc. IEEE ASRU , pp. 210-215
- Karakos, D.¹ Schwartz, R.² Tsakalidis, S.³ Zhang, L.⁴ Ranjan, S.⁵ Ng, T.⁶ Hsiao, R.⁷ Saikumar, G.⁸ Bulyko, I.⁹ Nguyen, L.¹⁰

29
- 84938721918
- Unsupervised data selection and word-morph mixed language model for Tamil low-resource keyword searh
- Chongjia Ni, Cheung-Chi Leung, Lei Wang, Nancy F Chen, and Bin Ma, "Unsupervised Data Selection and Word-Morph Mixed Language Model for Tamil Low-Resource Keyword Searh," in Proc. IEEE ICASSP, 2015.
- (2015) Proc. IEEE ICASSP
- Ni, C.¹ Leung, C.-C.² Wang, L.³ Chen, N.F.⁴ Ma, B.⁵

30
- 84946012970
- Submodular data selection with acoustic and phonetic features for automatic speech recogntion
- Chongjia Ni, LeiWang, Haibo Liu, Cheung-Chi Leung, Li Lu, and Bin M, "Submodular data selection with acoustic and phonetic features for automatic speech recogntion," in Proc. IEEE ICASSP, 2015.
- (2015) Proc. IEEE ICASSP
- Ni, C.¹ Wang, L.² Liu, H.³ Leung, C.-C.⁴ Lu, L.⁵ Bin, M.⁶

31
- 84946024082
- Language independent query-by-example spoken term detection using n-best phone sequences and partial matching
- Haihua Xu, Peng Yang, Xiao Xiong, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Eng Siong Chng, and Haiz, "Language independent query-by-example spoken term detection using n-best phone sequences and partial matching," in ICASSP, 2015.
- (2015) ICASSP
- Xu, H.¹ Yang, P.² Xiong, X.³ Xie, L.⁴ Leung, C.-C.⁵ Chen, H.⁶ Yu, J.⁷ Lv, H.⁸ Wang, L.⁹ Leow, S.J.¹⁰ Ma, B.¹¹ Chng, E.S.¹² Haiz¹³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.