SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings

Volumn , Issue , 2014, Pages 530-535

A keyword search system using open source software

(12) Trmal, Jan a Chen, Guoguo a Povey, Dan a Khudanpur, Sanjeev a Ghahremani, Pegah a Zhang, Xiaohui a Manohar, Vimal a Liu, Chunxi a Jansen, Aren a Klakow, Dietrich b Yarowsky, David a Metze, Florian c

a JOHNS HOPKINS UNIVERSITY (United States)

b SAARLAND UNIVERSITY (Germany)

c CARNEGIE MELLON UNIVERSITY (United States)

Author keywords

Deep neural networks; IARPA BABEL; Kaldi; Keyword search; OpenKWS; Pitch; Speech recognition; Spoken term detection

Indexed keywords

COMPUTATIONAL LINGUISTICS; CONTINUOUS SPEECH RECOGNITION; LINGUISTICS; OPEN SOURCE SOFTWARE; OPEN SYSTEMS; SEARCH ENGINES; SOFTWARE ENGINEERING;

DEEP NEURAL NETWORKS; IARPA BABEL; KALDI; KEYWORD SEARCH; OPENKWS; PITCH; SPOKEN TERM DETECTIONS;

SPEECH RECOGNITION;

EID: 84938721908 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/SLT.2014.7078630 Document Type: Conference Paper

Times cited : (43)

References (14)

1
- 80051618754
- A symmetrization of the subspace gaussian mixture model
- May
- D. Povey, M. Karafiat, A. Ghoshal, and P. Schwarz, "A symmetrization of the Subspace Gaussian Mixture Model, " in Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, May 2011, pp. 4504-4507.
- (2011) Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on , pp. 4504-4507
- Povey, D.¹ Karafiat, M.² Ghoshal, A.³ Schwarz, P.⁴

2
- 51449120120
- Boosted MMI for model and feature-space discriminative training
- March
- D. Povey, D. Kanevsky, B. Kingsbury, B. Ramabhadran, G. Saon, and K. Visweswariah, "Boosted MMI for Model and Feature-space Discriminative Training, " in Acoustics, Speech and Signal Processing (ICASSP), 2008 IEEE International Conference on, March 2008, pp. 4057-4060.
- (2008) Acoustics, Speech and Signal Processing (ICASSP), 2008 IEEE International Conference on , pp. 4057-4060
- Povey, D.¹ Kanevsky, D.² Kingsbury, B.³ Ramabhadran, B.⁴ Saon, G.⁵ Visweswariah, K.⁶

3
- 84905239342
- Improving deep neural network acoustic models using generalized maxout networks
- Florence, Italy
- X. Zhang, J. Trmal, D. Povey, and S. Khudanpur, "Improving Deep Neural Network Acoustic Models Using Generalized Maxout Networks, " in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, Florence, Italy, 2014.
- (2014) Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
- Zhang, X.¹ Trmal, J.² Povey, D.³ Khudanpur, S.⁴

4
- 80052042597
- Lattice indexing for spoken term detection
- Nov
- D. Can and M. Saraclar, "Lattice Indexing for Spoken Term Detection, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 8, pp. 2338-2347, Nov 2011.
- (2011) Audio, Speech, and Language Processing, IEEE Transactions on , vol.19 , Issue.8 , pp. 2338-2347
- Can, D.¹ Saraclar, M.²

5
- 84867616340
- Generating exact lattices in the wfst framework
- March
- D. Povey, M. Hannemann, G. Boulianne, L. Burget, A. Ghoshal, M. Janda, M. Karafiat, S. Kombrink, P. Motlicek, Yanmin Qian, K. Riedhammer, K. Vesely, and Ngoc Thang Vu, "Generating Exact Lattices in the WFST framework, " in Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on, March 2012, pp. 4213-4216.
- (2012) Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on , pp. 4213-4216
- Povey, D.¹ Hannemann, M.² Boulianne, G.³ Burget, L.⁴ Ghoshal, A.⁵ Janda, M.⁶ Karafiat, M.⁷ Kombrink, S.⁸ Motlicek, P.⁹ Qian, Y.¹⁰ Riedhammer, K.¹¹ Vesely, K.¹² Thang Vu, N.¹³

6
- 84893698649
- Using proxies for OOV leywords in the keyword search task
- Dec
- G. Chen, O. Yilmaz, J. Trmal, D. Povey, and S. Khudanpur, "Using Proxies for OOV Leywords in the Keyword Search Task, " in Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on, Dec 2013, pp. 416-421.
- (2013) Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on , pp. 416-421
- Chen, G.¹ Yilmaz, O.² Trmal, J.³ Povey, D.⁴ Khudanpur, S.⁵

7
- 69249145662
- Point process models for spotting keywords in continuous speech
- Nov
- A. Jansen and P. Niyogi, "Point Process Models for Spotting Keywords in Continuous Speech, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 17, no. 8, pp. 1457-1470, Nov 2009.
- (2009) Audio, Speech, and Language Processing, IEEE Transactions on , vol.17 , Issue.8 , pp. 1457-1470
- Jansen, A.¹ Niyogi, P.²

8
- 84905252790
- A pitch extraction algorithm tuned for automatic speech recognition
- P. Ghahremani, B. BabaAli, D. Povey, K. Riedhammer, J. Trmal, and S. Khudanpur, "A Pitch Extraction Algorithm Tuned for Automatic Speech Recognition, " Proceeding of Int. Conf. ICASSP 2014, 2014.
- (2014) Proceeding of Int. Conf. ICASSP 2014
- Ghahremani, P.¹ Babaali, B.² Povey, D.³ Riedhammer, K.⁴ Trmal, J.⁵ Khudanpur, S.⁶

9
- 84893650076
- Semisupervised training of deep neural networks
- IEEE Signal Processing Society
- K. Veselý, M. Hannemann, and L. Burget, "Semisupervised Training of Deep Neural Networks, " in Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on. 2013, pp. 267-272, IEEE Signal Processing Society.
- (2013) Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on , pp. 267-272
- Veselý, K.¹ Hannemann, M.² Burget, L.³

10
- 41049105254
- Joint-sequence models for grapheme-to-phoneme conversion
- M. Bisani and H. Ney, "Joint-sequence Models for Grapheme-to-phoneme Conversion, " Speech Communication, vol. 50, no. 5, pp. 434-451, 2008.
- (2008) Speech Communication , vol.50 , Issue.5 , pp. 434-451
- Bisani, M.¹ Ney, H.²

11
- 85022919385
- Class-Based N-Gram models of natural language
- Dec
- P. F. Brown, P. V. deSouza, R. L. Mercer, V. J. Della Pietra, and J. C. Lai, "Class-based N-gram Models of Natural Language, " Comput. Linguist, vol. 18, no. 4, pp. 467-479, Dec. 1992.
- (1992) Comput. Linguist. , vol.18 , Issue.4 , pp. 467-479
- Brown, P.F.¹ Desouza, P.V.² Mercer, R.L.³ Della Pietra, V.J.⁴ Lai, J.C.⁵

12
- 84905234179
- Featherweight phonetic keyword search for conversational speech
- K. Kintzley, A. Jansen, and H. Hermansky, "Featherweight Phonetic Keyword Search for Conversational Speech, " in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, 2014.
- (2014) Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
- Kintzley, K.¹ Jansen, A.² Hermansky, H.³

13
- 79953250475
- Minimum bayes risk decoding and system combination based on a recursion for edit distance
- H. Xu, D. Povey, L. Mangu, and J. Zhu, "Minimum Bayes Risk Decoding and System Combination Based on a Recursion for Edit Distance, " Computer Speech & Language, vol. 25, no. 4, pp. 802-828, 2011.
- (2011) Computer Speech & Language , vol.25 , Issue.4 , pp. 802-828
- Xu, H.¹ Povey, D.² Mangu, L.³ Zhu, J.⁴

14
- 4544253834
- Posterior probability decoding, confidence estimation and system combination
- Baltimore
- G. Evermann and P. C. Woodland, "Posterior Probability Decoding, Confidence Estimation and System Combination, " in Proc. Speech TranscriptionWorkshop. Baltimore, 2000, vol. 27.
- (2000) Proc. Speech TranscriptionWorkshop , vol.27
- Evermann, G.¹ Woodland, P.C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.