-
1
-
-
59649109709
-
Discriminative keyword spotting
-
Joseph Keshet, David Grangier, and Samy Bengio, "Discriminative keyword spotting," Speech Communication, vol. 51, no. 4, pp. 317-329, 2009.
-
(2009)
Speech Communication
, vol.51
, Issue.4
, pp. 317-329
-
-
Keshet, J.1
Grangier, D.2
Bengio, S.3
-
2
-
-
84906219389
-
A hybrid HMM/DNN Approach to keyword spotting for short words
-
I-Fan Chen and Chin-Hui Lee, "A hybrid HMM/DNN Approach to keyword spotting for short words," in Proc. of Interspeech, 2013, pp. 1574-1578.
-
(2013)
Proc. of Interspeech
, pp. 1574-1578
-
-
Chen, I.1
Lee, C.2
-
3
-
-
79951634009
-
Results of the 2006 spoken term detection evaluation
-
Jonathan G Fiscus, Jerome Ajot, John S Garofolo, and George Doddingtion, "Results of the 2006 spoken term detection evaluation," in Proceedings of ACM SIGIR Workshop on Searching Spontaneous Conversational, 2007, pp. 51-55.
-
(2007)
Proceedings of ACM SIGIR Workshop on Searching Spontaneous Conversational
, pp. 51-55
-
-
Fiscus, J.G.1
Ajot, J.2
Garofolo, J.S.3
Doddingtion, G.4
-
4
-
-
84874276847
-
The kaldi speech recognition toolkit
-
Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, et al., "The kaldi speech recognition toolkit," in Proc. of IEEE ASRU, 2011.
-
(2011)
Proc. of IEEE ASRU
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlicek, P.8
Qian, Y.9
Schwarz, P.10
-
5
-
-
84905247417
-
Voice quality dependent speech recognition
-
Tae-Jin Yoon, Xiaodan Zhuang, Jennifer Cole, and Mark Hasegawa-Johnson, "Voice quality dependent speech recognition," in International Symposium on Linguistic Patterns in Spontaneous Speech, 2008.
-
(2008)
International Symposium on Linguistic Patterns in Spontaneous Speech
-
-
Yoon, T.1
Zhuang, X.2
Cole, J.3
Hasegawa-Johnson, M.4
-
7
-
-
77949417963
-
Vietnamese large vocabulary continuous speech recognition
-
Ngoc Thang Vu and Tanja Schultz, "Vietnamese large vocabulary continuous speech recognition," in Proc. of IEEE ASRU. IEEE, 2009, pp. 333-338.
-
(2009)
Proc. of IEEE ASRU. IEEE
, pp. 333-338
-
-
Thang Vu, N.1
Schultz, T.2
-
8
-
-
51449101963
-
Openvocabulary spoken term detection using graphone-based hybrid recognition systems
-
Murat Akbacak, Dimitra Vergyri, and Andreas Stolcke, "Openvocabulary spoken term detection using graphone-based hybrid recognition systems," in Proc. of IEEE ICASSP, 2008, pp. 5240-5243.
-
(2008)
Proc. of IEEE ICASSP
, pp. 5240-5243
-
-
Akbacak, M.1
Vergyri, D.2
Stolcke, A.3
-
9
-
-
70349211775
-
Effect of pronounciations on OOV queries in spoken term detection
-
Dogan Can, Erica Cooper, Abhinav Sethy, Chris White, Bhuvana Ramabhadran, and Murat Saraclar, "Effect of pronounciations on OOV queries in spoken term detection," in Proc. of IEEE ICASSP, 2009, pp. 3957-3960.
-
(2009)
Proc. of IEEE ICASSP
, pp. 3957-3960
-
-
Can, D.1
Cooper, E.2
Sethy, A.3
White, C.4
Ramabhadran, B.5
Saraclar, M.6
-
11
-
-
41049105254
-
Joint-sequence models for grapheme-to-phoneme conversion
-
Maximilian Bisani and Hermann Ney, "Joint-sequence models for grapheme-to-phoneme conversion," Speech Communication, vol. 50, no. 5, pp. 434-451, 2008.
-
(2008)
Speech Communication
, vol.50
, Issue.5
, pp. 434-451
-
-
Bisani, M.1
Ney, H.2
-
13
-
-
78349290063
-
Comparative analysis of transliteration techniques based on statistical machine translation and joint-sequence model
-
Nam X Cao, Nhut M Pham, and Quan H Vu, "Comparative analysis of transliteration techniques based on statistical machine translation and joint-sequence model," in Proc. of Symposium on Information and Communication Technology. ACM, 2010, pp. 59-63.
-
(2010)
Proc. of Symposium on Information and Communication Technology. ACM
, pp. 59-63
-
-
Cao, N.X.1
Pham, N.M.2
Vu, Q.H.3
-
14
-
-
84890542302
-
Exploiting diversity for spoken term detection
-
Lidia Mangu, Hagen Soltau, Hong-Kwang Kuo, Brian Kingsbury, and George Saon, "Exploiting diversity for spoken term detection," in Proc. of IEEE ICASSP, 2013.
-
(2013)
Proc. of IEEE ICASSP
-
-
Mangu, L.1
Soltau, H.2
Kuo, H.3
Kingsbury, B.4
Saon, G.5
-
15
-
-
84906281193
-
On the calibration and fusion of heterogeneous spoken term detectionsystems
-
Alberto Abad, Luis Javier Rodrguez-Fuentes, Mikel Penagarikano, Amparo Varona, and Germán Bordel, "On the calibration and fusion of heterogeneous spoken term detectionsystems," in Proc. of Interspeech, 2013.
-
(2013)
Proc. of Interspeech
-
-
Abad, A.1
Rodrguez-Fuentes, L.J.2
Penagarikano, M.3
Varona, A.4
Bordel, G.5
-
17
-
-
84890537373
-
A high performance cantonese keyword search system
-
Brian Kingsbury, Jia Cui, Xiaodong Cui, Mark JF Gales, Kate Knill, Jonathan Mamou, Lidia Mangu, David Nolden, Michael Picheny, Bhuvana Ramabhadran, Ralf Schluter, Abhinav Sehty, and Phillip C.Woodland, "A High Performance Cantonese Keyword Search System," in Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Kingsbury, B.1
Cui, J.2
Cui, X.3
Gales, M.J.4
Knill, K.5
Mamou, J.6
Mangu, L.7
Nolden, D.8
Picheny, M.9
Ramabhadran, B.10
Schluter, R.11
Sehty, A.12
Phillip C.Woodland13
-
18
-
-
78049409301
-
Subspace gaussian mixture models for speech recognition
-
Daniel Povey, Lukas Burget, Mohit Agarwal, Pinar Akyazi, Kai Feng, Arnab Ghoshal, Ondrej Glembek, Nagendra K Goel, Martin Karafíat, Ariya Rastrow, R. C. Rose, P Schearz, and S. Thomas, "Subspace gaussian mixture models for speech recognition," in IEEE International Conference on Acoustics Speech and Signal Processing,. IEEE, 2010, pp. 4330-4333.
-
(2010)
IEEE International Conference on Acoustics Speech and Signal Processing,. IEEE
, pp. 4330-4333
-
-
Povey, D.1
Burget, L.2
Agarwal, M.3
Akyazi, P.4
Feng, K.5
Ghoshal, A.6
Glembek, O.7
Goel, N.K.8
Karafíat, M.9
Rastrow, A.10
Rose, R.C.11
Schearz, P.12
Thomas, S.13
-
20
-
-
0025041264
-
Perceptual linear predictive (PLP) analysis of speech
-
Hynek Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Am., vol. 87, pp. 1738, 1990.
-
(1990)
J. Acoust. Soc. Am.
, vol.87
, pp. 1738
-
-
Hermansky, H.1
-
21
-
-
84905283451
-
New methods in continuous mandarin speech recognition
-
C Julian Chen, Ramesh A Gopinath, Michael D Monkowski, Michael A Picheny, and Katherine Shen, "New methods in continuous mandarin speech recognition.," in Proc. of Eurospeech, 1997.
-
(1997)
Proc. of Eurospeech
-
-
Julian Chen, C.1
Gopinath, R.A.2
Monkowski, M.D.3
Picheny, M.A.4
Shen, K.5
-
22
-
-
70349209406
-
Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum
-
Kornel Laskowski and Qin Jin, "Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum," in Proc. of IEEE ICASSP, 2009, pp. 4541-4544.
-
(2009)
Proc. of IEEE ICASSP
, pp. 4541-4544
-
-
Laskowski, K.1
Jin, Q.2
-
23
-
-
84893656667
-
Models of tone for tonal and non-tonal languages
-
Olomouc; Czech Republic
-
Florian Metze, Zaid A. W. Sheikh, Alex Waibel, Jonas Gehring, Kevin Kilgour, Quoc Bao Nguyen, and Van Huy Nguyen, "Models of tone for tonal and non-tonal languages," in Proc. IEEE ASRU, Olomouc; Czech Republic, 2013.
-
(2013)
Proc. IEEE ASRU
-
-
Metze, F.1
Sheikh, W.Z.A.2
Waibel, A.3
Gehring, J.4
Kilgour, K.5
Bao Nguyen, Q.6
Van Huy Nguyen7
-
24
-
-
33745220757
-
Influence of F0 on Vietnamese syllable perception
-
Do Dat Tran, Eric Castelli, Jean-Francóis Serignat, Van Loan Trinh, and Le Xuan Hung, "Influence of F0 on Vietnamese syllable perception," in Proc. of Interspeech, 2005, pp. 1697-1700.
-
(2005)
Proc. of Interspeech
, pp. 1697-1700
-
-
Dat Tran, D.1
Castelli, E.2
Serignat, J.3
Van Loan, Trinh.4
Xuan Hung, L.5
-
25
-
-
84905247416
-
A phonetic study of Vietnamese tones: Acoustic and electroglottographic measurements
-
Vu Ngoc Tuan, Christophe d'Alessandro, and Sophie Rosset, "A phonetic study of Vietnamese tones: Acoustic and electroglottographic measurements," in Proc. of Interspeech, 2002.
-
(2002)
Proc. of Interspeech
-
-
Ngoc Tuan, V.1
D'alessandro, C.2
Rosset, S.3
-
26
-
-
0031023993
-
Glottal characteristics of female speakers: Acoustic correlates
-
Helen M Hanson, "Glottal characteristics of female speakers: Acoustic correlates," J. Acoust. Soc. Am., vol. 101, pp. 466, 1997.
-
(1997)
J. Acoust. Soc. Am.
, vol.101
, pp. 466
-
-
Hanson, H.M.1
-
29
-
-
0141589488
-
SRILM-an extensible language modeling toolkit
-
Andreas Stolcke et al., "SRILM-an extensible language modeling toolkit," in Proc. of Interspeech, 2002.
-
(2002)
Proc. of Interspeech
-
-
Stolcke, A.1
-
30
-
-
80052042597
-
Lattice indexing for spoken term detection
-
Dogan Can and Murat Saraclar, "Lattice indexing for spoken term detection," IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 8, pp. 2338-2347, 2011.
-
(2011)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.19
, Issue.8
, pp. 2338-2347
-
-
Can, D.1
Saraclar, M.2
-
31
-
-
43849104109
-
Rapid and accurate spoken term detection
-
David RH Miller, Michael Kleber, Chia-Lin Kao, Owen Kimball, Thomas Colthurst, Stephen A Lowe, RichardMSchwartz, and Herbert Gish, "Rapid and accurate spoken term detection.," in Proc. of Interspeech, 2007, pp. 314-317.
-
(2007)
Proc. of Interspeech
, pp. 314-317
-
-
Miller, D.Rh.1
Kleber, M.2
Kao, C.3
Kimball, O.4
Colthurst, T.5
Lowe, S.A.6
Schwartz, R.M.7
Gish, H.8
-
32
-
-
2442562479
-
Segmental minimum Bayes-risk decoding for automatic speech recognition
-
Vaibhava Goel, Shankar Kumar, andWilliam Byrne, "Segmental minimum Bayes-risk decoding for automatic speech recognition," Speech and Audio Processing, IEEE Transactions on, vol. 12, no. 3, pp. 234-249, 2004.
-
(2004)
Speech and Audio Processing, IEEE Transactions on
, vol.12
, Issue.3
, pp. 234-249
-
-
Goel, V.1
Kumar, S.2
Byrne, A.3
|