-
1
-
-
84890542302
-
Exploitingdiversity for spoken term detection
-
L. Mangu, H. Soltau, H.-K. Kuo, B. Kingsbury, and G. Saon, "Exploitingdiversity for spoken term detection, " in Proc. ICASSP, 2013, pp. 8282-8286.
-
(2013)
Proc. ICASSP
, pp. 8282-8286
-
-
Mangu, L.1
Soltau, H.2
Kuo, H.-K.3
Kingsbury, B.4
Saon, G.5
-
2
-
-
84893692703
-
Score normalization and system combinationfor improved keyword spotting
-
D. Karakos, R. Schwartz, S. Tsakalidis, L. Zhang, S. Ranjan, T. Ng, and R. Hsiao, "Score normalization and system combinationfor improved keyword spotting, " in Proc. ASRU, 2013, pp. 210-215.
-
(2013)
Proc. ASRU
, pp. 210-215
-
-
Karakos, D.1
Schwartz, R.2
Tsakalidis, S.3
Zhang, L.4
Ranjan, S.5
Ng, T.6
Hsiao, R.7
-
3
-
-
84910031125
-
Data augmentationfor low resource languages
-
A. Ragni, K. Knill, S. Rath, and M. Gales, "Data augmentationfor low resource languages, " in Proc. Interspeech, 2014, pp. 810-814.
-
(2014)
Proc. Interspeech
, pp. 810-814
-
-
Ragni, A.1
Knill, K.2
Rath, S.3
Gales, M.4
-
4
-
-
84910067354
-
Language independentand unsupervised acoustic models for speech recognition and keyword spotting
-
K. Knill, M. Gales, A. Ragni, and S. Rath, "Language independentand unsupervised acoustic models for speech recognition and keyword spotting, " in Proc. INTERSPEECH, 2014, pp. 20-26.
-
(2014)
Proc. INTERSPEECH
, pp. 20-26
-
-
Knill, K.1
Gales, M.2
Ragni, A.3
Rath, S.4
-
6
-
-
0030657238
-
Analyses of multiple evidence combination
-
J. Lee, "Analyses of multiple evidence combination, " in ACM SIGIR, 1997, pp. 267-276.
-
(1997)
ACM SIGIR
, pp. 267-276
-
-
Lee, J.1
-
7
-
-
84890489531
-
Systemcombination and score normalization for spoken term detection
-
J. Mamou, J. Cui, X. Cui, M. Gales, B. Kingsbury, K. Knill, L. Mangu, D. Nolden, M. Picheny, B. Ramabhadran et al., "Systemcombination and score normalization for spoken term detection, "in Proc. ICASSP, 2013, pp. 8272-8276.
-
(2013)
Proc. ICASSP
, pp. 8272-8276
-
-
Mamou, J.1
Cui, J.2
Cui, X.3
Gales, M.4
Kingsbury, B.5
Knill, K.6
Mangu, L.7
Nolden, D.8
Picheny, M.9
Ramabhadran, B.10
-
8
-
-
84946036768
-
Low-resource keyword search strategies forTAMIL
-
N. Chen et al., "Low-resource keyword search strategies forTAMIL, " in Proc. ICASSP, 2015, pp. 5366-5370.
-
(2015)
Proc. ICASSP
, pp. 5366-5370
-
-
Chen, N.1
-
9
-
-
0030638031
-
A Post-processing System to Yield ReducedWord Error Rates: Recogniser Output Voting Error Reduction(ROVER)
-
J. G. Fiscus, "A Post-processing System to Yield ReducedWord Error Rates: Recogniser Output Voting Error Reduction(ROVER), " in Proc. ASRU, 1997, pp. 347-354.
-
(1997)
Proc. ASRU
, pp. 347-354
-
-
Fiscus, J.G.1
-
10
-
-
4544253834
-
Posterior probability decoding, confidence estimation and system combination
-
G. Evermann and P. Woodland, "Posterior Probability Decoding, Confidence Estimation and System Combination, " in Proc. Speech Transcription Workshop, vol. 27, 2000.
-
(2000)
Proc. Speech Transcription Workshop
, vol.27
-
-
Evermann, G.1
Woodland, P.2
-
11
-
-
56149113962
-
Rapid and accurate spokenterm detection
-
D. Miller, M. Kleber, C.-L. Kao, O. Kimball, T. Colthurst, S. Lowe, R. Schwartz, and H. Gish, "Rapid and accurate spokenterm detection, " in Proc. Interspeech, 2007.
-
(2007)
Proc. Interspeech
-
-
Miller, D.1
Kleber, M.2
Kao, C.-L.3
Kimball, O.4
Colthurst, T.5
Lowe, S.6
Schwartz, R.7
Gish, H.8
-
12
-
-
43849107771
-
The SRI/OGI 2006 spoken term detection system
-
D. Vergyri, I. Shafran, A. Stolcke, V. Gadde, M. Akbacak et al., "The SRI/OGI 2006 spoken term detection system, " in Proc. Interspeech, 2007, pp. 2393-2396.
-
(2007)
Proc. Interspeech
, pp. 2393-2396
-
-
Vergyri, D.1
Shafran, I.2
Stolcke, A.3
Gadde, V.4
Akbacak, M.5
-
13
-
-
67649518727
-
Sub-word modelingof out of vocabulary words in spoken term detection
-
I. Szoke, L. Burget, J. Cernocky, and M. Fapso, "Sub-word modelingof out of vocabulary words in spoken term detection, " Proc. SLT, 2008, pp. 273-276.
-
(2008)
Proc. SLT
, pp. 273-276
-
-
Szoke, I.1
Burget, L.2
Cernocky, J.3
Fapso, M.4
-
14
-
-
84890537373
-
A high-performance Cantonese keywordsearch system
-
B. Kingsbury et al., "A high-performance Cantonese keywordsearch system, " in Proc. ICASSP, 2013, pp. 8277-8281.
-
(2013)
Proc. ICASSP
, pp. 8277-8281
-
-
Kingsbury, B.1
-
15
-
-
84910068314
-
Combining tand emand hybrid systems for improved speech recognition and keywordspotting on low resource languages
-
S. Rath, K. Knill, A. Ragni, and M. Gales, "Combining tand emand hybrid systems for improved speech recognition and keywordspotting on low resource languages, " in Proc. Interspeech, 2014, pp. 835-839.
-
(2014)
Proc. Interspeech
, pp. 835-839
-
-
Rath, S.1
Knill, K.2
Ragni, A.3
Gales, M.4
-
16
-
-
79251574977
-
Theefficient incorporation of MLP features into automatic speechrecognition systems
-
J. Park, F. Diehl, M. Gales, M. Tomalin, and P. C. Woodland, "Theefficient incorporation of MLP features into automatic speechrecognition systems, " Computer Speech and Language, vol. 25, no. 3, pp. 519-534, 2010.
-
(2010)
Computer Speech and Language
, vol.25
, Issue.3
, pp. 519-534
-
-
Park, J.1
Diehl, F.2
Gales, M.3
Tomalin, M.4
Woodland, P.C.5
-
17
-
-
84055211743
-
Acoustic modeling usingdeep belief networks
-
A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling usingdeep belief networks, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 14-22, 2012.
-
(2012)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.2
Hinton, G.3
-
18
-
-
85032751458
-
Deep neuralnetworks for acoustic modeling in speech recognition
-
Nov 2012
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath et al., "Deep neuralnetworks for acoustic modeling in speech recognition, " IEEESignal Processing Magazine, vol. 29, no. 6, pp. 82-97, Nov 2012.
-
IEEESignal Processing Magazine
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
-
20
-
-
0034825241
-
Multi-streamadaptive evidence combination for noise robust ASR
-
A. Morris, A. Hagen, H. Glotin, and H. Bourlard, "Multi-streamadaptive evidence combination for noise robust ASR, " SpeechCommunication, vol. 34, no. 1, pp. 25-40, 2001.
-
(2001)
SpeechCommunication
, vol.34
, Issue.1
, pp. 25-40
-
-
Morris, A.1
Hagen, A.2
Glotin, H.3
Bourlard, H.4
-
21
-
-
0141676589
-
New entropy based combinationrules in HMM/ANN multi-stream ASR
-
H. Misra, H. Bourlard, and V. Tyagi, "New entropy based combinationrules in HMM/ANN multi-stream ASR, " in Proc. ICASSP, 2003, pp. 738-741.
-
(2003)
Proc. ICASSP
, pp. 738-741
-
-
Misra, H.1
Bourlard, H.2
Tyagi, V.3
-
22
-
-
0028194709
-
Connectionist probability estimators in hmm speech recognition
-
S. Renals, N. Morgan, H. Bourlard, M. Cohen, and H. Franco, "Connectionist probability estimators in hmm speech recognition, "IEEE Trans. Speech and Audio Processing, vol. 2, no. 1, pp. 161-174, 1994.
-
(1994)
IEEE Trans. Speech and Audio Processing
, vol.2
, Issue.1
, pp. 161-174
-
-
Renals, S.1
Morgan, N.2
Bourlard, H.3
Cohen, M.4
Franco, H.5
-
23
-
-
0028204660
-
Combining TDNN and HMM in a hybrid system for improved continuous-speech recognition
-
C. Dugast, L. Devillers, and X. Aubert, "Combining TDNN and HMM in a hybrid system for improved continuous-speech recognition, "IEEE Trans. Speech and Audio Processing, vol. 2, no. 1, pp. 217-223, 1994.
-
(1994)
IEEE Trans. Speech and Audio Processing
, vol.2
, Issue.1
, pp. 217-223
-
-
Dugast, C.1
Devillers, L.2
Aubert, X.3
-
24
-
-
84890492591
-
Revisiting hybridand gmm-hmm system combination techniques
-
P. Swietojanski, A. Ghoshal, and S. Renals, "Revisiting hybridand gmm-hmm system combination techniques, " in Proc. ICASSP, 2013, pp. 6744-6748.
-
(2013)
Proc. ICASSP
, pp. 6744-6748
-
-
Swietojanski, P.1
Ghoshal, A.2
Renals, S.3
-
25
-
-
80053417853
-
Joint optimization for machine translationsystem combination
-
X. He and K. Toutanova, "Joint optimization for machine translationsystem combination, " in Proc. EMNLP, 2009, pp. 1202-1211.
-
(2009)
Proc. EMNLP
, pp. 1202-1211
-
-
He, X.1
Toutanova, K.2
-
26
-
-
84905265980
-
Joint training of convolutionaland non-convolutional neural networks
-
H. Soltau, G. Saon, and T. Sainath, "Joint training of convolutionaland non-convolutional neural networks, " Proc. ICASSP, 2014.
-
(2014)
Proc. ICASSP
-
-
Soltau, H.1
Saon, G.2
Sainath, T.3
-
27
-
-
84976253431
-
Results of the2006 spoken term detection evaluation
-
J. Fiscus, J. Ajot, J. Garofolo, and G. Doddingtion, "Results of the2006 Spoken Term Detection Evaluation, " in Proc. SIGIR, 2007, pp. 51-57.
-
(2007)
Proc. SIGIR
, pp. 51-57
-
-
Fiscus, J.1
Ajot, J.2
Garofolo, J.3
Doddingtion, G.4
-
28
-
-
0003571976
-
-
S. Young, G. Evermann, M. Gales, T. Hain, D. Kershaw, X. Liu, G. Moore, J. Odell, D. Ollason, D. Povey et al., The HTK Book(for HTK version 3. 4. 1). http: //htk. eng. cam. ac. uk: CambridgeUniversity, 2009.
-
(2009)
The HTK Book(for HTK Version 3. 4. 1)
-
-
Young, S.1
Evermann, G.2
Gales, M.3
Hain, T.4
Kershaw, D.5
Liu, X.6
Moore, G.7
Odell, J.8
Ollason, D.9
Povey, D.10
-
29
-
-
84959142742
-
A general artificial neural networkextension for HTK
-
C. Zhang and P. Woodland, "A general artificial neural networkextension for HTK, " in Submission to InterSpeech, 2015.
-
(2015)
Submission to InterSpeech
-
-
Zhang, C.1
Woodland, P.2
-
30
-
-
84946055405
-
Unicode-based graphemic systemsfor limited resource languages
-
M. Gales, K. Knill, and A. Ragni, "Unicode-based graphemic systemsfor limited resource languages, " in Proc. ICASSP, 2015.
-
(2015)
Proc. ICASSP
-
-
Gales, M.1
Knill, K.2
Ragni, A.3
-
31
-
-
0036460908
-
Lightly supervised and unsupervisedacoustic model training
-
L. Lamel and J.-L. Gauvain, "Lightly supervised and unsupervisedacoustic model training, " Computer speech and language, vol. 16, pp. 115-129, 2013.
-
(2013)
Computer Speech and Language
, vol.16
, pp. 115-129
-
-
Lamel, L.1
Gauvain, J.-L.2
-
32
-
-
84890474716
-
Deepneural network features and semi-supervised training for low resourcespeech recognition
-
S. Thomas, M. L. Seltzer, K. Church, and H. Hermansky, "Deepneural network features and semi-supervised training for low resourcespeech recognition, " in Proc. ICASSP, 2013, pp. 6704-6708.
-
(2013)
Proc. ICASSP
, pp. 6704-6708
-
-
Thomas, S.1
Seltzer, M.L.2
Church, K.3
Hermansky, H.4
-
33
-
-
84893705111
-
Discriminative semi-supervised training forkeyword search in low resource languages
-
R. Hsiao, T. Ng, F. Grézl, D. Karakos, S. Tsakalidis, L. Nguyen, and R. Schwartz, "Discriminative semi-supervised training forkeyword search in low resource languages, " in Proc. ASRU, 2013, pp. 440-445.
-
(2013)
Proc. ASRU
, pp. 440-445
-
-
Hsiao, R.1
Ng, T.2
Grézl, F.3
Karakos, D.4
Tsakalidis, S.5
Nguyen, L.6
Schwartz, R.7
-
34
-
-
84890474441
-
Investigation oncross-and multilingual MLP features under matched and mismatchedacoustical conditions
-
Z. Tuske, J. Pinto, D. Willett, and R. Schluter, "Investigation oncross-and multilingual MLP features under matched and mismatchedacoustical conditions, " in Proc. ICASSP, 2013, pp. 6970-6974.
-
(2013)
Proc. ICASSP
, pp. 6970-6974
-
-
Tuske, Z.1
Pinto, J.2
Willett, D.3
Schluter, R.4
-
35
-
-
84905215475
-
MultilingualMRASTA features for low-resource keyword search and speechrecognition systems
-
Z. Tuske, D. Nolden, R. Schluter, and H. Ney, "MultilingualMRASTA features for low-resource keyword search and speechrecognition systems, " in Proc. ICASSP, 2014, pp. 7854-7858.
-
(2014)
Proc. ICASSP
, pp. 7854-7858
-
-
Tuske, Z.1
Nolden, D.2
Schluter, R.3
Ney, H.4
-
36
-
-
84858953642
-
The Kaldi speech recognition toolkit
-
D. Povey et al., "The Kaldi speech recognition toolkit, " in Proc. ASRU, 2011.
-
(2011)
Proc. ASRU
-
-
Povey, D.1
-
37
-
-
0032638856
-
Semi-tied covariance matrices for hidden markovmodels
-
M. Gales, "Semi-tied covariance matrices for hidden markovmodels, " Speech and Audio Processing, IEEE Transactions on, vol. 7, no. 3, pp. 272-281, 1999.
-
(1999)
Speech and Audio Processing, IEEE Transactions on
, vol.7
, Issue.3
, pp. 272-281
-
-
Gales, M.1
-
38
-
-
33646773785
-
Feature space gaussianization
-
G. Saon, S. Dharanipragada, and D. Povey, "Feature space gaussianization, "in Proc. ICASSP, 2004, p. 326329.
-
(2004)
Proc. ICASSP
, pp. 326329
-
-
Saon, G.1
Dharanipragada, S.2
Povey, D.3
-
39
-
-
0036296863
-
Minimum Phone Error and I-smoothing for improved discriminative training
-
D. Povey and P. C. Woodland, "Minimum Phone Error and I-smoothing for improved discriminative training, " in Proc. ICASSP, 2002, pp. 101-105.
-
(2002)
Proc. ICASSP
, pp. 101-105
-
-
Povey, D.1
Woodland, P.C.2
-
40
-
-
0030362995
-
Acompact model for speaker adaptive training
-
T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "Acompact model for speaker adaptive training, " in Proc. ICSLP, 1996, pp. 1137-1140.
-
(1996)
Proc. ICSLP
, pp. 1137-1140
-
-
Anastasakos, T.1
McDonough, J.2
Schwartz, R.3
Makhoul, J.4
-
41
-
-
0032050110
-
Maximum likelihood linear transformations forHMM-based speech recognition
-
M. Gales, "Maximum likelihood linear transformations forHMM-based speech recognition, " Computer speech & language, vol. 12, no. 2, pp. 75-98, 1998.
-
(1998)
Computer Speech & Language
, vol.12
, Issue.2
, pp. 75-98
-
-
Gales, M.1
-
42
-
-
84906274730
-
Sequencediscriminativetraining of deep neural networks
-
K. Vesely, A. Ghoshal, L. Burget, and D. Povey, "Sequencediscriminativetraining of deep neural networks. " in Proc. Interspeech, 2013, pp. 2345-2349.
-
(2013)
Proc. Interspeech
, pp. 2345-2349
-
-
Vesely, K.1
Ghoshal, A.2
Burget, L.3
Povey, D.4
-
43
-
-
33745219793
-
General indexation ofweighted automata-application to spoken utterance retrieval
-
M. Mohri, C. Allauzen, and M. Saraclar, "General indexation ofweighted automata-application to spoken utterance retrieval, " Proc. HLT/NAACL, 2004, pp. 33-40.
-
(2004)
Proc. HLT/NAACL
, pp. 33-40
-
-
Mohri, M.1
Allauzen, C.2
Saraclar, M.3
|