-
1
-
-
84055222005
-
Context dependent pre-trained deep neural networks for large vocabulary speech recognition
-
G. Dahl, D. Yu, L. Deng, and A. Acero, "Context dependent pre-trained deep neural networks for large vocabulary speech recognition, " IEEE Transactions on Audio, Speech and Language Processing, vol. 20(1), pp. 30-42, 2012.
-
(2012)
IEEE Transactions on Audio, Speech and Language Processing
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.1
Yu, D.2
Deng, L.3
Acero, A.4
-
2
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in Proc. ASRU, pp. 24-29, 2011.
-
(2011)
Proc. ASRU
, pp. 24-29
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
3
-
-
0032050110
-
Maximum likelihood linear transformations for hmm-based speech recognition
-
M. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition, " Computer Speech and Language, vol. 12, pp. 75-98, 1998.
-
(1998)
Computer Speech and Language
, vol.12
, pp. 75-98
-
-
Gales, M.1
-
4
-
-
79959849500
-
Comparison of discriminative input and output transformations for speaker adaptation in the hybrid nn/hmm systems
-
B. Li, and K. C. Sim, "Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems, " in Proc. Inter speech, pp. 526-529, 2010.
-
(2010)
Proc. Inter Speech
, pp. 526-529
-
-
Li, B.1
Sim, K.C.2
-
5
-
-
84874226579
-
Adaptation of context-dependent deep neural networks for automatic speech recognition
-
K. Yao, D. Yu, F. Seide, H. Su, L. Deng, and Y. Gong, "Adaptation of context-dependent deep neural networks for automatic speech recognition, " in Proc. IEEE Spoken Language Technology Workshop, pp. 366-369, 2012.
-
(2012)
Proc. IEEE Spoken Language Technology Workshop
, pp. 366-369
-
-
Yao, K.1
Yu, D.2
Seide, F.3
Su, H.4
Deng, L.5
Gong, Y.6
-
6
-
-
84878606732
-
Hermitian-based hidden activation functions for adaptation of hybrid hmm/ann models
-
S. M. Siniscalchi, J. Li, and C.-H. Lee, "Hermitian-based hidden activation functions for adaptation of hybrid HMM/ANN models, " in Proc. Inter speech, pp. 526-529, 2012.
-
(2012)
Proc. Inter Speech
, pp. 526-529
-
-
Siniscalchi, S.M.1
Li, J.2
Lee, C.-H.3
-
7
-
-
84906241049
-
Improved feature processing for deep neural networks
-
S. P. Rath, D. Povey, K. Vesely, and J. Cernocky, "Improved feature processing for deep neural networks, " in Proc. Inter speech, 2013.
-
(2013)
Proc. Inter Speech
-
-
Rath, S.P.1
Povey, D.2
Vesely, K.3
Cernocky, J.4
-
8
-
-
84893691530
-
Speaker adaptation of neural network acoustic models using i-vectors
-
G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors, " in Proc. ASRU, pp. 55-59, 2013.
-
(2013)
Proc. ASRU
, pp. 55-59
-
-
Saon, G.1
Soltau, H.2
Nahamoo, D.3
Picheny, M.4
-
9
-
-
70450180849
-
Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification
-
N. Dehak, R. Dehak, P. Kenny, N. Brummer, P. Ouellet, and P. Dumouchel, "Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification, " in Proc. Inter speech, pp. 1559- 1562, 2009.
-
(2009)
Proc. Inter Speech
, pp. 1559-1562
-
-
Dehak, N.1
Dehak, R.2
Kenny, P.3
Brummer, N.4
Ouellet, P.5
Dumouchel, P.6
-
10
-
-
79951609039
-
Front-end factor analysis for speaker verification
-
N. Dehak, P. J. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification, " IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 4, pp. 788-798, 2011.
-
(2011)
IEEE Transactions on Audio, Speech and Language Processing
, vol.19
, Issue.4
, pp. 788-798
-
-
Dehak, N.1
Kenny, P.J.2
Dehak, R.3
Dumouchel, P.4
Ouellet, P.5
-
11
-
-
80051634401
-
Simplification and optimization of i-vector extraction
-
O. Glembek, L. Burget, P. Matejka, M. Karafiat, and P. Kenny, "Simplification and optimization of i-vector extraction, " in Proc. ICASSP, pp. 4516-4519, 2011.
-
(2011)
Proc. ICASSP
, pp. 4516-4519
-
-
Glembek, O.1
Burget, L.2
Matejka, P.3
Karafiat, M.4
Kenny, P.5
-
12
-
-
0030677475
-
Speaker adaptive training: A maximum likelihood approach to speaker normalization
-
T. Anastasakos, J. McDonough, and J. Makhoul, "Speaker adaptive training: A maximum likelihood approach to speaker normalization, " in Proc. ICASSP, pp. 1043-1046, 1997.
-
(1997)
Proc. ICASSP
, pp. 1043-1046
-
-
Anastasakos, T.1
McDonough, J.2
Makhoul, J.3
-
13
-
-
84890452886
-
Fast speaker adaptation of hybrid nn/hmm model for speech recognition based on discriminative learning of speaker code
-
O. Abdel-Hamid, and H. Jiang, "Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code, " in Proc. ICASSP, pp. 7942-7946, 2013.
-
(2013)
Proc. ICASSP
, pp. 7942-7946
-
-
Abdel-Hamid, O.1
Jiang, H.2
-
14
-
-
84906225505
-
Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition
-
O. Abdel-Hamid, and H. Jiang, "Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition, " in Proc. Inter speech, 2013.
-
(2013)
Proc. Inter Speech
-
-
Abdel-Hamid, O.1
Jiang, H.2
-
15
-
-
58349106697
-
A study of inter speaker variability in speaker verification
-
P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel, "A study of interspeaker variability in speaker verification, " IEEE Transactions on Audio, Speech and Language Processing, vol. 16, no. 5, pp. 980- 988, 2008.
-
(2008)
IEEE Transactions on Audio, Speech and Language Processing
, vol.16
, Issue.5
, pp. 980-988
-
-
Kenny, P.1
Ouellet, P.2
Dehak, N.3
Gupta, V.4
Dumouchel, P.5
-
16
-
-
84858984756
-
Ivector-based discriminative adaptation for automatic speech recognition
-
M. Karafiat, L. Burget, P. Matejka, O. Glembek, and J. Cernocky, "iVector-based discriminative adaptation for automatic speech recognition, " in Proc. ASRU, pp. 152- 157, 2011.
-
(2011)
Proc. ASRU
, pp. 152-157
-
-
Karafiat, M.1
Burget, L.2
Matejka, P.3
Glembek, O.4
Cernocky, J.5
-
17
-
-
33646788786
-
Fmpe: Discriminatively trained features for speech recognition
-
D. Povey, B. Kingsbury, L. Mangu, G. Saon, H. Soltau, and G. Zweig, "fMPE: discriminatively trained features for speech recognition, " in Proc. ICASSP, pp. 961-964, 2005.
-
(2005)
Proc. ICASSP
, pp. 961-964
-
-
Povey, D.1
Kingsbury, B.2
Mangu, L.3
Saon, G.4
Soltau, H.5
Zweig, G.6
-
18
-
-
84890483211
-
Learning discriminative basis coefficients for eigen space mllr unsupervised adaptation
-
Y. Miao, F. Metze, and A. Waibel, "Learning discriminative basis coefficients for eigen space MLLR unsupervised adaptation, " in Proc. ICASSP, pp. 7927- 7931, 2013.
-
(2013)
Proc. ICASSP
, pp. 7927-7931
-
-
Miao, Y.1
Metze, F.2
Waibel, A.3
-
19
-
-
44949102463
-
Recent progress on the discriminative region-dependent transform for speech feature extraction
-
B. Zhang, S. Matsoukas, and R. Schwartz, "Recent progress on the discriminative region-dependent transform for speech feature extraction, " in Proc. Inter speech, 2006.
-
(2006)
Proc. Inter Speech
-
-
Zhang, B.1
Matsoukas, S.2
Schwartz, R.3
-
20
-
-
84867605836
-
Applying convolutional neural networks concepts to hybrid nn-hmm model for speech recognition
-
O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition, " in Proc. ICASSP, pp. 4277-4280, 2012.
-
(2012)
Proc. ICASSP
, pp. 4277-4280
-
-
Abdel-Hamid, O.1
Mohamed, A.2
Jiang, H.3
Penn, G.4
-
21
-
-
84890525984
-
Deep convolutional neural networks for lvcsr
-
T. N. Sainath, A. Mohamed, B. Kingsbury, and B. Ramabhadran, "Deep convolutional neural networks for LVCSR, " in Proc. ICASSP, pp. 8614-8618, 2013.
-
(2013)
Proc. ICASSP
, pp. 8614-8618
-
-
Sainath, T.N.1
Mohamed, A.2
Kingsbury, B.3
Ramabhadran, B.4
-
22
-
-
84858953642
-
The kaldi speech recognition toolkit
-
D. Povey, A. Ghoshal, et al., "The Kaldi speech recognition toolkit, " in Proc. ASRU, 2011.
-
(2011)
Proc. ASRU
-
-
Povey, D.1
Ghoshal, A.2
-
23
-
-
85084012167
-
Alize/spkdet: A state-of-the-art open source software for speaker recognition
-
J.-F. Bonastre, N. Scheffer, D. Matrouf, C. Fredouille, A. Larcher, A. Preti, G. Pouchoulin, N. Evans, B. Fauve, and J. Mason, "ALIZE/SpkDet: A state-of-the-art open source software for speaker recognition, " in Proc. ISCA/IEEE Speaker Odyssey 2008.
-
(2008)
Proc. ISCA/IEEE Speaker Odyssey
-
-
Bonastre, J.-F.1
Scheffer, N.2
Matrouf, D.3
Fredouille, C.4
Larcher, A.5
Preti, A.6
Pouchoulin, G.7
Evans, N.8
Fauve, B.9
Mason, J.10
-
25
-
-
79551480483
-
Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion
-
P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P. Manzagol, "Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion, " Journal of Machine Learning Research, vol. 11, pp. 3371-3408, 2010.
-
(2010)
Journal of Machine Learning Research
, vol.11
, pp. 3371-3408
-
-
Vincent, P.1
Larochelle, H.2
Lajoie, I.3
Bengio, Y.4
Manzagol, P.5
-
26
-
-
84890482429
-
Extracting deep bottleneck features using stacked auto encoders
-
J. Gehring, Y. Miao, F. Metze, and A. Waibel, "Extracting deep bottleneck features using stacked auto encoders, " in Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Gehring, J.1
Miao, Y.2
Metze, F.3
Waibel, A.4
-
27
-
-
84906273176
-
Modular combination of deep neural networks for acoustic modeling
-
J. Gehring, W. Lee, K. Kilgour, I. Lane, Y. Miao, and A. Waibel, "Modular combination of deep neural networks for acoustic modeling, " in Proc. Inter speech, pp. 94-98, 2013.
-
(2013)
Proc. Inter Speech
, pp. 94-98
-
-
Gehring, J.1
Lee, W.2
Kilgour, K.3
Lane, I.4
Miao, Y.5
Waibel, A.6
-
28
-
-
84893701756
-
Deep maxout networks for low-resource speech recognition
-
Y. Miao, F. Metze, and S. Rawat, "Deep maxout networks for low-resource speech recognition, " in Proc. ASRU, pp. 398-403, 2013.
-
(2013)
Proc. ASRU
, pp. 398-403
-
-
Miao, Y.1
Metze, F.2
Rawat, S.3
-
29
-
-
84906283232
-
Using conversational word bursts in spoken term detection
-
J. Chiu, and A. Rudnicky, "Using conversational word bursts in spoken term detection, " in Proc. Inter speech, 2013.
-
(2013)
Proc. Inter Speech
-
-
Chiu, J.1
Rudnicky, A.2
-
30
-
-
84906273501
-
Improving low-resource cddnn- hmm using dropout and multilingual dnn training
-
Y. Miao, and F. Metze, "Improving low-resource CDDNN- HMM using dropout and multilingual DNN training, " in Proc. Inter speech, pp. 2237-2241, 2013.
-
(2013)
Proc. Inter Speech
, pp. 2237-2241
-
-
Miao, Y.1
Metze, F.2
-
31
-
-
84910068044
-
Distributed learning of multilingual dnn feature extractors using gpus
-
to appear
-
Y. Miao, H. Zhang, and F. Metze, "Distributed learning of multilingual DNN feature extractors using GPUs, " to appear in Proc. Inter speech, 2014.
-
(2014)
Proc. Inter Speech
-
-
Miao, Y.1
Zhang, H.2
Metze, F.3
-
32
-
-
84910028405
-
Improving language-universal feature extraction with deep maxout and convolutional neural networks
-
to appear
-
Y. Miao, and F. Metze, "Improving language-universal feature extraction with deep maxout and convolutional neural networks, " to appear in Proc. Inter speech, 2014.
-
(2014)
Proc. Inter Speech
-
-
Miao, Y.1
Metze, F.2
|