-
1
-
-
0035426931
-
Language independent and language adaptive acoustic modeling for speech recognition
-
T Schultz and AWaibel, "Language independent and language adaptive acoustic modeling for speech recognition," Speech Communication, vol. 35, pp. 31-51, 2001.
-
(2001)
Speech Communication
, vol.35
, pp. 31-51
-
-
Schultz, T.1
Waibel, A.2
-
2
-
-
85009101138
-
Experiments on cross-language acoustic modeling
-
T Schultz and A Waibel, "Experiments on cross-language acoustic modeling," in Proc. Eurospeech, 2001.
-
(2001)
Proc. Eurospeech
-
-
Schultz, T.1
Waibel, A.2
-
3
-
-
84858955616
-
Study of probabilistic and bottle-neck features in multilingual environment
-
F Gŕezl, M Karafíat, and M Janda, "Study of probabilistic and bottle-neck features in multilingual environment," in Proc. IEEE ASRU, 2011.
-
(2011)
Proc IEEE ASRU
-
-
Gŕezl, F.1
Karafíat, M.2
Janda, M.3
-
5
-
-
84867616349
-
Using KL-divergence and multilingual information to improve ASR for underresourced languages
-
D Imseng, H Bourlard, and PN Garner, "Using KL-divergence and multilingual information to improve ASR for underresourced languages," in Proc. IEEE ICASSP, 2012.
-
(2012)
Proc IEEE ICASSP
-
-
Imseng, D.1
Bourlard, H.2
Garner, P.N.3
-
6
-
-
78049394188
-
Multilingual acoustic modeling for speech recognition based on subspace Gaussian mixture models
-
L Burget, P Schwarz, M Agarwal, P Akyaz, K Feng, A Ghoshal, O Glembek, N Goel, M Karafíat, D Povey, A Rastrow, RC Rose, and S Thomas, "Multilingual acoustic modeling for speech recognition based on subspace Gaussian mixture models," in Proc. IEEE ICASSP, 2010.
-
(2010)
Proc IEEE ICASSP
-
-
Burget, L.1
Schwarz, P.2
Agarwal, M.3
Akyaz, P.4
Feng, K.5
Ghoshal, A.6
Glembek, O.7
Goel, N.8
Karafíat, M.9
Povey, D.10
Rastrow, A.11
Rose, R.C.12
Thomas, S.13
-
7
-
-
84858952433
-
Regularized subspace Gaussian mixture models for cross-lingual speech recognition
-
L Lu, A Ghoshal, and S Renals, "Regularized subspace Gaussian mixture models for cross-lingual speech recognition," in Proc. IEEE ASRU, 2011.
-
(2011)
Proc IEEE ASRU
-
-
Lu, L.1
Ghoshal, A.2
Renals, S.3
-
8
-
-
80051617867
-
Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil
-
NT Vu, F Kraus, and T Schultz, "Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil," in Proc. IEEE ICASSP, 2011.
-
(2011)
Proc IEEE ICASSP
-
-
Vu, N.T.1
Kraus, F.2
Schultz, T.3
-
9
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
G Hinton, S Osindero, and Y Teh, "A fast learning algorithm for deep belief nets," Neural Computation, vol. 18, pp. 1527-1554, 2006.
-
(2006)
Neural Computation
, vol.18
, pp. 1527-1554
-
-
Hinton, G.1
Osindero, S.2
Teh, Y.3
-
10
-
-
0033709098
-
Tandem connectionist feature extraction for conventional HMM systems
-
H Hermansky, DPW Ellis, and S Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. IEEE ICASSP, 2000.
-
(2000)
Proc IEEE ICASSP
-
-
Hermansky, H.1
Ellis, D.2
Sharma, S.3
-
11
-
-
34547548235
-
Probabilistic and bottle-neck features for LVCSR of meetings
-
F Grézl, M Karafiát, S Kontár, and J C? ernocký, "Probabilistic and bottle-neck features for LVCSR of meetings," in Proc. IEEE ICASSP, 2007.
-
(2007)
Proc IEEE ICASSP
-
-
Grézl, F.1
Karafiát, M.2
Kontár, S.3
Cernocký, J.4
-
13
-
-
0028194709
-
Connectionist probability estimators in HMM speech recognition
-
S Renals, N Morgan, H Bourlard, M Cohen, and H Franco, "Connectionist probability estimators in HMM speech recognition," IEEE Transactions on Speech and Audio Processing, vol. 2, no. 1, pp. 161-174, 1994.
-
(1994)
IEEE Transactions on Speech and Audio Processing
, vol.2
, Issue.1
, pp. 161-174
-
-
Renals, S.1
Morgan, N.2
Bourlard, H.3
Cohen, M.4
Franco, H.5
-
14
-
-
0028392167
-
An application of recurrent nets to phone probability estimation
-
AJ Robinson, "An application of recurrent nets to phone probability estimation," IEEE Transactions on Neural Networks, vol. 5, no. 2, pp. 298-305, 1994.
-
(1994)
IEEE Transactions on Neural Networks
, vol.5
, Issue.2
, pp. 298-305
-
-
Robinson, A.J.1
-
15
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
A Mohamed, GE Dahl, and G Hinton, "Acoustic modeling using deep belief networks," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 14-22, 2012.
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.E.2
Hinton, G.3
-
16
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
-
GE Dahl, D Yu, L Deng, and A Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Transactions on Audio, Speech & Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
-
(2012)
IEEE Transactions on Audio, Speech & Language Processing
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
17
-
-
33947619591
-
Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons
-
A Stolcke, F Gŕezl, M-Y Hwang, X Lei, N Morgan, and D Vergyri, "Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons," in Proc. IEEE ICASSP, 2006.
-
(2006)
Proc IEEE ICASSP
-
-
Stolcke, A.1
Gŕezl, F.2
Hwang, M.-Y.3
Lei, X.4
Morgan, N.5
Vergyri, D.6
-
18
-
-
79959819891
-
Cross-lingual and multistream posterior features for low resource LVCSR systems
-
S Thomas and H Hermansky, "Cross-lingual and multistream posterior features for low resource LVCSR systems," in Proc. Interspeech, 2010.
-
(2010)
Proc. Interspeech
-
-
Thomas, S.1
Hermansky, H.2
-
19
-
-
44849132075
-
Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPs
-
O Ç etin, M Magimai-Doss, K Livescu, A Kantor, S King, C Bartels, and J Frankel, "Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPs," in Proc IEEE ASRU, 2007.
-
(2007)
Proc IEEE ASRU
-
-
Çetin, O.1
Magimai-Doss, M.2
Livescu, K.3
Kantor, A.4
King, S.5
Bartels, C.6
Frankel, J.7
-
20
-
-
84864073449
-
Greedy layer-wise training of deep networks
-
MIT Press
-
Y Bengio, P Lamblin, D Popovici, and H Larochelle, "Greedy layer-wise training of deep networks," in Advances in Neural Information Processing Systems 19 (NIPS'06), pp. 153-160. MIT Press, 2007.
-
(2007)
Advances in Neural Information Processing Systems 19 (NIPS'06)
, pp. 153-160
-
-
Bengio, Y.1
Lamblin, P.2
Popovici, D.3
Larochelle, H.4
-
21
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
F Seide, G Li, X Chen, and D Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. IEEE ASRU, 2011.
-
(2011)
Proc IEEE ASRU
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
22
-
-
84905257065
-
GlobalPhone: A multilingual speech and text database developed at Karlsruhe University
-
T Schultz, "GlobalPhone: a multilingual speech and text database developed at Karlsruhe University," in Proc. ICLSP, 2002.
-
(2002)
Proc. ICLSP
-
-
Schultz, T.1
-
24
-
-
84874276847
-
The Kaldi speech recognition toolkit
-
December
-
D Povey, A Ghoshal, G Boulianne, L Burget, O Glembek, N Goel, M Hannemann, P Motĺcek, Y Qian, P Schwarz, J Silovsḱy, G Stemmer, and K Veseĺy, "The Kaldi speech recognition toolkit," in Proc. IEEE ASRU, December 2011.
-
(2011)
Proc. IEEE ASRU
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motĺcek, P.8
Qian, Y.9
Schwarz, P.10
Silovsḱy, J.11
Stemmer, G.12
Veseĺy, K.13
-
25
-
-
84892187452
-
Maximum likelihood modeling with Gaussian distributions for classification
-
May
-
R Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification," in Proc. IEEE ICASSP, May 1998, vol. 2, pp. 661-664.
-
(1998)
Proc IEEE ICASSP
, vol.2
, pp. 661-664
-
-
Gopinath, R.1
-
26
-
-
51449120120
-
Boosted MMI for model and featurespace discriminative training
-
D Povey, D Kanevsky, B Kingsbury, B Ramabhadran, G Saon, and K Visweswariah, "Boosted MMI for model and featurespace discriminative training," in Proc. IEEE ICASSP, 2008, pp. 4057-4060.
-
(2008)
Proc IEEE ICASSP
, pp. 4057-4060
-
-
Povey, D.1
Kanevsky, D.2
Kingsbury, B.3
Ramabhadran, B.4
Saon, G.5
Visweswariah, K.6
-
27
-
-
84873443879
-
Theano: A CPU and GPU math expression compiler
-
J Bergstra, O Breuleux, F Bastien, P Lamblin, R Pascanu, G Desjardins, J Turian, D Warde-Farley, and Y Bengio, "Theano: a CPU and GPU math expression compiler," in Proc. SciPy, 2010.
-
(2010)
Proc. SciPy
-
-
Bergstra, J.1
Breuleux, O.2
Bastien, F.3
Lamblin, P.4
Pascanu, R.5
Desjardins, G.6
Turian, J.7
Warde-Farley, D.8
Bengio, Y.9
-
28
-
-
77949522811
-
Why does unsupervised pre-training help deep learning?
-
February
-
D Erhan, Y Bengio, A Courville, P-A Manzagol, P Vincent, and S Bengio, "Why does unsupervised pre-training help deep learning?," Journal of Machine Learning Research, vol. 11, pp. 625-660, February 2010.
-
(2010)
Journal of Machine Learning Research
, vol.11
, pp. 625-660
-
-
Erhan, D.1
Bengio, Y.2
Courville, A.3
Manzagol, P.-A.4
Vincent, P.5
Bengio, S.6
|