-
1
-
-
84976225254
-
-
http: //htk. eng. cam. ac. uk
-
-
-
-
2
-
-
0003571976
-
-
Cambridge University Engineering Department
-
S. J. Young, G. Evermann, M. J. F. Gales, T. Hain., D. Kershaw, X.-Y. Liu, G. Moore, J. J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. C. Woodland, The HTK book (for HTK version3. 4). Cambridge University Engineering Department, 2006.
-
(2006)
The HTK Book (For HTK version3. 4)
-
-
Young, S.J.1
Evermann, G.2
Gales, M.J.F.3
Hain, T.4
Kershaw, D.5
Liu, X.-Y.6
Moore, G.7
Odell, J.J.8
Ollason, D.9
Povey, D.10
Valtchev, V.11
Woodland, P.C.12
-
3
-
-
0002144369
-
Tree-basedstate tying for high accuracy acoustic modelling
-
Plainsboro, NJ, USA: MorganKaufman Publishers Inc
-
S. J. Young, J. J. Odell, and P. C. Woodland, "Tree-basedstate tying for high accuracy acoustic modelling, " Proc. HumanLanguage Technology Workshop, Plainsboro, NJ, USA: MorganKaufman Publishers Inc, 1994.
-
(1994)
Proc. HumanLanguage Technology Workshop
-
-
Young, S.J.1
Odell, J.J.2
Woodland, P.C.3
-
4
-
-
0029288633
-
Maximum likelihood linearregression for speaker adaptation of continuous density hiddenMarkov models
-
C. J. Leggetter and P. C. Woodland, "Maximum likelihood linearregression for speaker adaptation of continuous density hiddenMarkov models, " Computer Speech & Language, Vol. 9, No. 2, pp. 171-185, 1995.
-
(1995)
Computer Speech & Language
, vol.9
, Issue.2
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
5
-
-
0036296863
-
Minimum phone errorand I-smoothing for improved discriminative training
-
Orland o, FL, USA
-
D. Povey and P. C. Woodland, "Minimum phone errorand I-smoothing for improved discriminative training, " Proc. ICASSP'02, Orland o, FL, USA, 2002.
-
(2002)
Proc. ICASSP'02
-
-
Povey, D.1
Woodland, P.C.2
-
6
-
-
85133720638
-
The HMM-based speech synthesis system (HTS)version 2. 0
-
Bonn, Germany
-
H. Zen, T. Nose, J. Yamagishi, S. Sako, T. Masuko, A. W. Black, and K. Tokuda, "The HMM-based speech synthesis system (HTS)version 2. 0, " Proc. 6th ISCA Workshop on Speech Synthesis, Bonn, Germany, 2007.
-
(2007)
Proc. 6th ISCA Workshop on Speech Synthesis
-
-
Zen, H.1
Nose, T.2
Yamagishi, J.3
Sako, S.4
Masuko, T.5
Black, A.W.6
Tokuda, K.7
-
7
-
-
84858976070
-
Feature engineeringin context-dependent deep neural networks for conversationalspeech transcription
-
Waikoloa, HI, USA
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineeringin context-dependent deep neural networks for conversationalspeech transcription, " Proc. ASRU'11, Waikoloa, HI, USA, 2011.
-
(2011)
Proc. ASRU'11
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
8
-
-
85032751458
-
Deep neural networks foracoustic modeling in speech recognition
-
Nov.
-
G. E. Hinton, L. Deng, D. Yu et al., "Deep neural networks foracoustic modeling in speech recognition, " IEEE Signal ProcessingMagazine, pp. 2-17, Nov. 2012.
-
(2012)
IEEE Signal ProcessingMagazine
, pp. 2-17
-
-
Hinton, G.E.1
Deng, L.2
Yu, D.3
-
9
-
-
33746600649
-
Reducing the dimensionalityof data with neural networks
-
Jul
-
G. E. Hinton and R. Salakhutdinov, "Reducing the dimensionalityof data with neural networks, " Science, vol. 313, no. 5786, pp. 504-507, Jul 2006.
-
(2006)
Science
, vol.313
, Issue.5786
, pp. 504-507
-
-
Hinton, G.E.1
Salakhutdinov, R.2
-
10
-
-
69349090197
-
Learning deep architectures for AI
-
Y. Bengio, "Learning deep architectures for AI, " Foundations and trends® in Machine Learning, Vol. 2, No. 1, pp. 1-127, 2009.
-
(2009)
Foundations and Trends® in Machine Learning
, vol.2
, Issue.1
, pp. 1-127
-
-
Bengio, Y.1
-
11
-
-
84878379108
-
Scalable minimumBayes risk training of deep neural network acoustic models usingdistributed Hessian-free optimization
-
Portland, OR, USA
-
B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimumBayes risk training of deep neural network acoustic models usingdistributed Hessian-free optimization, " Proc. Interspeech'12, Portland, OR, USA, 2012.
-
(2012)
Proc. Interspeech'12
-
-
Kingsbury, B.1
Sainath, T.N.2
Soltau, H.3
-
12
-
-
84910072497
-
Unfoldedrecurrent neural networks for speech recognition
-
Singapore
-
G. Saon, H. Soltau, A. Emami, and M. Picheny, "Unfoldedrecurrent neural networks for speech recognition, " Proc. Interspeech'14, Singapore, 2014.
-
(2014)
Proc. Interspeech'14
-
-
Saon, G.1
Soltau, H.2
Emami, A.3
Picheny, M.4
-
13
-
-
84906225757
-
A scalable approach to usingDNN-derived features in GMM-HMM based acoustic modelingfor LVCSR
-
Lyon, France
-
Z.-J. Yan, Q. Huo, and J. Xu, "A scalable approach to usingDNN-derived features in GMM-HMM based acoustic modelingfor LVCSR, " Proc. Interspeech'13, Lyon, France, 2013.
-
(2013)
Proc. Interspeech'13
-
-
Yan, Z.-J.1
Huo, Q.2
Xu, J.3
-
15
-
-
84910067710
-
Efficient GPU-based training of recurrent neural networklanguage models using spliced sentence bunch
-
Singapore
-
X. Chen, Y.-Q. Wang, X.-Y. Liu, M. J. F. Gales, and P. C. Woodland, "Efficient GPU-based training of recurrent neural networklanguage models using spliced sentence bunch, " Proc. Interspeech'14, Singapore, 2014.
-
(2014)
Proc. Interspeech'14
-
-
Chen, X.1
Wang, Y.-Q.2
Liu, X.-Y.3
Gales, M.J.F.4
Woodland, P.C.5
-
16
-
-
84858953642
-
The kaldi speech recognitiontoolkit
-
Waikoloa, HI, USA
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlcek, Y.-M. Qian, P. Schwarz, J. Silovský, G. Stemmer, and K. Veselý, "The Kaldi speech recognitiontoolkit, " Proc. ASRU'11, Waikoloa, HI, USA, 2011.
-
(2011)
Proc. ASRU'11
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlcek, P.8
Qian, Y.-M.9
Schwarz, P.10
Silovský, J.11
Stemmer, G.12
Veselý, K.13
-
17
-
-
84905222840
-
RASR/NN: The RWTH neural network toolkit for speech recognition
-
Florence, Italy
-
S. Wiesler, A. Richard, P. Golik, R. Schlüter, and H. Ney, "RASR/NN: The RWTH neural network toolkit for speech recognition, "Proc. ICASSP'14, Florence, Italy, 2014.
-
(2014)
Proc. ICASSP'14
-
-
Wiesler, S.1
Richard, A.2
Golik, P.3
Schlüter, R.4
Ney, H.5
-
18
-
-
84893712779
-
-
D. Johnson, "Quicknet, " http: //www1. icsi. berkeley. edu/speech/qn. html.
-
Quicknet
-
-
Johnson, D.1
-
19
-
-
84959109976
-
The Cambridge university 2014 BOLT conversationaltelephone mand arin Chinese LVCSR system for speechtranslation
-
Dresden, Germany
-
X.-Y. Liu, F. Flego, L.-L. Wang, C. Zhang, M. J. F. Gales, and P. C. Woodland, "The Cambridge University 2014 BOLT conversationaltelephone Mand arin Chinese LVCSR system for speechtranslation, " Proc. Interspeech'15, Dresden, Germany, 2015.
-
(2015)
Proc. Interspeech'15
-
-
Liu, X.-Y.1
Flego, F.2
Wang, L.-L.3
Zhang, C.4
Gales, M.J.F.5
Woodland, P.C.6
-
20
-
-
84959166110
-
Joint decoding of tand em and hybrid systemsfor improved keyword spotting on low resource languages
-
Dresden, Germany
-
H.-P. Wang, A. Ragni, M. J. F. Gales, K. M. Knill, P. C. Woodland, and C. Zhang, "Joint decoding of tand em and hybrid systemsfor improved keyword spotting on low resource languages, " Proc. Interspeech'15, Dresden, Germany, 2015.
-
(2015)
Proc. Interspeech'15
-
-
Wang, H.-P.1
Ragni, A.2
Gales, M.J.F.3
Knill, K.M.4
Woodland, P.C.5
Zhang, C.6
-
21
-
-
84890543852
-
Error back propagation forsequence training of context-dependent deep networks for conversationalspeech transcription
-
Vancouver, Canada
-
H. Su, G. Li, D. Yu, and F. Seide, "Error back propagation forsequence training of context-dependent deep networks for conversationalspeech transcription, " Proc. ICASSP'13, Vancouver, Canada, 2013.
-
(2013)
Proc. ICASSP'13
-
-
Su, H.1
Li, G.2
Yu, D.3
Seide, F.4
-
22
-
-
84906274730
-
Sequencediscriminativetraining of deep neural networks
-
Lyon, France
-
K. Veselý, A. Ghoshal, L. Burget, and D. Povey, "Sequencediscriminativetraining of deep neural networks, " Proc. Interspeech'13, Lyon, France, 2013.
-
(2013)
Proc. Interspeech'13
-
-
Veselý, K.1
Ghoshal, A.2
Burget, L.3
Povey, D.4
-
23
-
-
84983119674
-
Learning hidden unit contributionsfor unsupervised speaker adaptation of neural networkacoustic models
-
Lake Tahoe, USA, Dec.
-
P. Swietojanski and S. Renals, "Learning hidden unit contributionsfor unsupervised speaker adaptation of neural networkacoustic models, " Proc. IWSLT'14, Lake Tahoe, USA, Dec. 2014.
-
(2014)
Proc. IWSLT'14
-
-
Swietojanski, P.1
Renals, S.2
-
24
-
-
84890542079
-
KL-divergence regularizeddeep neural network adaptation for improved large vocabularyspeech recognition
-
Vancouver, Canada
-
D. Yu, K. Yao, H. Su, G. Li, and F. Seide, "KL-divergence regularizeddeep neural network adaptation for improved large vocabularyspeech recognition, " Proc. ICASSP'13, Vancouver, Canada, 2013.
-
(2013)
Proc. ICASSP'13
-
-
Yu, D.1
Yao, K.2
Su, H.3
Li, G.4
Seide, F.5
-
25
-
-
84959174678
-
Parameterised sigmoid and ReLUhidden activation functions for DNN acoustic modelling
-
Dresden, Germany
-
C. Zhang and P. C. Woodland, "Parameterised sigmoid and ReLUhidden activation functions for DNN acoustic modelling, " Proc. Interspeech'15, Dresden, Germany, 2015.
-
(2015)
Proc. Interspeech'15
-
-
Zhang, C.1
Woodland, P.C.2
-
26
-
-
84893691530
-
Speaker adaptationof neural network acoustic models using i-vectors
-
Olomouc, Czech Republic
-
G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptationof neural network acoustic models using i-vectors, " Proc. ASRU'13, Olomouc, Czech Republic, 2013.
-
(2013)
Proc. ASRU'13
-
-
Saon, G.1
Soltau, H.2
Nahamoo, D.3
Picheny, M.4
-
28
-
-
84897544737
-
Theano: New featuresand speed improvements
-
Lake Tahoe, USA
-
F. Bastien, P. Lamblin, R. Pascanu, J. Bergstra, I. J. Goodfellow, A. Bergeron, N. Bouchard, and Y. Bengio, "Theano: new featuresand speed improvements, " Proc. Deep Learning and UnsupervisedFeature Learning NIPS Workshop, Lake Tahoe, USA, 2012.
-
(2012)
Proc. Deep Learning and UnsupervisedFeature Learning NIPS Workshop
-
-
Bastien, F.1
Lamblin, P.2
Pascanu, R.3
Bergstra, J.4
Goodfellow, I.J.5
Bergeron, A.6
Bouchard, N.7
Bengio, Y.8
-
29
-
-
84928146953
-
An introduction to computational networks and thecomputational network toolkit
-
Tech. Rep. MSR-TR-2014-112
-
D. Yu, A. Eversole, M. Seltzer, K. Yao, Z. Huang, B. Guenter, O. Kuchaiev, Y. Zhang, F. Seide, H. Wang, J. Droppo, G. Zweig, C. Rossbach, J. Currey, J. Gao, A. May, B. Peng, A. Stolcke, and M. Slaney, "An introduction to computational networks and thecomputational network toolkit, " Microsoft, Tech. Rep. MSR-TR-2014-112, 2014.
-
(2014)
Microsoft
-
-
Yu, D.1
Eversole, A.2
Seltzer, M.3
Yao, K.4
Huang, Z.5
Guenter, B.6
Kuchaiev, O.7
Zhang, Y.8
Seide, F.9
Wang, H.10
Droppo, J.11
Zweig, G.12
Rossbach, C.13
Currey, J.14
Gao, J.15
May, A.16
Peng, B.17
Stolcke, A.18
Slaney, M.19
-
30
-
-
84890471125
-
On rectified linear units for speech processing
-
Vancouver, Canada
-
M. D. Zeiler, M. Ranzato, R. Monga, M. Mao, K. Yang, Q. V. Le, P. Nguyen, A. Senior, V. Vanhoucke, J. Dean, and G. E. Hinton, "On rectified linear units for speech processing, " Proc. ICASSP'13, Vancouver, Canada, 2013.
-
(2013)
Proc. ICASSP'13
-
-
Zeiler, M.D.1
Ranzato, M.2
Monga, R.3
Mao, M.4
Yang, K.5
Le, Q.V.6
Nguyen, P.7
Senior, A.8
Vanhoucke, V.9
Dean, J.10
Hinton, G.E.11
-
31
-
-
84905284804
-
Fine context, low-rank, softplus deep neuralnetworks for mobile speech recognition
-
Florence, Italy
-
A. Senior and X. Lei, "Fine context, low-rank, softplus deep neuralnetworks for mobile speech recognition, " Proc. ICASSP'14, Florence, Italy, 2014.
-
(2014)
Proc. ICASSP'14
-
-
Senior, A.1
Lei, X.2
-
32
-
-
84904163933
-
Dropout: A simple way to prevent neural networksfrom overfitting
-
N. Srivastava, G. E. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, "Dropout: A simple way to prevent neural networksfrom overfitting, " The Journal of Machine Learning Research, vol. 15, no. 1, pp. 1929-1958, 2014.
-
(2014)
The Journal of Machine Learning Research
, vol.15
, Issue.1
, pp. 1929-1958
-
-
Srivastava, N.1
Hinton, G.E.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
33
-
-
80052250414
-
Adaptive subgradient methodsfor online learning and stochastic optimization
-
J. Duchi, E. Hazan, and Y. Singer, "Adaptive subgradient methodsfor online learning and stochastic optimization, " The Journal ofMachine Learning Research, vol. 12, pp. 2121-2159, 2011.
-
(2011)
The Journal OfMachine Learning Research
, vol.12
, pp. 2121-2159
-
-
Duchi, J.1
Hazan, E.2
Singer, Y.3
-
34
-
-
84910072353
-
Asynchronous stochastic optimization for sequence trainingof deep neural networks: Towards big data
-
Singapore
-
E. McDermott, G. Heigold, P. J. Moreno, A. Senior, and M. Bacchiani, "Asynchronous stochastic optimization for sequence trainingof deep neural networks: Towards big data, " Proc. Interspeech'14, Singapore, 2014.
-
(2014)
Proc. Interspeech'14
-
-
McDermott, E.1
Heigold, G.2
Moreno, P.J.3
Senior, A.4
Bacchiani, M.5
-
36
-
-
0032638856
-
Semi-tied covariance matrices for hidden Markovmodels
-
M. J. F. Gales, "Semi-tied covariance matrices for hidden Markovmodels, " IEEE Transactions on Speech and Audio Processing, vol. 7, no. 3, pp. 272-281, 1999.
-
(1999)
IEEE Transactions on Speech and Audio Processing
, vol.7
, Issue.3
, pp. 272-281
-
-
Gales, M.J.F.1
-
37
-
-
0032050110
-
Maximum likelihood linear transformations forHMM-based speech recognition
-
M. J. F. Gales, "Maximum likelihood linear transformations forHMM-based speech recognition, " Computer speech & language, vol. 12, no. 2, pp. 75-98, 1998.
-
(1998)
Computer Speech & Language
, vol.12
, Issue.2
, pp. 75-98
-
-
Gales, M.J.F.1
-
38
-
-
84867614591
-
Scalable stacking and learningfor building deep architectures
-
Kyoto, Japan
-
L. Deng, D. Yu, and J. Platt, "Scalable stacking and learningfor building deep architectures, " Proc. ICASSP'12, Kyoto, Japan, 2012.
-
(2012)
Proc. ICASSP'12
-
-
Deng, L.1
Yu, D.2
Platt, J.3
-
39
-
-
84905222971
-
Stand alone training ofcontext-dependent deep neural network acoustic models
-
Florence, Italy
-
C. Zhang and P. C. Woodland, "Stand alone training ofcontext-dependent deep neural network acoustic models, " Proc. ICASSP'14, Florence, Italy, 2014.
-
(2014)
Proc. ICASSP'14
-
-
Zhang, C.1
Woodland, P.C.2
-
40
-
-
84890492591
-
Revisiting hybridand GMM-HMM system combination techniques
-
Vancouver, Canada
-
P. Swietojanski, A. Ghoshal, and S. Renals, "Revisiting hybridand GMM-HMM system combination techniques, " Proc. ICASSP'13, Vancouver, Canada, 2013.
-
(2013)
Proc. ICASSP'13
-
-
Swietojanski, P.1
Ghoshal, A.2
Renals, S.3
|