-
1
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. E. Dahl, A.-r. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath et al, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," IEEE Signal Processing Magazine, Vol. 29, no. 6, pp. 82-97, 2012.
-
(2012)
IEEE Signal Processing Magazine
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.-R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
-
2
-
-
0031573117
-
Long short-term memory
-
Nov.
-
S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Comput., Vol. 9, no. 8, pp. 1735-1780, Nov. 1997. [Online]. Available: http://dx.doi.org/10.1162/neco.1997.9.8.1735
-
(1997)
Neural Comput.
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
4
-
-
84910046405
-
Long short-term memory recurrent neural network architectures for large scale acoustic modeling
-
H. Sak, A. W. Senior, and F. Beaufays, "Long short-term memory recurrent neural network architectures for large scale acoustic modeling." in Interspeech, 2014, pp. 338-342.
-
(2014)
Interspeech
, pp. 338-342
-
-
Sak, H.1
Senior, A.W.2
Beaufays, F.3
-
5
-
-
84893691530
-
Speaker adaptation of neural network acoustic models using i-vectors
-
G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors." in ASRU, 2013, pp. 55-59.
-
(2013)
ASRU
, pp. 55-59
-
-
Saon, G.1
Soltau, H.2
Nahamoo, D.3
Picheny, M.4
-
6
-
-
84905259145
-
I-vector-based speaker adaptation of deep neural networks for French broadcast audio transcription
-
V. Gupta, P. Kenny, P. Ouellet, and T. Stafylakis, "I-vector-based speaker adaptation of deep neural networks for french broadcast audio transcription," in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on. IEEE, 2014, pp. 6334-6338.
-
(2014)
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference On. IEEE
, pp. 6334-6338
-
-
Gupta, V.1
Kenny, P.2
Ouellet, P.3
Stafylakis, T.4
-
7
-
-
84973380342
-
Speaker-aware training of lstm-rnns for acoustic modelling
-
T. Tan, Y. Qian, D. Yu, S. Kundu, L. Lu, K. C. Sim, X. Xiao, and Y Zhang, "Speaker-aware training of lstm-rnns for acoustic modelling," in Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on. IEEE, 2016, pp. 5280-5284.
-
(2016)
Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference On. IEEE
, pp. 5280-5284
-
-
Tan, T.1
Qian, Y.2
Yu, D.3
Kundu, S.4
Lu, L.5
Sim, K.C.6
Xiao, X.7
Zhang, Y.8
-
8
-
-
79951609039
-
Front-end factor analysis for speaker verification
-
N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Trans. Audio, Speech & Language Processing, Vol. 19, no. 4, pp. 788-798, 2011. [Online]. Available: http://dx.doi.org/10.1109/TASL.2010.2064307
-
(2011)
IEEE Trans. Audio, Speech & Language Processing
, vol.19
, Issue.4
, pp. 788-798
-
-
Dehak, N.1
Kenny, P.2
Dehak, R.3
Dumouchel, P.4
Ouellet, P.5
-
9
-
-
84890521103
-
Speaker adaptation of context dependent deep neural networks
-
H. Liao, "Speaker adaptation of context dependent deep neural networks," in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 7947-7951.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. IEEE
, pp. 7947-7951
-
-
Liao, H.1
-
10
-
-
84983119674
-
Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models
-
IEEE
-
P. Swietojanski and S. Renals, "Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models," in Spoken Language Technology Workshop (SLT), 2014 IEEE. IEEE, 2014, pp. 171-176.
-
(2014)
Spoken Language Technology Workshop (SLT), 2014 IEEE
, pp. 171-176
-
-
Swietojanski, P.1
Renals, S.2
-
11
-
-
84976435936
-
Learning hidden unit contributions for unsupervised acoustic model adaptation
-
P. Swietojanski, J. Li, and S. Renals, "Learning hidden unit contributions for unsupervised acoustic model adaptation," IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 24, no. 8, pp. 1450-1463, 2016.
-
(2016)
IEEE/ACM Transactions on Audio, Speech, and Language Processing
, vol.24
, Issue.8
, pp. 1450-1463
-
-
Swietojanski, P.1
Li, J.2
Renals, S.3
-
12
-
-
84938688160
-
Speaker adaptive training of deep neural network acoustic models using i-vectors
-
Y Miao, H. Zhang, and F. Metze, "Speaker adaptive training of deep neural network acoustic models using i-vectors," IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), Vol. 23, no. 11, pp. 1938-1949, 2015.
-
(2015)
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
, vol.23
, Issue.11
, pp. 1938-1949
-
-
Miao, Y.1
Zhang, H.2
Metze, F.3
-
13
-
-
85039174342
-
Layer Normalization
-
L. J. Ba, R. Kiros, and G. E. Hinton, "Layer normalization," CoRR, Vol. abs/1607.06450, 2016. [Online]. Available: http://arxiv.org/abs/1607.06450
-
(2016)
CoRR
-
-
Ba, L.J.1
Kiros, R.2
Hinton, G.E.3
-
14
-
-
84990067826
-
Texture networks: Feed-forward synthesis of textures and stylized images
-
D. Ulyanov, V. Lebedev, A. Vedaldi, and V. S. Lempitsky, "Texture networks: Feed-forward synthesis of textures and stylized images," CoRR, Vol. abs/1603.03417, 2016. [Online]. Available: http://arxiv.org/abs/1603.03417
-
(2016)
CoRR
-
-
Ulyanov, D.1
Lebedev, V.2
Vedaldi, A.3
Lempitsky, V.S.4
-
15
-
-
84990034290
-
Perceptual losses for real-time style transfer and super-resolution
-
J. Johnson, A. Alahi, and F. Li, "Perceptual losses for real-time style transfer and super-resolution," CoRR, Vol. abs/1603.08155, 2016. [Online]. Available: http://arxiv.org/abs/1603.08155
-
(2016)
CoRR
-
-
Johnson, J.1
Alahi, A.2
Li, F.3
-
16
-
-
85039172195
-
Instance Normalization: The missing ingredient for fast stylization
-
D. Ulyanov, A. Vedaldi, and V. S. Lempitsky, "Instance normalization: The missing ingredient for fast stylization," CoRR, Vol. abs/1607.08022, 2016. [Online]. Available: http://arxiv.org/abs/1607.08022
-
(2016)
CoRR
-
-
Ulyanov, D.1
Vedaldi, A.2
Lempitsky, V.S.3
-
17
-
-
85028600965
-
A learned representation for artistic style
-
V. Dumoulin, J. Shlens, and M. Kudlur, "A learned representation for artistic style," CoRR, Vol. abs/1610.07629, 2016. [Online]. Available: http://arxiv.org/abs/1610.07629
-
(2016)
CoRR
-
-
Dumoulin, V.1
Shlens, J.2
Kudlur, M.3
-
18
-
-
0012330750
-
The design for the wall street journal-based csr corpus
-
Association for Computational Linguistics
-
D. B. Paul and J. M. Baker, "The design for the wall street journal-based csr corpus," in Proceedings of the workshop on Speech and Natural Language. Association for Computational Linguistics, 1992, pp. 357-362.
-
(1992)
Proceedings of the Workshop on Speech and Natural Language
, pp. 357-362
-
-
Paul, D.B.1
Baker, J.M.2
-
19
-
-
85020205851
-
Enhancing the tedlium corpus with selected data for language modeling and more ted talks
-
A. Rousseau, P. Deléglise, and Y Estève, "Enhancing the tedlium corpus with selected data for language modeling and more ted talks." in LREC, 2014, pp. 3935-3939.
-
(2014)
LREC
, pp. 3935-3939
-
-
Rousseau, A.1
Deléglise, P.2
Estève, Y.3
-
20
-
-
84969584486
-
Batch Normalization: Accelerating deep network training by reducing internal covariate shift
-
F. R. Bach and D. M. Blei, Eds. JMLR.org
-
S. Ioffe and C. Szegedy, "Batch normalization: Accelerating deep network training by reducing internal covariate shift." in ICML, ser. JMLR Workshop and Conference Proceedings, F. R. Bach and D. M. Blei, Eds., Vol. 37. JMLR.org, 2015, pp. 448-456.
-
(2015)
ICML, Ser. JMLR Workshop and Conference Proceedings
, vol.37
, pp. 448-456
-
-
Ioffe, S.1
Szegedy, C.2
-
22
-
-
0031268931
-
Bidirectional recurrent neural networks
-
Nov
-
M. Schuster and K. K. Paliwal, "Bidirectional recurrent neural networks," IEEE Transactions on Signal Processing, Vol. 45, no. 11, pp. 2673-2681, Nov 1997.
-
(1997)
IEEE Transactions on Signal Processing
, vol.45
, Issue.11
, pp. 2673-2681
-
-
Schuster, M.1
Paliwal, K.K.2
-
23
-
-
84893701254
-
Hybrid speech recognition with deep bidirectional lstm
-
A. Graves, N. Jaitly, and A.-r. Mohamed, "Hybrid speech recognition with deep bidirectional lstm," in Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on. IEEE, 2013, pp. 273-278.
-
(2013)
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop On. IEEE
, pp. 273-278
-
-
Graves, A.1
Jaitly, N.2
Mohamed, A.-R.3
-
24
-
-
85083951076
-
Adam: A method for stochastic optimization
-
D. P. Kingma and J. Ba, "Adam: A method for stochastic optimization," CoRR, Vol. abs/1412.6980, 2014. [Online]. Available: http://arxiv.org/abs/1412.6980
-
(2014)
CoRR
-
-
Kingma, D.P.1
Ba, J.2
-
27
-
-
84973384984
-
-
Aug.
-
S. Dieleman, J. Schlüter, C. Raffel, E. Olson, S. K. Sønderby, D. Nouri, D. Maturana, M. Thoma, E. Battenberg, J. Kelly, J. D. Fauw, M. Heilman, D. M. de Almeida, B. McFee, H. Weideman, G. Takâcs, P. de Rivaz, J. Crall, G. Sanders, K. Rasul, C. Liu, G. French, and J. Degrave, "Lasagne: First release." Aug. 2015. [Online]. Available: http://dx.doi.org/10.5281/zenodo.27878
-
(2015)
Lasagne: First Release
-
-
Dieleman, S.1
Schlüter, J.2
Raffel, C.3
Olson, E.4
Sønderby, S.K.5
Nouri, D.6
Maturana, D.7
Thoma, M.8
Battenberg, E.9
Kelly, J.10
Fauw, J.D.11
Heilman, M.12
De Almeida, D.M.13
McFee, B.14
Weideman, H.15
Takâcs, G.16
De Rivaz, P.17
Crall, J.18
Sanders, G.19
Rasul, K.20
Liu, C.21
French, G.22
Degrave, J.23
more..
-
28
-
-
84893696682
-
The kaldi speech recognition toolkit
-
IEEE Signal Processing Society
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlicek, Y Qian, P. Schwarz et al, "The kaldi speech recognition toolkit," in IEEE 2011 workshop on automatic speech recognition and understanding, no. EPFLCONF-192584. IEEE Signal Processing Society, 2011.
-
(2011)
IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, No. EPFLCONF-192584
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlicek, P.8
Qian, Y.9
Schwarz, P.10
|