-
1
-
-
84928545733
-
-
CoRR, abs/1412. 5567
-
A. Hannun, C. Case, J. Casper, B. Catanzaro, G. Diamos, E. Elsen, R. Prenger, S. Satheesh, S. Sengupta, A. Coates, and A. Y. Ng, "Deep speech: Scaling up end-to-end speech recognition, " CoRR, abs/1412. 5567, 2014.
-
(2014)
Deep Speech: Scaling Up End-to-end Speech Recognition
-
-
Hannun, A.1
Case, C.2
Casper, J.3
Catanzaro, B.4
Diamos, G.5
Elsen, E.6
Prenger, R.7
Satheesh, S.8
Sengupta, S.9
Coates, A.10
Ng, A.Y.11
-
2
-
-
77949375556
-
Support vector machines for noise robust asr
-
M. J. F. Gales, A. Ragni, H. AlDamarki, and C. Gautier, "Support vector machines for noise robust asr, " in ASRU, 2009, pp. 205-210.
-
(2009)
ASRU
, pp. 205-210
-
-
Gales, M.J.F.1
Ragni, A.2
AlDamarki, H.3
Gautier, C.4
-
4
-
-
84905247925
-
Data augmentation for deep neural network acoustic modeling
-
X. Cui, V. Goel, and B. Kingsbury, "Data augmentation for deep neural network acoustic modeling, " in Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014, pp. 100-104.
-
(2014)
Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 100-104
-
-
Cui, X.1
Goel, V.2
Kingsbury, B.3
-
5
-
-
84910031125
-
Data augmentation for low resource languages
-
A. Ragni, K. M. Knill, S. P. Rath, and M. J. F. Gales, "Data augmentation for low resource languages, " in Interspeech, 2014.
-
(2014)
Interspeech
-
-
Ragni, A.1
Knill, K.M.2
Rath, S.P.3
Gales, M.J.F.4
-
6
-
-
84893642825
-
Elastic spectral distortion for low resource speech recognition with deep neural networks
-
N. Kanda, R. Takeda, and Y. Obuchi, "Elastic spectral distortion for low resource speech recognition with deep neural networks, " in ASRU, 2013.
-
(2013)
ASRU
-
-
Kanda, N.1
Takeda, R.2
Obuchi, Y.3
-
7
-
-
84959115289
-
A time delay neural network architecture for efficient modeling of long temporal contexts
-
V. Peddinti, D. Povey, and S. Khudanpur, "A time delay neural network architecture for efficient modeling of long temporal contexts, " in Proceedings of INTERSPEECH, 2015.
-
(2015)
Proceedings of INTERSPEECH
-
-
Peddinti, V.1
Povey, D.2
Khudanpur, S.3
-
8
-
-
84896734479
-
Deep scattering spectrum
-
Aug
-
J. Andén and S. Mallat, "Deep scattering spectrum, " Signal Processing, IEEE Transactions on, vol. 62, no. 16, pp. 4114-4128, Aug 2014.
-
(2014)
Signal Processing, IEEE Transactions on
, vol.62
, Issue.16
, pp. 4114-4128
-
-
Andén, J.1
Mallat, S.2
-
9
-
-
84959085793
-
-
accessed March 25, 2015
-
SoX, audio manipulation tool, (accessed March 25, 2015). [Online]. Available: http: //sox. sourceforge. net/
-
Audio Manipulation Tool
-
-
-
10
-
-
84946076428
-
Ted-lium: An automatic speech recognition dedicated corpus
-
A. Rousseau, P. Deléglise, and Y. Estève, "Ted-lium: An automatic speech recognition dedicated corpus. " in LREC, 2012, pp. 125-129.
-
(2012)
LREC
, pp. 125-129
-
-
Rousseau, A.1
Deléglise, P.2
Estève, Y.3
-
11
-
-
84946015916
-
Librispeech: An ASR corpus based on public domain audio books
-
V. Panayotov, G. Chen, D. Povey, and S. Khudanpur, "Librispeech: An ASR corpus based on public domain audio books, " in Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2015.
-
(2015)
Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE
-
-
Panayotov, V.1
Chen, G.2
Povey, D.3
Khudanpur, S.4
-
12
-
-
84905239342
-
Improving deep neural network acoustic models using generalized maxout networks
-
May
-
X. Zhang, J. Trmal, D. Povey, and S. Khudanpur, "Improving deep neural network acoustic models using generalized maxout networks, " in Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, May 2014, pp. 215-219.
-
(2014)
Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE
, pp. 215-219
-
-
Zhang, X.1
Trmal, J.2
Povey, D.3
Khudanpur, S.4
-
13
-
-
0019053271
-
Comparison of parametric representation for monosyllabic word recognition in continuously spoken sentences
-
S. B. Davis and P. Mermelstein, "Comparison of parametric representation for monosyllabic word recognition in continuously spoken sentences, " IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 28, no. 4, pp. 357-366, 1980.
-
(1980)
IEEE Transactions on Acoustics, Speech and Signal Processing
, vol.28
, Issue.4
, pp. 357-366
-
-
Davis, S.B.1
Mermelstein, P.2
-
14
-
-
84858984756
-
-
IEEE, Dec.
-
M. Karafiat, L. Burget, P. Matejka, O. Glembek, and J. Cernocky, in 2011 IEEE Workshop on Automatic Speech Recognition & Understanding. IEEE, Dec., pp. 152-157.
-
2011 IEEE Workshop on Automatic Speech Recognition & Understanding
, pp. 152-157
-
-
Karafiat, M.1
Burget, L.2
Matejka, P.3
Glembek, O.4
Cernocky, J.5
-
16
-
-
0027252181
-
An overlap-add technique based on waveform similarity (wsola) for high quality time-scale modification of speech
-
April
-
W. Verhelst and M. Roelands, "An overlap-add technique based on waveform similarity (wsola) for high quality time-scale modification of speech, " in Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 2, April 1993, pp. 554-557 vol. 2.
-
(1993)
Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, vol.2
, pp. 554-557
-
-
Verhelst, W.1
Roelands, M.2
-
17
-
-
84905252790
-
A pitch extraction algorithm tuned for automatic speech recognition
-
P. Ghahremani, B. BabaAli, D. Povey, K. Riedhammer, J. Trmal, and S. Khudanpur, "A pitch extraction algorithm tuned for automatic speech recognition, " in Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2014, pp. 2494-2498.
-
(2014)
Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE
, pp. 2494-2498
-
-
Ghahremani, P.1
BabaAli, B.2
Povey, D.3
Riedhammer, K.4
Trmal, J.5
Khudanpur, S.6
|