-
1
-
-
84874241818
-
Deeplevel acoustic-to-articulatory mapping for DBN-HMM based phone recognition
-
Miami, Florida
-
Badino, L., Canevari, C., Fadiga, L., Metta, G., "Deeplevel acoustic-to-articulatory mapping for DBN-HMM based phone recognition", in Proceedings of IEEE SLT 2012, Miami, Florida, 2012.
-
(2012)
Proceedings of IEEE SLT 2012
-
-
Badino, L.1
Canevari, C.2
Fadiga, L.3
Metta, G.4
-
2
-
-
0004113976
-
Mixture density networks
-
Department of Computer Science, Aston University, Birmingham, B4 7 ET, UK, February
-
Bishop, C. M., "Mixture density networks", Technical Report NCRG/4288, Neural Computing research Group, Department of Computer Science, Aston University, Birmingham, B4 7 ET, UK, February, 1994.
-
(1994)
Technical Report NCRG/4288, Neural Computing Research Group
-
-
Bishop, C.M.1
-
3
-
-
33846516584
-
-
Springer Science+Business Media, LLC, 233 Spring street, New York, NY 10012, USA
-
Bishop, C. M., "Pattern recognition and machine learning", Springer Science+Business Media, LLC, 233 Spring street, New York, NY 10012, USA, 2009.
-
(2009)
Pattern Recognition and Machine Learning
-
-
Bishop, C.M.1
-
4
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
Hinton, G. E., Osindero, S. and Teh, Y., "A fast learning algorithm for deep belief nets", Neural Computation, 18, pp 1527- 1554, 2006.
-
(2006)
Neural Computation
, vol.18
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.3
-
5
-
-
33846680938
-
Speech production knowledge in automatic speech recognition
-
King, S., Frankel, J., Livescu, K., McDermott, E., Richmond, K. and Wester, M., "Speech production knowledge in automatic speech recognition", Journal of the Acoustic Society of America, vol. 121(2), pp. 723-742, 2007.
-
(2007)
Journal of the Acoustic Society of America
, vol.121
, Issue.2
, pp. 723-742
-
-
King, S.1
Frankel, J.2
Livescu, K.3
McDermott, E.4
Richmond, K.5
Wester, M.6
-
6
-
-
0002539638
-
Formant frequencies of some fixed-mandible vowels and model of speech motor programming by predictive simulation
-
Lindblom, B., Lubker, J. and Gay, T., "Formant frequencies of some fixed-mandible vowels and model of speech motor programming by predictive simulation", Journal of Phonetics, vol. 7, pp. 146-161, 1979.
-
(1979)
Journal of Phonetics
, vol.7
, pp. 146-161
-
-
Lindblom, B.1
Lubker, J.2
Gay, T.3
-
7
-
-
29444436962
-
Integration of articulatory and spectrum features based on the hybrid HMM/BN modelling framework
-
Markov, K., Dang, J. and Nakamura, S., "Integration of articulatory and spectrum features based on the hybrid HMM/BN modelling framework", Speech Communication, vol. 48, 161-175, 2006.
-
(2006)
Speech Communication
, vol.48
, pp. 161-175
-
-
Markov, K.1
Dang, J.2
Nakamura, S.3
-
8
-
-
78649390043
-
Retrieving tract variables from acoustics: A comparison of different machine learning strategy
-
Mitra, V., Nam, H., Espy-Wilson, C., Saltzman, E. and Goldstein, L., "Retrieving tract variables from acoustics: A comparison of different machine learning strategy", IEEE J. of Selected Topics in Signal Processing, vol. 4(6), pp. 1027-1045, 2010.
-
(2010)
IEEE J. of Selected Topics in Signal Processing
, vol.4
, Issue.6
, pp. 1027-1045
-
-
Mitra, V.1
Nam, H.2
Espy-Wilson, C.3
Saltzman, E.4
Goldstein, L.5
-
9
-
-
78649297301
-
Deep belief networks for phone recognition
-
Mohamed, R., Dahl, G. E. and Hinton, G. E., "Deep belief networks for phone recognition", NIPS 22, work-shop on deep learning for speech recognition, 2011.
-
(2011)
NIPS 22, Work-shop on Deep Learning for Speech Recognition
-
-
Mohamed, R.1
Dahl, G.E.2
Hinton, G.E.3
-
10
-
-
0026675669
-
Inferring articulation and recognising gestures from acoustics with a neural network trained on x-ray microbeam data
-
Papcun, G., Hochberg, J., Thomas, T. R., Laroche, F., Zachs, J. and Levy, S., "Inferring Articulation and Recognising Gestures from Acoustics with a Neural Network Trained on X-ray Microbeam Data", The Journal of the Acoustical Society of America, 92(2), pp. 688-700, 1992.
-
(1992)
The Journal of the Acoustical Society of America
, vol.92
, Issue.2
, pp. 688-700
-
-
Papcun, G.1
Hochberg, J.2
Thomas, T.R.3
Laroche, F.4
Zachs, J.5
Levy, S.6
-
11
-
-
0038359547
-
Modeling the uncertainty in recovering articulation from acoustics
-
Richmond, K., King, S. and Taylor, P., "Modeling the uncertainty in recovering articulation from acoustics", Computer Speech and Language, vol. 17(2), pp. 153-172, 2003.
-
(2003)
Computer Speech and Language
, vol.17
, Issue.2
, pp. 153-172
-
-
Richmond, K.1
King, S.2
Taylor, P.3
-
12
-
-
0030008004
-
The potential role of speech production models in automatic speech recognition
-
Rose, R. C., Schroeter, J. and Sondhi, M. M., "The potential role of speech production models in automatic speech recognition", The Journal of the Acoustical Society of America, 99(3), pp. 1699-1709, 1996.
-
(1996)
The Journal of the Acoustical Society of America
, vol.99
, Issue.3
, pp. 1699-1709
-
-
Rose, R.C.1
Schroeter, J.2
Sondhi, M.M.3
-
13
-
-
84874282835
-
A deep neural network for acoustic-articulatory speech inversion
-
Sierra Nevada, Spain
-
Uria, B., Renals, S. and Richmond, K., "A deep neural network for acoustic-articulatory speech inversion", In Proc. NIPS 2011 Workshop on Deep Learning and Unsupervised Feature Learning, Sierra Nevada, Spain, 2011.
-
(2011)
Proc. NIPS 2011 Workshop on Deep Learning and Unsupervised Feature Learning
-
-
Uria, B.1
Renals, S.2
Richmond, K.3
-
14
-
-
84878403872
-
Deep architectures for articulatory inversion
-
Portland, Orego, USA, September
-
Uria, B., Murray, I., Renals, S., Richmond, K., "Deep architectures for articulatory inversion", In Proc. Interspeech, Portland, Orego, USA, September 2012.
-
(2012)
Proc. Interspeech
-
-
Uria, B.1
Murray, I.2
Renals, S.3
Richmond, K.4
-
16
-
-
84906213644
-
-
Available at http://data.cstr.ed.ac.uk/mocha/.
-
-
-
-
17
-
-
84906245796
-
-
Available at http://htk.eng.cam.ac.uk/.
-
-
-
|