SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 1297-1301

Relevance-weighted-reconstruction of articulatory features in deep-neural-network-based acoustic-to-articulatory mapping

(4) Canevari, Claudia a Badino, Leonardo a Fadiga, Luciano a Metta, Giorgio a

a ISTITUTO ITALIANO DI TECNOLOGIA (Italy)

Author keywords

Acoustic to articulatory mapping; Critical articulators; Deep neural networks; Phone recognition

Indexed keywords

TELEPHONE SETS;

ACOUSTIC-TO-ARTICULATORY MAPPING; ARTICULATORY FEATURES; CRITICAL ARTICULATORS; DEEP NEURAL NETWORKS; ERROR REDUCTION; MIXTURE DENSITY; PHONE RECOGNITION; RECONSTRUCTION ERROR;

SPEECH PROCESSING;

EID: 84906219170 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (16)

References (17)

1
- 84874241818
- Deeplevel acoustic-to-articulatory mapping for DBN-HMM based phone recognition
- Miami, Florida
- Badino, L., Canevari, C., Fadiga, L., Metta, G., "Deeplevel acoustic-to-articulatory mapping for DBN-HMM based phone recognition", in Proceedings of IEEE SLT 2012, Miami, Florida, 2012.
- (2012) Proceedings of IEEE SLT 2012
- Badino, L.¹ Canevari, C.² Fadiga, L.³ Metta, G.⁴

2
- 0004113976
- Mixture density networks
- Department of Computer Science, Aston University, Birmingham, B4 7 ET, UK, February
- Bishop, C. M., "Mixture density networks", Technical Report NCRG/4288, Neural Computing research Group, Department of Computer Science, Aston University, Birmingham, B4 7 ET, UK, February, 1994.
- (1994) Technical Report NCRG/4288, Neural Computing Research Group
- Bishop, C.M.¹

3
- 33846516584
- Springer Science+Business Media, LLC, 233 Spring street, New York, NY 10012, USA
- Bishop, C. M., "Pattern recognition and machine learning", Springer Science+Business Media, LLC, 233 Spring street, New York, NY 10012, USA, 2009.
- (2009) Pattern Recognition and Machine Learning
- Bishop, C.M.¹

4
- 33745805403
- A fast learning algorithm for deep belief nets
- Hinton, G. E., Osindero, S. and Teh, Y., "A fast learning algorithm for deep belief nets", Neural Computation, 18, pp 1527- 1554, 2006.
- (2006) Neural Computation , vol.18 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.³

5
- 33846680938
- Speech production knowledge in automatic speech recognition
- King, S., Frankel, J., Livescu, K., McDermott, E., Richmond, K. and Wester, M., "Speech production knowledge in automatic speech recognition", Journal of the Acoustic Society of America, vol. 121(2), pp. 723-742, 2007.
- (2007) Journal of the Acoustic Society of America , vol.121 , Issue.2 , pp. 723-742
- King, S.¹ Frankel, J.² Livescu, K.³ McDermott, E.⁴ Richmond, K.⁵ Wester, M.⁶

6
- 0002539638
- Formant frequencies of some fixed-mandible vowels and model of speech motor programming by predictive simulation
- Lindblom, B., Lubker, J. and Gay, T., "Formant frequencies of some fixed-mandible vowels and model of speech motor programming by predictive simulation", Journal of Phonetics, vol. 7, pp. 146-161, 1979.
- (1979) Journal of Phonetics , vol.7 , pp. 146-161
- Lindblom, B.¹ Lubker, J.² Gay, T.³

7
- 29444436962
- Integration of articulatory and spectrum features based on the hybrid HMM/BN modelling framework
- Markov, K., Dang, J. and Nakamura, S., "Integration of articulatory and spectrum features based on the hybrid HMM/BN modelling framework", Speech Communication, vol. 48, 161-175, 2006.
- (2006) Speech Communication , vol.48 , pp. 161-175
- Markov, K.¹ Dang, J.² Nakamura, S.³

8
- 78649390043
- Retrieving tract variables from acoustics: A comparison of different machine learning strategy
- Mitra, V., Nam, H., Espy-Wilson, C., Saltzman, E. and Goldstein, L., "Retrieving tract variables from acoustics: A comparison of different machine learning strategy", IEEE J. of Selected Topics in Signal Processing, vol. 4(6), pp. 1027-1045, 2010.
- (2010) IEEE J. of Selected Topics in Signal Processing , vol.4 , Issue.6 , pp. 1027-1045
- Mitra, V.¹ Nam, H.² Espy-Wilson, C.³ Saltzman, E.⁴ Goldstein, L.⁵

9
- 78649297301
- Deep belief networks for phone recognition
- Mohamed, R., Dahl, G. E. and Hinton, G. E., "Deep belief networks for phone recognition", NIPS 22, work-shop on deep learning for speech recognition, 2011.
- (2011) NIPS 22, Work-shop on Deep Learning for Speech Recognition
- Mohamed, R.¹ Dahl, G.E.² Hinton, G.E.³

10
- 0026675669
- Inferring articulation and recognising gestures from acoustics with a neural network trained on x-ray microbeam data
- Papcun, G., Hochberg, J., Thomas, T. R., Laroche, F., Zachs, J. and Levy, S., "Inferring Articulation and Recognising Gestures from Acoustics with a Neural Network Trained on X-ray Microbeam Data", The Journal of the Acoustical Society of America, 92(2), pp. 688-700, 1992.
- (1992) The Journal of the Acoustical Society of America , vol.92 , Issue.2 , pp. 688-700
- Papcun, G.¹ Hochberg, J.² Thomas, T.R.³ Laroche, F.⁴ Zachs, J.⁵ Levy, S.⁶

11
- 0038359547
- Modeling the uncertainty in recovering articulation from acoustics
- Richmond, K., King, S. and Taylor, P., "Modeling the uncertainty in recovering articulation from acoustics", Computer Speech and Language, vol. 17(2), pp. 153-172, 2003.
- (2003) Computer Speech and Language , vol.17 , Issue.2 , pp. 153-172
- Richmond, K.¹ King, S.² Taylor, P.³

12
- 0030008004
- The potential role of speech production models in automatic speech recognition
- Rose, R. C., Schroeter, J. and Sondhi, M. M., "The potential role of speech production models in automatic speech recognition", The Journal of the Acoustical Society of America, 99(3), pp. 1699-1709, 1996.
- (1996) The Journal of the Acoustical Society of America , vol.99 , Issue.3 , pp. 1699-1709
- Rose, R.C.¹ Schroeter, J.² Sondhi, M.M.³

13
- 84874282835
- A deep neural network for acoustic-articulatory speech inversion
- Sierra Nevada, Spain
- Uria, B., Renals, S. and Richmond, K., "A deep neural network for acoustic-articulatory speech inversion", In Proc. NIPS 2011 Workshop on Deep Learning and Unsupervised Feature Learning, Sierra Nevada, Spain, 2011.
- (2011) Proc. NIPS 2011 Workshop on Deep Learning and Unsupervised Feature Learning
- Uria, B.¹ Renals, S.² Richmond, K.³

14
- 84878403872
- Deep architectures for articulatory inversion
- Portland, Orego, USA, September
- Uria, B., Murray, I., Renals, S., Richmond, K., "Deep architectures for articulatory inversion", In Proc. Interspeech, Portland, Orego, USA, September 2012.
- (2012) Proc. Interspeech
- Uria, B.¹ Murray, I.² Renals, S.³ Richmond, K.⁴

15
- 85009089757
- Continuous speech recognition using articulatory data
- Wrench, A. A. and Richmond, K., "Continuous speech recognition using articulatory data", in Proceedings of the International Conference on Spoken Language Processing, pp. 145-148, 2000.
- (2000) Proceedings of the International Conference on Spoken Language Processing , pp. 145-148
- Wrench, A.A.¹ Richmond, K.²

16
- 84906213644
- Available at http://data.cstr.ed.ac.uk/mocha/.

17
- 84906245796
- Available at http://htk.eng.cam.ac.uk/.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.