SCOPUS 정보 검색 플랫폼

13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012

Volumn 1, Issue , 2012, Pages 866-869

Deep architectures for articulatory inversion

(4) Uria, Benigno a Murray, Iain a Renals, Steve a Richmond, Korin a

a UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

Articulatory inversion; Deep belief network; Deep neural network; Deep regression network; Pretraining

Indexed keywords

ACCURATE PREDICTION; ADJUSTABLE PARAMETERS; ARTICULATORY INVERSION; DEEP BELIEF NETWORKS; DEEP NEURAL NETWORKS; INVERSION ACCURACY; PRE-TRAINING; ROOT MEAN SQUARE ERRORS;

MEAN SQUARE ERROR; NEURAL NETWORKS; STATISTICAL TESTS;

NETWORK ARCHITECTURE;

EID: 84878403872 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (70)

References (23)

1
- 33846680938
- Speech production knowledge in automatic speech recognition
- S. King, J. Frankel, K. Livescu, E. McDermott, K. Richmond, and M. Wester, "Speech production knowledge in automatic speech recognition," Journal of the Acoustical Society of America, vol. 121, pp. 723-743, 2007.
- (2007) Journal of the Acoustical Society of America , vol.121 , pp. 723-743
- King, S.¹ Frankel, J.² Livescu, K.³ McDermott, E.⁴ Richmond, K.⁵ Wester, M.⁶

2
- 68149157315
- Integrating articulatory features into HMM-based parametric speech synthesis
- Z. Ling, K. Richmond, J. Yamagishi, and R. Wang, "Integrating articulatory features into HMM-based parametric speech synthesis," IEEE Transactions on Audio, Speech and Language Processing, vol. 17, no. 6, pp. 1171-1185, 2009.
- (2009) IEEE Transactions on Audio, Speech and Language Processing , vol.17 , Issue.6 , pp. 1171-1185
- Ling, Z.¹ Richmond, K.² Yamagishi, J.³ Wang, R.⁴

3
- 79959851452
- Comparison of HMM and TMDN methods for lip synchronisation
- Makuhari, Japan, September
- G. Hofer and K. Richmond, "Comparison of HMM and TMDN methods for lip synchronisation," in Proc. Interspeech, Makuhari, Japan, September 2010, pp. 454-457.
- (2010) Proc. Interspeech , pp. 454-457
- Hofer, G.¹ Richmond, K.²

4
- 0038359547
- Modelling the uncertainty in recovering articulation from acoustics
- K. Richmond, S. King, and P. Taylor, "Modelling the uncertainty in recovering articulation from acoustics," Computer Speech and Language, vol. 17, pp. 153-172, 2003.
- (2003) Computer Speech and Language , vol.17 , pp. 153-172
- Richmond, K.¹ King, S.² Taylor, P.³

5
- 67650153217
- Acoustic-articulatory modeling with the trajectory HMM
- L. Zhang and S. Renals, "Acoustic-articulatory modeling with the trajectory HMM," IEEE Signal Processing Letters, vol. 15, pp. 245-248, 2008.
- (2008) IEEE Signal Processing Letters , vol.15 , pp. 245-248
- Zhang, L.¹ Renals, S.²

6
- 84055211743
- Acoustic modelling using deep belief networks
- A. Mohamed, G. E. Dahl, and G. Hinton, "Acoustic modelling using deep belief networks," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 14-22, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 14-22
- Mohamed, A.¹ Dahl, G.E.² Hinton, G.³

7
- 84055222005
- Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
- G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

8
- 84865778430
- Announcing the electromagnetic articulography (day 1) subset of the mngu0 articulatory corpus
- K. Richmond, P. Hoole, and S. King, "Announcing the electromagnetic articulography (day 1) subset of the mngu0 articulatory corpus," in Proc. Interspeech, 2011.
- (2011) Proc. Interspeech
- Richmond, K.¹ Hoole, P.² King, S.³

9
- 70450219528
- Preliminary inversion mapping results with a new EMA corpus
- Brighton, UK, Sep.
- K. Richmond, "Preliminary inversion mapping results with a new EMA corpus," in Proc. Interspeech, Brighton, UK, Sep. 2009, pp. 2835-2838.
- (2009) Proc. Interspeech , pp. 2835-2838
- Richmond, K.¹

10
- 0024880831
- Multilayer feedforward networks are universal approximators
- K. Hornik, M. Stinchcombe, and H. White, "Multilayer feedforward networks are universal approximators," Neural Networks, vol. 2, no. 5, pp. 359-366, 1989.
- (1989) Neural Networks , vol.2 , Issue.5 , pp. 359-366
- Hornik, K.¹ Stinchcombe, M.² White, H.³

11
- 84864073449
- Greedy layer-wise training of deep networks
- Vancouver, Canada: The MIT Press, Dec.
- Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle, "Greedy layer-wise training of deep networks," in Advances in Neural Information Processing Systems, vol. 19. Vancouver, Canada: The MIT Press, Dec. 2007, p. 153.
- (2007) Advances in Neural Information Processing Systems , vol.19 , pp. 153
- Bengio, Y.¹ Lamblin, P.² Popovici, D.³ Larochelle, H.⁴

12
- 69349090197
- Learning deep architectures for AI
- Y. Bengio, "Learning deep architectures for AI," Foundations and Trends in Machine Learning, vol. 2, no. 1, pp. 1-127, 2009.
- (2009) Foundations and Trends in Machine Learning , vol.2 , Issue.1 , pp. 1-127
- Bengio, Y.¹

13
- 84858955616
- Study of probabilistic and bottle-neck features in multilingual environment
- F. Grézl, M. Karafiát, and M. Janda, "Study of probabilistic and bottle-neck features in multilingual environment," in Proc. IEEE ASRU, 2011.
- (2011) Proc. IEEE ASRU
- Grézl, F.¹ Karafiát, M.² Janda, M.³

14
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. IEEE ASRU, 2011.
- (2011) Proc. IEEE ASRU
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

15
- 33745805403
- A fast learning algorithm for deep belief nets
- G. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets," Neural Computation, vol. 18, no. 7, pp. 1527-1554, 2006.
- (2006) Neural Computation , vol.18 , Issue.7 , pp. 1527-1554
- Hinton, G.¹ Osindero, S.² Teh, Y.³

16
- 0013344078
- Training products of experts by minimizing contrastive divergence
- G. Hinton, "Training products of experts by minimizing contrastive divergence," Neural Computation, vol. 14, no. 8, pp. 1771-1800, 2002.
- (2002) Neural Computation , vol.14 , Issue.8 , pp. 1771-1800
- Hinton, G.¹

17
- 84862612564
- On contrastive divergence learning
- M. Carreira-Perpiñán and G. Hinton, "On contrastive divergence learning," in Proc. Artificial Intelligence and Statistics, 2005, pp. 33-40.
- (2005) Proc. Artificial Intelligence and Statistics , pp. 33-40
- Carreira-Perpiñán, M.¹ Hinton, G.²

18
- 44949185845
- A trajectory mixture density network for the acoustic-articulatory inversion mapping
- Pittsburgh, USA, Sep.
- K. Richmond, "A trajectory mixture density network for the acoustic-articulatory inversion mapping," in Proc. Interspeech, Pittsburgh, USA, Sep. 2006.
- (2006) Proc. Interspeech
- Richmond, K.¹

19
- 0004113976
- Mixture density networks
- Aston University, Tech. Rep. NCRG/94/004
- C. Bishop, "Mixture density networks," Neural Computing Research Group, Aston University, Tech. Rep. NCRG/94/004, 1994.
- (1994) Neural Computing Research Group
- Bishop, C.¹

20
- 0033708106
- Speech parameter generation algorithms for HMM-based speech synthesis
- Istanbul, Turkey, Jun.
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis," in Proc. ICASSP, Istanbul, Turkey, Jun. 2000, pp. 1315-1318.
- (2000) Proc. ICASSP , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

21
- 0026491198
- Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements
- J. Perkell, M. Cohen, M. Svirsky, M. Matthies, I. Garabieta, and M. Jackson, "Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements," The Journal of the Acoustical Society of America, vol. 92, p. 3078, 1992.
- (1992) The Journal of the Acoustical Society of America , vol.92 , pp. 3078
- Perkell, J.¹ Cohen, M.² Svirsky, M.³ Matthies, M.⁴ Garabieta, I.⁵ Jackson, M.⁶

22
- 0019068177
- Linear prediction on a warped frequency scale
- H. Strube, "Linear prediction on a warped frequency scale," The Journal of the Acoustical Society of America, vol. 68, p. 1071, 1980.
- (1980) The Journal of the Acoustical Society of America , vol.68 , pp. 1071
- Strube, H.¹

23
- 84878399327
- Master's thesis, MIT
- K. L. Poort, "Stop consonant production: An articulation and acoustic study," Master's thesis, MIT, 1995.
- (1995) Stop Consonant Production: An Articulation and Acoustic Study
- Poort, K.L.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.