SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn 2017-August, Issue , 2017, Pages 3986-3990

Evaluation of a silent speech interface based on magnetic sensing and deep learning for a phonetically rich vocabulary

(7) Gonzalez, Jose A a Cheah, Lam A b Green, Phil D a Gilbert, James M b Ell, Stephen R c Moore, Roger K a Holdsworth, Ed d

a UNIVERSITY OF SHEFFIELD (United Kingdom)

b UNIVERSITY OF HULL (United Kingdom)

c CASTLE HILL HOSPITAL (United Kingdom)

d Practical Control Limited (United Kingdom)

Author keywords

Articulatory to acoustic mapping; Recurrent neural network; Speech rehabilitation; Speech synthesis

Indexed keywords

DEEP LEARNING; DEEP NEURAL NETWORKS; MAPPING; METADATA; QUALITY CONTROL; RECURRENT NEURAL NETWORKS; SPEECH; SPEECH COMMUNICATION; SPEECH RECOGNITION; SPEECH SYNTHESIS;

ACOUSTIC MAPPING; GAUSSIAN MIXTURE MODEL (GMMS); REALTIME PROCESSING; RECURRENT NEURAL NETWORK (RNN); SILENT SPEECH INTERFACES; SIMULTANEOUS RECORDING; SPEECH REHABILITATION; TOTAL LARYNGECTOMIES;

AUDIO RECORDINGS;

EID: 85039155335 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: 10.21437/Interspeech.2017-802 Document Type: Conference Paper

Times cited : (18)

References (34)

1
- 42949175762
- Development of a (silent) speech recognition system for patients following laryngectomy
- M. J. Fagan, S. R. Ell, J. M. Gilbert, E. Sarrazin, and P. M. Chapman, "Development of a (silent) speech recognition system for patients following laryngectomy," Med. Eng. Phys., vol. 30, no. 4, pp. 419-425, 2008.
- (2008) Med. Eng. Phys. , vol.30 , Issue.4 , pp. 419-425
- Fagan, M.J.¹ Ell, S.R.² Gilbert, J.M.³ Sarrazin, E.⁴ Chapman, P.M.⁵

2
- 78449253410
- Isolated word recognition of silent speech using magnetic implants and sensors
- J. M. Gilbert, S. I. Rybchenko, R. Hofe, S. R. Ell, M. J. Fagan, R. K. Moore, and P. Green, "Isolated word recognition of silent speech using magnetic implants and sensors," Med. Eng. Phys., vol. 32, no. 10, pp. 1189-1197, 2010.
- (2010) Med. Eng. Phys. , vol.32 , Issue.10 , pp. 1189-1197
- Gilbert, J.M.¹ Rybchenko, S.I.² Hofe, R.³ Ell, S.R.⁴ Fagan, M.J.⁵ Moore, R.K.⁶ Green, P.⁷

3
- 84870292488
- Small-vocabulary speech recognition using a silent speech interface based on magnetic sensing
- R. Hofe, S. R. Ell, M. J. Fagan, J. M. Gilbert, P. D. Green, R. K. Moore, and S. I. Rybchenko, "Small-vocabulary speech recognition using a silent speech interface based on magnetic sensing," Speech Commun., vol. 55, no. 1, pp. 22-32, 2013.
- (2013) Speech Commun. , vol.55 , Issue.1 , pp. 22-32
- Hofe, R.¹ Ell, S.R.² Fagan, M.J.³ Gilbert, J.M.⁴ Green, P.D.⁵ Moore, R.K.⁶ Rybchenko, S.I.⁷

4
- 84962110277
- A silent speech system based on permanent magnet articulography and direct synthesis
- J. A. Gonzalez, L. A. Cheah, J. M. Gilbert, J. Bai, S. R. Ell, P. D. Green, and R. K. Moore, "A silent speech system based on permanent magnet articulography and direct synthesis," Comput. Speech Lang., vol. 39, pp. 67-87, 2016.
- (2016) Comput. Speech Lang. , vol.39 , pp. 67-87
- Gonzalez, J.A.¹ Cheah, L.A.² Gilbert, J.M.³ Bai, J.⁴ Ell, S.R.⁵ Green, P.D.⁶ Moore, R.K.⁷

5
- 85016157373
- Restoring speech following total removal of the larynx by a learned transformation from sensor data to acoustics
- J. M. Gilbert, J. A. Gonzalez, L. A. Cheah, S. R. Ell, P. Green, R. K. Moore, and E. Holdsworth, "Restoring speech following total removal of the larynx by a learned transformation from sensor data to acoustics," J. Acoust. Soc. Am., vol. 141, no. 3, pp. EL307- EL313, 2017.
- (2017) J. Acoust. Soc. Am. , vol.141 , Issue.3 , pp. EL307-EL313
- Gilbert, J.M.¹ Gonzalez, J.A.² Cheah, L.A.³ Ell, S.R.⁴ Green, P.⁵ Moore, R.K.⁶ Holdsworth, E.⁷

6
- 84930630277
- Deep learning
- May
- Y. LeCun, Y. Bengio, and G. Hinton, "Deep learning," Nature, vol. 521, no. 7553, pp. 436-444, May 2015.
- (2015) Nature , vol.521 , Issue.7553 , pp. 436-444
- LeCun, Y.¹ Bengio, Y.² Hinton, G.³

7
- 84890543083
- Speech recognition with deep recurrent neural networks
- A. Graves, A.-r. Mohamed, and G. Hinton, "Speech recognition with deep recurrent neural networks," in Proc. ICASSP, 2013, pp. 6645-6649.
- (2013) Proc. ICASSP , pp. 6645-6649
- Graves, A.¹ Mohamed, A.-R.² Hinton, G.³

8
- 84910047819
- TTS synthesis with bidirectional LSTM based recurrent neural networks
- Y. Fan, Y. Qian, F.-L. Xie, and F. K. Soong, "TTS synthesis with bidirectional LSTM based recurrent neural networks," in Proc. Interspeech, 2014, pp. 1964-1968.
- (2014) Proc. Interspeech , pp. 1964-1968
- Fan, Y.¹ Qian, Y.² Xie, F.-L.³ Soong, F.K.⁴

9
- 84946045510
- Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis
- H. Zen and H. Sak, "Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis," in Proc. ICASSP, 2015, pp. 4470-4474.
- (2015) Proc. ICASSP , pp. 4470-4474
- Zen, H.¹ Sak, H.²

10
- 84946027999
- Voice conversion using deep bidirectional long short-term memory based recurrent neural networks
- L. Sun, S. Kang, K. Li, and H. Meng, "Voice conversion using deep bidirectional long short-term memory based recurrent neural networks," in Proc. ICASSP, 2015, pp. 4869-4873.
- (2015) Proc. ICASSP , pp. 4869-4873
- Sun, L.¹ Kang, S.² Li, K.³ Meng, H.⁴

11
- 84938864664
- A user-centric design of permanent magnetic articulography based assistive speech technology
- L. A. Cheah, J. Bai, J. A. Gonzalez, S. R. Ell, J. M. Gilbert, R. K. Moore, and P. D. Green, "A user-centric design of permanent magnetic articulography based assistive speech technology," in Proc. BioSignals, 2015, pp. 109-116.
- (2015) Proc. BioSignals , pp. 109-116
- Cheah, L.A.¹ Bai, J.² Gonzalez, J.A.³ Ell, S.R.⁴ Gilbert, J.M.⁵ Moore, R.K.⁶ Green, P.D.⁷

12
- 84969850648
- Preliminary evaluation of a silent speech interface based on intra-oral magnetic sensing
- L. A. Cheah, J. Bai, J. A. Gonzalez, J. M. Gilbert, S. R. Ell, P. D. Green, and R. K. Moore, "Preliminary evaluation of a silent speech interface based on intra-oral magnetic sensing," in Proc. Biodevices, 2016, pp. 108-116.
- (2016) Proc. Biodevices , pp. 108-116
- Cheah, L.A.¹ Bai, J.² Gonzalez, J.A.³ Gilbert, J.M.⁴ Ell, S.R.⁵ Green, P.D.⁶ Moore, R.K.⁷

13
- 84949568676
- Data driven articulatory synthesis with deep neural networks
- S. Aryal and R. Gutierrez-Osuna, "Data driven articulatory synthesis with deep neural networks," Comput. Speech Lang., vol. 36, pp. 260-273, 2016.
- (2016) Comput. Speech Lang. , vol.36 , pp. 260-273
- Aryal, S.¹ Gutierrez-Osuna, R.²

14
- 0000877063
- Delayed auditory feedback
- A. J. Yates, "Delayed auditory feedback," Psychological bulletin, vol. 60, no. 3, p. 213, 1963.
- (1963) Psychological Bulletin , vol.60 , Issue.3 , pp. 213
- Yates, A.J.¹

15
- 0036096888
- Effect of delayed auditory feedback on normal speakers at two speech rates
- A. Stuart, J. Kalinowski, M. P. Rastatter, and K. Lynch, "Effect of delayed auditory feedback on normal speakers at two speech rates," J. Acoust. Soc. Am., vol. 111, no. 5, pp. 2237-2241, 2002.
- (2002) J. Acoust. Soc. Am. , vol.111 , Issue.5 , pp. 2237-2241
- Stuart, A.¹ Kalinowski, J.² Rastatter, M.P.³ Lynch, K.⁴

16
- 0031573117
- Long short-term memory
- S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural computation, vol. 9, no. 8, pp. 1735-1780, 1997.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

17
- 84961291190
- Learning phrase representations using RNN encoder-decoder for statistical machine translation
- K. Cho, B. Van Merriënboer, Ç. Gülçehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio, "Learning phrase representations using RNN encoder-decoder for statistical machine translation," in Proc. EMNLP, 2014, pp. 1724-1734.
- (2014) Proc. EMNLP , pp. 1724-1734
- Cho, K.¹ Van Merriënboer, B.² Gülçehre, C.³ Bahdanau, D.⁴ Bougares, F.⁵ Schwenk, H.⁶ Bengio, Y.⁷

18
- 0025503558
- Backpropagation through time: What it does and how to do it
- P. J. Werbos, "Backpropagation through time: what it does and how to do it," Proceedings of the IEEE, vol. 78, no. 10, pp. 1550- 1560, 1990.
- (1990) Proceedings of the IEEE , vol.78 , Issue.10 , pp. 1550-1560
- Werbos, P.J.¹

19
- 85090475413
- The CMU Arctic speech databases
- J. Kominek and A. W. Black, "The CMU Arctic speech databases," in Fifth ISCA Workshop on Speech Synthesis, 2004, pp. 223-224.
- (2004) Fifth ISCA Workshop on Speech Synthesis , pp. 223-224
- Kominek, J.¹ Black, A.W.²

20
- 0032673049
- Restructuring speech representations using a pitch-adaptive time- frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
- Apr.
- H. Kawahara, I. Masuda-Katsuse, and A. De Cheveigne, "Restructuring speech representations using a pitch-adaptive time- frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds," Speech communication, vol. 27, no. 3, pp. 187-207, Apr. 1999.
- (1999) Speech Communication , vol.27 , Issue.3 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigne, A.³

21
- 85016140477
- An adaptive algorithm for Mel-cepstral analysis of speech
- T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for Mel-cepstral analysis of speech," in Proc. ICASSP, 1992, pp. 137-140.
- (1992) Proc. ICASSP , pp. 137-140
- Fukada, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

22
- 0027530250
- SIMPLS: An alternative approach to partial least squares regression
- S. De Jong, "SIMPLS: an alternative approach to partial least squares regression," Chemometrics Intell. Lab. Syst., vol. 18, no. 3, pp. 251-263, 1993.
- (1993) Chemometrics Intell. Lab. Syst. , vol.18 , Issue.3 , pp. 251-263
- De Jong, S.¹

23
- 84958264664
- M. Abadi, A. Agarwal, P. Barham, E. Brevdo, Z. Chen, C. Citro, G. S. Corrado, A. Davis, J. Dean, M. Devin, S. Ghemawat, I. Goodfellow, A. Harp, G. Irving, M. Isard, Y. Jia, R. Jozefowicz, L. Kaiser, M. Kudlur, J. Levenberg, D. Mane, R. Monga, S. Moore, D. Murray, C. Olah, M. Schuster, J. Shlens, B. Steiner, I. Sutskever, K. Talwar, P. Tucker, V. Vanhoucke, V. Vasudevan, F. Viegas, O. Vinyals, P. Warden, M. Wattenberg, M. Wicke, Y. Yu, and X. Zheng. (2015) Tensorflow: Large-scale machine learning on heterogeneous distributed systems.[Online]. Available: www.tensorflow.org
- (2015) Tensorflow: Large-scale Machine Learning on Heterogeneous Distributed Systems
- Abadi, M.¹ Agarwal, A.² Barham, P.³ Brevdo, E.⁴ Chen, Z.⁵ Citro, C.⁶ Corrado, G.S.⁷ Davis, A.⁸ Dean, J.⁹ Devin, M.¹⁰ Ghemawat, S.¹¹ Goodfellow, I.¹² Harp, A.¹³ Irving, G.¹⁴ Isard, M.¹⁵ Jia, Y.¹⁶ Jozefowicz, R.¹⁷ Kaiser, L.¹⁸ Kudlur, M.¹⁹ Levenberg, J.²⁰ more..

24
- 85083951076
- Adam: A method for stochastic optimization
- D. Kingma and J. Ba, "Adam: A method for stochastic optimization," in International Conference on Learning Representations, 2015.
- (2015) International Conference on Learning Representations
- Kingma, D.¹ Ba, J.²

25
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- Nov.
- T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
- (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

26
- 38649140222
- Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model
- Mar.
- - "Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model," Speech Commun., vol. 50, no. 3, pp. 215-227, Mar. 2008.
- (2008) Speech Commun. , vol.50 , Issue.3 , pp. 215-227
- Toda, T.¹ Black, A.W.² Tokuda, K.³

27
- 84999828343
- Real-time control of an articulatory-based speech synthesizer for brain computer interfaces
- F. Bocquelet, T. Hueber, L. Girin, C. Savariaux, and B. Yvert, "Real-time control of an articulatory-based speech synthesizer for brain computer interfaces," PLOS Computational Biology, vol. 12, no. 11, p. e1005119, 2016.
- (2016) PLOS Computational Biology , vol.12 , Issue.11 , pp. e1005119
- Bocquelet, F.¹ Hueber, T.² Girin, L.³ Savariaux, C.⁴ Yvert, B.⁵

28
- 0033708106
- Speech parameter generation algorithms for HMM-based speech synthesis
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis," in Proc. ICASSP, 2000, pp. 1315-1318.
- (2000) Proc. ICASSP , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

29
- 84890490547
- Statistical parametric speech synthesis using deep neural networks
- IEEE
- H. Zen, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using deep neural networks," in Proc. ICASSP. IEEE, 2013, pp. 7962-7966.
- (2013) Proc. ICASSP , pp. 7962-7966
- Zen, H.¹ Senior, A.² Schuster, M.³

30
- 84946074523
- The effect of neural networks in statistical parametric speech synthesis
- K. Hashimoto, K. Oura, Y. Nankaku, and K. Tokuda, "The effect of neural networks in statistical parametric speech synthesis," in Proc. ICASSP, 2015, pp. 4455-4459.
- (2015) Proc. ICASSP , pp. 4455-4459
- Hashimoto, K.¹ Oura, K.² Nankaku, Y.³ Tokuda, K.⁴

31
- 0031268931
- Bidirectional recurrent neural networks
- M. Schuster and K. K. Paliwal, "Bidirectional recurrent neural networks," IEEE Trans. Signal Process., vol. 45, no. 11, pp. 2673- 2681, 1997.
- (1997) IEEE Trans. Signal Process. , vol.45 , Issue.11 , pp. 2673-2681
- Schuster, M.¹ Paliwal, K.K.²

32
- 27744588611
- Framewise phoneme classification with bidirectional LSTM and other neural network architectures
- A. Graves and J. Schmidhuber, "Framewise phoneme classification with bidirectional LSTM and other neural network architectures," Neural Networks, vol. 18, no. 5, pp. 602-610, 2005.
- (2005) Neural Networks , vol.18 , Issue.5 , pp. 602-610
- Graves, A.¹ Schmidhuber, J.²

33
- 0027247004
- Mel-cepstral distance measure for objective speech quality assessment
- R. Kubichek, "Mel-cepstral distance measure for objective speech quality assessment," in Proc. IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, 1993, pp. 125-128.
- (1993) Proc. IEEE Pacific Rim Conference on Communications, Computers and Signal Processing , pp. 125-128
- Kubichek, R.¹

34
- 84910067727
- Analysis of phonetic similarity in a silent speech interface based on permanent magnetic articulography
- J. A. Gonzalez, L. A. Cheah, J. Bai, S. R. Ell, J. M. Gilbert, R. K. Moore, and P. D. Green, "Analysis of phonetic similarity in a silent speech interface based on permanent magnetic articulography," in Proc. Interspeech, 2014, pp. 1018-1022.
- (2014) Proc. Interspeech , pp. 1018-1022
- Gonzalez, J.A.¹ Cheah, L.A.² Bai, J.³ Ell, S.R.⁴ Gilbert, J.M.⁵ Moore, R.K.⁶ Green, P.D.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.