메뉴 건너뛰기




Volumn 39, Issue , 2016, Pages 67-87

A silent speech system based on permanent magnet articulography and direct synthesis

Author keywords

Augmentative and alternative communication; Permanent magnet articulography; Silent speech interfaces; Speech rehabilitation; Speech synthesis

Indexed keywords

HUMAN REHABILITATION ENGINEERING; LINEAR TRANSFORMATIONS; MAGNETS; MATHEMATICAL TRANSFORMATIONS; METADATA; PERMANENT MAGNETS; SPEECH; SPEECH COMMUNICATION; SPEECH INTELLIGIBILITY; SPEECH RECOGNITION; SPEECH SYNTHESIS;

EID: 84962110277     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2016.02.002     Document Type: Article
Times cited : (58)

References (62)
  • 1
    • 84867169172 scopus 로고    scopus 로고
    • Exploring the predictability of non-unique acoustic-to-articulatory mappings
    • G. Ananthakrishnan, O. Engwall, and D. Neiberg Exploring the predictability of non-unique acoustic-to-articulatory mappings IEEE Trans. Audio Speech Lang. Process. 20 December (10) 2012 2672 2682
    • (2012) IEEE Trans. Audio Speech Lang. Process. , vol.20 , Issue.10 December , pp. 2672-2682
    • Ananthakrishnan, G.1    Engwall, O.2    Neiberg, D.3
  • 3
    • 0017968519 scopus 로고
    • Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique
    • B.S. Atal, J.J. Chang, M.V. Mathews, and J.W. Tukey Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique J. Acoust. Soc. Am. 63 5 1978 1535 1555
    • (1978) J. Acoust. Soc. Am. , vol.63 , Issue.5 , pp. 1535-1555
    • Atal, B.S.1    Chang, J.J.2    Mathews, M.V.3    Tukey, J.W.4
  • 4
    • 0036656541 scopus 로고    scopus 로고
    • Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images
    • P. Badin, G. Bailly, L. Revret, M. Baciu, C. Segebarth, and C. Savariaux Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images J. Phon. 30 July (3) 2002 533 553
    • (2002) J. Phon. , vol.30 , Issue.3 July , pp. 533-553
    • Badin, P.1    Bailly, G.2    Revret, L.3    Baciu, M.4    Segebarth, C.5    Savariaux, C.6
  • 10
    • 80051905631 scopus 로고    scopus 로고
    • Classification of intended phoneme production from chronic intracortical microelectrode recordings in speech-motor cortex
    • 2011, May
    • J.S. Brumberg, E.J. Wright, D.S. Andreasen, F.H. Guenther, and P.R. Kennedy Classification of intended phoneme production from chronic intracortical microelectrode recordings in speech-motor cortex Front. Neurosci. 5 2011, May 1 12
    • (2011) Front. Neurosci. , vol.5 , pp. 1-12
    • Brumberg, J.S.1    Wright, E.J.2    Andreasen, D.S.3    Guenther, F.H.4    Kennedy, P.R.5
  • 13
    • 0027530250 scopus 로고
    • SIMPLS: An alternative approach to partial least squares regression
    • S. De Jong SIMPLS: an alternative approach to partial least squares regression Chemomet. Intell. Lab. Syst. 18 3 1993 251 263
    • (1993) Chemomet. Intell. Lab. Syst. , vol.18 , Issue.3 , pp. 251-263
    • De Jong, S.1
  • 15
    • 84910028724 scopus 로고    scopus 로고
    • Towards a practical silent speech recognition system
    • Y. Deng, J.T. Heaton, and G.S. Meltzner Towards a practical silent speech recognition system Proc. Interspeech. 2014 1164 1168
    • (2014) Proc. Interspeech. , pp. 1164-1168
    • Deng, Y.1    Heaton, J.T.2    Meltzner, G.S.3
  • 17
    • 42949175762 scopus 로고    scopus 로고
    • Development of a (silent) speech recognition system for patients following laryngectomy
    • M.J. Fagan, S.R. Ell, J.M. Gilbert, E. Sarrazin, and P.M. Chapman Development of a (silent) speech recognition system for patients following laryngectomy Med. Eng. Phys. 30 4 2008 419 425
    • (2008) Med. Eng. Phys. , vol.30 , Issue.4 , pp. 419-425
    • Fagan, M.J.1    Ell, S.R.2    Gilbert, J.M.3    Sarrazin, E.4    Chapman, P.M.5
  • 18
    • 85016140477 scopus 로고
    • An adaptive algorithm for Mel-cepstral analysis of speech
    • T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai An adaptive algorithm for Mel-cepstral analysis of speech Proc. ICASSP. 1992 137 140
    • (1992) Proc. ICASSP. , pp. 137-140
    • Fukada, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 19
    • 0003744820 scopus 로고    scopus 로고
    • The EM algorithm for mixtures of factor analyzers
    • University of Toronto
    • Z. Ghahramani, and G.E. Hinton The EM algorithm for mixtures of factor analyzers. Tech. Rep. CRG-TR-96-1 1996 University of Toronto
    • (1996) Tech. Rep. CRG-TR-96-1
    • Ghahramani, Z.1    Hinton, G.E.2
  • 21
    • 84910067727 scopus 로고    scopus 로고
    • Analysis of phonetic similarity in a silent speech interface based on permanent magnetic articulography
    • J.A. Gonzalez, L.A. Cheah, J. Bai, S.R. Ell, J.M. Gilbert, R.K.M. 1, and P.D. Green Analysis of phonetic similarity in a silent speech interface based on permanent magnetic articulography Proc. Interspeech. 2014 1018 1022
    • (2014) Proc. Interspeech. , pp. 1018-1022
    • Gonzalez, J.A.1    Cheah, L.A.2    Bai, J.3    Ell, S.R.4    Gilbert, J.M.5    M, R.K.6    Green, P.D.7
  • 22
    • 84962118183 scopus 로고    scopus 로고
    • A non-parametric articulatory-to-acoustic conversion system for silent speech using shared Gaussian process dynamical models
    • J.A. Gonzalez, P.D. Green, R.K. Moore, L.A. Cheah, and J.M. Gilbert A non-parametric articulatory-to-acoustic conversion system for silent speech using shared Gaussian process dynamical models Proc. UK Speech. 2015 11
    • (2015) Proc. UK Speech. , pp. 11
    • Gonzalez, J.A.1    Green, P.D.2    Moore, R.K.3    Cheah, L.A.4    Gilbert, J.M.5
  • 27
    • 84878391491 scopus 로고    scopus 로고
    • Continuous articulatory-to-acoustic mapping using phone-baseda trajectory HMM for a silent speech interface
    • T. Hueber, G. Bailly, and B. Denby Continuous articulatory-to-acoustic mapping using phone-baseda trajectory HMM for a silent speech interface Proc. Interspeech. 2012 723 726
    • (2012) Proc. Interspeech. , pp. 723-726
    • Hueber, T.1    Bailly, G.2    Denby, B.3
  • 28
    • 76849104115 scopus 로고    scopus 로고
    • Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips
    • T. Hueber, E.-L. Benaroya, G. Chollet, B. Denby, G. Dreyfus, and M. Stone Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips Speech Commun. 52 4 2010 288 300
    • (2010) Speech Commun. , vol.52 , Issue.4 , pp. 288-300
    • Hueber, T.1    Benaroya, E.-L.2    Chollet, G.3    Denby, B.4    Dreyfus, G.5    Stone, M.6
  • 29
    • 84865772217 scopus 로고    scopus 로고
    • Statistical mapping between articulatory and acoustic data for an ultrasound-based silent speech interface
    • T. Hueber, E.-L. Benaroya, B. Denby, and G. Chollet Statistical mapping between articulatory and acoustic data for an ultrasound-based silent speech interface Proc. Interspeech. 2011 593 596
    • (2011) Proc. Interspeech. , pp. 593-596
    • Hueber, T.1    Benaroya, E.-L.2    Denby, B.3    Chollet, G.4
  • 30
    • 84867195703 scopus 로고    scopus 로고
    • Phone recognition from ultrasound and optical video sequences for a silent speech interface
    • T. Hueber, G. Chollet, B. Denby, G. Dreyfus, and M. Stone Phone recognition from ultrasound and optical video sequences for a silent speech interface Proc. Interspeech. 2008 2032 2035
    • (2008) Proc. Interspeech. , pp. 2032-2035
    • Hueber, T.1    Chollet, G.2    Denby, B.3    Dreyfus, G.4    Stone, M.5
  • 31
  • 33
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. De Cheveigne Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: possible role of a repetitive structure in sounds Speech Commun. 27 April (3) 1999 187 207
    • (1999) Speech Commun. , vol.27 , Issue.3 April , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigne, A.3
  • 35
    • 0002560960 scopus 로고
    • A database for speaker-independent digit recognition
    • R. Leonard A database for speaker-independent digit recognition Proc. ICASSP. 1984 328 331
    • (1984) Proc. ICASSP. , pp. 328-331
    • Leonard, R.1
  • 36
    • 0020281396 scopus 로고
    • A digital simulation method of the vocal-tract system
    • S. Maeda A digital simulation method of the vocal-tract system Speech Commun. 1 3 1982 199 229
    • (1982) Speech Commun. , vol.1 , Issue.3 , pp. 199-229
    • Maeda, S.1
  • 38
    • 84894161045 scopus 로고    scopus 로고
    • A digital signal processor implementation of silent/electrolaryngeal speech enhancement based on real-time statistical voice conversion
    • T. Moriguchi, T. Toda, M. Sano, H. Sato, G. Neubig, S. Sakti, and S. Nakamura A digital signal processor implementation of silent/electrolaryngeal speech enhancement based on real-time statistical voice conversion Proc. Interspeech. 2013 3072 3076
    • (2013) Proc. Interspeech. , pp. 3072-3076
    • Moriguchi, T.1    Toda, T.2    Sano, M.3    Sato, H.4    Neubig, G.5    Sakti, S.6    Nakamura, S.7
  • 39
    • 80052698826 scopus 로고    scopus 로고
    • Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech
    • K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech Speech Commun. 54 January (1) 2012 134 146
    • (2012) Speech Commun. , vol.54 , Issue.1 January , pp. 134-146
    • Nakamura, K.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 40
    • 84867222549 scopus 로고    scopus 로고
    • The acoustic to articulation mapping: Non-linear or non-unique?
    • D. Neiberg, G. Ananthakrishnan, and O. Engwall The acoustic to articulation mapping: non-linear or non-unique? Proc. Interspeech. 2008 1485 1488
    • (2008) Proc. Interspeech. , pp. 1485-1488
    • Neiberg, D.1    Ananthakrishnan, G.2    Engwall, O.3
  • 43
    • 51449098747 scopus 로고    scopus 로고
    • An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping
    • C. Qin, and M.Á. Carreira-Perpiñán An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping Proc. Interspeech. 2007 74 77
    • (2007) Proc. Interspeech. , pp. 74-77
    • Qin, C.1    Carreira-Perpiñán, M.Á.2
  • 44
    • 0019606728 scopus 로고
    • An articulatory synthesizer for perceptual research
    • P. Rubin, T. Baer, and P. Mermelstein An articulatory synthesizer for perceptual research J. Acoust. Soc. Am. 70 2 1981 321 328
    • (1981) J. Acoust. Soc. Am. , vol.70 , Issue.2 , pp. 321-328
    • Rubin, P.1    Baer, T.2    Mermelstein, P.3
  • 45
    • 0028259480 scopus 로고
    • Techniques for estimating vocal-tract shapes from the speech signal
    • J. Schroeter, and M.M. Sondhi Techniques for estimating vocal-tract shapes from the speech signal IEEE Trans. Speech Audio Process. 2 1 1994 133 150
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.1 , pp. 133-150
    • Schroeter, J.1    Sondhi, M.M.2
  • 46
    • 76849099234 scopus 로고    scopus 로고
    • Modeling coarticulation in EMG-based continuous speech recognition
    • T. Schultz, and M. Wand Modeling coarticulation in EMG-based continuous speech recognition Speech Commun. 52 April (4) 2010 341 353
    • (2010) Speech Commun. , vol.52 , Issue.4 April , pp. 341-353
    • Schultz, T.1    Wand, M.2
  • 47
  • 48
    • 33646779506 scopus 로고    scopus 로고
    • Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter
    • T. Toda, A.W. Black, and K. Tokuda Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter Proc. ICASSP. 2005 9 12
    • (2005) Proc. ICASSP. , pp. 9-12
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 49
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • T. Toda, A.W. Black, and K. Tokuda Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory IEEE Trans. Audio Speech Lang. Process. 15 November (8) 2007 2222 2235
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.8 November , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 50
    • 38649140222 scopus 로고    scopus 로고
    • Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model
    • T. Toda, A.W. Black, and K. Tokuda Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model Speech Commun. 50 March (3) 2008 215 227
    • (2008) Speech Commun. , vol.50 , Issue.March 3 , pp. 215-227
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 51
    • 84878390910 scopus 로고    scopus 로고
    • Implementation of computationally efficient real-time voice conversion
    • T. Toda, T. Muramatsu, and H. Banno Implementation of computationally efficient real-time voice conversion Proc. Interspeech 2012 94 97
    • (2012) Proc. Interspeech , pp. 94-97
    • Toda, T.1    Muramatsu, T.2    Banno, H.3
  • 52
    • 84865698185 scopus 로고    scopus 로고
    • Statistical voice conversion techniques for body-conducted unvoiced speech enhancement
    • T. Toda, M. Nakagiri, and K. Shikano Statistical voice conversion techniques for body-conducted unvoiced speech enhancement IEEE Trans. Audio Speech Lang. Process. 20 November (9) 2012 2505 2517
    • (2012) IEEE Trans. Audio Speech Lang. Process. , vol.20 , Issue.9 November , pp. 2505-2517
    • Toda, T.1    Nakagiri, M.2    Shikano, K.3
  • 55
    • 33745288610 scopus 로고    scopus 로고
    • A support vector approach to the acoustic-to-articulatory mapping
    • A. Toutios, and K.G. Margaritis A support vector approach to the acoustic-to-articulatory mapping Proc. Interspeech. 2005 3221 3224
    • (2005) Proc. Interspeech. , pp. 3221-3224
    • Toutios, A.1    Margaritis, K.G.2
  • 56
    • 84906226748 scopus 로고    scopus 로고
    • Articulatory synthesis of French connected speech from EMA data
    • A. Toutios, and S. Narayanan Articulatory synthesis of French connected speech from EMA data Proc. Interspeech. 2013 2738 2742
    • (2013) Proc. Interspeech. , pp. 2738-2742
    • Toutios, A.1    Narayanan, S.2
  • 57
    • 79959617904 scopus 로고    scopus 로고
    • Estimating the control parameters of an articulatory model from electromagnetic articulograph data
    • A. Toutios, S. Ouni, and Y. Laprie Estimating the control parameters of an articulatory model from electromagnetic articulograph data J. Acoust. Soc. Am. 129 5 2011 3245 3257
    • (2011) J. Acoust. Soc. Am. , vol.129 , Issue.5 , pp. 3245-3257
    • Toutios, A.1    Ouni, S.2    Laprie, Y.3
  • 58
    • 84907468717 scopus 로고    scopus 로고
    • Tackling speaking mode varieties in EMG-based speech recognition
    • M. Wand, M. Janke, and T. Schultz Tackling speaking mode varieties in EMG-based speech recognition IEEE Trans. Bio-Med. Eng. 61 October (10) 2014 2515 2526
    • (2014) IEEE Trans. Bio-Med. Eng. , vol.61 , Issue.10 October , pp. 2515-2526
    • Wand, M.1    Janke, M.2    Schultz, T.3
  • 59
    • 80051603386 scopus 로고    scopus 로고
    • Analysis of phone confusion in EMG-based speech recognition
    • M. Wand, and T. Schultz Analysis of phone confusion in EMG-based speech recognition Proc. ICASSP. 2011 757 760
    • (2011) Proc. ICASSP. , pp. 757-760
    • Wand, M.1    Schultz, T.2
  • 61
    • 84910091101 scopus 로고    scopus 로고
    • Conversion from facial myoelectric signals to speech: A unit selection approach
    • M. Zahner, M. Janke, M. Wand, and T. Schultz Conversion from facial myoelectric signals to speech: a unit selection approach Proc. Interspeech. 2014 1184 1188
    • (2014) Proc. Interspeech. , pp. 1184-1188
    • Zahner, M.1    Janke, M.2    Wand, M.3    Schultz, T.4
  • 62
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A.W. Black Statistical parametric speech synthesis Speech Commun. 51 November (11) 2009 1039 1064
    • (2009) Speech Commun. , vol.51 , Issue.11 November , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.