메뉴 건너뛰기




Volumn 36, Issue , 2016, Pages 274-293

Statistical conversion of silent articulation into audible speech using full-covariance HMM

Author keywords

Articulatory acoustic mapping; GMM; HMM; Silent speech interface; Ultrasound

Indexed keywords

LINGUISTICS; MAPPING; SPEECH; SPEECH PROCESSING; ULTRASONIC APPLICATIONS; ULTRASONICS;

EID: 84949568613     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2015.03.005     Document Type: Article
Times cited : (44)

References (47)
  • 1
    • 33947682260 scopus 로고    scopus 로고
    • Construction and control of a three-dimensional vocal tract model
    • Toulouse, France
    • P. Birkholz, D. Jackèl, and B. Kroger Construction and control of a three-dimensional vocal tract model Proceedings of ICASSP Toulouse, France 2006 873 876
    • (2006) Proceedings of ICASSP , pp. 873-876
    • Birkholz, P.1    Jackèl, D.2    Kroger, B.3
  • 3
    • 85032752352 scopus 로고    scopus 로고
    • Audiovisual speech processing
    • T. Chen Audiovisual speech processing Signal Process. Mag. IEEE 18 1 2001 9 21
    • (2001) Signal Process. Mag. IEEE , vol.18 , Issue.1 , pp. 9-21
    • Chen, T.1
  • 4
    • 85118743743 scopus 로고    scopus 로고
    • Statistical language modeling using the CMU-Cambridge toolkit
    • Rhodes, Greece
    • P. Clarkson, and R. Rosenfeld Statistical language modeling using the CMU-Cambridge toolkit Proceedings of Eurospeech Rhodes, Greece 1997 2707 2710
    • (1997) Proceedings of Eurospeech , pp. 2707-2710
    • Clarkson, P.1    Rosenfeld, R.2
  • 5
    • 0040319993 scopus 로고
    • Vingt listes de dix phrases phonétiquement équilibrées
    • P. Combescure Vingt listes de dix phrases phonétiquement équilibrées Rev. Acoust. 14 56 1981 34 38
    • (1981) Rev. Acoust. , vol.14 , Issue.56 , pp. 34-38
    • Combescure, P.1
  • 6
    • 33947642146 scopus 로고    scopus 로고
    • Prospects for a silent speech interface using ultrasound imaging
    • Toulouse, France
    • B. Denby, Y. Oussar, G. Dreyfus, and M. Stone Prospects for a silent speech interface using ultrasound imaging Proceedings of ICASSP Toulouse, France 2006 365 368
    • (2006) Proceedings of ICASSP , pp. 365-368
    • Denby, B.1    Oussar, Y.2    Dreyfus, G.3    Stone, M.4
  • 8
    • 42949175762 scopus 로고    scopus 로고
    • Development of a (silent) speech recognition system for patients following laryngectomy
    • M.J. Fagan, S.R. Ell, J.M. Gilbert, E. Sarrazin, and P.M. Chapman Development of a (silent) speech recognition system for patients following laryngectomy Med. Eng. Phys. 30 4 2008 419 425
    • (2008) Med. Eng. Phys. , vol.30 , Issue.4 , pp. 419-425
    • Fagan, M.J.1    Ell, S.R.2    Gilbert, J.M.3    Sarrazin, E.4    Chapman, P.M.5
  • 10
    • 2142659020 scopus 로고    scopus 로고
    • Estimation of articulatory movements from speech acoustics using an HMM-based speech production model
    • S. Hiroya, and M. Honda Estimation of articulatory movements from speech acoustics using an HMM-based speech production model IEEE Trans. Speech Audio Process. 12 2 2004 175 185
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.2 , pp. 175-185
    • Hiroya, S.1    Honda, M.2
  • 13
    • 76849104115 scopus 로고    scopus 로고
    • Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips
    • T. Hueber, E.-L. Benaroya, G. Chollet, B. Denby, and M. Stone Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips Speech Commun. 52 4 2010 288 300
    • (2010) Speech Commun. , vol.52 , Issue.4 , pp. 288-300
    • Hueber, T.1    Benaroya, E.-L.2    Chollet, G.3    Denby, B.4    Stone, M.5
  • 14
    • 84865772217 scopus 로고    scopus 로고
    • Statistical mapping between articulatory and acoustic data for an ultrasound-based silent speech interface
    • Firenze, Italia
    • T. Hueber, E.-L. Benaroya, B. Denby, and G. Chollet Statistical mapping between articulatory and acoustic data for an ultrasound-based silent speech interface Proceedings of Interspeech Firenze, Italia 2011 593 596
    • (2011) Proceedings of Interspeech , pp. 593-596
    • Hueber, T.1    Benaroya, E.-L.2    Denby, B.3    Chollet, G.4
  • 15
    • 84878395809 scopus 로고    scopus 로고
    • Cross-speaker acoustic-to-articulatory inversion using phone-based trajectory HMM for pronunciation training
    • Portland, USA
    • T. Hueber, A. Ben Youssef, G. Bailly, P. Badin, and F. Elisei Cross-speaker acoustic-to-articulatory inversion using phone-based trajectory HMM for pronunciation training Proceedings of Interspeech Portland, USA 2012
    • (2012) Proceedings of Interspeech
    • Hueber, T.1    Ben Youssef, A.2    Bailly, G.3    Badin, P.4    Elisei, F.5
  • 16
    • 70450206214 scopus 로고    scopus 로고
    • Visuo-phonetic decoding using multi-stream and context-dependent models for an ultrasound-based silent speech interface
    • Brighton, England
    • T. Hueber, G. Chollet, B. Denby, G. Dreyfus, and M. Stone Visuo-phonetic decoding using multi-stream and context-dependent models for an ultrasound-based silent speech interface Proceedings of Interspeech Brighton, England 2009 640 643
    • (2009) Proceedings of Interspeech , pp. 640-643
    • Hueber, T.1    Chollet, G.2    Denby, B.3    Dreyfus, G.4    Stone, M.5
  • 17
    • 79956290540 scopus 로고    scopus 로고
    • Acquisition of ultrasound, video and acoustic speech data for a silent-speech interface application
    • Strasbourg, France
    • T. Hueber, G. Chollet, B. Denby, and M. Stone Acquisition of ultrasound, video and acoustic speech data for a silent-speech interface application Proceedings of International Seminar on Speech Production Strasbourg, France 2008 365 369
    • (2008) Proceedings of International Seminar on Speech Production , pp. 365-369
    • Hueber, T.1    Chollet, G.2    Denby, B.3    Stone, M.4
  • 19
    • 79959839217 scopus 로고    scopus 로고
    • Impact of lack of acoustic feedback in EMG-based silent speech recognition
    • Makuhari, Japan
    • M. Janke, M. Wand, and T. Schultz Impact of lack of acoustic feedback in EMG-based silent speech recognition Proceedings of Interspeech Makuhari, Japan 2010 2686 2689
    • (2010) Proceedings of Interspeech , pp. 2686-2689
    • Janke, M.1    Wand, M.2    Schultz, T.3
  • 20
    • 6344254321 scopus 로고    scopus 로고
    • A neural network model of the articulatory-acoustic forward mapping trained on recordings of articulatory parameters
    • C.T. Kello, and D.C. Plaut A neural network model of the articulatory-acoustic forward mapping trained on recordings of articulatory parameters J. Acoust. Soc. Am. 116 4 2004 2354 2364
    • (2004) J. Acoust. Soc. Am. , vol.116 , Issue.4 , pp. 2354-2364
    • Kello, C.T.1    Plaut, D.C.2
  • 21
    • 77955426622 scopus 로고    scopus 로고
    • An analysis of HMM-based prediction of articulatory movements
    • Z.-H. Ling, K. Richmond, and J. Yamagishi An analysis of HMM-based prediction of articulatory movements Speech Commun. 52 10 2010 834 846
    • (2010) Speech Commun. , vol.52 , Issue.10 , pp. 834-846
    • Ling, Z.-H.1    Richmond, K.2    Yamagishi, J.3
  • 23
    • 0001792343 scopus 로고
    • Compensatory articulation during speech: Evidence from the analysis and synthesis of vocal-tract shapes using an articulatory model
    • Springer
    • S. Maeda Compensatory articulation during speech: evidence from the analysis and synthesis of vocal-tract shapes using an articulatory model Speech Production and Speech Modelling 1990 Springer 131 149
    • (1990) Speech Production and Speech Modelling , pp. 131-149
    • Maeda, S.1
  • 24
    • 84867211725 scopus 로고    scopus 로고
    • Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • Brisbane, Australia
    • T. Muramatsu, Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory Proceedings of Interspeech Brisbane, Australia 2008 1076 1079
    • (2008) Proceedings of Interspeech , pp. 1076-1079
    • Muramatsu, T.1    Ohtani, Y.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5
  • 26
    • 0141520383 scopus 로고    scopus 로고
    • Non-audible murmur recognition input interface using stethoscopic microphone attached to the skin
    • Hong Kong, Hong Kong
    • Y. Nakajima, H. Kashioka, K. Shikano, and N. Campbell Non-audible murmur recognition input interface using stethoscopic microphone attached to the skin Proceedings of ICASSP Hong Kong, Hong Kong 2003 708 711
    • (2003) Proceedings of ICASSP , pp. 708-711
    • Nakajima, Y.1    Kashioka, H.2    Shikano, K.3    Campbell, N.4
  • 27
    • 0030245363 scopus 로고    scopus 로고
    • From HMM's to segment models: A unified view of stochastic modeling for speech recognition
    • M. Ostendorf, V.V. Digalakis, and O.A. Kimball From HMM's to segment models: a unified view of stochastic modeling for speech recognition IEEE Trans. Speech Audio Process. 4 5 1996 360 378
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.5 , pp. 360-378
    • Ostendorf, M.1    Digalakis, V.V.2    Kimball, O.A.3
  • 28
    • 44949185845 scopus 로고    scopus 로고
    • A trajectory mixture density network for the acoustic-articulatory inversion mapping
    • Pittsburgh, PA, USA
    • K. Richmond A trajectory mixture density network for the acoustic-articulatory inversion mapping Proceedings of Interspeech Pittsburgh, PA, USA 2006 577 580
    • (2006) Proceedings of Interspeech , pp. 577-580
    • Richmond, K.1
  • 29
    • 0022234383 scopus 로고
    • Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition
    • Detroit, MI, USA
    • M. Russell, and R. Moore Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition Proceedings of ICASSP Detroit, MI, USA 1985 5 8
    • (1985) Proceedings of ICASSP , pp. 5-8
    • Russell, M.1    Moore, R.2
  • 30
    • 76849099234 scopus 로고    scopus 로고
    • Modeling coarticulation in EMG-based continuous speech recognition
    • T. Schultz, and M. Wand Modeling coarticulation in EMG-based continuous speech recognition Speech Commun. 52 4 2010 341 353
    • (2010) Speech Commun. , vol.52 , Issue.4 , pp. 341-353
    • Schultz, T.1    Wand, M.2
  • 31
    • 84890495160 scopus 로고    scopus 로고
    • Fast, low-artifact speech synthesis considering global variance
    • Vancouver, British Columbia, Canada
    • M. Shannon, and W. Byrne Fast, low-artifact speech synthesis considering global variance Proceedings of ICASSP Vancouver, British Columbia, Canada 2013 7869 7873
    • (2013) Proceedings of ICASSP , pp. 7869-7873
    • Shannon, M.1    Byrne, W.2
  • 34
    • 21844437086 scopus 로고    scopus 로고
    • A guide to analysing tongue motion from ultrasound images
    • M. Stone A guide to analysing tongue motion from ultrasound images Clin. Linguist. Phon. 19 6-7 2005 455 501
    • (2005) Clin. Linguist. Phon. , vol.19 , Issue.6-7 , pp. 455-501
    • Stone, M.1
  • 35
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • T. Toda, A.W. Black, and K. Tokuda Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory IEEE Trans. Audio Speech Lang. Process. 15 8 2007 2222 2235
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 36
    • 38649140222 scopus 로고    scopus 로고
    • Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model
    • T. Toda, A.W. Black, and K. Tokuda Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model Speech Commun. 50 3 2008 215 227
    • (2008) Speech Commun. , vol.50 , Issue.3 , pp. 215-227
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 37
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda, and K. Tokuda A speech parameter generation algorithm considering global variance for HMM-based speech synthesis IEICE Trans. Inf. Syst. E90-D 2007 816 824
    • (2007) IEICE Trans. Inf. Syst. , vol.90 E -D , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 38
  • 40
    • 79960267600 scopus 로고    scopus 로고
    • Session-independent EMG-based speech recognition
    • Rome, Italy
    • M. Wand, and T. Schultz Session-independent EMG-based speech recognition Proceedings of Biosignals Rome, Italy 2011 295 300
    • (2011) Proceedings of Biosignals , pp. 295-300
    • Wand, M.1    Schultz, T.2
  • 44
    • 84865795783 scopus 로고    scopus 로고
    • Toward a multi-speaker visual articulatory feedback system
    • Firenze, Italia
    • A.B. Youssef, T. Hueber, P. Badin, and G. Bailly Toward a multi-speaker visual articulatory feedback system Proceedings of Interspeech Firenze, Italia 2011 589 592
    • (2011) Proceedings of Interspeech , pp. 589-592
    • Youssef, A.B.1    Hueber, T.2    Badin, P.3    Bailly, G.4
  • 45
    • 0036870577 scopus 로고    scopus 로고
    • Speckle reducing anisotropic diffusion
    • Y.J. Yu, and S.T. Acton Speckle reducing anisotropic diffusion IEEE Trans. Image Process. 11 11 2002 1260 1270
    • (2002) IEEE Trans. Image Process. , vol.11 , Issue.11 , pp. 1260-1270
    • Yu, Y.J.1    Acton, S.T.2
  • 46
    • 78149260085 scopus 로고    scopus 로고
    • Continuous stochastic feature mapping based on trajectory HMMS
    • H. Zen, Y. Nankaku, and K. Tokuda Continuous stochastic feature mapping based on trajectory HMMS IEEE Trans. Audio Speech Lang. Process. 19 2 2011 417 430
    • (2011) IEEE Trans. Audio Speech Lang. Process. , vol.19 , Issue.2 , pp. 417-430
    • Zen, H.1    Nankaku, Y.2    Tokuda, K.3
  • 47
    • 67650153217 scopus 로고    scopus 로고
    • Acoustic-articulatory modelling with the trajectory HMM
    • L. Zhang, and S. Renals Acoustic-articulatory modelling with the trajectory HMM IEEE Signal Process. Lett. 15 2008 245 248
    • (2008) IEEE Signal Process. Lett. , vol.15 , pp. 245-248
    • Zhang, L.1    Renals, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.