SCOPUS 정보 검색 플랫폼

Computer Speech and Language

Volumn 36, Issue , 2016, Pages 274-293

Statistical conversion of silent articulation into audible speech using full-covariance HMM

(2) Hueber, Thomas a,b Bailly, Gérard a,b

a UNIV GRENOBLE ALPES (France)

b CNRS (France)

Author keywords

Articulatory acoustic mapping; GMM; HMM; Silent speech interface; Ultrasound

Indexed keywords

LINGUISTICS; MAPPING; SPEECH; SPEECH PROCESSING; ULTRASONIC APPLICATIONS; ULTRASONICS;

ACOUSTIC MAPPING; DISCRIMINATION TESTS; GMM; HMM; LINGUISTIC KNOWLEDGE; OBJECTIVE EVALUATION; SILENT SPEECH INTERFACES; SPECTRAL DISTORTIONS;

SPEECH INTELLIGIBILITY;

EID: 84949568613 PISSN: 08852308 EISSN: 10958363 Source Type: Journal
DOI: 10.1016/j.csl.2015.03.005 Document Type: Article

Times cited : (44)

References (47)

1
- 33947682260
- Construction and control of a three-dimensional vocal tract model
- Toulouse, France
- P. Birkholz, D. Jackèl, and B. Kroger Construction and control of a three-dimensional vocal tract model Proceedings of ICASSP Toulouse, France 2006 873 876
- (2006) Proceedings of ICASSP , pp. 873-876
- Birkholz, P.¹ Jackèl, D.² Kroger, B.³

2
- 77953171897
- Selecting training inputs via greedy rank covering
- Philadelphia, PA, USA
- A.L. Buchsbaum, and J.P. van Santen Selecting training inputs via greedy rank covering Proceedings of the 7th Annual ACM-SIAM Symposium on Discrete Algorithms Philadelphia, PA, USA 1996 288 295
- (1996) Proceedings of the 7th Annual ACM-SIAM Symposium on Discrete Algorithms , pp. 288-295
- Buchsbaum, A.L.¹ Van Santen, J.P.²

3
- 85032752352
- Audiovisual speech processing
- T. Chen Audiovisual speech processing Signal Process. Mag. IEEE 18 1 2001 9 21
- (2001) Signal Process. Mag. IEEE , vol.18 , Issue.1 , pp. 9-21
- Chen, T.¹

4
- 85118743743
- Statistical language modeling using the CMU-Cambridge toolkit
- Rhodes, Greece
- P. Clarkson, and R. Rosenfeld Statistical language modeling using the CMU-Cambridge toolkit Proceedings of Eurospeech Rhodes, Greece 1997 2707 2710
- (1997) Proceedings of Eurospeech , pp. 2707-2710
- Clarkson, P.¹ Rosenfeld, R.²

5
- 0040319993
- Vingt listes de dix phrases phonétiquement équilibrées
- P. Combescure Vingt listes de dix phrases phonétiquement équilibrées Rev. Acoust. 14 56 1981 34 38
- (1981) Rev. Acoust. , vol.14 , Issue.56 , pp. 34-38
- Combescure, P.¹

6
- 33947642146
- Prospects for a silent speech interface using ultrasound imaging
- Toulouse, France
- B. Denby, Y. Oussar, G. Dreyfus, and M. Stone Prospects for a silent speech interface using ultrasound imaging Proceedings of ICASSP Toulouse, France 2006 365 368
- (2006) Proceedings of ICASSP , pp. 365-368
- Denby, B.¹ Oussar, Y.² Dreyfus, G.³ Stone, M.⁴

7
- 76849116340
- Silent speech interfaces
- B. Denby, T. Schultz, K. Honda, T. Hueber, J. Gilbert, and J. Brumberg Silent speech interfaces Speech Commun. 52 4 2010 270 287
- (2010) Speech Commun. , vol.52 , Issue.4 , pp. 270-287
- Denby, B.¹ Schultz, T.² Honda, K.³ Hueber, T.⁴ Gilbert, J.⁵ Brumberg, J.⁶

8
- 42949175762
- Development of a (silent) speech recognition system for patients following laryngectomy
- M.J. Fagan, S.R. Ell, J.M. Gilbert, E. Sarrazin, and P.M. Chapman Development of a (silent) speech recognition system for patients following laryngectomy Med. Eng. Phys. 30 4 2008 419 425
- (2008) Med. Eng. Phys. , vol.30 , Issue.4 , pp. 419-425
- Fagan, M.J.¹ Ell, S.R.² Gilbert, J.M.³ Sarrazin, E.⁴ Chapman, P.M.⁵

9
- 78449253410
- Isolated word recognition of silent speech using magnetic implants and sensors
- J.M. Gilbert, S.I. Rybchenko, R. Hofe, S.R. Ell, M.J. Fagan, R.K. Moore, and P. Green Isolated word recognition of silent speech using magnetic implants and sensors Med. Eng. Phys. 32 10 2010 1189 1197
- (2010) Med. Eng. Phys. , vol.32 , Issue.10 , pp. 1189-1197
- Gilbert, J.M.¹ Rybchenko, S.I.² Hofe, R.³ Ell, S.R.⁴ Fagan, M.J.⁵ Moore, R.K.⁶ Green, P.⁷

10
- 2142659020
- Estimation of articulatory movements from speech acoustics using an HMM-based speech production model
- S. Hiroya, and M. Honda Estimation of articulatory movements from speech acoustics using an HMM-based speech production model IEEE Trans. Speech Audio Process. 12 2 2004 175 185
- (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.2 , pp. 175-185
- Hiroya, S.¹ Honda, M.²

11
- 34547554405
- Eigentongue feature extraction for an ultrasound-based silent speech interface
- Honolulu, USA
- T. Hueber, G. Aversano, G. Chollet, B. Denby, G. Dreyfus, Y. Oussar, P. Roussel, and M. Stone Eigentongue feature extraction for an ultrasound-based silent speech interface Proceedings of ICASSP Honolulu, USA 2007 1245 1248
- (2007) Proceedings of ICASSP , pp. 1245-1248
- Hueber, T.¹ Aversano, G.² Chollet, G.³ Denby, B.⁴ Dreyfus, G.⁵ Oussar, Y.⁶ Roussel, P.⁷ Stone, M.⁸

12
- 84949628676
- Differences in articulatory strategies between silent, whispered and normal speech? A pilot study using electromagnetic articulography
- Montreal, Canada
- T. Hueber, P. Badin, C. Savariaux, C. Vilain, and G. Bailly Differences in articulatory strategies between silent, whispered and normal speech? A pilot study using electromagnetic articulography Proceedings of International Seminar on Speech Production (ISSP) Montreal, Canada 2010
- (2010) Proceedings of International Seminar on Speech Production (ISSP)
- Hueber, T.¹ Badin, P.² Savariaux, C.³ Vilain, C.⁴ Bailly, G.⁵

13
- 76849104115
- Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips
- T. Hueber, E.-L. Benaroya, G. Chollet, B. Denby, and M. Stone Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips Speech Commun. 52 4 2010 288 300
- (2010) Speech Commun. , vol.52 , Issue.4 , pp. 288-300
- Hueber, T.¹ Benaroya, E.-L.² Chollet, G.³ Denby, B.⁴ Stone, M.⁵

14
- 84865772217
- Statistical mapping between articulatory and acoustic data for an ultrasound-based silent speech interface
- Firenze, Italia
- T. Hueber, E.-L. Benaroya, B. Denby, and G. Chollet Statistical mapping between articulatory and acoustic data for an ultrasound-based silent speech interface Proceedings of Interspeech Firenze, Italia 2011 593 596
- (2011) Proceedings of Interspeech , pp. 593-596
- Hueber, T.¹ Benaroya, E.-L.² Denby, B.³ Chollet, G.⁴

15
- 84878395809
- Cross-speaker acoustic-to-articulatory inversion using phone-based trajectory HMM for pronunciation training
- Portland, USA
- T. Hueber, A. Ben Youssef, G. Bailly, P. Badin, and F. Elisei Cross-speaker acoustic-to-articulatory inversion using phone-based trajectory HMM for pronunciation training Proceedings of Interspeech Portland, USA 2012
- (2012) Proceedings of Interspeech
- Hueber, T.¹ Ben Youssef, A.² Bailly, G.³ Badin, P.⁴ Elisei, F.⁵

16
- 70450206214
- Visuo-phonetic decoding using multi-stream and context-dependent models for an ultrasound-based silent speech interface
- Brighton, England
- T. Hueber, G. Chollet, B. Denby, G. Dreyfus, and M. Stone Visuo-phonetic decoding using multi-stream and context-dependent models for an ultrasound-based silent speech interface Proceedings of Interspeech Brighton, England 2009 640 643
- (2009) Proceedings of Interspeech , pp. 640-643
- Hueber, T.¹ Chollet, G.² Denby, B.³ Dreyfus, G.⁴ Stone, M.⁵

17
- 79956290540
- Acquisition of ultrasound, video and acoustic speech data for a silent-speech interface application
- Strasbourg, France
- T. Hueber, G. Chollet, B. Denby, and M. Stone Acquisition of ultrasound, video and acoustic speech data for a silent-speech interface application Proceedings of International Seminar on Speech Production Strasbourg, France 2008 365 369
- (2008) Proceedings of International Seminar on Speech Production , pp. 365-369
- Hueber, T.¹ Chollet, G.² Denby, B.³ Stone, M.⁴

18
- 0020703324
- Mel log spectrum approximation (MLSA) filter for speech synthesis
- S. Imai, K. Sumita, and C. Furuichi Mel log spectrum approximation (MLSA) filter for speech synthesis Electron. Commun. Jpn. Part I Commun. 66 1983 10 18
- (1983) Electron. Commun. Jpn. Part i Commun. , vol.66 , pp. 10-18
- Imai, S.¹ Sumita, K.² Furuichi, C.³

19
- 79959839217
- Impact of lack of acoustic feedback in EMG-based silent speech recognition
- Makuhari, Japan
- M. Janke, M. Wand, and T. Schultz Impact of lack of acoustic feedback in EMG-based silent speech recognition Proceedings of Interspeech Makuhari, Japan 2010 2686 2689
- (2010) Proceedings of Interspeech , pp. 2686-2689
- Janke, M.¹ Wand, M.² Schultz, T.³

20
- 6344254321
- A neural network model of the articulatory-acoustic forward mapping trained on recordings of articulatory parameters
- C.T. Kello, and D.C. Plaut A neural network model of the articulatory-acoustic forward mapping trained on recordings of articulatory parameters J. Acoust. Soc. Am. 116 4 2004 2354 2364
- (2004) J. Acoust. Soc. Am. , vol.116 , Issue.4 , pp. 2354-2364
- Kello, C.T.¹ Plaut, D.C.²

21
- 77955426622
- An analysis of HMM-based prediction of articulatory movements
- Z.-H. Ling, K. Richmond, and J. Yamagishi An analysis of HMM-based prediction of articulatory movements Speech Commun. 52 10 2010 834 846
- (2010) Speech Commun. , vol.52 , Issue.10 , pp. 834-846
- Ling, Z.-H.¹ Richmond, K.² Yamagishi, J.³

22
- 68149157315
- Integrating articulatory features into HMM-based parametric speech synthesis
- Z.-H. Ling, K. Richmond, J. Yamagishi, and R.-H. Wang Integrating articulatory features into HMM-based parametric speech synthesis IEEE Trans. Audio Speech Lang. Process. 17 6 2009 1171 1185
- (2009) IEEE Trans. Audio Speech Lang. Process. , vol.17 , Issue.6 , pp. 1171-1185
- Ling, Z.-H.¹ Richmond, K.² Yamagishi, J.³ Wang, R.-H.⁴

23
- 0001792343
- Compensatory articulation during speech: Evidence from the analysis and synthesis of vocal-tract shapes using an articulatory model
- Springer
- S. Maeda Compensatory articulation during speech: evidence from the analysis and synthesis of vocal-tract shapes using an articulatory model Speech Production and Speech Modelling 1990 Springer 131 149
- (1990) Speech Production and Speech Modelling , pp. 131-149
- Maeda, S.¹

24
- 84867211725
- Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory
- Brisbane, Australia
- T. Muramatsu, Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory Proceedings of Interspeech Brisbane, Australia 2008 1076 1079
- (2008) Proceedings of Interspeech , pp. 1076-1079
- Muramatsu, T.¹ Ohtani, Y.² Toda, T.³ Saruwatari, H.⁴ Shikano, K.⁵

25
- 32244438249
- Non-audible murmur (NAM) recognition
- Y. Nakajima, H. Kashioka, N. Campbell, and K. Shikano Non-audible murmur (NAM) recognition IEICE Trans. Inf. Syst. 89 1 2006 1 8
- (2006) IEICE Trans. Inf. Syst. , vol.89 , Issue.1 , pp. 1-8
- Nakajima, Y.¹ Kashioka, H.² Campbell, N.³ Shikano, K.⁴

26
- 0141520383
- Non-audible murmur recognition input interface using stethoscopic microphone attached to the skin
- Hong Kong, Hong Kong
- Y. Nakajima, H. Kashioka, K. Shikano, and N. Campbell Non-audible murmur recognition input interface using stethoscopic microphone attached to the skin Proceedings of ICASSP Hong Kong, Hong Kong 2003 708 711
- (2003) Proceedings of ICASSP , pp. 708-711
- Nakajima, Y.¹ Kashioka, H.² Shikano, K.³ Campbell, N.⁴

27
- 0030245363
- From HMM's to segment models: A unified view of stochastic modeling for speech recognition
- M. Ostendorf, V.V. Digalakis, and O.A. Kimball From HMM's to segment models: a unified view of stochastic modeling for speech recognition IEEE Trans. Speech Audio Process. 4 5 1996 360 378
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.5 , pp. 360-378
- Ostendorf, M.¹ Digalakis, V.V.² Kimball, O.A.³

28
- 44949185845
- A trajectory mixture density network for the acoustic-articulatory inversion mapping
- Pittsburgh, PA, USA
- K. Richmond A trajectory mixture density network for the acoustic-articulatory inversion mapping Proceedings of Interspeech Pittsburgh, PA, USA 2006 577 580
- (2006) Proceedings of Interspeech , pp. 577-580
- Richmond, K.¹

29
- 0022234383
- Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition
- Detroit, MI, USA
- M. Russell, and R. Moore Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition Proceedings of ICASSP Detroit, MI, USA 1985 5 8
- (1985) Proceedings of ICASSP , pp. 5-8
- Russell, M.¹ Moore, R.²

30
- 76849099234
- Modeling coarticulation in EMG-based continuous speech recognition
- T. Schultz, and M. Wand Modeling coarticulation in EMG-based continuous speech recognition Speech Commun. 52 4 2010 341 353
- (2010) Speech Commun. , vol.52 , Issue.4 , pp. 341-353
- Schultz, T.¹ Wand, M.²

31
- 84890495160
- Fast, low-artifact speech synthesis considering global variance
- Vancouver, British Columbia, Canada
- M. Shannon, and W. Byrne Fast, low-artifact speech synthesis considering global variance Proceedings of ICASSP Vancouver, British Columbia, Canada 2013 7869 7873
- (2013) Proceedings of ICASSP , pp. 7869-7873
- Shannon, M.¹ Byrne, W.²

32
- 84878384520
- Ways to implement global variance in statistical speech synthesis
- Portland, USA
- H. Silén, E. Helander, J. Nurminen, and M. Gabbouj Ways to implement global variance in statistical speech synthesis Proceedings of Interspeech Portland, USA 2012
- (2012) Proceedings of Interspeech
- Silén, H.¹ Helander, E.² Nurminen, J.³ Gabbouj, M.⁴

33
- 0023165217
- A hybrid time-frequency domain articulatory speech synthesizer
- M.M. Sondhi, and J. Schroeter A hybrid time-frequency domain articulatory speech synthesizer IEEE Trans. Acoust. Speech Signal Process. 35 7 1987 955 967
- (1987) IEEE Trans. Acoust. Speech Signal Process. , vol.35 , Issue.7 , pp. 955-967
- Sondhi, M.M.¹ Schroeter, J.²

34
- 21844437086
- A guide to analysing tongue motion from ultrasound images
- M. Stone A guide to analysing tongue motion from ultrasound images Clin. Linguist. Phon. 19 6-7 2005 455 501
- (2005) Clin. Linguist. Phon. , vol.19 , Issue.6-7 , pp. 455-501
- Stone, M.¹

35
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- T. Toda, A.W. Black, and K. Tokuda Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory IEEE Trans. Audio Speech Lang. Process. 15 8 2007 2222 2235
- (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

36
- 38649140222
- Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model
- T. Toda, A.W. Black, and K. Tokuda Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model Speech Commun. 50 3 2008 215 227
- (2008) Speech Commun. , vol.50 , Issue.3 , pp. 215-227
- Toda, T.¹ Black, A.W.² Tokuda, K.³

37
- 38549096029
- A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- T. Toda, and K. Tokuda A speech parameter generation algorithm considering global variance for HMM-based speech synthesis IEICE Trans. Inf. Syst. E90-D 2007 816 824
- (2007) IEICE Trans. Inf. Syst. , vol.90 E -D , pp. 816-824
- Toda, T.¹ Tokuda, K.²

38
- 0033708106
- Speech parameter generation algorithms for HMM-based speech synthesis
- Istanbul, Turkey
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura Speech parameter generation algorithms for HMM-based speech synthesis Proceedings of ICASSP Istanbul, Turkey 2000 1315 1318
- (2000) Proceedings of ICASSP , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

39
- 0026065565
- Eigenfaces for recognition
- M. Turk, and A. Pentland Eigenfaces for recognition J. Cogn. Neurosci. 3 1991 71 86
- (1991) J. Cogn. Neurosci. , vol.3 , pp. 71-86
- Turk, M.¹ Pentland, A.²

40
- 79960267600
- Session-independent EMG-based speech recognition
- Rome, Italy
- M. Wand, and T. Schultz Session-independent EMG-based speech recognition Proceedings of Biosignals Rome, Italy 2011 295 300
- (2011) Proceedings of Biosignals , pp. 295-300
- Wand, M.¹ Schultz, T.²

41
- 76849102615
- Evaluation of a helmet to hold an ultrasound probe
- New York, USA
- A. Wrench, J. Scobbie, and M. Linden Evaluation of a helmet to hold an ultrasound probe Presented at the Ultrafest IV New York, USA 2007
- (2007) Presented at the Ultrafest IV
- Wrench, A.¹ Scobbie, J.² Linden, M.³

42
- 84871767214
- Spatio-temporal inaccuracies of video-based ultrasound images of the tongue
- Ubatuba, Bresil
- A. Wrench, and J.M. Scobbie Spatio-temporal inaccuracies of video-based ultrasound images of the tongue Proceedings of the International Seminar on Speech Production Ubatuba, Bresil 2006 451 458
- (2006) Proceedings of the International Seminar on Speech Production , pp. 451-458
- Wrench, A.¹ Scobbie, J.M.²

43
- 0003822743
- S. Young The HTK Book 2005 http://htk.eng.cam.ac.uk/
- (2005) The HTK Book
- Young, S.¹

44
- 84865795783
- Toward a multi-speaker visual articulatory feedback system
- Firenze, Italia
- A.B. Youssef, T. Hueber, P. Badin, and G. Bailly Toward a multi-speaker visual articulatory feedback system Proceedings of Interspeech Firenze, Italia 2011 589 592
- (2011) Proceedings of Interspeech , pp. 589-592
- Youssef, A.B.¹ Hueber, T.² Badin, P.³ Bailly, G.⁴

45
- 0036870577
- Speckle reducing anisotropic diffusion
- Y.J. Yu, and S.T. Acton Speckle reducing anisotropic diffusion IEEE Trans. Image Process. 11 11 2002 1260 1270
- (2002) IEEE Trans. Image Process. , vol.11 , Issue.11 , pp. 1260-1270
- Yu, Y.J.¹ Acton, S.T.²

46
- 78149260085
- Continuous stochastic feature mapping based on trajectory HMMS
- H. Zen, Y. Nankaku, and K. Tokuda Continuous stochastic feature mapping based on trajectory HMMS IEEE Trans. Audio Speech Lang. Process. 19 2 2011 417 430
- (2011) IEEE Trans. Audio Speech Lang. Process. , vol.19 , Issue.2 , pp. 417-430
- Zen, H.¹ Nankaku, Y.² Tokuda, K.³

47
- 67650153217
- Acoustic-articulatory modelling with the trajectory HMM
- L. Zhang, and S. Renals Acoustic-articulatory modelling with the trajectory HMM IEEE Signal Process. Lett. 15 2008 245 248
- (2008) IEEE Signal Process. Lett. , vol.15 , pp. 245-248
- Zhang, L.¹ Renals, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.