SCOPUS 정보 검색 플랫폼

13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012

Volumn 1, Issue , 2012, Pages 86-89

Iterative MMSE estimation of vocal tract length normalization factors for voice transformation

(3) Erro, Daniel a Navas, Eva a Hernáez, Inma a

a UNIVERSITY OF THE BASQUE COUNTRY UPV EHU (Spain)

Author keywords

Frequency warping plus amplitude scaling; Speech synthesis; Vocal tract length normalization; Voice conversion

Indexed keywords

AMPLITUDE CORRECTION; AMPLITUDE SCALING; CEPSTRAL DOMAIN; CONVERSION ACCURACIES; ITERATIVE PROCEDURES; SINGLE PARAMETER; VOCAL TRACT LENGTH NORMALIZATION; VOICE CONVERSION;

ITERATIVE METHODS; SPEECH PROCESSING; SPEECH SYNTHESIS;

SPEECH RECOGNITION;

EID: 84878409257 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (16)

1
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappé, E. Moulines, "Continuous probabilistic transform for voice conversion", IEEE Trans. Speech and Audio Process., vol. 6, pp. 131-142, 1998.
- (1998) IEEE Trans. Speech and Audio Process. , vol.6 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

2
- 4444285698
- Ph.D. thesis, Oregon Health & Science University
- A. Kain, "High resolution voice transformation", Ph.D. thesis, Oregon Health & Science University, 2001.
- (2001) High Resolution Voice Transformation
- Kain, A.¹

3
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- T. Toda, A.W. Black, K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory", IEEE Trans. Audio, Speech, Lang. Process., vol. 15(8), pp. 2222-2235, 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

4
- 85010815133
- Voice transformation using PSOLA technique
- H. Valbret, E. Moulines, J.P. Tubach, "Voice transformation using PSOLA technique", Speech Commun., vol. 1, pp. 145-148, 1992.
- (1992) Speech Commun. , vol.1 , pp. 145-148
- Valbret, H.¹ Moulines, E.² Tubach, J.P.³

5
- 84948175540
- VTLN-based voice conversion
- D. Sündermann, H. Ney, "VTLN-based voice conversion", Proc. IEEE Symp. Signal Process. Inf. Technol., pp. 556-559, 2003.
- (2003) Proc. IEEE Symp. Signal Process. Inf. Technol. , pp. 556-559
- Sündermann, D.¹ Ney, H.²

6
- 77953727123
- Voice conversion based on weighted frequency warping
- D. Erro, A. Moreno, A. Bonafonte, "Voice conversion based on weighted frequency warping", IEEE Trans. Audio, Speech, Lang. Process., vol. 18(5), pp. 922-931, 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.5 , pp. 922-931
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

7
- 80051619373
- One sentence voice adaptation using GMM-based frequency-warping and shift with a sub-band basis spectrum model
- M. Tamura, M. Morita, T. Kagoshima, M. Akamine, "One sentence voice adaptation using GMM-based frequency-warping and shift with a sub-band basis spectrum model", Proc. ICASSP, pp. 5124-5127, 2011.
- (2011) Proc. ICASSP , pp. 5124-5127
- Tamura, M.¹ Morita, M.² Kagoshima, T.³ Akamine, M.⁴

8
- 84865717274
- Spectral envelope transformation using DFW and amplitude scaling for voice conversion with parallel or nonparallel corpora
- E. Godoy, O. Rosec, T. Chonavel, "Spectral envelope transformation using DFW and amplitude scaling for voice conversion with parallel or nonparallel corpora", Proc. Interspeech, pp. 673-676, 2011.
- (2011) Proc. Interspeech , pp. 673-676
- Godoy, E.¹ Rosec, O.² Chonavel, T.³

9
- 4544373000
- Voice characteristics conversion for TTS using reverse VTLN
- M. Eichner, M. Wolff, R. Hoffmann, "Voice characteristics conversion for TTS using reverse VTLN", Proc. ICASSP, pp. 17-20, 2004.
- (2004) Proc. ICASSP , pp. 17-20
- Eichner, M.¹ Wolff, M.² Hoffmann, R.³

10
- 0009589496
- Vocal tract length normalization for large vocabulary continuous speech recognition
- P. Zhan, A. Waibel, "Vocal tract length normalization for large vocabulary continuous speech recognition", CMU computer science technical reports, 1997.
- (1997) CMU Computer Science Technical Reports
- Zhan, P.¹ Waibel, A.²

11
- 0032657747
- Speaker adaptation with all-pass transforms
- J. McDonough, W. Byrne, "Speaker adaptation with all-pass transforms", Proc. ICASSP, pp. 757-760, 1999.
- (1999) Proc. ICASSP , pp. 757-760
- McDonough, J.¹ Byrne, W.²

12
- 27644522706
- Vocal tract normalization equals linear transformation in cepstral space
- M. Pitz, H. Ney, "Vocal tract normalization equals linear transformation in cepstral space", IEEE Trans. Speech and Audio Process., vol. 13(5), pp. 930-944, 2005.
- (2005) IEEE Trans. Speech and Audio Process. , vol.13 , Issue.5 , pp. 930-944
- Pitz, M.¹ Ney, H.²

13
- 51449094035
- Rapid vocal tract length normalization using maximum likelihood estimation
- T. Emori, K. Shinoda, "Rapid vocal tract length normalization using maximum likelihood estimation", Proc. Eurospeech, pp. 1649-1652, 2001.
- (2001) Proc. Eurospeech , pp. 1649-1652
- Emori, T.¹ Shinoda, K.²

14
- 0004319970
- Ph.D. dissertation, Carnegie Mellon Univ.
- A. Acero, "Acoustical and environmental robustness for automatic speech recognition", Ph.D. dissertation, Carnegie Mellon Univ., 1990.
- (1990) Acoustical and Environmental Robustness for Automatic Speech Recognition
- Acero, A.¹

15
- 84870724307
- Online, "CMU ARCTIC speech synthesis databases", available in http://festvox.org/cmu-arctic/
- CMU ARCTIC Speech Synthesis Databases

16
- 80051629671
- HNM-based MFCC+f0 extractor applied to statistical speech synthesis
- Available at:
- D. Erro, I. Sainz, E. Navas, I. Hernaez, "HNM-based MFCC+f0 extractor applied to statistical speech synthesis", Proc. ICASSP, pp. 4728-4731, 2011. Available at: http://aholab.ehu.es/ahocoder
- (2011) Proc. ICASSP , pp. 4728-4731
- Erro, D.¹ Sainz, I.² Navas, E.³ Hernaez, I.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.