SCOPUS 정보 검색 플랫폼

Circuits, Systems, and Signal Processing

Volumn 35, Issue 4, 2016, Pages 1283-1311

A Multi-level GMM-Based Cross-Lingual Voice Conversion Using Language-Specific Mixture Weights for Polyglot Synthesis

(4) Ramani, B a Actlin Jeeva, M P a Vijayalakshmi, P a Nagarajan, T a

a SSN COLLEGE OF ENGINEERING (India)

Author keywords

ABX listening test; Cross lingual voice conversion; GMM; Multilingual; Oversmoothing; Polyglot

Indexed keywords

COMPUTATIONAL LINGUISTICS; HIDDEN MARKOV MODELS; MARKOV PROCESSES; MIXTURES; SPEECH; SPEECH INTELLIGIBILITY; SPEECH RECOGNITION;

LISTENING TESTS; MULTILINGUAL; OVERSMOOTHING; POLYGLOT; VOICE CONVERSION;

SPEECH PROCESSING;

EID: 84959297010 PISSN: 0278081X EISSN: 15315878 Source Type: Journal
DOI: 10.1007/s00034-015-0118-1 Document Type: Article

Times cited : (17)

References (33)

1
- 0023739214
- Voice conversion through vector quantization, in International Conference on Acoustics
- M. Abe, S. Nakamura, K. Shikano, H. Kuwabara, Voice conversion through vector quantization, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1 (1988), pp. 655–658
- (1988) Speech, and Signal Processing (ICASSP) , vol.1 , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

2
- 0025590356
- Cross-language voice conversion, in International Conference on Acoustics
- M. Abe, K. Shikano, H. Kuwabara, Cross-language voice conversion, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1 (1990), pp. 345–348
- (1990) Speech, and Signal Processing (ICASSP) , vol.1 , pp. 345-348
- Abe, M.¹ Shikano, K.² Kuwabara, H.³

3
- 70450205902
- M. Charlier, Y. Ohtani, T. Toda, A. Moinet, T. Dutoit, Cross-language voice conversion based on eigenvoices, in INTERSPEECH (2009), pp. 1635–1638
- (2009) T. Dutoit, Cross-language voice conversion based on eigenvoices, in INTERSPEECH , pp. 1635-1638
- Charlier, M.¹ Ohtani, Y.² Toda, T.³ Moinet, A.⁴

4
- 77953727123
- Voice conversion based on weighted frequency warping
- D. Erro, A. Moreno, A. Bonafonte, Voice conversion based on weighted frequency warping. IEEE Trans. Audio Speech Lang. Process. 18(5), 922–931 (2010)
- (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , Issue.5 , pp. 922-931
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

5
- 84857498745
- Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
- E. Godoy, O. Rosec, T. Chonavel, Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora. IEEE Trans. Audio Speech Lang. Process. 20(4), 1313–1323 (2012)
- (2012) IEEE Trans. Audio Speech Lang. Process. , vol.20 , Issue.4 , pp. 1313-1323
- Godoy, E.¹ Rosec, O.² Chonavel, T.³

6
- 0029765811
- Unit selection in a concatenative speech synthesis system using a large speech database, in International Conference on Acoustics
- A.J. Hunt, A.W. Black, Unit selection in a concatenative speech synthesis system using a large speech database, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1 (1996), pp. 373–376
- (1996) Speech, and Signal Processing (ICASSP) , vol.1 , pp. 373-376
- Hunt, A.J.¹ Black, A.W.²

7
- 84906281888
- H.T. Hwang, Y. Tsao, H.M. Wang, Y.R. Wang, S.H. Chen, Alleviating the over-smoothing problem in GMM-based voice conversion with discriminative training, in INTERSPEECH (2013), pp. 3062–3066
- (2013) S.H. Chen, Alleviating the over-smoothing problem in GMM-based voice conversion with discriminative training, in INTERSPEECH , pp. 3062-3066
- Hwang, H.T.¹ Tsao, Y.² Wang, H.M.³ Wang, Y.R.⁴

8
- 0031623661
- Spectral voice conversion for text-to-speech synthesis. In International Conference on Acoustics
- A. Kain, M. Macon, Spectral voice conversion for text-to-speech synthesis. In International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1 (1998), pp. 285–288
- (1998) Speech, and Signal Processing (ICASSP) , vol.1 , pp. 285-288
- Kain, A.¹ Macon, M.²

9
- 0030677481
- Speech representation and transformation using adaptive interpolation of weighted spectrum: vocoder revisited, in International Conference on Acoustics
- H. Kawahara, Speech representation and transformation using adaptive interpolation of weighted spectrum: vocoder revisited, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 2 (1997), pp. 1303–1306
- (1997) Speech, and Signal Processing (ICASSP) , vol.2 , pp. 1303-1306
- Kawahara, H.¹

10
- 85135141647
- E.K. Kim, S. Lee, Y.H. Oh, Hidden Markov model based voice conversion using dynamic characteristics of speaker, in EUROSPEECH (1997), pp. 2519–2522
- (1997) Y.H. Oh, Hidden Markov model based voice conversion using dynamic characteristics of speaker, in EUROSPEECH , pp. 2519-2522
- Kim, E.K.¹ Lee, S.²

11
- 33748468338
- New approach to the polyglot speech generation by means of an HMM-based speaker adaptable synthesizer
- J. Latorre, K. Iwano, S. Furui, New approach to the polyglot speech generation by means of an HMM-based speaker adaptable synthesizer. Speech Commun. 48(10), 1227–1242 (2006)
- (2006) Speech Commun. , vol.48 , Issue.10 , pp. 1227-1242
- Latorre, J.¹ Iwano, K.² Furui, S.³

12
- 84910097523
- Voice conversion: a critical survey
- A.F. Machado, M. Quieroz, Voice conversion: a critical survey. In: Sound and Music Computing, pp. 291–298 (2010)
- (2010) Sound and Music Computing , pp. 291-298
- Machado, A.F.¹ Quieroz, M.²

13
- 84892362620
- Ph.D, Dissertation
- T. Masuko, HMM-based speech synthesis and its applications. Ph.D. Dissertation, (2002)
- (2002) HMM-based speech synthesis and its applications
- Masuko, T.¹

14
- 41049089736
- Estimation of glottal closure instants in voiced speech using the DYPSA algorithm
- P.A. Naylor, A. Kounoudes, J. Gudnason, M. Brookes, Estimation of glottal closure instants in voiced speech using the DYPSA algorithm. IEEE Trans. Audio Speech Lang. Process. 15, 34–43 (2007)
- (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , pp. 34-43
- Naylor, P.A.¹ Kounoudes, A.² Gudnason, J.³ Brookes, M.⁴

15
- 0009435105
- Numerical recipes in C: the art of scientific computing (Chapter 14), 2nd edn. (Cambridge University Press
- W.H. Press, S.A. Teukolsky, W.T. Vetterling, B.P. Flannery, Numerical recipes in C: the art of scientific computing (Chapter 14), 2nd edn. (Cambridge University Press, Cambridge, 1992), pp. 615–619
- Cambridge , vol.1992 , pp. 615-619
- Press, W.H.¹ Teukolsky, S.A.² Vetterling, W.T.³ Flannery, B.P.⁴

16
- 84910051772
- B. Ramani, M.P. Actlin Jeeva, P. Vijayalakshmi, T. Nagarajan, Cross-lingual voice conversion-based polyglot speech synthesizer for Indian languages, in INTERSPEECH (2014), pp. 775–779
- (2014) T. Nagarajan, Cross-lingual voice conversion-based polyglot speech synthesizer for Indian languages, in INTERSPEECH , pp. 775-779
- Ramani, B.¹ Actlin Jeeva, M.P.² Vijayalakshmi, P.³

17
- 84977851321
- B. Ramani, S.L. Christina, G.A. Rachel, V.S. Solomi, M.K. Nandwana, A. Prakash, A. Shanmugam, R. Krishnan, S. Kishore, K. Samudravijaya, P. Vijayalakshmi, T. Nagarajan, H.A. Murthy, A common attribute based unified HTS framework for speech synthesis in Indian languages, in ISCA Workshop on Speech Synthesis (2013), pp. 291–296
- (2013) H.A. Murthy, A common attribute based unified HTS framework for speech synthesis in Indian languages, in ISCA Workshop on Speech Synthesis , pp. 291-296
- Ramani, B.¹ Christina, S.L.² Rachel, G.A.³ Solomi, V.S.⁴ Nandwana, M.K.⁵ Prakash, A.⁶ Shanmugam, A.⁷ Krishnan, R.⁸ Kishore, S.⁹ Samudravijaya, K.¹⁰ Vijayalakshmi, P.¹¹ Nagarajan, T.¹²

18
- 84872076059
- A computational phonetic model for indian language scripts
- A.K. Singh, A computational phonetic model for indian language scripts, in Constraints on Spelling Changes: Fifth International Workshop on Writing Systems (2006)
- (2006) in Constraints on Spelling Changes: Fifth International Workshop on Writing Systems
- Singh, A.K.¹

19
- 84894111243
- V.S. Solomi, S.L. Christina, G.A. Rachel, B. Ramani, P. Vijayalakshmi, T. Nagarajan, Analysis on acoustic similarities between tamil and english phonemes using product of likelihood-Gaussians for an HMM-based mixed-language synthesizer, in COCOSDA (2013), pp. 1–5
- (2013) T. Nagarajan, Analysis on acoustic similarities between tamil and english phonemes using product of likelihood-Gaussians for an HMM-based mixed-language synthesizer, in COCOSDA , pp. 1-5
- Solomi, V.S.¹ Christina, S.L.² Rachel, G.A.³ Ramani, B.⁴ Vijayalakshmi, P.⁵

20
- 70349197715
- Voice transformation: a survey, in International Conference on Acoustics
- Y. Stylianou, Voice transformation: a survey, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2009), pp. 3585–3588
- (2009) Speech, and Signal Processing (ICASSP) , pp. 3585-3588
- Stylianou, Y.¹

21
- 85135175982
- Y. Stylianou, O. Cappe, E. Moulines, Statistical methods for voice quality transformation, in EUROSPEECH (1995), pp. 447–450
- (1995) E. Moulines, Statistical methods for voice quality transformation, in EUROSPEECH , pp. 447-450
- Stylianou, Y.¹ Cappe, O.²

22
- 33947623206
- Text-independent voice conversion based on unit selection, in International Conference on Acoustics
- D. Sundermann, H. Hoge, A. Bonafonte, H. Ney, A. Black, S. Narayanan, Text-independent voice conversion based on unit selection, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1 (2006), pp. I81–I84
- (2006) Speech, and Signal Processing (ICASSP) , vol.1 , pp. I81-I84
- Sundermann, D.¹ Hoge, H.² Bonafonte, A.³ Ney, H.⁴ Black, A.⁵ Narayanan, S.⁶

23
- 84946753271
- VTLN-based cross-language voice conversion, in IEEE Workshop on Automatic Speech Recognition and Understanding
- D. Sundermann, H. Ney, H. Hoge, VTLN-based cross-language voice conversion, in IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU’03 (2003), pp. 676–681
- (2003) ASRU’03 , pp. 676-681
- Sundermann, D.¹ Ney, H.² Hoge, H.³

24
- 84959258193
- Technology Development for Indian Languages Programme, DeitY (2013),. Last Accessed on 06 Sept 2014
- Technology Development for Indian Languages Programme, DeitY (2013), http://tdil.mit.gov.in/AboutUs.aspx. Last Accessed on 06 Sept 2014

25
- 0034842552
- Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of straight spectrum, in International Conference on Acoustics
- T. Toda, H. Saruwatari, K. Shikano, Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of straight spectrum, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 2 (2001), pp. 841–844
- (2001) Speech, and Signal Processing (ICASSP) , vol.2 , pp. 841-844
- Toda, T.¹ Saruwatari, H.² Shikano, K.³

26
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- T. Toda, A. Black, K. Tokuda, Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory. IEEE Trans. Audio Speech Lang. Process. 15, 2222–2235 (2007)
- (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , pp. 2222-2235
- Toda, T.¹ Black, A.² Tokuda, K.³

27
- 17444453660
- Torres-carrasquillo, D.A. Reynolds, J. Deller Jr, Language identification using Gaussian mixture model tokenization, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp
- P.A. Torres-carrasquillo, D.A. Reynolds, J. Deller Jr, Language identification using Gaussian mixture model tokenization, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. I-757–I-760 (2002)
- (2002) I-757–I-760

28
- 85010815133
- Voice transformation using PSOLA technique, in International Conference on Acoustics
- H. Valbret, E. Moulines, J.P. Tubach, Voice transformation using PSOLA technique, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1 (1992), pp. 145–148
- (1992) Speech, and Signal Processing (ICASSP) , vol.1 , pp. 145-148
- Valbret, H.¹ Moulines, E.² Tubach, J.P.³

29
- 84857131313
- Improving speech intelligibility in cochlear implants using acoustic models
- P. Vijayalakshmi, T. Nagarajan, P. Mahadevan, Improving speech intelligibility in cochlear implants using acoustic models. WSEAS Trans. Signal Process. 7(4), 131–144 (2011)
- (2011) WSEAS Trans. Signal Process. , vol.7 , Issue.4 , pp. 131-144
- Vijayalakshmi, P.¹ Nagarajan, T.² Mahadevan, P.³

30
- 67650854725
- Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
- J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, J. Isogai, Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm. IEEE Trans. Audio Speech Lang. Process. 17(1), 66–83 (2009)
- (2009) IEEE Trans. Audio Speech Lang. Process. , vol.17 , Issue.1 , pp. 66-83
- Yamagishi, J.¹ Kobayashi, T.² Nakano, Y.³ Ogata, K.⁴ Isogai, J.⁵

31
- 67651002140
- Statistical parametric speech synthesis
- H. Zen, K. Tokuda, A.W. Black, Statistical parametric speech synthesis. Speech Commun. 51, 1039–1064 (2009)
- (2009) Speech Commun. , vol.51 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

32
- 51449121435
- Text-independent voice conversion based on state mapped codebook, in International Conference on Acoustics
- M. Zhang, J. Tao, J. Tian, X. Wang, Text-independent voice conversion based on state mapped codebook, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2008), pp. 4605–4608
- (2008) Speech, and Signal Processing (ICASSP) , pp. 4605-4608
- Zhang, M.¹ Tao, J.² Tian, J.³ Wang, X.⁴

33
- 85079102131
- M.A. Zissman, E. Singer, Automatic language identification of telephone speech messages using phoneme recognition and N-gram modeling, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1 (1994)
- M.A. Zissman, E. Singer, Automatic language identification of telephone speech messages using phoneme recognition and N-gram modeling, in International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1 (1994), pp. I-305–I-308

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.