메뉴 건너뛰기




Volumn 60, Issue , 2014, Pages 30-43

A unit selection approach for voice transformation

Author keywords

Hidden Markov model; Unit selection; Voice conversion

Indexed keywords

HIDDEN MARKOV MODELS; OPTIMIZATION;

EID: 84896464538     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2014.02.002     Document Type: Article
Times cited : (3)

References (45)
  • 2
    • 0033154052 scopus 로고    scopus 로고
    • Speaker transformation algorithm using segmental codebooks (STASC)
    • L.M. Arslan Speaker transformation algorithm using segmental codebooks (STASC) Speech Commun. 28 1999 211 226
    • (1999) Speech Commun. , vol.28 , pp. 211-226
    • Arslan, L.M.1
  • 4
    • 0031104132 scopus 로고    scopus 로고
    • Application of Speech Conversion to Alaryngeal Speech Enhancement
    • PII S1063667697018944
    • N. Bi, and Y. Qi Application of speech conversion to alaryngeal speech enhancement IEEE Trans. Acoust. Speech Signal Process. 5 2 1997 97 105 (Pubitemid 127746041)
    • (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , Issue.2 , pp. 97-105
    • Bi, N.1    Qi, Y.2
  • 6
    • 0028016265 scopus 로고
    • Measuring and modeling vocal source-tract interaction
    • D.G. Childers, and C.F. Wong Measuring and modeling vocal source-tract interaction IEEE Trans. Biomed. Eng. 41 7 1994 663 671
    • (1994) IEEE Trans. Biomed. Eng. , vol.41 , Issue.7 , pp. 663-671
    • Childers, D.G.1    Wong, C.F.2
  • 10
    • 1542408811 scopus 로고    scopus 로고
    • The interaction of formant frequency and pitch in the perception of voice category and jaw opening in female singers
    • DOI 10.1016/j.jvoice.2003.08.001, PII S0892199703001243
    • Erickson, M.L.; 2003. The interaction of formant frequency and pitch in the perception of voice category and jaw opening in female singers. In: The 31st Annual Symposium: Care of the Professional Voice, pp. 24-37. (Pubitemid 38333008)
    • (2004) Journal of Voice , vol.18 , Issue.1 , pp. 24-37
    • Erickson, M.L.1
  • 11
    • 84872177757 scopus 로고    scopus 로고
    • Parametric voice conversion based on bilinear frequency warping plus amplitude scaling
    • D. Erro, E. Navas, and I. Hernáez Parametric voice conversion based on bilinear frequency warping plus amplitude scaling IEEE Trans. Audio Speech Lang. Process. 21 3 2013 556 566
    • (2013) IEEE Trans. Audio Speech Lang. Process. , vol.21 , Issue.3 , pp. 556-566
    • Erro, D.1    Navas, E.2    Hernáez, I.3
  • 12
  • 13
    • 84867950508 scopus 로고    scopus 로고
    • Personalized spectral and prosody conversion using frame-based codeword distribution and adaptive CRF
    • Y.C. Huang, C.H. Wu, and Y.T. Chao Personalized spectral and prosody conversion using frame-based codeword distribution and adaptive CRF IEEE Trans. Audio Speech Lang. Process. 21 1 2013 51 52
    • (2013) IEEE Trans. Audio Speech Lang. Process. , vol.21 , Issue.1 , pp. 51-52
    • Huang, Y.C.1    Wu, C.H.2    Chao, Y.T.3
  • 14
    • 0029251946 scopus 로고
    • Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks
    • N. Iwahashi, and Y. Sagisaka Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks Speech Commun. 16 2 1995 139 152
    • (1995) Speech Commun. , vol.16 , Issue.2 , pp. 139-152
    • Iwahashi, N.1    Sagisaka, Y.2
  • 16
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • Kain, A.; Macon, M.W.; 1998. Spectral voice conversion for text-to-speech synthesis. In: Proc. ICASSP, Seattle, pp. 285-288.
    • (1998) Proc. ICASSP, Seattle , pp. 285-288
    • Kain, A.1    Macon M., .W.2
  • 18
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds Speech Commun. 27 3-4 1999 187 207
    • (1999) Speech Commun. , vol.27 , Issue.34 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigne, A.3
  • 20
    • 38149065136 scopus 로고    scopus 로고
    • Statistical approach for voice personality transformation
    • K.S. Lee Statistical approach for voice personality transformation IEEE Trans. Audio Speech Lang. Process. 15 2 2007 641 651
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.2 , pp. 641-651
    • Lee, K.S.1
  • 21
    • 39749106069 scopus 로고    scopus 로고
    • EMG-based speech recognition using hidden Markov models with global control variables
    • K.S. Lee EMG-based speech recognition using hidden Markov models with global control variables IEEE Trans. Biomed. Eng. 55 3 2008 930 940
    • (2008) IEEE Trans. Biomed. Eng. , vol.55 , Issue.3 , pp. 930-940
    • Lee, K.S.1
  • 22
    • 0030365550 scopus 로고    scopus 로고
    • A new voice personality transformation based on both linear and nonlinear prediction analysis
    • Lee, K.S.; Youn, D.H.; Cha, I.W.; 1996. A new voice personality transformation based on both linear and nonlinear prediction analysis. In: Proc. ICSLP, pp. 1401-1404.
    • (1996) Proc. ICSLP , pp. 1401-1404
    • Lee K., .S.1    Youn D., .H.2    Cha I., .W.3
  • 23
    • 0036670960 scopus 로고    scopus 로고
    • Voice conversion using a low dimensional vector mapping
    • K.S. Lee, D.H. Youn, and I.W. Cha Voice conversion using a low dimensional vector mapping IEICE Trans. Inf. Syst. E85-D 8 2002 1297 1305
    • (2002) IEICE Trans. Inf. Syst. , vol.85 E -D , Issue.8 , pp. 1297-1305
    • Lee, K.S.1    Youn, D.H.2    Cha, I.W.3
  • 24
    • 0018918171 scopus 로고
    • An algorithm for vector quantizer design
    • Y. Linde, A. Buzo, and R.M. Gray An algorithm for vector quantizer design IEEE Trans. Commun. 28 1980 84 95
    • (1980) IEEE Trans. Commun. , vol.28 , pp. 84-95
    • Linde, Y.1    Buzo, A.2    Gray, R.M.3
  • 25
    • 33847332065 scopus 로고    scopus 로고
    • Voice conversion based on joint pitch and spectral transformation with component group-GMM
    • Ma, J.; Liu, W.; 2005. Voice conversion based on joint pitch and spectral transformation with component group-GMM. In: Proc. IEEE NLP-KE, pp. 199-203.
    • (2005) Proc. IEEE NLP-KE , pp. 199-203
    • Ma, J.1    Liu, W.2
  • 27
    • 0029256372 scopus 로고
    • Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectral tilt
    • H. Mizuno, and M. Abe Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectral tilt Speech Commun. 16 2 1995 153 164
    • (1995) Speech Commun. , vol.16 , Issue.2 , pp. 153-164
    • Mizuno, H.1    Abe, M.2
  • 28
    • 0025543906 scopus 로고
    • Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • E. Moulines, and F. Charpentier Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones Speech Commun. 9 5/6 1990 453 467
    • (1990) Speech Commun. , vol.9 , Issue.5-6 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 29
    • 0029254176 scopus 로고
    • Transformation of formants of voice conversion using artificial neural networks
    • M. Narendranath, H.A. Murthy, S. Rajendran, and B. Yegnanarayana Transformation of formants of voice conversion using artificial neural networks Speech Commun. 16 2 1995 207 216
    • (1995) Speech Commun. , vol.16 , Issue.2 , pp. 207-216
    • Narendranath, M.1    Murthy, H.A.2    Rajendran, S.3    Yegnanarayana, B.4
  • 30
    • 4544290191 scopus 로고    scopus 로고
    • Recent advances in the automatic recognition of audiovisual speech
    • G. Potamianos, C. Neti, G. Gravier, A. Garg, and A.W. Senior Recent advances in the automatic recognition of audiovisual speech Proc. IEEE 91 9 2003 1306 1326
    • (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1306-1326
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4    Senior, A.W.5
  • 33
    • 77950029338 scopus 로고    scopus 로고
    • Voice conversion by mapping the speaker-specific features using pitch synchronous approach
    • K.S. Rao Voice conversion by mapping the speaker-specific features using pitch synchronous approach Comput. Speech Lang. 24 2010 474 494
    • (2010) Comput. Speech Lang. , vol.24 , pp. 474-494
    • Rao, K.S.1
  • 34
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • D.A. Reynolds, and R.C. Rose Robust text-independent speaker identification using Gaussian mixture speaker models IEEE Trans. Acoust. Speech Signal Process. 3 1 1995 72 83
    • (1995) IEEE Trans. Acoust. Speech Signal Process. , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 36
    • 0001503040 scopus 로고
    • Voice personality transformation
    • M. Savic, and I.H. Nam Voice personality transformation Digital Signal Process. 4 1991 107 110
    • (1991) Digital Signal Process. , vol.4 , pp. 107-110
    • Savic, M.1    Nam, I.H.2
  • 37
    • 51449112440 scopus 로고    scopus 로고
    • Voice conversion by combining frequency warping with unit selection
    • Shuang, Z.; Meng, F.; Qin, Y.; 2008. Voice conversion by combining frequency warping with unit selection. In: Proc. ICASSP, pp. 4661-4664.
    • (2008) Proc. ICASSP , pp. 4661-4664
    • Shuang, Z.1    Meng, F.2    Qin, Y.3
  • 39
    • 0027128576 scopus 로고
    • Lipreading and audio-visual speech perception
    • A.Q. Summerfield Lipreading and audio-visual speech perception Philos. Trans. R. Soc. Lond. B 335 1992 71 78
    • (1992) Philos. Trans. R. Soc. Lond. B , vol.335 , pp. 71-78
    • Summerfield, A.Q.1
  • 42
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • T. Toda, A.W. Black, Black, and K. Tokuda Voice conversion based on maximum likelihood estimation of spectral parameter trajectory IEEE Trans. Acoust. Speech Lang. Process. 15 8 2007 2222 2235
    • (2007) IEEE Trans. Acoust. Speech Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2
  • 43
    • 0026880275 scopus 로고
    • Voice transformation using PSOLA technique
    • H. Valbret, E. Moulines, and J.P. Tubach Voice transformation using PSOLA technique Speech Commun. 11 1992 175 187 (Pubitemid 23572497)
    • (1992) Speech Communication , vol.11 , Issue.2-3 , pp. 175-187
    • Valbret, H.1    Moulines, E.2    Tubach, J.P.3
  • 44
    • 4143120860 scopus 로고
    • Speech recognition experiments with linear prediction, bandpass filtering, and dynamic programming
    • G.M. White, and R.B. Neely Speech recognition experiments with linear prediction, bandpass filtering, and dynamic programming IEEE Trans. Acoust. Speech Signal Process. 24 2 1976 183 188
    • (1976) IEEE Trans. Acoust. Speech Signal Process. , vol.24 , Issue.2 , pp. 183-188
    • White, G.M.1    Neely, R.B.2
  • 45
    • 34047254509 scopus 로고    scopus 로고
    • Quality-enhanced voice morphing using maximum likelihood transformations
    • DOI 10.1109/TSA.2005.860839
    • H. Ye, and S. Young Quality-enhanced voice morphing using maximum likelihood transformations IEEE Trans. Audio Speech Lang. Process. 14 4 2006 1301 1312 (Pubitemid 46547625)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.4 , pp. 1301-1312
    • Ye, H.1    Young, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.