SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 22, Issue 10, 2014, Pages 1506-1521

Exemplar-based sparse representation with residual compensation for voice conversion

(4) Wu, Zhizheng a,b Virtanen, Tuomas c Chng, Eng Siong a,b Li, Haizhou a,d

a NANYANG TECHNOLOGICAL UNIVERSITY (Singapore)

b NANYANG TECHNOLOGICAL UNIVERSITY (Singapore)

c TAMPERE UNIVERSITY OF TECHNOLOGY (Finland)

d INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

Author keywords

Exemplar; Nonnegative matrix factorization; Residual compensation; Sparse representation; Voice conversion

Indexed keywords

MATRIX ALGEBRA; MAXIMUM LIKELIHOOD; SPEECH PROCESSING;

DIMENSIONALITY REDUCTION; EXEMPLAR; NONNEGATIVE MATRIX FACTORIZATION; PARTIAL LEAST-SQUARES REGRESSION; SPARSE REPRESENTATION; SUBJECTIVE LISTENING TEST; VOICE CONVERSION; WEIGHTED LINEAR COMBINATIONS;

LEAST SQUARES APPROXIMATIONS;

EID: 84911369131 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASLP.2014.2333242 Document Type: Article

Times cited : (188)

References (54)

1
- 70350125882
- An overview of text-independent speaker recognition: From features to supervectors
- T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: From features to supervectors," Speech Commun., vol. 52, no. 1, pp. 12-40, 2010.
- (2010) Speech Commun , vol.52 , Issue.1 , pp. 12-40
- Kinnunen, T.¹ Li, H.²

2
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- A. Kain and M. W. Macon, "Spectral voice conversion for text-to-speech synthesis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1998, pp. 285-288.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1998 , pp. 285-288
- Kain, A.¹ Macon, M.W.²

3
- 0033692729
- Narrowband to wideband conversion of speech using GMM based transformation
- K. Park and H. Kim, "Narrowband to wideband conversion of speech using GMM based transformation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2000, pp. 1843-1846.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2000 , pp. 1843-1846
- Park, K.¹ Kim, H.²

4
- 58149308063
- A spectral conversion approach to single-channel speech enhancement
- May
- A. Mouchtaris, J. Van der Spiegel, P. Mueller, and P. Tsakalides, "A spectral conversion approach to single-channel speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1180-1193, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1180-1193
- Mouchtaris, A.¹ Van Der Spiegel, J.² Mueller, P.³ Tsakalides, P.⁴

5
- 34547550766
- Stereo-based stochastic mapping for robust speech recognition
- M. Afify, X. Cui, and Y. Gao, "Stereo-based stochastic mapping for robust speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007, pp. 377-380.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007 , pp. 377-380
- Afify, M.¹ Cui, X.² Gao, Y.³

6
- 84867591125
- Stereo-based stochastic mapping with context using probabilistic PCA for noise robust automatic speech recognition
- X. Cui, M. Afify, and B. Zhou, "Stereo-based stochastic mapping with context using probabilistic PCA for noise robust automatic speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2012, pp. 4705-4708.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2012 , pp. 4705-4708
- Cui, X.¹ Afify, M.² Zhou, B.³

7
- 38649140222
- Statistical mapping between articulatory movements and acoustic spectrum using a gaussian mixture model
- T. Toda, A. Black, and K. Tokuda, "Statistical mapping between articulatory movements and acoustic spectrum using a gaussian mixture model," Speech Commun., vol. 50, no. 3, pp. 215-227, 2008.
- (2008) Speech Commun , vol.50 , Issue.3 , pp. 215-227
- Toda, T.¹ Black, A.² Tokuda, K.³

8
- 70349200844
- Voice conversion for various types of body transmitted speech
- T. Toda, K. Nakamura, H. Sekimoto, and K. Shikano, "Voice conversion for various types of body transmitted speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2009, pp. 3601-3604.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2009 , pp. 3601-3604
- Toda, T.¹ Nakamura, K.² Sekimoto, H.³ Shikano, K.⁴

9
- 0023739214
- Voice conversion through vector quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1988, pp. 655-658.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1988 , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

10
- 0026394044
- Speaker adaptation and voice conversion by codebook mapping
- K. Shikano, S. Nakamura, and M. Abe, "Speaker adaptation and voice conversion by codebook mapping," in Proc. IEEE Int. Symp. Circuits Syst., 1991, pp. 594-597.
- Proc. IEEE Int. Symp. Circuits Syst., 1991 , pp. 594-597
- Shikano, K.¹ Nakamura, S.² Abe, M.³

11
- 0032026483
- Continuous probabilistic transform for voice conversion
- Mar.
- Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp. 131-142, Mar. 1998.
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

12
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- Nov.
- T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

13
- 78149260085
- Continuous stochastic feature mapping based on trajectory HMMs
- Feb.
- H. Zen, Y. Nankaku, and K. Tokuda, "Continuous stochastic feature mapping based on trajectory HMMs," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 2, pp. 417-430, Feb. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.2 , pp. 417-430
- Zen, H.¹ Nankaku, Y.² Tokuda, K.³

14
- 84911386426
- Perceptually weighted linear transformations for voice conversion
- H. Ye and S. Young, "Perceptually weighted linear transformations for voice conversion," in Proc. Interspeech, 2003.
- Proc. Interspeech, 2003
- Ye, H.¹ Young, S.²

15
- 34047254509
- Quality-enhanced voice morphing using maximum likelihood transformations
- Jul.
- H. Ye and S. Young, "Quality-enhanced voice morphing using maximum likelihood transformations," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 4, pp. 1301-1312, Jul. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.4 , pp. 1301-1312
- Ye, H.¹ Young, S.²

16
- 77953712499
- Voice conversion using partial least squares regression
- Jul.
- E. Helander, T. Virtanen, J. Nurminen, and M. Gabbouj, "Voice conversion using partial least squares regression," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 912-921, Jul. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.5 , pp. 912-921
- Helander, E.¹ Virtanen, T.² Nurminen, J.³ Gabbouj, M.⁴

17
- 84867594339
- Local linear transformation for voice conversion
- V. Popa, H. Silen, J. Nurminen, and M. Gabbouj, "Local linear transformation for voice conversion," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2012, pp. 4517-4520.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2012 , pp. 4517-4520
- Popa, V.¹ Silen, H.² Nurminen, J.³ Gabbouj, M.⁴

18
- 0029254176
- Transformation of formants for voice conversion using artificial neural networks
- M. Narendranath, H. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants for voice conversion using artificial neural networks," Speech Commun., vol. 16, no. 2, pp. 207-216, 1995.
- (1995) Speech Commun , vol.16 , Issue.2 , pp. 207-216
- Narendranath, M.¹ Murthy, H.² Rajendran, S.³ Yegnanarayana, B.⁴

19
- 70349197691
- Voice conversion using artificial neural networks
- S. Desai, E. V. Raghavendra, B. Yegnanarayana, A. W. Black, and K. Prahallad, "Voice conversion using artificial neural networks," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process. (ICASSP), 2009, pp. 3893-3896.
- Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process. (ICASSP), 2009 , pp. 3893-3896
- Desai, S.¹ Raghavendra, E.V.² Yegnanarayana, B.³ Black, A.W.⁴ Prahallad, K.⁵

20
- 80053068819
- Voice conversion using support vector regression
- P. Song, Y. Bao, L. Zhao, and C. Zou, "Voice conversion using support vector regression," Electron. Lett., vol. 47, no. 18, pp. 1045-1046, 2011.
- (2011) Electron. Lett. , vol.47 , Issue.18 , pp. 1045-1046
- Song, P.¹ Bao, Y.² Zhao, L.³ Zou, C.⁴

21
- 84856141218
- Voice conversion using dynamic kernel partial least squares regression
- Mar.
- E. Helander, H. Silén, T. Virtanen, and M. Gabbouj, "Voice conversion using dynamic kernel partial least squares regression," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 3, pp. 806-817, Mar. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.3 , pp. 806-817
- Helander, E.¹ Silén, H.² Virtanen, T.³ Gabbouj, M.⁴

22
- 84906225084
- Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion
- L.-H. Chen, Z.-H. Ling, Y. Song, and L.-R. Dai, "Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion," in Proc. Interspeech, 2013.
- Proc. Interspeech, 2013
- Chen, L.-H.¹ Ling, Z.-H.² Song, Y.³ Dai, L.-R.⁴

23
- 84948175540
- VTLN-based voice conversion
- D. Sundermann and H. Ney, "VTLN-based voice conversion," in Proc. 3rd IEEE Int. Symp. Signal Process. Inf. Technol., 2003, pp. 556-559.
- Proc. 3rd IEEE Int. Symp. Signal Process. Inf. Technol., 2003 , pp. 556-559
- Sundermann, D.¹ Ney, H.²

24
- 84946753271
- VTLN-based cross-language voice conversion
- D. Sundermann, H. Ney, and H. Hoge, "VTLN-based cross-language voice conversion," in IEEE Workshop Autom. Speech Recogn. Understand., 2003, pp. 676-681.
- IEEE Workshop Autom. Speech Recogn. Understand., 2003 , pp. 676-681
- Sundermann, D.¹ Ney, H.² Hoge, H.³

25
- 77953727123
- Voice conversion based on weighted frequency warping
- Jul.
- D. Erro, A. Moreno, and A. Bonafonte, "Voice conversion based on weighted frequency warping," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 922-931, Jul. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.5 , pp. 922-931
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

26
- 84857498745
- Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
- May
- E. Godoy, O. Rosec, and T. Chonavel, "Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 4, pp. 1313-1323, May 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.4 , pp. 1313-1323
- Godoy, E.¹ Rosec, O.² Chonavel, T.³

27
- 84872177757
- Parametric voice conversion based on bilinear frequency warping plus amplitude scaling
- Mar.
- D. Erro, E. Navas, and I. Hernaez, "Parametric voice conversion based on bilinear frequency warping plus amplitude scaling," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 3, pp. 556-566, Mar. 2013.
- (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.3 , pp. 556-566
- Erro, D.¹ Navas, E.² Hernaez, I.³

28
- 84901803470
- Exemplar-based voice conversion using non-negative spectrogram deconvolution
- Z. Wu, T. Virtanen, T. Kinnunen, E. S. Chng, and H. Li, "Exemplar-based voice conversion using non-negative spectrogram deconvolution," in Proc. 8th ISCA Speech Synth. Workshop (SSW8), 2013.
- Proc. 8th ISCA Speech Synth. Workshop (SSW8), 2013
- Wu, Z.¹ Virtanen, T.² Kinnunen, T.³ Chng, E.S.⁴ Li, H.⁵

29
- 84898964201
- Algorithms for non-negative matrix factorization
- D. Seung and L. Lee, "Algorithms for non-negative matrix factorization," Adv. Neural Inf. Process. Syst., vol. 13, pp. 556-562, 2001.
- (2001) Adv. Neural Inf. Process. Syst. , vol.13 , pp. 556-562
- Seung, D.¹ Lee, L.²

30
- 79960657803
- Exemplar-based sparse representations for noise robust automatic speech recognition
- Sep.
- J. Gemmeke, T. Virtanen, and A. Hurmalainen, "Exemplar-based sparse representations for noise robust automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 7, pp. 2067-2080, Sep. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.7 , pp. 2067-2080
- Gemmeke, J.¹ Virtanen, T.² Hurmalainen, A.³

31
- 80051620372
- Non-negative matrix deconvolution in noise robust speech recognition
- A. Hurmalainen, J. Gemmeke, and T. Virtanen, "Non-negative matrix deconvolution in noise robust speech recognition," in Proc. ICASSP, 2011, pp. 4588-4591.
- Proc. ICASSP, 2011 , pp. 4588-4591
- Hurmalainen, A.¹ Gemmeke, J.² Virtanen, T.³

32
- 84874248255
- Exemplar-based voice conversion in noisy environment
- R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplar-based voice conversion in noisy environment," in Proc. IEEE Spoken Lang. Technol. Workshop (SLT), 2012, pp. 313-317.
- Proc. IEEE Spoken Lang. Technol. Workshop (SLT), 2012 , pp. 313-317
- Takashima, R.¹ Takiguchi, T.² Ariki, Y.³

33
- 84870706588
- Optimal cost function and magnitude power for NMF-based speech separation and music interpolation
- B. King, C. Févotte, and P. Smaragdis, "Optimal cost function and magnitude power for NMF-based speech separation and music interpolation," in Proc. IEEE Int. Workshop Mach. Learn. Signal Process. (MLSP), 2012, pp. 1-6.
- Proc. IEEE Int. Workshop Mach. Learn. Signal Process. (MLSP), 2012 , pp. 1-6
- King, B.¹ Févotte, C.² Smaragdis, P.³

34
- 77953725318
- INCA algorithm for training voice conversion systems from nonparallel corpora
- D. Erro, A. Moreno, and A. Bonafonte, "INCA algorithm for training voice conversion systems from nonparallel corpora," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 944-953, 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.5 , pp. 944-953
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

35
- 84905560807
- Voice conversion with smoothed GMM and MAP adaptation
- Y. Chen, M. Chu, E. Chang, J. Liu, and R. Liu, "Voice conversion with smoothed GMM and MAP adaptation," in Proc. Eur. Conf. Speech Commun. Technol. (Eurospeech), 2003.
- Proc. Eur. Conf. Speech Commun. Technol. (Eurospeech), 2003
- Chen, Y.¹ Chu, M.² Chang, E.³ Liu, J.⁴ Liu, R.⁵

36
- 85016140477
- An adaptive algorithm for mel-cepstral analysis of speech
- T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1992, pp. 137-140.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1992 , pp. 137-140
- Fukada, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

37
- 0021157408
- Line spectrum pair (LSP) and speech data compression
- F. Soong and B.-H. Juang, "Line spectrum pair (LSP) and speech data compression," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1984, pp. 137-140.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1984 , pp. 137-140
- Soong, F.¹ Juang, B.-H.²

38
- 85008006694
- Robust speaker-adaptive HMM-based text-to-speech synthesis
- Aug.
- J. Yamagishi, T. Nose, H. Zen, Z.-H. Ling, T. Toda, K. Tokuda, S. King, and S. Renals, "Robust speaker-adaptive HMM-based text-to-speech synthesis," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 6, pp. 1208-1230, Aug. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.6 , pp. 1208-1230
- Yamagishi, J.¹ Nose, T.² Zen, H.³ Ling, Z.-H.⁴ Toda, T.⁵ Tokuda, K.⁶ King, S.⁷ Renals, S.⁸

39
- 77953728395
- Measuring the gap between HMM-based ASR and TTS
- Aug.
- J. Dines, J. Yamagishi, and S. King, "Measuring the gap between HMM-based ASR and TTS," IEEE J. Sel. Topics Signal Process., vol. 4, no. 6, pp. 1046-1058, Aug. 2010.
- (2010) IEEE J. Sel. Topics Signal Process. , vol.4 , Issue.6 , pp. 1046-1058
- Dines, J.¹ Yamagishi, J.² King, S.³

40
- 64849096680
- Unsupervised learning methods for source separation in monaural music signals
- A. Klapuri and M. Davy, Eds. New York, NY, USA: Springer
- T. Virtanen, "Unsupervised learning methods for source separation in monaural music signals," in Signal Processing Methods for Music Transcription, A. Klapuri and M. Davy, Eds. New York, NY, USA: Springer, 2006, pp. 267-296.
- (2006) Signal Processing Methods for Music Transcription , pp. 267-296
- Virtanen, T.¹

41
- 79959812724
- A super-resolution spectrogram using coupled PLCA
- J. Nam, G. J. Mysore, J. Ganseman, K. Lee, and J. S. Abel, "A super-resolution spectrogram using coupled PLCA," in Proc. Interspeech, 2010.
- Proc. Interspeech, 2010
- Nam, J.¹ Mysore, G.J.² Ganseman, J.³ Lee, K.⁴ Abel, J.S.⁵

42
- 84863766226
- Bandwidth expansion of narrow-band speech using non-negative matrix factorization
- D. Bansal, B. Raj, and P. Smaragdis, "Bandwidth expansion of narrow-band speech using non-negative matrix factorization," in Proc. Interspeech, 2005.
- Proc. Interspeech, 2005
- Bansal, D.¹ Raj, B.² Smaragdis, P.³

43
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3, pp. 187-207, 1999.
- (1999) Speech Commun , vol.27 , Issue.3 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigné, A.³

44
- 85131821539
- Mel-generalized cepstral analysis-a unified approach to speech spectral estimation
- K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Mel-generalized cepstral analysis-a unified approach to speech spectral estimation," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP), 1994.
- Proc. Int. Conf. Spoken Lang. Process. (ICSLP), 1994
- Tokuda, K.¹ Kobayashi, T.² Masuko, T.³ Imai, S.⁴

45
- 4444285698
- Ph.D. dissertation, OGI School of Sci. & Eng., Oregon Health and Science Univ., Beaverton, OR, USA
- A. B. Kain, "High resolution voice transformation," Ph.D. dissertation, OGI School of Sci. & Eng., Oregon Health and Science Univ., Beaverton, OR, USA, 2001.
- (2001) High Resolution Voice Transformation
- Kain, A.B.¹

46
- 84878390910
- Implementation of computationally efficient real-time voice conversion
- T. Toda, T. Muramatsu, and H. Banno, "Implementation of computationally efficient real-time voice conversion," in Proc. Interspeech, 2012.
- Proc. Interspeech, 2012
- Toda, T.¹ Muramatsu, T.² Banno, H.³

47
- 84861098169
- Evaluating speech synthesis intelligibility using amazon mechanical turk
- M. K. Wolters, K. B. Isaac, and S. Renals, "Evaluating speech synthesis intelligibility using amazon mechanical turk," in Proc. 7th ISCA Speech Synth. Workshop (SSW7).
- Proc. 7th ISCA Speech Synth. Workshop (SSW7)
- Wolters, M.K.¹ Isaac, K.B.² Renals, S.³

48
- 80051607565
- CROWDMOS: An approach for crowdsourcing mean opinion score studies
- F. Ribeiro, D. Florêncio, C. Zhang, and M. Seltzer, "CROWDMOS: An approach for crowdsourcing mean opinion score studies," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2011, pp. 2416-2419.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2011 , pp. 2416-2419
- Ribeiro, F.¹ Florêncio, D.² Zhang, C.³ Seltzer, M.⁴

49
- 84865713971
- Crowdsourcing preference tests, and how to detect cheating
- S. Buchholz and J. Latorre, "Crowdsourcing preference tests, and how to detect cheating," in Proc. Interspeech, 2011.
- Proc. Interspeech, 2011
- Buchholz, S.¹ Latorre, J.²

50
- 34547496196
- Towards a voice conversion system based on frame selection
- T. Dutoit, A. Holzapfel, M. Jottrand, A. Moinet, J. Perez, and Y. Stylianou, "Towards a voice conversion system based on frame selection," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007, pp. 513-516.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007 , pp. 513-516
- Dutoit, T.¹ Holzapfel, A.² Jottrand, M.³ Moinet, A.⁴ Perez, J.⁵ Stylianou, Y.⁶

51
- 33947623206
- Text-independent voice conversion based on unit selection
- D. Sundermann, H. Hoge, A. Bonafonte, H. Ney, A. Black, and S. Narayanan, "Text-independent voice conversion based on unit selection," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2006, pp. 81-84.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2006 , pp. 81-84
- Sundermann, D.¹ Hoge, H.² Bonafonte, A.³ Ney, H.⁴ Black, A.⁵ Narayanan, S.⁶

52
- 0034854702
- Perceptual and objective detection of discontinuities in concatenative speech synthesis
- Y. Stylianou and A. K. Syrdal, "Perceptual and objective detection of discontinuities in concatenative speech synthesis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2001, pp. 837-840.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2001 , pp. 837-840
- Stylianou, Y.¹ Syrdal, A.K.²

53
- 0033592606
- Learning the parts of objects by non-negative matrix factorization
- D. D. Lee and H. S. Seung, "Learning the parts of objects by non-negative matrix factorization," Nature, vol. 401, no. 6755, pp. 788-791, 1999.
- (1999) Nature , vol.401 , Issue.6755 , pp. 788-791
- Lee, D.D.¹ Seung, H.S.²

54
- 80052600921
- Large margin based nonnegative matrix factorization and partial least squares regression for face recognition
- J.-Y. Pan and J.-S. Zhang, "Large margin based nonnegative matrix factorization and partial least squares regression for face recognition," Pattern Recogn. Lett., vol. 32, no. 14, pp. 1822-1835, 2011.
- (2011) Pattern Recogn. Lett. , vol.32 , Issue.14 , pp. 1822-1835
- Pan, J.-Y.¹ Zhang, J.-S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.