-
1
-
-
70350125882
-
An overview of text-independent speaker recognition: From features to supervectors
-
T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: From features to supervectors," Speech Commun., vol. 52, no. 1, pp. 12-40, 2010.
-
(2010)
Speech Commun
, vol.52
, Issue.1
, pp. 12-40
-
-
Kinnunen, T.1
Li, H.2
-
2
-
-
0031623661
-
Spectral voice conversion for text-to-speech synthesis
-
A. Kain and M. W. Macon, "Spectral voice conversion for text-to-speech synthesis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1998, pp. 285-288.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1998
, pp. 285-288
-
-
Kain, A.1
Macon, M.W.2
-
3
-
-
0033692729
-
Narrowband to wideband conversion of speech using GMM based transformation
-
K. Park and H. Kim, "Narrowband to wideband conversion of speech using GMM based transformation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2000, pp. 1843-1846.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2000
, pp. 1843-1846
-
-
Park, K.1
Kim, H.2
-
4
-
-
58149308063
-
A spectral conversion approach to single-channel speech enhancement
-
May
-
A. Mouchtaris, J. Van der Spiegel, P. Mueller, and P. Tsakalides, "A spectral conversion approach to single-channel speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1180-1193, May 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.4
, pp. 1180-1193
-
-
Mouchtaris, A.1
Van Der Spiegel, J.2
Mueller, P.3
Tsakalides, P.4
-
5
-
-
34547550766
-
Stereo-based stochastic mapping for robust speech recognition
-
M. Afify, X. Cui, and Y. Gao, "Stereo-based stochastic mapping for robust speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007, pp. 377-380.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007
, pp. 377-380
-
-
Afify, M.1
Cui, X.2
Gao, Y.3
-
6
-
-
84867591125
-
Stereo-based stochastic mapping with context using probabilistic PCA for noise robust automatic speech recognition
-
X. Cui, M. Afify, and B. Zhou, "Stereo-based stochastic mapping with context using probabilistic PCA for noise robust automatic speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2012, pp. 4705-4708.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2012
, pp. 4705-4708
-
-
Cui, X.1
Afify, M.2
Zhou, B.3
-
7
-
-
38649140222
-
Statistical mapping between articulatory movements and acoustic spectrum using a gaussian mixture model
-
T. Toda, A. Black, and K. Tokuda, "Statistical mapping between articulatory movements and acoustic spectrum using a gaussian mixture model," Speech Commun., vol. 50, no. 3, pp. 215-227, 2008.
-
(2008)
Speech Commun
, vol.50
, Issue.3
, pp. 215-227
-
-
Toda, T.1
Black, A.2
Tokuda, K.3
-
8
-
-
70349200844
-
Voice conversion for various types of body transmitted speech
-
T. Toda, K. Nakamura, H. Sekimoto, and K. Shikano, "Voice conversion for various types of body transmitted speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2009, pp. 3601-3604.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2009
, pp. 3601-3604
-
-
Toda, T.1
Nakamura, K.2
Sekimoto, H.3
Shikano, K.4
-
9
-
-
0023739214
-
Voice conversion through vector quantization
-
M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1988, pp. 655-658.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1988
, pp. 655-658
-
-
Abe, M.1
Nakamura, S.2
Shikano, K.3
Kuwabara, H.4
-
10
-
-
0026394044
-
Speaker adaptation and voice conversion by codebook mapping
-
K. Shikano, S. Nakamura, and M. Abe, "Speaker adaptation and voice conversion by codebook mapping," in Proc. IEEE Int. Symp. Circuits Syst., 1991, pp. 594-597.
-
Proc. IEEE Int. Symp. Circuits Syst., 1991
, pp. 594-597
-
-
Shikano, K.1
Nakamura, S.2
Abe, M.3
-
11
-
-
0032026483
-
Continuous probabilistic transform for voice conversion
-
Mar.
-
Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp. 131-142, Mar. 1998.
-
(1998)
IEEE Trans. Speech Audio Process.
, vol.6
, Issue.2
, pp. 131-142
-
-
Stylianou, Y.1
Cappé, O.2
Moulines, E.3
-
12
-
-
57749193836
-
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
-
Nov.
-
T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.W.2
Tokuda, K.3
-
13
-
-
78149260085
-
Continuous stochastic feature mapping based on trajectory HMMs
-
Feb.
-
H. Zen, Y. Nankaku, and K. Tokuda, "Continuous stochastic feature mapping based on trajectory HMMs," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 2, pp. 417-430, Feb. 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.2
, pp. 417-430
-
-
Zen, H.1
Nankaku, Y.2
Tokuda, K.3
-
14
-
-
84911386426
-
Perceptually weighted linear transformations for voice conversion
-
H. Ye and S. Young, "Perceptually weighted linear transformations for voice conversion," in Proc. Interspeech, 2003.
-
Proc. Interspeech, 2003
-
-
Ye, H.1
Young, S.2
-
15
-
-
34047254509
-
Quality-enhanced voice morphing using maximum likelihood transformations
-
Jul.
-
H. Ye and S. Young, "Quality-enhanced voice morphing using maximum likelihood transformations," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 4, pp. 1301-1312, Jul. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.14
, Issue.4
, pp. 1301-1312
-
-
Ye, H.1
Young, S.2
-
16
-
-
77953712499
-
Voice conversion using partial least squares regression
-
Jul.
-
E. Helander, T. Virtanen, J. Nurminen, and M. Gabbouj, "Voice conversion using partial least squares regression," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 912-921, Jul. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.5
, pp. 912-921
-
-
Helander, E.1
Virtanen, T.2
Nurminen, J.3
Gabbouj, M.4
-
17
-
-
84867594339
-
Local linear transformation for voice conversion
-
V. Popa, H. Silen, J. Nurminen, and M. Gabbouj, "Local linear transformation for voice conversion," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2012, pp. 4517-4520.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2012
, pp. 4517-4520
-
-
Popa, V.1
Silen, H.2
Nurminen, J.3
Gabbouj, M.4
-
18
-
-
0029254176
-
Transformation of formants for voice conversion using artificial neural networks
-
M. Narendranath, H. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants for voice conversion using artificial neural networks," Speech Commun., vol. 16, no. 2, pp. 207-216, 1995.
-
(1995)
Speech Commun
, vol.16
, Issue.2
, pp. 207-216
-
-
Narendranath, M.1
Murthy, H.2
Rajendran, S.3
Yegnanarayana, B.4
-
19
-
-
70349197691
-
Voice conversion using artificial neural networks
-
S. Desai, E. V. Raghavendra, B. Yegnanarayana, A. W. Black, and K. Prahallad, "Voice conversion using artificial neural networks," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process. (ICASSP), 2009, pp. 3893-3896.
-
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process. (ICASSP), 2009
, pp. 3893-3896
-
-
Desai, S.1
Raghavendra, E.V.2
Yegnanarayana, B.3
Black, A.W.4
Prahallad, K.5
-
20
-
-
80053068819
-
Voice conversion using support vector regression
-
P. Song, Y. Bao, L. Zhao, and C. Zou, "Voice conversion using support vector regression," Electron. Lett., vol. 47, no. 18, pp. 1045-1046, 2011.
-
(2011)
Electron. Lett.
, vol.47
, Issue.18
, pp. 1045-1046
-
-
Song, P.1
Bao, Y.2
Zhao, L.3
Zou, C.4
-
21
-
-
84856141218
-
Voice conversion using dynamic kernel partial least squares regression
-
Mar.
-
E. Helander, H. Silén, T. Virtanen, and M. Gabbouj, "Voice conversion using dynamic kernel partial least squares regression," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 3, pp. 806-817, Mar. 2012.
-
(2012)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.20
, Issue.3
, pp. 806-817
-
-
Helander, E.1
Silén, H.2
Virtanen, T.3
Gabbouj, M.4
-
22
-
-
84906225084
-
Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion
-
L.-H. Chen, Z.-H. Ling, Y. Song, and L.-R. Dai, "Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion," in Proc. Interspeech, 2013.
-
Proc. Interspeech, 2013
-
-
Chen, L.-H.1
Ling, Z.-H.2
Song, Y.3
Dai, L.-R.4
-
24
-
-
84946753271
-
VTLN-based cross-language voice conversion
-
D. Sundermann, H. Ney, and H. Hoge, "VTLN-based cross-language voice conversion," in IEEE Workshop Autom. Speech Recogn. Understand., 2003, pp. 676-681.
-
IEEE Workshop Autom. Speech Recogn. Understand., 2003
, pp. 676-681
-
-
Sundermann, D.1
Ney, H.2
Hoge, H.3
-
25
-
-
77953727123
-
Voice conversion based on weighted frequency warping
-
Jul.
-
D. Erro, A. Moreno, and A. Bonafonte, "Voice conversion based on weighted frequency warping," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 922-931, Jul. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.5
, pp. 922-931
-
-
Erro, D.1
Moreno, A.2
Bonafonte, A.3
-
26
-
-
84857498745
-
Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
-
May
-
E. Godoy, O. Rosec, and T. Chonavel, "Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 4, pp. 1313-1323, May 2012.
-
(2012)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.20
, Issue.4
, pp. 1313-1323
-
-
Godoy, E.1
Rosec, O.2
Chonavel, T.3
-
27
-
-
84872177757
-
Parametric voice conversion based on bilinear frequency warping plus amplitude scaling
-
Mar.
-
D. Erro, E. Navas, and I. Hernaez, "Parametric voice conversion based on bilinear frequency warping plus amplitude scaling," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 3, pp. 556-566, Mar. 2013.
-
(2013)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.21
, Issue.3
, pp. 556-566
-
-
Erro, D.1
Navas, E.2
Hernaez, I.3
-
28
-
-
84901803470
-
Exemplar-based voice conversion using non-negative spectrogram deconvolution
-
Z. Wu, T. Virtanen, T. Kinnunen, E. S. Chng, and H. Li, "Exemplar-based voice conversion using non-negative spectrogram deconvolution," in Proc. 8th ISCA Speech Synth. Workshop (SSW8), 2013.
-
Proc. 8th ISCA Speech Synth. Workshop (SSW8), 2013
-
-
Wu, Z.1
Virtanen, T.2
Kinnunen, T.3
Chng, E.S.4
Li, H.5
-
29
-
-
84898964201
-
Algorithms for non-negative matrix factorization
-
D. Seung and L. Lee, "Algorithms for non-negative matrix factorization," Adv. Neural Inf. Process. Syst., vol. 13, pp. 556-562, 2001.
-
(2001)
Adv. Neural Inf. Process. Syst.
, vol.13
, pp. 556-562
-
-
Seung, D.1
Lee, L.2
-
30
-
-
79960657803
-
Exemplar-based sparse representations for noise robust automatic speech recognition
-
Sep.
-
J. Gemmeke, T. Virtanen, and A. Hurmalainen, "Exemplar-based sparse representations for noise robust automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 7, pp. 2067-2080, Sep. 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.7
, pp. 2067-2080
-
-
Gemmeke, J.1
Virtanen, T.2
Hurmalainen, A.3
-
31
-
-
80051620372
-
Non-negative matrix deconvolution in noise robust speech recognition
-
A. Hurmalainen, J. Gemmeke, and T. Virtanen, "Non-negative matrix deconvolution in noise robust speech recognition," in Proc. ICASSP, 2011, pp. 4588-4591.
-
Proc. ICASSP, 2011
, pp. 4588-4591
-
-
Hurmalainen, A.1
Gemmeke, J.2
Virtanen, T.3
-
32
-
-
84874248255
-
Exemplar-based voice conversion in noisy environment
-
R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplar-based voice conversion in noisy environment," in Proc. IEEE Spoken Lang. Technol. Workshop (SLT), 2012, pp. 313-317.
-
Proc. IEEE Spoken Lang. Technol. Workshop (SLT), 2012
, pp. 313-317
-
-
Takashima, R.1
Takiguchi, T.2
Ariki, Y.3
-
33
-
-
84870706588
-
Optimal cost function and magnitude power for NMF-based speech separation and music interpolation
-
B. King, C. Févotte, and P. Smaragdis, "Optimal cost function and magnitude power for NMF-based speech separation and music interpolation," in Proc. IEEE Int. Workshop Mach. Learn. Signal Process. (MLSP), 2012, pp. 1-6.
-
Proc. IEEE Int. Workshop Mach. Learn. Signal Process. (MLSP), 2012
, pp. 1-6
-
-
King, B.1
Févotte, C.2
Smaragdis, P.3
-
34
-
-
77953725318
-
INCA algorithm for training voice conversion systems from nonparallel corpora
-
D. Erro, A. Moreno, and A. Bonafonte, "INCA algorithm for training voice conversion systems from nonparallel corpora," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 944-953, 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.5
, pp. 944-953
-
-
Erro, D.1
Moreno, A.2
Bonafonte, A.3
-
35
-
-
84905560807
-
Voice conversion with smoothed GMM and MAP adaptation
-
Y. Chen, M. Chu, E. Chang, J. Liu, and R. Liu, "Voice conversion with smoothed GMM and MAP adaptation," in Proc. Eur. Conf. Speech Commun. Technol. (Eurospeech), 2003.
-
Proc. Eur. Conf. Speech Commun. Technol. (Eurospeech), 2003
-
-
Chen, Y.1
Chu, M.2
Chang, E.3
Liu, J.4
Liu, R.5
-
36
-
-
85016140477
-
An adaptive algorithm for mel-cepstral analysis of speech
-
T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1992, pp. 137-140.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1992
, pp. 137-140
-
-
Fukada, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
37
-
-
0021157408
-
Line spectrum pair (LSP) and speech data compression
-
F. Soong and B.-H. Juang, "Line spectrum pair (LSP) and speech data compression," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1984, pp. 137-140.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1984
, pp. 137-140
-
-
Soong, F.1
Juang, B.-H.2
-
38
-
-
85008006694
-
Robust speaker-adaptive HMM-based text-to-speech synthesis
-
Aug.
-
J. Yamagishi, T. Nose, H. Zen, Z.-H. Ling, T. Toda, K. Tokuda, S. King, and S. Renals, "Robust speaker-adaptive HMM-based text-to-speech synthesis," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 6, pp. 1208-1230, Aug. 2009.
-
(2009)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.17
, Issue.6
, pp. 1208-1230
-
-
Yamagishi, J.1
Nose, T.2
Zen, H.3
Ling, Z.-H.4
Toda, T.5
Tokuda, K.6
King, S.7
Renals, S.8
-
39
-
-
77953728395
-
Measuring the gap between HMM-based ASR and TTS
-
Aug.
-
J. Dines, J. Yamagishi, and S. King, "Measuring the gap between HMM-based ASR and TTS," IEEE J. Sel. Topics Signal Process., vol. 4, no. 6, pp. 1046-1058, Aug. 2010.
-
(2010)
IEEE J. Sel. Topics Signal Process.
, vol.4
, Issue.6
, pp. 1046-1058
-
-
Dines, J.1
Yamagishi, J.2
King, S.3
-
40
-
-
64849096680
-
Unsupervised learning methods for source separation in monaural music signals
-
A. Klapuri and M. Davy, Eds. New York, NY, USA: Springer
-
T. Virtanen, "Unsupervised learning methods for source separation in monaural music signals," in Signal Processing Methods for Music Transcription, A. Klapuri and M. Davy, Eds. New York, NY, USA: Springer, 2006, pp. 267-296.
-
(2006)
Signal Processing Methods for Music Transcription
, pp. 267-296
-
-
Virtanen, T.1
-
41
-
-
79959812724
-
A super-resolution spectrogram using coupled PLCA
-
J. Nam, G. J. Mysore, J. Ganseman, K. Lee, and J. S. Abel, "A super-resolution spectrogram using coupled PLCA," in Proc. Interspeech, 2010.
-
Proc. Interspeech, 2010
-
-
Nam, J.1
Mysore, G.J.2
Ganseman, J.3
Lee, K.4
Abel, J.S.5
-
42
-
-
84863766226
-
Bandwidth expansion of narrow-band speech using non-negative matrix factorization
-
D. Bansal, B. Raj, and P. Smaragdis, "Bandwidth expansion of narrow-band speech using non-negative matrix factorization," in Proc. Interspeech, 2005.
-
Proc. Interspeech, 2005
-
-
Bansal, D.1
Raj, B.2
Smaragdis, P.3
-
43
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3, pp. 187-207, 1999.
-
(1999)
Speech Commun
, vol.27
, Issue.3
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
De Cheveigné, A.3
-
44
-
-
85131821539
-
Mel-generalized cepstral analysis-a unified approach to speech spectral estimation
-
K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Mel-generalized cepstral analysis-a unified approach to speech spectral estimation," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP), 1994.
-
Proc. Int. Conf. Spoken Lang. Process. (ICSLP), 1994
-
-
Tokuda, K.1
Kobayashi, T.2
Masuko, T.3
Imai, S.4
-
45
-
-
4444285698
-
-
Ph.D. dissertation, OGI School of Sci. & Eng., Oregon Health and Science Univ., Beaverton, OR, USA
-
A. B. Kain, "High resolution voice transformation," Ph.D. dissertation, OGI School of Sci. & Eng., Oregon Health and Science Univ., Beaverton, OR, USA, 2001.
-
(2001)
High Resolution Voice Transformation
-
-
Kain, A.B.1
-
46
-
-
84878390910
-
Implementation of computationally efficient real-time voice conversion
-
T. Toda, T. Muramatsu, and H. Banno, "Implementation of computationally efficient real-time voice conversion," in Proc. Interspeech, 2012.
-
Proc. Interspeech, 2012
-
-
Toda, T.1
Muramatsu, T.2
Banno, H.3
-
48
-
-
80051607565
-
CROWDMOS: An approach for crowdsourcing mean opinion score studies
-
F. Ribeiro, D. Florêncio, C. Zhang, and M. Seltzer, "CROWDMOS: An approach for crowdsourcing mean opinion score studies," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2011, pp. 2416-2419.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2011
, pp. 2416-2419
-
-
Ribeiro, F.1
Florêncio, D.2
Zhang, C.3
Seltzer, M.4
-
50
-
-
34547496196
-
Towards a voice conversion system based on frame selection
-
T. Dutoit, A. Holzapfel, M. Jottrand, A. Moinet, J. Perez, and Y. Stylianou, "Towards a voice conversion system based on frame selection," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007, pp. 513-516.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2007
, pp. 513-516
-
-
Dutoit, T.1
Holzapfel, A.2
Jottrand, M.3
Moinet, A.4
Perez, J.5
Stylianou, Y.6
-
51
-
-
33947623206
-
Text-independent voice conversion based on unit selection
-
D. Sundermann, H. Hoge, A. Bonafonte, H. Ney, A. Black, and S. Narayanan, "Text-independent voice conversion based on unit selection," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2006, pp. 81-84.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2006
, pp. 81-84
-
-
Sundermann, D.1
Hoge, H.2
Bonafonte, A.3
Ney, H.4
Black, A.5
Narayanan, S.6
-
52
-
-
0034854702
-
Perceptual and objective detection of discontinuities in concatenative speech synthesis
-
Y. Stylianou and A. K. Syrdal, "Perceptual and objective detection of discontinuities in concatenative speech synthesis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2001, pp. 837-840.
-
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2001
, pp. 837-840
-
-
Stylianou, Y.1
Syrdal, A.K.2
-
53
-
-
0033592606
-
Learning the parts of objects by non-negative matrix factorization
-
D. D. Lee and H. S. Seung, "Learning the parts of objects by non-negative matrix factorization," Nature, vol. 401, no. 6755, pp. 788-791, 1999.
-
(1999)
Nature
, vol.401
, Issue.6755
, pp. 788-791
-
-
Lee, D.D.1
Seung, H.S.2
-
54
-
-
80052600921
-
Large margin based nonnegative matrix factorization and partial least squares regression for face recognition
-
J.-Y. Pan and J.-S. Zhang, "Large margin based nonnegative matrix factorization and partial least squares regression for face recognition," Pattern Recogn. Lett., vol. 32, no. 14, pp. 1822-1835, 2011.
-
(2011)
Pattern Recogn. Lett.
, vol.32
, Issue.14
, pp. 1822-1835
-
-
Pan, J.-Y.1
Zhang, J.-S.2
|