-
1
-
-
0023739214
-
Voice conversion through vector quantization
-
Abe, M.; Nakamura, S.; Shikano, K.; Kuwabara, H.; 1988. Voice conversion through vector quantization. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(1988)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Abe, M.1
Nakamura, S.2
Shikano, K.3
Kuwabara, H.4
-
2
-
-
0141521592
-
Modeling prosodic dynamics for speaker recognition
-
Adami, A.G.; Mihaescu, R.; Reynolds, D.A.; Godfrey, J.J.; 2003. Modeling prosodic dynamics for speaker recognition. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2003)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Adami A. ., G.1
Mihaescu, R.2
Reynolds D. ., A.3
Godfrey J. ., J.4
-
5
-
-
84890542394
-
Spoofing countermeasures to protect automatic speaker verification from voice conversion
-
Alegre, F.; Amehraye, A.; Evans, N.; 2013b. Spoofing countermeasures to protect automatic speaker verification from voice conversion. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2013)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Alegre, F.1
Amehraye, A.2
Evans, N.3
-
6
-
-
84856662834
-
Anti-spoofing: Voice databases
-
S.Z. Li, A.K. Jain, Springer-Verlag US
-
F. Alegre, N. Evans, T. Kinnunen, Z. Wu, and J. Yamagishi Anti-spoofing: voice databases S.Z. Li, A.K. Jain, Encyclopedia of Biometrics 2014 Springer-Verlag US
-
(2014)
Encyclopedia of Biometrics
-
-
Alegre, F.1
Evans, N.2
Kinnunen, T.3
Wu, Z.4
Yamagishi, J.5
-
7
-
-
84906244272
-
A new speaker verification spoofing countermeasure based on local binary patterns
-
Alegre, F.; Vipperla, R.; Amehraye, A.; Evans, N.; 2013c. A new speaker verification spoofing countermeasure based on local binary patterns. In: Proc. Interspeech.
-
(2013)
Proc. Interspeech
-
-
Alegre, F.1
Vipperla, R.2
Amehraye, A.3
Evans, N.4
-
9
-
-
84878412793
-
Spoofing countermeasures for the protection of automatic speaker recognition systems against attacks with artificial signals
-
Alegre, F.; Vipperla, R.; Evans, N.; et al.; 2012b. Spoofing countermeasures for the protection of automatic speaker recognition systems against attacks with artificial signals. In: Proc. Interspeech.
-
(2012)
Proc. Interspeech
-
-
Alegre, F.1
Vipperla, R.2
Evans, N.3
-
10
-
-
84995407786
-
Detecting voice disguise from speech variability: Analysis of three glottal and vocal tract measures
-
4068-4068
-
T.B. Amin, J.S. German, and P. Marziliano Detecting voice disguise from speech variability: analysis of three glottal and vocal tract measures J. Acoust. Soc. Am. 134 2013 4068-4068
-
(2013)
J. Acoust. Soc. Am.
, vol.134
-
-
Amin, T.B.1
German, J.S.2
Marziliano, P.3
-
11
-
-
84896907805
-
Glottal and vocal tract characteristics of voice impersonators
-
T.B. Amin, P. Marziliano, and J.S. German Glottal and vocal tract characteristics of voice impersonators IEEE Trans. Multimedia 16 2014 668 678
-
(2014)
IEEE Trans. Multimedia
, vol.16
, pp. 668-678
-
-
Amin, T.B.1
Marziliano, P.2
German, J.S.3
-
12
-
-
84871393354
-
Bob: A free signal processing and machine learning toolbox for researchers
-
Anjos, A.; El-Shafey, L.; Wallace, R.; Günther, M.; McCool, C.; Marcel, S.; 2012. Bob: a free signal processing and machine learning toolbox for researchers. In: Proc. the 20th ACM Int. Conf. on Multimedia.
-
(2012)
Proc. The 20th ACM Int. Conf. on Multimedia
-
-
Anjos, A.1
El-Shafey, L.2
Wallace, R.3
Günther, M.4
McCool, C.5
Marcel, S.6
-
13
-
-
0002425861
-
The ATandT Next-Gen TTS system
-
Beutnagel, B.; Conkie, A.; Schroeter, J.; Stylianou, Y.; Syrdal, A.; 1999. The ATandT Next-Gen TTS system. In: Proc. Joint ASA, EAA and DAEA Meeting.
-
(1999)
Proc. Joint ASA, EAA and DAEA Meeting
-
-
Beutnagel, B.1
Conkie, A.2
Schroeter, J.3
Stylianou, Y.4
Syrdal, A.5
-
14
-
-
2942594475
-
A tutorial on text-independent speaker verification
-
F. Bimbot, J.F. Bonastre, C. Fredouille, G. Gravier, I. Magrin-Chagnolleau, S. Meignier, T. Merlin, J. Ortega-García, D. Petrovska-Delacrétaz, and D.A. Reynolds A tutorial on text-independent speaker verification EURASIP J. Appl. Signal Process. 2004 2004 430 451
-
(2004)
EURASIP J. Appl. Signal Process.
, vol.2004
, pp. 430-451
-
-
Bimbot, F.1
Bonastre, J.F.2
Fredouille, C.3
Gravier, G.4
Magrin-Chagnolleau, I.5
Meignier, S.6
Merlin, T.7
Ortega-García, J.8
Petrovska-Delacrétaz, D.9
Reynolds, D.A.10
-
15
-
-
44949232373
-
CLUSTERGEN: A statistical parametric synthesizer using trajectory modeling
-
Black, A.W.; 2006. CLUSTERGEN: A statistical parametric synthesizer using trajectory modeling. In: Proc. Interspeech.
-
(2006)
Proc. Interspeech
-
-
Black, A.W.1
-
16
-
-
84890478111
-
Speaker verification scores and acoustic analysis of a professional impersonator
-
Blomberg, M.; Elenius, D.; Zetterholm, E.; 2004. Speaker verification scores and acoustic analysis of a professional impersonator. In: Proc. FONETIK.
-
(2004)
Proc. FONETIK
-
-
Blomberg, M.1
Elenius, D.2
Zetterholm, E.3
-
17
-
-
84974570703
-
Praat: Doing phonetics by computer
-
retrieved 12 February 2014
-
Boersma, P.; Weenink, D.; 2014. Praat: doing phonetics by computer. Computer program. Version 5.3.64, retrieved 12 February 2014 from < http://www.praat.org/ >.
-
(2014)
Computer Program. Version 5.3.64
-
-
Boersma, P.1
Weenink, D.2
-
19
-
-
33947662555
-
Detecting replay attacks in audiovisual identity verification
-
Bredin, H.; Miguel, A.; Witten, I.H.; Chollet, G.; 2006. Detecting replay attacks in audiovisual identity verification. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2006)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Bredin, H.1
Miguel, A.2
Witten I. ., H.3
Chollet, G.4
-
21
-
-
51449086024
-
Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006
-
N. Brümmer, L. Burget, J. Černocký, O. Glembek, F. Grézl, M. Karafiát, D. Leeuwen, P. Matějka, P. Schwartz, and A. Strasheim Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006 IEEE Trans. Audio Speech Language Process. 15 2007 2072 2084
-
(2007)
IEEE Trans. Audio Speech Language Process.
, vol.15
, pp. 2072-2084
-
-
Brümmer, N.1
Burget, L.2
Černocký, J.3
Glembek, O.4
Grézl, F.5
Karafiát, M.6
Leeuwen, D.7
Matějka, P.8
Schwartz, P.9
Strasheim, A.10
-
22
-
-
58349102016
-
Analysis of feature extraction and channel compensation in a GMM speaker recognition system
-
L. Burget, P. Matějka, P. Schwarz, O. Glembek, and J. Černocký Analysis of feature extraction and channel compensation in a GMM speaker recognition system IEEE Trans. Audio Speech Language Process. 15 2007 1979 1986
-
(2007)
IEEE Trans. Audio Speech Language Process.
, vol.15
, pp. 1979-1986
-
-
Burget, L.1
Matějka, P.2
Schwarz, P.3
Glembek, O.4
Černocký, J.5
-
23
-
-
33645887246
-
Support vector machines using GMM supervectors for speaker verification
-
W.M. Campbell, D.E. Sturim, and D.A. Reynolds Support vector machines using GMM supervectors for speaker verification IEEE Signal Process. Lett. 13 2006 308 311
-
(2006)
IEEE Signal Process. Lett.
, vol.13
, pp. 308-311
-
-
Campbell, W.M.1
Sturim, D.E.2
Reynolds, D.A.3
-
24
-
-
0031233424
-
Speaker recognition: A tutorial
-
J.P. Campbell Jr. Speaker recognition: a tutorial Proc. IEEE 85 1997 1437 1462
-
(1997)
Proc. IEEE
, vol.85
, pp. 1437-1462
-
-
Campbell, Jr.J.P.1
-
25
-
-
84906225084
-
Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion
-
Chen, L.H.; Ling, Z.H.; Song, Y.; Dai, L.R.; 2013. Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion. In: Proc. Interspeech.
-
(2013)
Proc. Interspeech
-
-
Chen L. ., H.1
Ling Z. ., H.2
Song, Y.3
Dai L. ., R.4
-
27
-
-
84905560807
-
Voice conversion with smoothed GMM and MAP adaptation
-
Chen, Y.; Chu, M.; Chang, E.; Liu, J.; Liu, R.; 2003. Voice conversion with smoothed GMM and MAP adaptation. In: Proc. European Conference on Speech Communication and Technology (Eurospeech).
-
(2003)
Proc. European Conference on Speech Communication and Technology (Eurospeech)
-
-
Chen, Y.1
Chu, M.2
Chang, E.3
Liu, J.4
Liu, R.5
-
30
-
-
84985926077
-
Segment selection in the LandH realspeak laboratory TTS system
-
Coorman, G.; Fackrell, J.; Rutten, P.; Coile, B.; 2000. Segment selection in the LandH realspeak laboratory TTS system. In: Proc. Int. Conf. on Spoken Language Processing (ICSLP), pp. 395-398.
-
(2000)
Proc. Int. Conf. on Spoken Language Processing (ICSLP)
, pp. 395-398
-
-
Coorman, G.1
Fackrell, J.2
Rutten, P.3
Coile, B.4
-
31
-
-
78049409687
-
Revisiting the security of speaker verification systems against imposture using synthetic speech
-
De Leon, P.L.; Apsingekar, V.R.; Pucher, M.; Yamagishi, J.; 2010a. Revisiting the security of speaker verification systems against imposture using synthetic speech. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2010)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
De Leon P. ., L.1
Apsingekar V. ., R.2
Pucher, M.3
Yamagishi, J.4
-
32
-
-
80051658143
-
Detection of synthetic speech for the problem of imposture
-
De Leon, P.L.; Hernaez, I.; Saratxaga, I.; Pucher, M.; Yamagishi, J.; 2011. Detection of synthetic speech for the problem of imposture. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2011)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
De Leon P. ., L.1
Hernaez, I.2
Saratxaga, I.3
Pucher, M.4
Yamagishi, J.5
-
34
-
-
84865369980
-
Evaluation of speaker verification security and detection of HMM-based synthetic speech
-
P.L. De Leon, M. Pucher, J. Yamagishi, I. Hernaez, and I. Saratxaga Evaluation of speaker verification security and detection of HMM-based synthetic speech IEEE Trans. Audio Speech Language Process. 20 2012 2280 2290
-
(2012)
IEEE Trans. Audio Speech Language Process.
, vol.20
, pp. 2280-2290
-
-
De Leon, P.L.1
Pucher, M.2
Yamagishi, J.3
Hernaez, I.4
Saratxaga, I.5
-
35
-
-
84878402831
-
Synthetic speech discrimination using pitch pattern statistics derived from image analysis
-
De Leon, P.L.; Stewart, B.; Yamagishi, J.; 2012b. Synthetic speech discrimination using pitch pattern statistics derived from image analysis. In: Proc. Interspeech.
-
(2012)
Proc. Interspeech
-
-
De Leon P. ., L.1
Stewart, B.2
Yamagishi, J.3
-
37
-
-
79951609039
-
Front-end factor analysis for speaker verification
-
N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet Front-end factor analysis for speaker verification IEEE Trans. Audio Speech Language Process. 19 2011 788 798
-
(2011)
IEEE Trans. Audio Speech Language Process.
, vol.19
, pp. 788-798
-
-
Dehak, N.1
Kenny, P.2
Dehak, R.3
Dumouchel, P.4
Ouellet, P.5
-
40
-
-
2942623033
-
-
MD. National Institute of Standards and Technology
-
Doddington, G.; Liggett, W.; Martin, A.; Przybocki, M.; Reynolds, D.; 1998. Sheep, goats, lambs and wolves: a statistical analysis of speaker performance in the NIST 1998 speaker recognition evaluation, Gaithersburg, MD. National Institute of Standards and Technology.
-
(1998)
Sheep, Goats, Lambs and Wolves: A Statistical Analysis of Speaker Performance in the NIST 1998 Speaker Recognition Evaluation, Gaithersburg
-
-
Doddington, G.1
Liggett, W.2
Martin, A.3
Przybocki, M.4
Reynolds, D.5
-
42
-
-
34547496196
-
Towards a voice conversion system based on frame selection
-
Dutoit, T.; Holzapfel, A.; Jottrand, M.; Moinet, A.; Perez, J.; Stylianou, Y.; 2007. Towards a voice conversion system based on frame selection. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2007)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Dutoit, T.1
Holzapfel, A.2
Jottrand, M.3
Moinet, A.4
Perez, J.5
Stylianou, Y.6
-
43
-
-
0015069185
-
Voice spectrograms as a function of age, voice disguise, and voice imitation
-
W. Endres, W. Bambach, and G. Flösser Voice spectrograms as a function of age, voice disguise, and voice imitation J. Acoust. Soc. Am. 49 1971 1842 1848
-
(1971)
J. Acoust. Soc. Am.
, vol.49
, pp. 1842-1848
-
-
Endres, W.1
Bambach, W.2
Flösser, G.3
-
46
-
-
84872177757
-
Parametric voice conversion based on bilinear frequency warping plus amplitude scaling
-
D. Erro, E. Navas, and I. Hernaez Parametric voice conversion based on bilinear frequency warping plus amplitude scaling IEEE Trans. Audio Speech Language Process. 21 2013 556 566
-
(2013)
IEEE Trans. Audio Speech Language Process.
, vol.21
, pp. 556-566
-
-
Erro, D.1
Navas, E.2
Hernaez, I.3
-
47
-
-
84856662834
-
Anti-spoofing: Voice conversion
-
S.Z. Li, A.K. Jain, Springer-Verlag US
-
N. Evans, F. Alegre, Z. Wu, and T. Kinnunen Anti-spoofing: voice conversion S.Z. Li, A.K. Jain, Encyclopedia of Biometrics 2014 Springer-Verlag US
-
(2014)
Encyclopedia of Biometrics
-
-
Evans, N.1
Alegre, F.2
Wu, Z.3
Kinnunen, T.4
-
48
-
-
84906263293
-
Spoofing and countermeasures for automatic speaker verification
-
Evans, N.; Kinnunen, T.; Yamagishi, J.; 2013. Spoofing and countermeasures for automatic speaker verification. In: Proc. Interspeech.
-
(2013)
Proc. Interspeech
-
-
Evans, N.1
Kinnunen, T.2
Yamagishi, J.3
-
49
-
-
84905231482
-
Speaker recognition anti-spoofing
-
S. Marcel, S.Z. Li, M. Nixon, Springer
-
N. Evans, T. Kinnunen, J. Yamagishi, Z. Wu, F. Alegre, and P. DeLeon Speaker recognition anti-spoofing S. Marcel, S.Z. Li, M. Nixon, Handbook of Biometric Anti-spoofing 2014 Springer
-
(2014)
Handbook of Biometric Anti-spoofing
-
-
Evans, N.1
Kinnunen, T.2
Yamagishi, J.3
Wu, Z.4
Alegre, F.5
Deleon, P.6
-
51
-
-
77956826012
-
Automatic speaker recognition as a measurement of voice imitation and conversion
-
M. Farrús, M. Wagner, D. Erro, and J. Hernando Automatic speaker recognition as a measurement of voice imitation and conversion Int. J. Speech Language Law 17 2010 119 142
-
(2010)
Int. J. Speech Language Law
, vol.17
, pp. 119-142
-
-
Farrús, M.1
Wagner, M.2
Erro, D.3
Hernando, J.4
-
53
-
-
33751542948
-
Speaker verification security improvement by means of speech watermarking
-
M. Faundez-Zanuy, M. Hagmüller, and G. Kubin Speaker verification security improvement by means of speech watermarking Speech Commun. 48 2006 1608 1619
-
(2006)
Speech Commun.
, vol.48
, pp. 1608-1619
-
-
Faundez-Zanuy, M.1
Hagmüller, M.2
Kubin, G.3
-
54
-
-
78049390307
-
A comparison of approaches for modeling prosodic features in speaker recognition
-
Ferrer, L.; Scheffer, N.; Shriberg, E.; 2010. A comparison of approaches for modeling prosodic features in speaker recognition. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2010)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Ferrer, L.1
Scheffer, N.2
Shriberg, E.3
-
56
-
-
70449713306
-
On the vulnerability of face verification systems to hill-climbing attacks
-
J. Galbally, C. McCool, J. Fierrez, S. Marcel, and J. Ortega-Garcia On the vulnerability of face verification systems to hill-climbing attacks Pattern Recogn. 43 2010 1027 1038
-
(2010)
Pattern Recogn.
, vol.43
, pp. 1027-1038
-
-
Galbally, J.1
McCool, C.2
Fierrez, J.3
Marcel, S.4
Ortega-Garcia, J.5
-
58
-
-
84865733857
-
Analysis of i-vector length normalization in speaker recognition systems
-
Garcia-Romero, D.; Espy-Wilson, C.Y.; 2011. Analysis of i-vector length normalization in speaker recognition systems. In: Proc. Interspeech.
-
(2011)
Proc. Interspeech
-
-
Garcia-Romero, D.1
Espy-Wilson C. ., Y.2
-
59
-
-
0028419019
-
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
-
J.L. Gauvain, and C.H. Lee Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains IEEE Trans. Speech Audio Process. 2 1994 291 298
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, pp. 291-298
-
-
Gauvain, J.L.1
Lee, C.H.2
-
60
-
-
29844455876
-
Pitch extraction and fundamental frequency: History and current techniques
-
Department of Computer Science, University of Regina
-
Gerhard, D.; 2003. Pitch extraction and fundamental frequency: History and current techniques. Technical Report TR-CS 2003-06, Department of Computer Science, University of Regina.
-
(2003)
Technical Report TR-CS 2003-06
-
-
Gerhard, D.1
-
62
-
-
84857498745
-
Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
-
E. Godoy, O. Rosec, and T. Chonavel Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora IEEE Trans. Audio Speech Language Process. 20 2012 1313 1323
-
(2012)
IEEE Trans. Audio Speech Language Process.
, vol.20
, pp. 1313-1323
-
-
Godoy, E.1
Rosec, O.2
Chonavel, T.3
-
64
-
-
84929624509
-
Comparison of human listeners and speaker verification systems using voice mimicry data
-
Joensuu, Finland
-
Hautamäki, R.G.; Kinnunen, T.; Hautamäki, V.; Laukkanen, A.M.; 2014. Comparison of human listeners and speaker verification systems using voice mimicry data. In: Proc. Odyssey: the Speaker and Language Recognition Workshop, Joensuu, Finland. pp. 137-144.
-
(2014)
Proc. Odyssey: The Speaker and Language Recognition Workshop
, pp. 137-144
-
-
Hautamäki R. ., G.1
Kinnunen, T.2
Hautamäki, V.3
Laukkanen A. ., M.4
-
65
-
-
84906213805
-
I-vectors meet imitators: On vulnerability of speaker verification systems against voice mimicry
-
Hautamäki, R.G.; Kinnunen, T.; Hautamäki, V.; Leino, T.; Laukkanen, A.M.; 2013a. I-vectors meet imitators: on vulnerability of speaker verification systems against voice mimicry. In: Proc. Interspeech.
-
(2013)
Proc. Interspeech
-
-
Hautamäki R. ., G.1
Kinnunen, T.2
Hautamäki, V.3
Leino, T.4
Laukkanen A. ., M.5
-
66
-
-
84877743396
-
Sparse classifier fusion for speaker verification
-
V. Hautamäki, T. Kinnunen, F. Sedlák, K.A. Lee, B. Ma, and H. Li Sparse classifier fusion for speaker verification IEEE Trans. Audio Speech Language Process. 21 2013 1622 1631
-
(2013)
IEEE Trans. Audio Speech Language Process.
, vol.21
, pp. 1622-1631
-
-
Hautamäki, V.1
Kinnunen, T.2
Sedlák, F.3
Lee, K.A.4
Ma, B.5
Li, H.6
-
69
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups IEEE Signal Process. Mag. 29 2012 82 97
-
(2012)
IEEE Signal Process. Mag.
, vol.29
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
75
-
-
33947637189
-
Joint factor analysis of speaker and session variability: Theory and algorithms
-
Kenny, P.; 2006. Joint factor analysis of speaker and session variability: theory and algorithms. Technical report CRIM-06/08-14.
-
(2006)
Technical Report CRIM-06/08-14
-
-
Kenny, P.1
-
77
-
-
58349106697
-
A study of inter-speaker variability in speaker verification
-
P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel A study of inter-speaker variability in speaker verification IEEE Trans. Audio Speech Language Process. 16 2008 980 988
-
(2008)
IEEE Trans. Audio Speech Language Process.
, vol.16
, pp. 980-988
-
-
Kenny, P.1
Ouellet, P.2
Dehak, N.3
Gupta, V.4
Dumouchel, P.5
-
78
-
-
70350125882
-
An overview of text-independent speaker recognition: From features to supervectors
-
T. Kinnunen, and H. Li An overview of text-independent speaker recognition: from features to supervectors Speech Commun. 52 2010 12 40
-
(2010)
Speech Commun.
, vol.52
, pp. 12-40
-
-
Kinnunen, T.1
Li, H.2
-
79
-
-
84867600098
-
Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech
-
Kinnunen, T.; Wu, Z.Z.; Lee, K.A.; Sedlak, F.; Chng, E.S.; Li, H.; 2012. Vulnerability of speaker verification systems against voice conversion spoofing attacks: the case of telephone speech. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2012)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Kinnunen, T.1
Wu Z. ., Z.2
Lee K. ., A.3
Sedlak, F.4
Chng E. ., S.5
Li, H.6
-
80
-
-
84867211827
-
Acoustic analysis of imitated voice produced by a professional impersonator
-
Kitamura, T.; 2008. Acoustic analysis of imitated voice produced by a professional impersonator. In: Proc. Interspeech.
-
(2008)
Proc. Interspeech
-
-
Kitamura, T.1
-
81
-
-
0018986665
-
Software for a cascade/parallel formant synthesizer
-
D.H. Klatt Software for a cascade/parallel formant synthesizer J. Acoust. Soc. Am. 67 1980 971 995
-
(1980)
J. Acoust. Soc. Am.
, vol.67
, pp. 971-995
-
-
Klatt, D.H.1
-
83
-
-
84906234851
-
Voice transformation-based spoofing of text-dependent speaker verification systems
-
Kons, Z.; Aronowitz, H.; 2013. Voice transformation-based spoofing of text-dependent speaker verification systems. In: Proc. Interspeech.
-
(2013)
Proc. Interspeech
-
-
Kons, Z.1
Aronowitz, H.2
-
84
-
-
84896111913
-
Alize 3.0-open source toolkit for state-of - The-art speaker recognition
-
Larcher, A.; Bonastre, J.F.; Fauve, B.; Lee, K.A.; Lévy, C.; Li, H.; Mason, J.S.; Parfait, J.Y.; ValidSoft Ltd, U.; 2013a. Alize 3.0-open source toolkit for state-of-the-art speaker recognition. In: Proc. Interspeech.
-
(2013)
Proc. Interspeech
-
-
Larcher, A.1
Bonastre J. ., F.2
Fauve, B.3
Lee K. ., A.4
Lévy, C.5
Li, H.6
Mason J. ., S.7
Parfait J. ., Y.8
Validsoft Ltd, U.9
-
85
-
-
84878465724
-
RSR2015: Database for text-dependent speaker verification using multiple pass-phrases
-
Larcher, A.; Lee, K.A.; Ma, B.; Li, H.; 2012. RSR2015: database for text-dependent speaker verification using multiple pass-phrases. In: Proc. Interspeech.
-
(2012)
Proc. Interspeech
-
-
Larcher, A.1
Lee K. ., A.2
Ma, B.3
Li, H.4
-
86
-
-
84890536710
-
Phonetically-constrained PLDA modeling for text-dependent speaker verification with multiple short utterances
-
Larcher, A.; Lee, K.A.; Ma, B.; Li, H.; 2013b. Phonetically-constrained PLDA modeling for text-dependent speaker verification with multiple short utterances. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2013)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Larcher, A.1
Lee K. ., A.2
Ma, B.3
Li, H.4
-
87
-
-
84897385841
-
Text-dependent speaker verification: Classifiers, databases and RSR2015
-
A. Larcher, K.A. Lee, B. Ma, and H. Li Text-dependent speaker verification: classifiers, databases and RSR2015 Speech Commun. 60 2014 5677
-
(2014)
Speech Commun.
, vol.60
, pp. 5677
-
-
Larcher, A.1
Lee, K.A.2
Ma, B.3
Li, H.4
-
91
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
-
C.J. Leggetter, and P.C. Woodland Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models Comput. Speech Language 9 1995 171 185
-
(1995)
Comput. Speech Language
, vol.9
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
92
-
-
84906236343
-
-
Master's thesis, University of Tampere, Finland, In Finnish
-
Leskelä, J.; 2011. Changes in F0, formant frequencies and spectral slope in imitation. Master's thesis, University of Tampere, Finland, In Finnish.
-
(2011)
Changes in F0, Formant Frequencies and Spectral Slope in Imitation
-
-
Leskelä, J.1
-
93
-
-
85032751399
-
Techware: Speaker and spoken language recognition resources [best of the web]
-
H. Li, and B. Ma Techware: speaker and spoken language recognition resources [best of the web] IEEE Signal Process. Mag. 27 2010 139 142
-
(2010)
IEEE Signal Process. Mag.
, vol.27
, pp. 139-142
-
-
Li, H.1
Ma, B.2
-
94
-
-
84876676725
-
Spoken language recognition: From fundamentals to practice
-
H. Li, B. Ma, and K.A. Lee Spoken language recognition: from fundamentals to practice Proc. IEEE 101 2013 1136 1159
-
(2013)
Proc. IEEE
, vol.101
, pp. 1136-1159
-
-
Li, H.1
Ma, B.2
Lee, K.A.3
-
95
-
-
81855205043
-
Probabilistic models for inference about identity
-
P. Li, Y. Fu, U. Mohammed, J.H. Elder, and S.J. Prince Probabilistic models for inference about identity IEEE Trans. Pattern Anal. Machine Intell. 34 2012 144 157
-
(2012)
IEEE Trans. Pattern Anal. Machine Intell.
, vol.34
, pp. 144-157
-
-
Li, P.1
Fu, Y.2
Mohammed, U.3
Elder, J.H.4
Prince, S.J.5
-
97
-
-
84901237776
-
Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis
-
Z.H. Ling, L. Deng, and D. Yu Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis IEEE Trans. Audio Speech Language Process. 21 2013 2129 2139
-
(2013)
IEEE Trans. Audio Speech Language Process.
, vol.21
, pp. 2129-2139
-
-
Ling, Z.H.1
Deng, L.2
Yu, D.3
-
98
-
-
67650851754
-
USTC system for Blizzard Challenge 2006 an improved HMM-based speech synthesis method
-
Ling, Z.H.; Wu, Y.J.; Wang, Y.P.; Qin, L.; Wang, R.H.; 2006. USTC system for Blizzard Challenge 2006 an improved HMM-based speech synthesis method. In: The Blizzard Challenge Workshop.
-
(2006)
The Blizzard Challenge Workshop
-
-
Ling Z. ., H.1
Wu Y. ., J.2
Wang Y. ., P.3
Qin, L.4
Wang R. ., H.5
-
99
-
-
84919943786
-
-
Blizzard Challenge workshop
-
Ling, Z.H.; Xia, X.J.; Song, Y.; Yang, C.Y.; Chen, L.H.; Dai, L.R.; 2012. The USTC system for Blizzard Challenge 2012. In: Blizzard Challenge workshop.
-
(2012)
The USTC System for Blizzard Challenge 2012
-
-
Ling Z. ., H.1
Xia X. ., J.2
Song, Y.3
Yang C. ., Y.4
Chen L. ., H.5
Dai L. ., R.6
-
101
-
-
84929157442
-
Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis
-
Lu, H.; King, S.; Watts, O.; 2013. Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis. In: Proc. the 8th ISCA Speech Synthesis Workshop.
-
(2013)
Proc. The 8th ISCA Speech Synthesis Workshop
-
-
Lu, H.1
King, S.2
Watts, O.3
-
102
-
-
84919943783
-
Spoofing and anti-spoofing in biometrics: Lessons learned from the tabula rasa project
-
Retrieved 26 February 2014
-
Marcel, S.; 2013. Spoofing and anti-spoofing in biometrics: Lessons learned from the tabula rasa project. Tutorial. Retrieved 26 February 2014 from < http://www.idiap.ch/marcel/professional/BTAS-2013.html >.
-
(2013)
Tutorial
-
-
Marcel, S.1
-
104
-
-
85046873967
-
The DET curve in assessment of detection task performance
-
Martin, A.; Doddington, G.; Kamm, T.; Ordowski, M.; Przybocki, M.; 1997. The DET curve in assessment of detection task performance. In: Proc. European Conference on Speech Communication and Technology (Eurospeech).
-
(1997)
Proc. European Conference on Speech Communication and Technology (Eurospeech)
-
-
Martin, A.1
Doddington, G.2
Kamm, T.3
Ordowski, M.4
Przybocki, M.5
-
106
-
-
1942512336
-
Imposture using synthetic speech against speaker verification based on spectrum and pitch
-
Masuko, T.; Tokuda, K.; Kobayashi, T.; 2000. Imposture using synthetic speech against speaker verification based on spectrum and pitch. In: Proc. Interspeech.
-
(2000)
Proc. Interspeech
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
-
107
-
-
0029725605
-
Speech synthesis using HMMs with dynamic features
-
Masuko, T.; Tokuda, K.; Kobayashi, T.; Imai, S.; 1996. Speech synthesis using HMMs with dynamic features. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(1996)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
108
-
-
0030696416
-
Voice characteristics conversion for HMM-based speech synthesis system
-
Masuko, T.; Tokuda, K.; Kobayashi, T.; Imai, S.; 1997. Voice characteristics conversion for HMM-based speech synthesis system. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(1997)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
109
-
-
33947714703
-
Effect of speech transformation on impostor acceptance
-
Matrouf, D.; Bonastre, J.F.; Fredouille, C.; 2006. Effect of speech transformation on impostor acceptance. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2006)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Matrouf, D.1
Bonastre J. ., F.2
Fredouille, C.3
-
110
-
-
0029355724
-
Likelihood normalization for speaker verification using a phoneme- and speaker-independent model
-
T. Matsui, and S. Furui Likelihood normalization for speaker verification using a phoneme- and speaker-independent model Speech Commun. 17 1995 109 116
-
(1995)
Speech Commun.
, vol.17
, pp. 109-116
-
-
Matsui, T.1
Furui, S.2
-
111
-
-
0025543906
-
Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
-
E. Moulines, and F. Charpentier Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones Speech Commun. 9 1990 453 467
-
(1990)
Speech Commun.
, vol.9
, pp. 453-467
-
-
Moulines, E.1
Charpentier, F.2
-
113
-
-
84919943779
-
-
Nuance
-
Nuance, 2013. Nuance vocalpassword. In: < http://www.nuance.com/landing-pages/products/voicebiometrics/vocalpassword.asp >.
-
(2013)
Nuance Vocalpassword
-
-
-
114
-
-
27544482501
-
Discrimination method of synthetic speech using pitch frequency against synthetic speech falsification
-
A. Ogihara, H. Unno, and A. Shiozakai Discrimination method of synthetic speech using pitch frequency against synthetic speech falsification IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 88 2005 280 286
-
(2005)
IEICE Trans. Fundam. Electron. Commun. Comput. Sci.
, vol.88
, pp. 280-286
-
-
Ogihara, A.1
Unno, H.2
Shiozakai, A.3
-
115
-
-
84919943778
-
Finding impostors in the crowd: The use of crowdsourcing to attack biometric systems
-
Bell Labs India
-
Panjwani, S.; Prakash, A.; 2014. Finding impostors in the crowd: the use of crowdsourcing to attack biometric systems. Unpublished manuscript, Bell Labs India.
-
(2014)
Unpublished Manuscript
-
-
Panjwani, S.1
Prakash, A.2
-
117
-
-
33646787422
-
Voice forgery using ALISP: Indexation in a client memory
-
Perrot, P.; Aversano, G.; Blouet, R.; Charbit, M.; Chollet, G.; 2005. Voice forgery using ALISP: indexation in a client memory. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2005)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Perrot, P.1
Aversano, G.2
Blouet, R.3
Charbit, M.4
Chollet, G.5
-
120
-
-
84905251808
-
On the training aspects of deep neural network (dnn) for parametric tts synthesis
-
Qian, Y.; Fan, Y.; Hu, W.; Soong, F.K.; 2014. On the training aspects of deep neural network (dnn) for parametric tts synthesis. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2014)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Qian, Y.1
Fan, Y.2
Hu, W.3
Soong F. ., K.4
-
121
-
-
85008039410
-
Improved prosody generation by maximizing joint probability of state and longer units
-
Y. Qian, Z. Wu, B. Gao, and F.K. Soong Improved prosody generation by maximizing joint probability of state and longer units IEEE Trans. Audio Speech Language Process. 19 2011 1702 1710
-
(2011)
IEEE Trans. Audio Speech Language Process.
, vol.19
, pp. 1702-1710
-
-
Qian, Y.1
Wu, Z.2
Gao, B.3
Soong, F.K.4
-
123
-
-
0034809453
-
Enhancing security and privacy in biometrics-based authentication systems
-
N.K. Ratha, J.H. Connell, and R.M. Bolle Enhancing security and privacy in biometrics-based authentication systems IBM Syst. J. 40 2001 614 634
-
(2001)
IBM Syst. J.
, vol.40
, pp. 614-634
-
-
Ratha, N.K.1
Connell, J.H.2
Bolle, R.M.3
-
124
-
-
0141744710
-
The SuperSID project: Exploiting high-level information for high-accuracy speaker recognition
-
Reynolds, D.; Andrews, W.; Campbell, J.; Navratil, J.; Peskin, B.; Adami, A.; Jin, Q.; Klusacek, D.; Abramson, J.; Mihaescu, R.; Godfrey, J.; Jones, D.; Xiang, B.; 2003. The SuperSID project: exploiting high-level information for high-accuracy speaker recognition. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2003)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Reynolds, D.1
Andrews, W.2
Campbell, J.3
Navratil, J.4
Peskin, B.5
Adami, A.6
Jin, Q.7
Klusacek, D.8
Abramson, J.9
Mihaescu, R.10
Godfrey, J.11
Jones, D.12
Xiang, B.13
-
125
-
-
0033884858
-
Speaker verification using adapted Gaussian mixture models
-
D. Reynolds, T. Quatieri, and R. Dunn Speaker verification using adapted Gaussian mixture models Digital Signal Process. 10 2000 19 41
-
(2000)
Digital Signal Process.
, vol.10
, pp. 19-41
-
-
Reynolds, D.1
Quatieri, T.2
Dunn, R.3
-
126
-
-
0029209272
-
Robust text-independent speaker identification using Gaussian mixture speaker models
-
D. Reynolds, and R. Rose Robust text-independent speaker identification using Gaussian mixture speaker models IEEE Trans. Speech Audio Process. 3 1995 72 83
-
(1995)
IEEE Trans. Speech Audio Process.
, vol.3
, pp. 72-83
-
-
Reynolds, D.1
Rose, R.2
-
127
-
-
84919943776
-
Evaluation of initial non-ICAO countermeasures for spoofing attacks
-
7th Framework Programme of the European, grant agreement number 257289
-
Riera, A.; Soria-Frisch, A.; Acedo, J.; Hadid, A.; Alegre, F.; Evans, N.; Marcialis, G.L.; 2012. Evaluation of initial non-ICAO countermeasures for spoofing attacks. Technical Report Deliverable D4.2, Trusted biometrics under spoofing attacks (TABULA RASA), 7th Framework Programme of the European, grant agreement number 257289.
-
(2012)
Technical Report Deliverable D4.2, Trusted Biometrics under Spoofing Attacks (TABULA RASA)
-
-
Riera, A.1
Soria-Frisch, A.2
Acedo, J.3
Hadid, A.4
Alegre, F.5
Evans, N.6
Marcialis G. ., L.7
-
129
-
-
84898068800
-
I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification
-
Saeidi, R.; et al.; 2013. I4U submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification. In: Proc. Interspeech.
-
(2013)
Proc. Interspeech
-
-
Saeidi, R.1
-
132
-
-
21844454996
-
Modeling prosodic feature sequences for speaker recognition
-
E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. Stolcke Modeling prosodic feature sequences for speaker recognition Speech Commun. 46 2005 455 472
-
(2005)
Speech Commun.
, vol.46
, pp. 455-472
-
-
Shriberg, E.1
Ferrer, L.2
Kajarekar, S.3
Venkataraman, A.4
Stolcke, A.5
-
133
-
-
84867600823
-
Intonational speaker verification: A study on parameters and performance under noisy conditions
-
Siddiq, S.; Kinnunen, T.; Vainio, M.; Werner, S.; 2012. Intonational speaker verification: a study on parameters and performance under noisy conditions. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2012)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Siddiq, S.1
Kinnunen, T.2
Vainio, M.3
Werner, S.4
-
134
-
-
33645895387
-
Advances in channel compensation for SVM speaker recognition
-
Solomonoff, A.; Campbell, W.; Boardman, I.; 2005. Advances in channel compensation for SVM speaker recognition. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2005)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Solomonoff, A.1
Campbell, W.2
Boardman, I.3
-
135
-
-
84897378628
-
Text-dependent speaker recognition using PLDA with uncertainty propagation
-
Stafylakis, T.; Kenny, P.; Ouellet, P.; Perez, J.; Kockmann, M.; Dumouchel, P.; 2013. Text-dependent speaker recognition using PLDA with uncertainty propagation. In: Proc. Interspeech.
-
(2013)
Proc. Interspeech
-
-
Stafylakis, T.1
Kenny, P.2
Ouellet, P.3
Perez, J.4
Kockmann, M.5
Dumouchel, P.6
-
139
-
-
33947623206
-
Text-independent voice conversion based on unit selection
-
Sundermann, D.; Hoge, H.; Bonafonte, A.; Ney, H.; Black, A.; Narayanan, S.; 2006. Text-independent voice conversion based on unit selection. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2006)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Sundermann, D.1
Hoge, H.2
Bonafonte, A.3
Ney, H.4
Black, A.5
Narayanan, S.6
-
141
-
-
57749193836
-
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
-
T. Toda, A.W. Black, and K. Tokuda Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory IEEE Trans. Audio Speech Language Process. 15 2007 2222 2235
-
(2007)
IEEE Trans. Audio Speech Language Process.
, vol.15
, pp. 2222-2235
-
-
Toda, T.1
Black, A.W.2
Tokuda, K.3
-
142
-
-
0034842552
-
Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum
-
Toda, T.; Saruwatari, H.; Shikano, K.; 2001. Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2001)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Toda, T.1
Saruwatari, H.2
Shikano, K.3
-
143
-
-
79958818321
-
An overview of speaker identification: Accuracy and robustness issues
-
R. Togneri, and D. Pullella An overview of speaker identification: accuracy and robustness issues IEEE Circ. Syst. Mag. 11 2011 23 61
-
(2011)
IEEE Circ. Syst. Mag.
, vol.11
, pp. 23-61
-
-
Togneri, R.1
Pullella, D.2
-
144
-
-
38549096029
-
A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
-
T. Tomoki, and K. Tokuda A speech parameter generation algorithm considering global variance for HMM-based speech synthesis IEICE Trans. Inform. Syst. 90 2007 816 824
-
(2007)
IEICE Trans. Inform. Syst.
, vol.90
, pp. 816-824
-
-
Tomoki, T.1
Tokuda, K.2
-
145
-
-
84867605072
-
Speaker verification performance degradation against spoofing and tampering attacks
-
Villalba, J.; Lleida, E.; 2010. Speaker verification performance degradation against spoofing and tampering attacks. In: FALA 10 workshop, pp. 131-134.
-
(2010)
FALA 10 Workshop
, pp. 131-134
-
-
Villalba, J.1
Lleida, E.2
-
146
-
-
79952940570
-
Detecting replay attacks from far-field recordings on speaker verification systems
-
C. Vielhauer, J. Dittmann, A. Drygajlo, N. Juul, M. Fairhurst, Lecture Notes in Computer Science Springer
-
J. Villalba, and E. Lleida Detecting replay attacks from far-field recordings on speaker verification systems C. Vielhauer, J. Dittmann, A. Drygajlo, N. Juul, M. Fairhurst, Biometrics and ID Management Lecture Notes in Computer Science 2011 Springer 274 285
-
(2011)
IEICE Trans. Inform. Syst.
, pp. 274-285
-
-
Villalba, J.1
Lleida, E.2
-
151
-
-
84878410960
-
Detecting converted speech and natural speech for anti-spoofing attack in speaker recognition
-
Wu, Z.; Chng, E.S.; Li, H.; 2012a. Detecting converted speech and natural speech for anti-spoofing attack in speaker recognition. In: Proc. Interspeech 2012.
-
(2012)
Proc. Interspeech 2012
-
-
Wu, Z.1
Chng E. ., S.2
Li, H.3
-
152
-
-
84874448812
-
A study on spoofing attack in state-of - The-art speaker verification: The telephone speech case
-
Wu, Z.; Kinnunen, T.; Chng, E.S.; Li, H.; Ambikairajah, E.; 2012b. A study on spoofing attack in state-of-the-art speaker verification: the telephone speech case. In: Proc. Asia-Pacific Signal Information Processing Association Annual Summit and Conference (APSIPA ASC).
-
(2012)
Proc. Asia-Pacific Signal Information Processing Association Annual Summit and Conference (APSIPA ASC)
-
-
Wu, Z.1
Kinnunen, T.2
Chng E. ., S.3
Li, H.4
Ambikairajah, E.5
-
154
-
-
84906276055
-
Exemplar-based unit selection for voice conversion utilizing temporal information
-
Wu, Z.; Virtanen, T.; Kinnunen, T.; Chng, E.S.; Li, H.; 2013a. Exemplar-based unit selection for voice conversion utilizing temporal information. In: Proc. Interspeech.
-
(2013)
Proc. Interspeech
-
-
Wu, Z.1
Virtanen, T.2
Kinnunen, T.3
Chng E. ., S.4
Li, H.5
-
155
-
-
84890543945
-
Synthetic speech detection using temporal modulation feature
-
Wu, Z.; Xiao, X.; Chng, E.S.; Li, H.; 2013b. Synthetic speech detection using temporal modulation feature. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2013)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Wu, Z.1
Xiao, X.2
Chng E. ., S.3
Li, H.4
-
156
-
-
79959842826
-
Text-independent F0 transformation with non-parallel data for voice conversion
-
Wu, Z.Z.; Kinnunen, T.; Chng, E.S.; Li, H.; 2010. Text-independent F0 transformation with non-parallel data for voice conversion. In: Proc. Interspeech.
-
(2010)
Proc. Interspeech
-
-
Wu Z. ., Z.1
Kinnunen, T.2
Chng E. ., S.3
Li, H.4
-
158
-
-
67650854725
-
Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
-
J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm IEEE Trans. Audio Speech Language Process. 17 2009 66 83
-
(2009)
IEEE Trans. Audio Speech Language Process.
, vol.17
, pp. 66-83
-
-
Yamagishi, J.1
Kobayashi, T.2
Nakano, Y.3
Ogata, K.4
Isogai, J.5
-
159
-
-
85009139544
-
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
-
Yoshimura, T.; Tokuda, K.; Masuko, T.; Kobayashi, T.; Kitamura, T.; 1999. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis. In: Proc. European Conference on Speech Communication and Technology (Eurospeech).
-
(1999)
Proc. European Conference on Speech Communication and Technology (Eurospeech)
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
160
-
-
84890490547
-
Statistical parametric speech synthesis using deep neural networks
-
Zen, H.; Senior, A.; Schuster, M.; 2013. Statistical parametric speech synthesis using deep neural networks. In: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP).
-
(2013)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Zen, H.1
Senior, A.2
Schuster, M.3
-
161
-
-
33846405723
-
Details of the Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005
-
H. Zen, T. Toda, M. Nakamura, and K. Tokuda Details of the Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005 IEICE Trans. Inform. Syst. 2007 325 333
-
(2007)
IEICE Trans. Inform. Syst.
, pp. 325-333
-
-
Zen, H.1
Toda, T.2
Nakamura, M.3
Tokuda, K.4
-
162
-
-
67651002140
-
Statistical parametric speech synthesis
-
H. Zen, K. Tokuda, and A.W. Black Statistical parametric speech synthesis Speech Commun. 51 2009 1039 1064
-
(2009)
Speech Commun.
, vol.51
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.W.3
|