-
1
-
-
2942594475
-
A tutorial on text-independent speaker verification
-
F. Bimbot, J F. Bonastre, C. Fredouille, G. Gravier, I. Magrin-Chagnolleaua, S. Meignier, T. Merlin, J. Ortega-Garcia, and D. A. Reynolds, “A tutorial on text-independent speaker verification,” EURASIP J. Applied Signal Process., vol. 4, pp. 430-451, 2004.
-
(2004)
EURASIP J. Applied Signal Process.
, vol.4
, pp. 430-451
-
-
Bimbot, F.1
Bonastre, J.F.2
Fredouille, C.3
Gravier, G.4
Magrin-Chagnolleaua, I.5
Meignier, S.6
Merlin, T.7
Ortega-Garcia, J.8
Reynolds, D.A.9
-
2
-
-
85135274466
-
On the security of HMM-based speaker verification systems against imposture using synthetic speech
-
T. Masuko, T. Hitotsumatsu, K. Tokuda, and T. Kobayashi, “On the security of HMM-based speaker verification systems against imposture using synthetic speech,” in Proc. EUROSPEECH, 1999.
-
(1999)
Proc. EUROSPEECH
-
-
Masuko, T.1
Hitotsumatsu, T.2
Tokuda, K.3
Kobayashi, T.4
-
3
-
-
0029355724
-
Likelihood normalization for speaker verification using a phoneme- And speaker-independent model
-
Aug
-
T. Matsui and S. Furui, “Likelihood normalization for speaker verification using a phoneme- and speaker-independent model,” Speech Commun., vol. 17, no. 1-2, pp. 109-116, Aug. 1995.
-
(1995)
Speech Commun
, vol.17
, Issue.1-2
, pp. 109-116
-
-
Matsui, T.1
Furui, S.2
-
4
-
-
0029725605
-
Speech synthesis using HMMs with dynamic features
-
T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, “Speech synthesis using HMMs with dynamic features,” in Proc. ICASSP, 1996.
-
(1996)
Proc. ICASSP
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
5
-
-
85009077529
-
Imposture using synthetic speech against speaker verification based on spectrum and pitch
-
T. Masuko, K. Tokuda, and T. Kobayashi, “Imposture using synthetic speech against speaker verification based on spectrum and pitch,” in Proc. ICSLP, 2000.
-
(2000)
Proc. ICSLP
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
-
6
-
-
0033884858
-
Speaker verification using adapted Gaussian mixture models
-
D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, “Speaker verification using adapted gaussian mixture models,” Dig. Sig. Process., vol. 10, pp. 19-41, 2000.
-
(2000)
Dig. Sig. Process.
, vol.10
, pp. 19-41
-
-
Reynolds, D.A.1
Quatieri, T.F.2
Dunn, R.B.3
-
7
-
-
33744969076
-
Real-time speaker identification and verification
-
Jan
-
T. Kinnunen, E. Karpov, and P. Franti, “Real-time speaker identification and verification,” IEEE Trans. Audio, Speech, and Language Process., vol. 14, no. 1, pp. 277-288, Jan. 2006.
-
(2006)
IEEE Trans. Audio, Speech, and Language Process.
, vol.14
, Issue.1
, pp. 277-288
-
-
Kinnunen, T.1
Karpov, E.2
Franti, P.3
-
8
-
-
65249096207
-
Combining derivative and parametric kernels for speaker verification
-
May
-
C. Longworth and M.L.F. Gales, “Combining derivative and parametric kernels for speaker verification,” IEEE Trans. Audio, Speech, and Language Process., vol. 17, no. 4, pp. 748-757, May 2009.
-
(2009)
IEEE Trans. Audio, Speech, and Language Process.
, vol.17
, Issue.4
, pp. 748-757
-
-
Longworth, C.1
Gales, M.L.F.2
-
9
-
-
67651002140
-
Statistical parametric speech synthesis
-
Nov
-
H. Zen, K. Tokuda, and A. W. Black, “Statistical parametric speech synthesis,” Speech Communication, vol. 51, no. 11, pp. 1039-1064, Nov. 2009.
-
(2009)
Speech Communication
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.W.3
-
10
-
-
85008006694
-
A robust speaker-adaptive HMM-based text-to-speech synthesis
-
Aug
-
J. Yamagishi, T. Nose, H. Zen, Z.-H. Ling, T. Toda, K. Tokuda, S. King, and S. Renals, “A robust speaker-adaptive HMM-based text-to-speech synthesis,” IEEE Trans. Speech, Audio & Language Process., vol. 17, no. 6, pp. 1208-1230, Aug. 2009.
-
(2009)
IEEE Trans. Speech, Audio & Language Process.
, vol.17
, Issue.6
, pp. 1208-1230
-
-
Yamagishi, J.1
Nose, T.2
Zen, H.3
Ling, Z.-H.4
Toda, T.5
Tokuda, K.6
King, S.7
Renals, S.8
-
11
-
-
84867223798
-
Robustness of HMM-based speech synthesis
-
Brisbane, Australia, Sept
-
J. Yamagishi, Z.-H. Ling, and S. King, “Robustness of HMM-based speech synthesis,” in Proc. Interspeech 2008, Brisbane, Australia, Sept. 2008, pp. 581-584.
-
(2008)
Proc. Interspeech 2008
, pp. 581-584
-
-
Yamagishi, J.1
Ling, Z.-H.2
King, S.3
-
12
-
-
70450161300
-
Thousands of voices for HMM-based speech synthesis
-
Brighton, UK, September
-
J. Yamagishi, B. Usabaev, S. King, O. Watts, J. Dines, J. Tian, R. Hu, K. Oura, K. Tokuda, R. Karhila, and M. Kurimo, “Thousands of voices for HMM-based speech synthesis,” in Proc. Interspeech 2009, Brighton, UK, September 2009, pp. 420-423.
-
(2009)
Proc. Interspeech 2009
, pp. 420-423
-
-
Yamagishi, J.1
Usabaev, B.2
King, S.3
Watts, O.4
Dines, J.5
Tian, J.6
Hu, R.7
Oura, K.8
Tokuda, K.9
Karhila, R.10
Kurimo, M.11
-
13
-
-
77953708096
-
Thousands of voices for HMM-based speech synthesis - Analysis and application of TTS systems built on various ASR corpora
-
press, March
-
J. Yamagishi, B. Usabaev, S. King, O. Watts, J. Dines, J. Tian, R. Hu, K. Oura, K. Tokuda, R. Karhila, and M. Kurimo, “Thousands of voices for HMM-based speech synthesis - Analysis and application of TTS systems built on various ASR corpora,” IEEE Trans. Speech, Audio & Language Process, vol. in press, March 2010.
-
(2010)
IEEE Trans. Speech, Audio & Language Process
-
-
Yamagishi, J.1
Usabaev, B.2
King, S.3
Watts, O.4
Dines, J.5
Tian, J.6
Hu, R.7
Oura, K.8
Tokuda, K.9
Karhila, R.10
Kurimo, M.11
-
14
-
-
67650819492
-
The HTS-2008 system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 blizzard challenge
-
Sept
-
J. Yamagishi, H. Zen, Y.-J. Wu, T. Toda, and K. Tokuda, “The HTS-2008 system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge,” in Proc. Blizzard Challenge 2008, Sept. 2008.
-
(2008)
Proc. Blizzard Challenge 2008
-
-
Yamagishi, J.1
Zen, H.2
Wu, Y.-J.3
Toda, T.4
Tokuda, K.5
-
16
-
-
67650790758
-
The blizzard challenge 2008
-
Brisbane, Australia, September
-
Vasilis Karaiskos, Simon King, Robert A. J. Clark, and Catherine Mayo, “The Blizzard challenge 2008,” in Proc. Blizzard Challenge Workshop, Brisbane, Australia, September 2008.
-
(2008)
Proc. Blizzard Challenge Workshop
-
-
Karaiskos, V.1
King, S.2
Clark, R.A.J.3
Mayo, C.4
-
17
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, “Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds,” Speech Communication, vol. 27, pp. 187-207, 1999.
-
(1999)
Speech Communication
, vol.27
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
Cheveigné, A.3
-
18
-
-
33846405723
-
Details of Nitech HMM-based speech synthesis system for the blizzard challenge 2005
-
Jan
-
H. Zen, T. Toda, M. Nakamura, and K. Tokuda, “Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005,” IEICE Trans. Inf. & Syst., vol. E90-D, no. 1, pp. 325-333, Jan. 2007.
-
(2007)
IEICE Trans. Inf. & Syst.
, vol.90
, Issue.1
, pp. 325-333
-
-
Zen, H.1
Toda, T.2
Nakamura, M.3
Tokuda, K.4
-
19
-
-
44449177634
-
A hidden semi-Markov model-based speech synthesis system
-
May
-
H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, “A hidden semi-Markov model-based speech synthesis system,” IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 825-834, May 2007.
-
(2007)
IEICE Trans. Inf. & Syst.
, vol.90
, Issue.5
, pp. 825-834
-
-
Zen, H.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
20
-
-
0002629270
-
Maximum likelihood from incomplete data via the em algorithm
-
A. Dempster, N. Laird, and D. Rubin, “Maximum likelihood from incomplete data via the em algorithm,” Journal of the Royal Statistical Society, Series B, vol. 39, no. 1, pp. 1-38, 1977.
-
(1977)
Journal of the Royal Statistical Society, Series B
, vol.39
, Issue.1
, pp. 1-38
-
-
Dempster, A.1
Laird, N.2
Rubin, D.3
-
21
-
-
0033906251
-
MDL-based context-dependent subword modeling for speech recognition
-
Mar
-
K. Shinoda and T. Watanabe, “MDL-based context-dependent subword modeling for speech recognition,” J. Acoust. Soc. Japan (E), vol. 21, pp. 79-86, Mar. 2000.
-
(2000)
J. Acoust. Soc. Japan (E)
, vol.21
, pp. 79-86
-
-
Shinoda, K.1
Watanabe, T.2
-
22
-
-
0030362995
-
A compact model for speaker-adaptive training
-
Oct
-
T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, “A compact model for speaker-adaptive training,” in Proc. ICSLP-96, Oct. 1996, pp. 1137-1140.
-
(1996)
Proc. ICSLP-96
, pp. 1137-1140
-
-
Anastasakos, T.1
McDonough, J.2
Schwartz, R.3
Makhoul, J.4
-
23
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M.J.F. Gales, “Maximum likelihood linear transformations for HMM-based speech recognition,” Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
-
(1998)
Computer Speech and Language
, vol.12
, Issue.2
, pp. 75-98
-
-
Gales, M.J.F.1
-
24
-
-
67650854725
-
Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
-
1
-
J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, “Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm,” IEEE Trans. Speech, Audio & Language Process., vol. 17, no. 1, pp. 66-83, 1 2009.
-
(2009)
IEEE Trans. Speech, Audio & Language Process.
, vol.17
, Issue.1
, pp. 66-83
-
-
Yamagishi, J.1
Kobayashi, T.2
Nakano, Y.3
Ogata, K.4
Isogai, J.5
-
25
-
-
33847129573
-
Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
-
Feb
-
J. Yamagishi and T. Kobayashi, “Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training,” IEICE Trans. Inf. & Syst., vol. E90-D, no. 2, pp. 533-543, Feb. 2007.
-
(2007)
IEICE Trans. Inf. & Syst.
, vol.90
, Issue.2
, pp. 533-543
-
-
Yamagishi, J.1
Kobayashi, T.2
-
26
-
-
38549096029
-
A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
-
May
-
T. Toda and K. Tokuda, “A speech parameter generation algorithm considering global variance for HMM-based speech synthesis,” IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 816-824, May 2007.
-
(2007)
IEICE Trans. Inf. & Syst.
, vol.90
, Issue.5
, pp. 816-824
-
-
Toda, T.1
Tokuda, K.2
-
27
-
-
0025543906
-
Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
-
E. Moulines and F. Charpentier, “Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones,” Speech Communication, vol. 9, no. 5-6, pp. 453-468, 1990.
-
(1990)
Speech Communication
, vol.9
, Issue.5-6
, pp. 453-468
-
-
Moulines, E.1
Charpentier, F.2
-
28
-
-
85016140477
-
An adaptive algorithm for mel-cepstral analysis of speech
-
Mar
-
T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, “An adaptive algorithm for mel-cepstral analysis of speech,” in Proc. ICASSP-92, Mar. 1992, pp. 137-140.
-
(1992)
Proc. ICASSP-92
, pp. 137-140
-
-
Fukada, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
29
-
-
85073258179
-
Feature warping for robust speaker verification
-
J. Pelecanos and S. Sridharan, “Feature warping for robust speaker verification,” in Proc. ODYSSEY, 2001.
-
(2001)
Proc. ODYSSEY
-
-
Pelecanos, J.1
Sridharan, S.2
-
30
-
-
78049409687
-
Revisiting the security of speaker verification systems against imposture using synthetic speech
-
Dallas, USA, March
-
P. L. De Leon, V. R. Apsingekar, M. Pucher, and J. Yamagishi, “Revisiting the security of speaker verification systems against imposture using synthetic speech,” in Proccedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, USA, March 2010.
-
(2010)
Proccedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
De Leon, P.L.1
Apsingekar, V.R.2
Pucher, M.3
Yamagishi, J.4
-
31
-
-
0012330750
-
The design for the wall street journal-based CSR corpus
-
Harriman, New York
-
Douglas B. Paul and Janet M. Baker, “The design for the wall street journal-based CSR corpus,” in Proceedings of the workshop on Speech and Natural Language, Harriman, New York, 1992, pp. 357-362.
-
(1992)
Proceedings of the Workshop on Speech and Natural Language
, pp. 357-362
-
-
Paul, D.B.1
Baker, J.M.2
-
33
-
-
0023704929
-
Normalizations and selection of speech segments for speaker recognition scoring
-
April
-
K. P. Li and J. E. Porter, “Normalizations and selection of speech segments for speaker recognition scoring,” Proc. IEEE. Int. Conf. Acoustics, Speech and Signal Processing, vol. 1, pp. 595-598, April 1988.
-
(1988)
Proc. IEEE. Int. Conf. Acoustics, Speech and Signal Processing
, vol.1
, pp. 595-598
-
-
Li, K.P.1
Porter, J.E.2
-
34
-
-
0033884857
-
Score normalization for test-independent speaker verification system
-
R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, “Score normalization for test-independent speaker verification system,” Digital Signal Processing, vol. 10, no. 1, pp. 42-54, 2000.
-
(2000)
Digital Signal Processing
, vol.10
, Issue.1
, pp. 42-54
-
-
Auckenthaler, R.1
Carey, M.2
Lloyd-Thomas, H.3
-
35
-
-
85009119461
-
A robust speaker verification system against imposture using an HMM-based speech synthesis system
-
T. Satoh, T. Masuko, T. Kobayashi, and K. Tokuda, “A robust speaker verification system against imposture using an HMM-based speech synthesis system,” in Proc. Eurospeech, 2001.
-
(2001)
Proc. Eurospeech
-
-
Satoh, T.1
Masuko, T.2
Kobayashi, T.3
Tokuda, K.4
|