-
1
-
-
0029725605
-
Speech synthesis from HMMs using dynamic features
-
T. Masuko, K. Tokuda, T. Kobayashi and, S. Imai, "Speech synthesis from HMMs using dynamic features, " Proceedings of ICASSP, pp. 389-392, 1996.
-
(1996)
Proceedings of ICASSP
, pp. 389-392
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
2
-
-
85009139544
-
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis, " Proceedings of Eurospeech, pp. 2347-2350, 1999.
-
(1999)
Proceedings of Eurospeech
, pp. 2347-2350
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
3
-
-
0033708106
-
Speech parameter generation algorithms for HMM-based speech synthesis
-
K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " Proceedings of ICASSP, pp. 1315-1318, 2000.
-
(2000)
Proceedings of ICASSP
, pp. 1315-1318
-
-
Tokuda, K.1
Yoshimura, T.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
4
-
-
73649117102
-
Joint acoustic and language modeling for speech recognition
-
J. T. Chien and C. H. Chueh, "Joint acoustic and language modeling for speech recognition, " Speech Communication, vol. 52, Issue 3, pp. 223-235, 2010.
-
(2010)
Speech Communication
, vol.52
, Issue.3
, pp. 223-235
-
-
Chien, J.T.1
Chueh, C.H.2
-
5
-
-
79959824887
-
Improving speech synthesis of machine translation output
-
A. Parlikar, A. Black, and S. Vogel, "Improving speech synthesis of machine translation output, " Proceedings of Interspeech, pp. 194-197, 2010.
-
(2010)
Proceedings of Interspeech
, pp. 194-197
-
-
Parlikar, A.1
Black, A.2
Vogel, S.3
-
6
-
-
84861092214
-
Impacts of machine translation and speech synthesis on speechto- speech translation
-
K. Hashimoto, J. Yamagishi, W. Byrne, S. King, and K. Tokuda, "Impacts of machine translation and speech synthesis on speechto- speech translation, " Speech Communication, vol. 54, Issue 7, pp. 854-866, 2012.
-
(2012)
Speech Communication
, vol.54
, Issue.7
, pp. 854-866
-
-
Hashimoto, K.1
Yamagishi, J.2
Byrne, W.3
King, S.4
Tokuda, K.5
-
7
-
-
0036663562
-
Efficient integrated response generation from multiple target using weighted finite state transducers
-
I. Bulyko and M. Ostendorf, "Efficient integrated response generation from multiple target using weighted finite state transducers, " Computer Speech and Language, vol. 16, pp. 533-550, 2002.
-
(2002)
Computer Speech and Language
, vol.16
, pp. 533-550
-
-
Bulyko, I.1
Ostendorf, M.2
-
8
-
-
70450158623
-
Reranking realizations by predicted synthesis quality
-
C. Nakatsu and M. White, "Reranking realizations by predicted synthesis quality, " Proceedings of ACL, pp. 1113-1120, 2006.
-
(2006)
Proceedings of ACL
, pp. 1113-1120
-
-
Nakatsu, C.1
White, M.2
-
9
-
-
70450163425
-
Predicting how it sounds: Re-ranking dialogue prompts based on TTS quality for adaptive spoken dialogue systems
-
C. Boidin, V. Rieser, L. Plas, O. Lemon, and J. Chevelu, "Predicting how it sounds: Re-ranking dialogue prompts based on TTS quality for adaptive spoken dialogue systems, " Proceedings of Interspeech, pp. 2487-2490, 2009.
-
(2009)
Proceedings of Interspeech
, pp. 2487-2490
-
-
Boidin, C.1
Rieser, V.2
Plas, L.3
Lemon, O.4
Chevelu, J.5
-
10
-
-
84890493635
-
Integration of acoustic modeling and mel-cepstral analysis for HMMbased speech synthesis
-
K. Nakamura, K. Hashimoto, Y. Nankaku, and K. Tokuda, "Integration of acoustic modeling and mel-cepstral analysis for HMMbased speech synthesis, " Proceedings of ICASSP, pp. 7883-7887, 2013.
-
(2013)
Proceedings of ICASSP
, pp. 7883-7887
-
-
Nakamura, K.1
Hashimoto, K.2
Nankaku, Y.3
Tokuda, K.4
-
11
-
-
85016140477
-
An adaptive algorithm for mel-cepstral analysis of speech
-
T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech, " Proceedings of ICASSP, vol. 1, pp. 137-140, 1992.
-
(1992)
Proceedings of ICASSP
, vol.1
, pp. 137-140
-
-
Fukada, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
12
-
-
85131821539
-
Mel-generated cepstral analysis - A unified approach to speech spectral estimation
-
K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Mel-generated cepstral analysis - A unified approach to speech spectral estimation, " Proceedings of ICSLP, pp. 1043-1045, 1994.
-
(1994)
Proceedings of ICSLP
, pp. 1043-1045
-
-
Tokuda, K.1
Kobayashi, T.2
Masuko, T.3
Imai, S.4
-
13
-
-
0003323711
-
Unbiased estimator of log spectrum and its application to speech signal processing
-
S. Imai and C. Furuichi, "Unbiased estimator of log spectrum and its application to speech signal processing, " Proceedings of EURASIP, pp. 203-206, 1988.
-
(1988)
Proceedings of EURASIP
, pp. 203-206
-
-
Imai, S.1
Furuichi, C.2
-
15
-
-
0009553788
-
A statistical method for estimation of speech spectral density and formant frequencies
-
(Japanese Edition), Jan. Translation: R.W. Schafer and J.D. Markel, eds. Speech Analysis, 295-302, IEEE Press, New York, 1979
-
F. Itakura and S. Saito, "A statistical method for estimation of speech spectral density and formant frequencies, " IECE Transactions on Fundamentals (Japanese Edition), vol.J53-A, no.1, pp35- 42, Jan. 1970. Translation: R.W. Schafer and J.D. Markel, eds., Speech Analysis, pp.295-302, IEEE Press, New York, 1979.
-
(1970)
IECE Transactions on Fundamentals
, vol.J53-A
, Issue.1
, pp. 35-42
-
-
Itakura, F.1
Saito, S.2
-
16
-
-
0000306505
-
Mel log spectral approximation filter for speech synthesis
-
(Japanese Edition), Feb
-
S. Imai, K. Sumita, and C. Furuichi, "Mel log spectral approximation filter for speech synthesis, " IECE Translations on Fundamentals (Japanese Edition), vol. J66-A, pp. 122-129, Feb. 1983.
-
(1983)
IECE Translations on Fundamentals
, vol.J66-A
, pp. 122-129
-
-
Imai, S.1
Sumita, K.2
Furuichi, C.3
-
17
-
-
0002629270
-
Maximumlikelihood from incomplete data via the EM algorithm
-
A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximumlikelihood from incomplete data via the EM algorithm, " J. Royal Statist. Soc., Ser. B, 39, pp. 1-38, 1977.
-
(1977)
J. Royal Statist. Soc., Ser. B
, vol.39
, pp. 1-38
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
18
-
-
84972512635
-
Memoir on the probability of the causes of events
-
P. S. Laplace, "Memoir on the probability of the causes of events, " Statistical Science, pp. 364-378, 1986.
-
(1986)
Statistical Science
, pp. 364-378
-
-
Laplace, P.S.1
-
20
-
-
0032029288
-
Deterministic annealing EM algorithm
-
Mar
-
N. Ueda, R. Nakano, "Deterministic annealing EM algorithm, " Neural Networks, vol.11, pp.271-282, Mar. 1998.
-
(1998)
Neural Networks
, vol.11
, pp. 271-282
-
-
Ueda, N.1
Nakano, R.2
-
21
-
-
0033692729
-
Narrowband to wideband conversion of speech using GMM based transformation
-
K.-H. Park and H. S. Kim, "Narrowband to wideband conversion of speech using GMM based transformation, " Proceedings of ICASSP, vol. 3, pp. 1843-1846, 2000.
-
(2000)
Proceedings of ICASSP
, vol.3
, pp. 1843-1846
-
-
Park, K.-H.1
Kim, H.S.2
-
22
-
-
78149261566
-
Bandwidth extension of cellular phone speech based on maximum likelihood estimation with GMM
-
W. Fujitsuru, H. Sekimoto, T. Toda, H. Saruwatari, and K. Shikano, "Bandwidth extension of cellular phone speech based on maximum likelihood estimation with GMM, " Proceedings of NCSP, pp. 283-286, 2008.
-
(2008)
Proceedings of NCSP
, pp. 283-286
-
-
Fujitsuru, W.1
Sekimoto, H.2
Toda, T.3
Saruwatari, H.4
Shikano, K.5
-
23
-
-
0025475528
-
ATR Japanese speech database as a tool of speech recognition and synthesis
-
A. Kuramatsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kawabara, and K. Shikano, "ATR Japanese speech database as a tool of speech recognition and synthesis, " Speech Communication, vol. 9, pp. 357-363, 1990.
-
(1990)
Speech Communication
, vol.9
, pp. 357-363
-
-
Kuramatsu, A.1
Takeda, K.2
Sagisaka, Y.3
Katagiri, S.4
Kawabara, H.5
Shikano, K.6
-
25
-
-
0032678076
-
Hidden Markov models based on multi-space probability distribution for pitch pattern modeling
-
K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Hidden Markov models based on multi-space probability distribution for pitch pattern modeling, " Proceedings of ICASSP, pp. 229-232, 1999.
-
(1999)
Proceedings of ICASSP
, pp. 229-232
-
-
Tokuda, K.1
Masuko, T.2
Miyazaki, N.3
Kobayashi, T.4
-
26
-
-
0025419316
-
Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition
-
K. F. Lee, "Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition, " IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 38, no. 4, pp. 599-609, 1990.
-
(1990)
IEEE Transactions on Acoustics, Speech and Signal Processing
, vol.38
, Issue.4
, pp. 599-609
-
-
Lee, K.F.1
-
27
-
-
0002144369
-
Tree-based state tying for high accuracy acoustic modelling
-
S. Young, J. J. Odell, and P.Woodland, "Tree-based state tying for high accuracy acoustic modelling, " Proceedings of ARPA Workshop on Human Language Technology, pp. 307-312, 1994.
-
(1994)
Proceedings of ARPA Workshop on Human Language Technology
, pp. 307-312
-
-
Young, S.1
Odell, J.J.2
Woodland, P.3
-
28
-
-
85135145174
-
Acoustic modeling based on the MDL criterion for speech recognition
-
K. Shinoda and T. Watanabe, "Acoustic modeling based on the MDL criterion for speech recognition, " Proceedings of Eurospeech, pp. 99-102, 1997.
-
(1997)
Proceedings of Eurospeech
, pp. 99-102
-
-
Shinoda, K.1
Watanabe, T.2
|