-
1
-
-
84865526894
-
On the impact of annotation errors on unit-selection speech synthesis
-
Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. Springer, Heidelberg
-
Matoušek, J., Tihelka, D., Šmídl, L.: On the impact of annotation errors on unit-selection speech synthesis. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. LNCS, vol. 7499, pp. 456-463. Springer, Heidelberg (2012)
-
(2012)
LNCS
, vol.7499
, pp. 456-463
-
-
Matoušek, J.1
Tihelka, D.2
Šmídl, L.3
-
2
-
-
38049115184
-
Recording and Annotation of Speech Corpus for Czech Unit Selection Speech Synthesis
-
Matoušek, V., Mautner, P. (eds.) TSD 2007. Springer, Heidelberg
-
Matoušek, J., Romportl, J.: Recording and Annotation of Speech Corpus for Czech Unit Selection Speech Synthesis. In: Matoušek, V., Mautner, P. (eds.) TSD 2007. LNCS (LNAI), vol. 4629, pp. 326-333. Springer, Heidelberg (2007)
-
(2007)
LNCS (LNAI)
, vol.4629
, pp. 326-333
-
-
Matoušek, J.1
Romportl, J.2
-
3
-
-
33947621608
-
Database pruning for unsupervised building of text-to-speech voices
-
Adell, J., Agüero, P.D., Bonafonte, A.: Database pruning for unsupervised building of text-to-speech voices. In: Proc. ICASSP, Toulouse, France, pp. 889-892 (2006)
-
(2006)
Proc. ICASSP, Toulouse, France
, pp. 889-892
-
-
Adell, J.1
Agüero, P.D.2
Bonafonte, A.3
-
4
-
-
85133409312
-
Preliminary experiments toward automatic generation of new TTS voices from recorded speech alone
-
Tachibana, R., Nagano, T., Kurata, G., Nishimura, M., Babaguchi, N.: Preliminary experiments toward automatic generation of new TTS voices from recorded speech alone. In: Proc. INTERSPEECH, Antwerp, Belgium, pp. 1917-1920 (2007)
-
(2007)
Proc. INTERSPEECH, Antwerp, Belgium
, pp. 1917-1920
-
-
Tachibana, R.1
Nagano, T.2
Kurata, G.3
Nishimura, M.4
Babaguchi, N.5
-
5
-
-
67650738938
-
A new method for mispronunciation detection using support vector machine based on pronunciation space models
-
Wei, S., Hu, G., Hu, Y., Wang, R.H.: A new method for mispronunciation detection using support vector machine based on pronunciation space models. Speech Commun. 51(10), 896-905 (2009)
-
(2009)
Speech Commun.
, vol.51
, Issue.10
, pp. 896-905
-
-
Wei, S.1
Hu, G.2
Hu, Y.3
Wang, R.H.4
-
6
-
-
85133467491
-
Impact of durational outlier removal from unit selection catalogs
-
Kominek, J., Black, A.: Impact of durational outlier removal from unit selection catalogs. In: Proc. SSW, Pittsburgh, USA, pp. 155-160 (2004)
-
(2004)
Proc. SSW, Pittsburgh, USA
, pp. 155-160
-
-
Kominek, J.1
Black, A.2
-
7
-
-
79959842894
-
Automatic error detection for unit selection speech synthesis using log likelihood ratio based SVM classifier
-
Lu, H.,Wei, S., Dai, L.,Wang, R.H.: Automatic error detection for unit selection speech synthesis using log likelihood ratio based SVM classifier. In: Proc. INTERSPEECH, Makuhari, Japan, pp. 162-165 (2010)
-
(2010)
Proc. INTERSPEECH, Makuhari, Japan
, pp. 162-165
-
-
Lu, H.1
Wei, S.2
Dai, L.3
Wang, R.H.4
-
8
-
-
84858985804
-
Automatic detection of unnatural word-level segments in unitselection speech synthesis
-
Wang, W.Y., Georgila, K.: Automatic detection of unnatural word-level segments in unitselection speech synthesis. In: Proc. ASRU, Hawaii, USA, pp. 289-294 (2011)
-
(2011)
Proc. ASRU, Hawaii, USA
, pp. 289-294
-
-
Wang, W.Y.1
Georgila, K.2
-
9
-
-
79959816801
-
Enhancements of Viterbi search for fast unit selection synthesis
-
Tihelka, D., Kala, J., Matoušek, J.: Enhancements of Viterbi search for fast unit selection synthesis. In: Proc. INTERSPEECH, Makuhari, Japan, pp. 174-177 (2010)
-
(2010)
Proc. INTERSPEECH, Makuhari, Japan
, pp. 174-177
-
-
Tihelka, D.1
Kala, J.2
Matoušek, J.3
-
10
-
-
0003822743
-
-
(for HTK Version 3.4). The Cambridge University, Cambridge
-
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: HTK Book (for HTK Version 3.4). The Cambridge University, Cambridge (2006)
-
(2006)
HTK Book
-
-
Young, S.1
Evermann, G.2
Gales, M.3
Hain, T.4
Kershaw, D.5
Liu, X.6
Moore, G.7
Odell, J.8
Ollason, D.9
Povey, D.10
Valtchev, V.11
Woodland, P.12
-
11
-
-
9444268028
-
Experiments with Automatic Segmentation for Czech Speech Synthesis
-
Text, Speech and Dialogue
-
Matoušek, J., Tihelka, D., Psutka, J.V.: Experiments with Automatic Segmentation for Czech Speech Synthesis. In: Matoušek, V., Mautner, P. (eds.) TSD 2003. LNCS (LNAI), vol. 2807, pp. 287-294. Springer, Heidelberg (2003) (Pubitemid 37171194)
-
(2003)
LECTURE NOTES IN COMPUTER SCIENCE
, Issue.2807
, pp. 287-294
-
-
Matousek, J.1
Tihelka, D.2
Psutka, J.3
-
12
-
-
84867215793
-
Automatic pitch-synchronous phonetic segmentation
-
Matoušek, J., Romportl, J.: Automatic pitch-synchronous phonetic segmentation. In: Proc. INTERSPEECH, Brisbane, Australia, pp. 1626-1629 (2008)
-
(2008)
Proc. INTERSPEECH, Brisbane, Australia
, pp. 1626-1629
-
-
Matoušek, J.1
Romportl, J.2
-
13
-
-
0000259511
-
Approximate statistical tests for comparing supervised classification learning algorithms
-
Dietterich, T.G.: Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput. 10, 1895-1923 (1998)
-
(1998)
Neural Comput.
, vol.10
, pp. 1895-1923
-
-
Dietterich, T.G.1
-
14
-
-
34249753618
-
Support-vector networks
-
Cortes, C., Vapnik, V.: Support-vector networks. Machine Leaming 20(3), 273-279 (1995)
-
(1995)
Machine Leaming
, vol.20
, Issue.3
, pp. 273-279
-
-
Cortes, C.1
Vapnik, V.2
-
16
-
-
84893371793
-
Prosody modelling in Czech text-to-speech synthesis
-
Romportl, J., Kala, J.: Prosody modelling in Czech text-to-speech synthesis. In: Proc. SSW, Bonn, Germany, pp. 200-205 (2007)
-
(2007)
Proc. SSW, Bonn, Germany
, pp. 200-205
-
-
Romportl, J.1
Kala, J.2
-
17
-
-
0003850397
-
-
Taylor, P., Caley, R., Black, A., King, S.: Edinburgh speech tools library: System documentation (1999), http://www.cstr.ed.ac.uk/projects/speech- tools/manual-1.2.0/
-
(1999)
Edinburgh Speech Tools Library: System Documentation
-
-
Taylor, P.1
Caley, R.2
Black, A.3
King, S.4
-
18
-
-
80555140075
-
Édouard Duchesnay: Scikit-learn: Machine learning in Python
-
Pedregosa, F., Varoquaux, G., Gramfort, A., Thirion, V.M.B., Grisel, O., Blondel, M., Prettenhofer, P.,Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perror, M.: Édouard Duchesnay: Scikit-learn: Machine learning in Python. J. Machine Learn. Res. 12, 2825-2830 (2011)
-
(2011)
J. Machine Learn. Res.
, vol.12
, pp. 2825-2830
-
-
Pedregosa, F.1
Varoquaux, G.2
Gramfort, A.3
Thirion, V.M.B.4
Grisel, O.5
Blondel, M.6
Prettenhofer, P.7
Weiss, R.8
Dubourg, V.9
Vanderplas, J.10
Passos, A.11
Cournapeau, D.12
Brucher, M.13
Perror, M.14
-
19
-
-
80051635711
-
Comparison of spectral and prosodic parameters of male and female emotional speech in Czech and Slovak
-
Přibil, J., Přibilová, A.: Comparison of spectral and prosodic parameters of male and female emotional speech in Czech and Slovak. In: Proc. ICASSP, Prague, Czech Republic, pp. 4720-4723 (2011)
-
(2011)
Proc. ICASSP, Prague, Czech Republic
, pp. 4720-4723
-
-
Přibil, J.1
Přibilová, A.2
-
20
-
-
65249161821
-
Using morphological information for robust language modeling in Czech ASR system
-
Ircing, P., Psutka, J., Psutka, J.V.: Using morphological information for robust language modeling in Czech ASR system. IEEE Trans. Audio Speech Lang. Process. 17, 840-847 (2009)
-
(2009)
IEEE Trans. Audio Speech Lang. Process.
, vol.17
, pp. 840-847
-
-
Ircing, P.1
Psutka, J.2
Psutka, J.V.3
-
21
-
-
84960458231
-
System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive
-
Psutka, J., Švec, J., Psutka, J.V., Vaněk, J., Pražák, A., Šmídl, L., Ircing, P.: System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive. EURASIP J. Audio Speech Music Process. 10 (2011)
-
(2011)
EURASIP J. Audio Speech Music Process.
, vol.10
-
-
Psutka, J.1
Švec, J.2
Psutka, J.V.3
Vaněk, J.4
Pražák, A.5
Šmídl, L.6
Ircing, P.7
|