SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 8082 LNAI, Issue , 2013, Pages 457-464

SVM-based detection of misannotated words in read speech corpora

(2) Matoušek, Jindřich a Tihelka, Daniel a

a UNIVERSITY OF WEST BOHEMIA (Czech Republic)

Author keywords

annotation error detection; classification; read speech corpora; support vector machine

Indexed keywords

ANNOTATION ERRORS; AUTOMATIC DETECTION; DETECTION METHODS; FEATURE SETS; PRECISION AND RECALL; SPEECH CORPORA; SVM CLASSIFIERS;

CLASSIFICATION (OF INFORMATION); SPEECH; SUPPORT VECTOR MACHINES;

SPEECH RECOGNITION;

EID: 84884955838 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-40585-3_58 Document Type: Conference Paper

Times cited : (3)

References (21)

1
- 84865526894
- On the impact of annotation errors on unit-selection speech synthesis
- Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. Springer, Heidelberg
- Matoušek, J., Tihelka, D., Šmídl, L.: On the impact of annotation errors on unit-selection speech synthesis. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. LNCS, vol. 7499, pp. 456-463. Springer, Heidelberg (2012)
- (2012) LNCS , vol.7499 , pp. 456-463
- Matoušek, J.¹ Tihelka, D.² Šmídl, L.³

2
- 38049115184
- Recording and Annotation of Speech Corpus for Czech Unit Selection Speech Synthesis
- Matoušek, V., Mautner, P. (eds.) TSD 2007. Springer, Heidelberg
- Matoušek, J., Romportl, J.: Recording and Annotation of Speech Corpus for Czech Unit Selection Speech Synthesis. In: Matoušek, V., Mautner, P. (eds.) TSD 2007. LNCS (LNAI), vol. 4629, pp. 326-333. Springer, Heidelberg (2007)
- (2007) LNCS (LNAI) , vol.4629 , pp. 326-333
- Matoušek, J.¹ Romportl, J.²

3
- 33947621608
- Database pruning for unsupervised building of text-to-speech voices
- Adell, J., Agüero, P.D., Bonafonte, A.: Database pruning for unsupervised building of text-to-speech voices. In: Proc. ICASSP, Toulouse, France, pp. 889-892 (2006)
- (2006) Proc. ICASSP, Toulouse, France , pp. 889-892
- Adell, J.¹ Agüero, P.D.² Bonafonte, A.³

4
- 85133409312
- Preliminary experiments toward automatic generation of new TTS voices from recorded speech alone
- Tachibana, R., Nagano, T., Kurata, G., Nishimura, M., Babaguchi, N.: Preliminary experiments toward automatic generation of new TTS voices from recorded speech alone. In: Proc. INTERSPEECH, Antwerp, Belgium, pp. 1917-1920 (2007)
- (2007) Proc. INTERSPEECH, Antwerp, Belgium , pp. 1917-1920
- Tachibana, R.¹ Nagano, T.² Kurata, G.³ Nishimura, M.⁴ Babaguchi, N.⁵

5
- 67650738938
- A new method for mispronunciation detection using support vector machine based on pronunciation space models
- Wei, S., Hu, G., Hu, Y., Wang, R.H.: A new method for mispronunciation detection using support vector machine based on pronunciation space models. Speech Commun. 51(10), 896-905 (2009)
- (2009) Speech Commun. , vol.51 , Issue.10 , pp. 896-905
- Wei, S.¹ Hu, G.² Hu, Y.³ Wang, R.H.⁴

6
- 85133467491
- Impact of durational outlier removal from unit selection catalogs
- Kominek, J., Black, A.: Impact of durational outlier removal from unit selection catalogs. In: Proc. SSW, Pittsburgh, USA, pp. 155-160 (2004)
- (2004) Proc. SSW, Pittsburgh, USA , pp. 155-160
- Kominek, J.¹ Black, A.²

7
- 79959842894
- Automatic error detection for unit selection speech synthesis using log likelihood ratio based SVM classifier
- Lu, H.,Wei, S., Dai, L.,Wang, R.H.: Automatic error detection for unit selection speech synthesis using log likelihood ratio based SVM classifier. In: Proc. INTERSPEECH, Makuhari, Japan, pp. 162-165 (2010)
- (2010) Proc. INTERSPEECH, Makuhari, Japan , pp. 162-165
- Lu, H.¹ Wei, S.² Dai, L.³ Wang, R.H.⁴

8
- 84858985804
- Automatic detection of unnatural word-level segments in unitselection speech synthesis
- Wang, W.Y., Georgila, K.: Automatic detection of unnatural word-level segments in unitselection speech synthesis. In: Proc. ASRU, Hawaii, USA, pp. 289-294 (2011)
- (2011) Proc. ASRU, Hawaii, USA , pp. 289-294
- Wang, W.Y.¹ Georgila, K.²

9
- 79959816801
- Enhancements of Viterbi search for fast unit selection synthesis
- Tihelka, D., Kala, J., Matoušek, J.: Enhancements of Viterbi search for fast unit selection synthesis. In: Proc. INTERSPEECH, Makuhari, Japan, pp. 174-177 (2010)
- (2010) Proc. INTERSPEECH, Makuhari, Japan , pp. 174-177
- Tihelka, D.¹ Kala, J.² Matoušek, J.³

10
- 0003822743
- (for HTK Version 3.4). The Cambridge University, Cambridge
- Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: HTK Book (for HTK Version 3.4). The Cambridge University, Cambridge (2006)
- (2006) HTK Book
- Young, S.¹ Evermann, G.² Gales, M.³ Hain, T.⁴ Kershaw, D.⁵ Liu, X.⁶ Moore, G.⁷ Odell, J.⁸ Ollason, D.⁹ Povey, D.¹⁰ Valtchev, V.¹¹ Woodland, P.¹²

11
- 9444268028
- Experiments with Automatic Segmentation for Czech Speech Synthesis
- Text, Speech and Dialogue
- Matoušek, J., Tihelka, D., Psutka, J.V.: Experiments with Automatic Segmentation for Czech Speech Synthesis. In: Matoušek, V., Mautner, P. (eds.) TSD 2003. LNCS (LNAI), vol. 2807, pp. 287-294. Springer, Heidelberg (2003) (Pubitemid 37171194)
- (2003) LECTURE NOTES IN COMPUTER SCIENCE , Issue.2807 , pp. 287-294
- Matousek, J.¹ Tihelka, D.² Psutka, J.³

12
- 84867215793
- Automatic pitch-synchronous phonetic segmentation
- Matoušek, J., Romportl, J.: Automatic pitch-synchronous phonetic segmentation. In: Proc. INTERSPEECH, Brisbane, Australia, pp. 1626-1629 (2008)
- (2008) Proc. INTERSPEECH, Brisbane, Australia , pp. 1626-1629
- Matoušek, J.¹ Romportl, J.²

13
- 0000259511
- Approximate statistical tests for comparing supervised classification learning algorithms
- Dietterich, T.G.: Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput. 10, 1895-1923 (1998)
- (1998) Neural Comput. , vol.10 , pp. 1895-1923
- Dietterich, T.G.¹

14
- 34249753618
- Support-vector networks
- Cortes, C., Vapnik, V.: Support-vector networks. Machine Leaming 20(3), 273-279 (1995)
- (1995) Machine Leaming , vol.20 , Issue.3 , pp. 273-279
- Cortes, C.¹ Vapnik, V.²

15
- 84906228629
- Annotation errors detection in TTS corpora
- Matoušek, J., Tihelka, D.: Annotation errors detection in TTS corpora. In: Proc. Interspeech, Lyon, France (2013)
- Proc. Interspeech, Lyon, France (2013)
- Matoušek, J.¹ Tihelka, D.²

16
- 84893371793
- Prosody modelling in Czech text-to-speech synthesis
- Romportl, J., Kala, J.: Prosody modelling in Czech text-to-speech synthesis. In: Proc. SSW, Bonn, Germany, pp. 200-205 (2007)
- (2007) Proc. SSW, Bonn, Germany , pp. 200-205
- Romportl, J.¹ Kala, J.²

17
- 0003850397
- Taylor, P., Caley, R., Black, A., King, S.: Edinburgh speech tools library: System documentation (1999), http://www.cstr.ed.ac.uk/projects/speech- tools/manual-1.2.0/
- (1999) Edinburgh Speech Tools Library: System Documentation
- Taylor, P.¹ Caley, R.² Black, A.³ King, S.⁴

18
- 80555140075
- Édouard Duchesnay: Scikit-learn: Machine learning in Python
- Pedregosa, F., Varoquaux, G., Gramfort, A., Thirion, V.M.B., Grisel, O., Blondel, M., Prettenhofer, P.,Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perror, M.: Édouard Duchesnay: Scikit-learn: Machine learning in Python. J. Machine Learn. Res. 12, 2825-2830 (2011)
- (2011) J. Machine Learn. Res. , vol.12 , pp. 2825-2830
- Pedregosa, F.¹ Varoquaux, G.² Gramfort, A.³ Thirion, V.M.B.⁴ Grisel, O.⁵ Blondel, M.⁶ Prettenhofer, P.⁷ Weiss, R.⁸ Dubourg, V.⁹ Vanderplas, J.¹⁰ Passos, A.¹¹ Cournapeau, D.¹² Brucher, M.¹³ Perror, M.¹⁴

19
- 80051635711
- Comparison of spectral and prosodic parameters of male and female emotional speech in Czech and Slovak
- Přibil, J., Přibilová, A.: Comparison of spectral and prosodic parameters of male and female emotional speech in Czech and Slovak. In: Proc. ICASSP, Prague, Czech Republic, pp. 4720-4723 (2011)
- (2011) Proc. ICASSP, Prague, Czech Republic , pp. 4720-4723
- Přibil, J.¹ Přibilová, A.²

20
- 65249161821
- Using morphological information for robust language modeling in Czech ASR system
- Ircing, P., Psutka, J., Psutka, J.V.: Using morphological information for robust language modeling in Czech ASR system. IEEE Trans. Audio Speech Lang. Process. 17, 840-847 (2009)
- (2009) IEEE Trans. Audio Speech Lang. Process. , vol.17 , pp. 840-847
- Ircing, P.¹ Psutka, J.² Psutka, J.V.³

21
- 84960458231
- System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive
- Psutka, J., Švec, J., Psutka, J.V., Vaněk, J., Pražák, A., Šmídl, L., Ircing, P.: System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive. EURASIP J. Audio Speech Music Process. 10 (2011)
- (2011) EURASIP J. Audio Speech Music Process. , vol.10
- Psutka, J.¹ Švec, J.² Psutka, J.V.³ Vaněk, J.⁴ Pražák, A.⁵ Šmídl, L.⁶ Ircing, P.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.