메뉴 건너뛰기




Volumn 6836 LNAI, Issue , 2011, Pages 356-363

Web text data mining for building large scale language modelling corpus

Author keywords

duplicity detection; Internet; language modelling; topic identification

Indexed keywords

DETECTION ALGORITHM; DUPLICITY DETECTION; LANGUAGE MODELLING; TEXT CORPORA; TEXT PREPROCESSING; TOPIC IDENTIFICATION; WEB TEXT DATA;

EID: 80052762570     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-23538-2_45     Document Type: Conference Paper
Times cited : (17)

References (11)
  • 1
    • 44349181213 scopus 로고    scopus 로고
    • Design of speech recognition engine
    • Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2000 Springer, Heidelberg
    • Müller, L., Psutka, J., Šmídl, L.: Design of speech recognition engine. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2000. LNCS (LNAI), vol. 1902, pp. 259-264. Springer, Heidelberg (2000)
    • (2000) LNCS (LNAI) , vol.1902 , pp. 259-264
    • Müller, L.1    Psutka, J.2    Šmídl, L.3
  • 2
    • 77958528496 scopus 로고    scopus 로고
    • Using story topics for language model adaptation
    • Seymore, K., Rosenfeld, R.: Using story topics for language model adaptation. In: Proc. Eurospeech, vol. 97, pp. 1987-1990 (1997)
    • (1997) Proc. Eurospeech , vol.97 , pp. 1987-1990
    • Seymore, K.1    Rosenfeld, R.2
  • 4
    • 38049115184 scopus 로고    scopus 로고
    • Recording and annotation of speech corpus for Czech unit selection speech synthesis
    • Matoušek, V., Mautner, P. (eds.) TSD 2007 Springer, Heidelberg
    • Matoušek, J., Romportl, J.: Recording and annotation of speech corpus for Czech unit selection speech synthesis. In: Matoušek, V., Mautner, P. (eds.) TSD 2007. LNCS (LNAI), vol. 4629, pp. 326-333. Springer, Heidelberg (2007)
    • (2007) LNCS (LNAI) , vol.4629 , pp. 326-333
    • Matoušek, J.1    Romportl, J.2
  • 6
    • 78049302607 scopus 로고    scopus 로고
    • Online TV Captioning of Czech Parliamentary Sessions
    • Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2010 Springer, Heidelberg
    • Trmal, J., Pražák, A., Loose, Z., Psutka, J.: Online TV Captioning of Czech Parliamentary Sessions. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2010. LNCS, vol. 6231, pp. 416-422. Springer, Heidelberg (2010)
    • (2010) LNCS , vol.6231 , pp. 416-422
    • Trmal, J.1    Pražák, A.2    Loose, Z.3    Psutka, J.4
  • 7
    • 33646043795 scopus 로고    scopus 로고
    • Automatic transcription of numerals in inflectional languages
    • Matoušek, V.,Mautner, P., Pavelka, T. (eds.) TSD2005 Springer, Heidelberg
    • Zelinka, J., Kanis, J., Müller, L.: Automatic transcription of numerals in inflectional languages. In:Matoušek, V.,Mautner, P., Pavelka, T. (eds.) TSD2005. LNCS (LNAI), vol. 3658, pp. 326-333. Springer, Heidelberg (2005)
    • (2005) LNCS (LNAI) , vol.3658 , pp. 326-333
    • Zelinka, J.1    Kanis, J.2    Müller, L.3
  • 10
    • 80052732897 scopus 로고    scopus 로고
    • Automatic topic identification for large scale language modeling data filtering
    • Habernal, I., Matoušek, V. (eds.) TDS 2011 Springer, Heidelberg
    • Skorkovská, L., Ircing, P., Pražák, A., Lehečka, J.: Automatic topic identification for large scale language modeling data filtering. In: Habernal, I., Matoušek, V. (eds.) TDS 2011. LNCS(LNAI), vol. 6836, pp. 64-71. Springer, Heidelberg (2011)
    • (2011) LNCS(LNAI) , vol.6836 , pp. 64-71
    • Skorkovská, L.1    Ircing, P.2    Pražák, A.3    Lehečka, J.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.