메뉴 건너뛰기




Volumn 6, Issue 4, 2007, Pages

Stemming Indonesian: A confix-stripping approach

Author keywords

Indonesian; Information retrieval; Stemming

Indexed keywords

ALGORITHMS; INFORMATION RETRIEVAL SYSTEMS; TEXT PROCESSING; TRANSLATION (LANGUAGES); WORD PROCESSING;

EID: 38149091838     PISSN: 15300226     EISSN: 15583430     Source Type: Journal    
DOI: 10.1145/1316457.1316459     Document Type: Article
Times cited : (135)

References (53)
  • 1
    • 0030378256 scopus 로고    scopus 로고
    • Experiments with a stemming algorithm for Malay words
    • AHMAD, F., YUSOFF, M., AND SEMBOK, T. M. T. 1996. Experiments with a stemming algorithm for Malay words. J. Amer. Soc. Inform. Sci. 47, 12, 909-918.
    • (1996) J. Amer. Soc. Inform. Sci , vol.47 , Issue.12 , pp. 909-918
    • AHMAD, F.1    YUSOFF, M.2    SEMBOK, T.M.T.3
  • 2
    • 38149081039 scopus 로고    scopus 로고
    • Classification of event news documents in Indonesian language using single pass clustering algorithm
    • Teknik Elektro, Sepuluh Nopember Institute of Technology
    • ARIFIN, A. Z. AND SETIONO, A. N. 2002. Classification of event news documents in Indonesian language using single pass clustering algorithm. In Proceedings of the Seminar on Intelligent Technology and its Applications (SITIA). Teknik Elektro, Sepuluh Nopember Institute of Technology.
    • (2002) Proceedings of the Seminar on Intelligent Technology and its Applications (SITIA)
    • ARIFIN, A.Z.1    SETIONO, A.N.2
  • 4
    • 35548989201 scopus 로고    scopus 로고
    • A testbed for Indonesian text retrieval
    • P. Bruza, A. Moffat, and A. Turpin, Eds. University of Melbourne, Department of Computer Science, Melbourne, Australia
    • ASIAN, J., WILLIAMS, H. E., AND TAHAGHOGHI, S. 2004. A testbed for Indonesian text retrieval. In Proceedings of the 9th Australasian Document Computing Symposium (ADCS'04). P. Bruza, A. Moffat, and A. Turpin, Eds. University of Melbourne, Department of Computer Science, Melbourne, Australia, 55-58.
    • (2004) Proceedings of the 9th Australasian Document Computing Symposium (ADCS'04) , pp. 55-58
    • ASIAN, J.1    WILLIAMS, H.E.2    TAHAGHOGHI, S.3
  • 5
    • 35248834213 scopus 로고    scopus 로고
    • Evaluating the effectiveness of thesaurus and stemming methods in retrieving Malay translated Al-Quran documents
    • Digital Libraries: Technology and Management of Indigenous Knowledge for Global Access, T. M. T. Sembok, H. B. Zaman, H. Chen, S.R.Urs, and S. Myaeng, Eds, Springer-Verlag
    • BAKAR, Z. A. AND RAHMAN, N. A. 2003. Evaluating the effectiveness of thesaurus and stemming methods in retrieving Malay translated Al-Quran documents. In Digital Libraries: Technology and Management of Indigenous Knowledge for Global Access, T. M. T. Sembok, H. B. Zaman, H. Chen, S.R.Urs, and S. Myaeng, Eds. Lecture Notes in Computer Science, vol. 2911. Springer-Verlag, 653-662.
    • (2003) Lecture Notes in Computer Science , vol.2911 , pp. 653-662
    • BAKAR, Z.A.1    RAHMAN, N.A.2
  • 6
    • 0033721946 scopus 로고    scopus 로고
    • An evaluation of retrieval effectiveness using spelling-correction and string-similarity matching methods on Malay texts
    • BAKAR, Z. A., SEMBOK, T. M. T., AND YUSOFF, M. 2000. An evaluation of retrieval effectiveness using spelling-correction and string-similarity matching methods on Malay texts. J. Amer. Soc. Inform. Sci. Technol. 51, 8, 691-706.
    • (2000) J. Amer. Soc. Inform. Sci. Technol , vol.51 , Issue.8 , pp. 691-706
    • BAKAR, Z.A.1    SEMBOK, T.M.T.2    YUSOFF, M.3
  • 8
    • 38149033371 scopus 로고
    • CICC, Research on Indonesian dictionary. Tech. rep. 6-CICC-MT53, Center of the International Cooperation for Computerization, Tokyo, Japan
    • CICC.1994. Research on Indonesian dictionary. Tech. rep. 6-CICC-MT53, Center of the International Cooperation for Computerization, Tokyo, Japan.
    • (1994)
  • 9
    • 85118972638 scopus 로고    scopus 로고
    • Language independent NER using a maximum entropy tagger
    • W. Daelemans and M. Osborne, Eds. Association for Computational Linguistics, Edmonton, Canada
    • CURRAN, J. R. AND CLARK, S. 2003. Language independent NER using a maximum entropy tagger. In Proceedings of Conference on Natural Language Learning. W. Daelemans and M. Osborne, Eds. Association for Computational Linguistics, Edmonton, Canada, 164-167.
    • (2003) Proceedings of Conference on Natural Language Learning , pp. 164-167
    • CURRAN, J.R.1    CLARK, S.2
  • 12
    • 38149129744 scopus 로고    scopus 로고
    • Personal communication
    • FAHMI, I. 2004. Personal communication.
    • (2004)
    • FAHMI, I.1
  • 13
    • 0001918328 scopus 로고
    • Stemming algorithms
    • W. Frakes and R. Baeza-Yates, Eds. Prentice-Hall, Englewood Cliffs, NJ, Chapter 8
    • FRAKES, W. 1992. Stemming algorithms. In Information Retrieval: Data Structures and Algorithms, W. Frakes and R. Baeza-Yates, Eds. Prentice-Hall, Englewood Cliffs, NJ, Chapter 8, 131-160.
    • (1992) Information Retrieval: Data Structures and Algorithms , pp. 131-160
    • FRAKES, W.1
  • 14
    • 34047136156 scopus 로고    scopus 로고
    • Accurate stemming of Dutch for text classification
    • GAUSTAD, T. AND BOUMA, G. 2002. Accurate stemming of Dutch for text classification. Lang. Comput 45, 1, 104-117.
    • (2002) Lang. Comput , vol.45 , Issue.1 , pp. 104-117
    • GAUSTAD, T.1    BOUMA, G.2
  • 15
    • 84976659284 scopus 로고
    • Approximate string matching
    • HALL, P. A. V. AND DOWLING, G. R. 1980. Approximate string matching. Comput. Surv. 12, 4, 381-402.
    • (1980) Comput. Surv , vol.12 , Issue.4 , pp. 381-402
    • HALL, P.A.V.1    DOWLING, G.R.2
  • 16
    • 0002565067 scopus 로고    scopus 로고
    • Overview of the First TREC conference (TREC-1)
    • NIST Special Publication 500-207
    • HARMAN, D. 1992. Overview of the First TREC conference (TREC-1). In Proceedings of the Text Retrieval Conference (TREC). NIST Special Publication 500-207, 1-20.
    • (1992) Proceedings of the Text Retrieval Conference (TREC) , pp. 1-20
    • HARMAN, D.1
  • 17
    • 3843105784 scopus 로고    scopus 로고
    • HOLLINK, V., KAMPS, J., MONZ, C., AND DE RIJKE, M. 2004. Monolingual document retrieval for European languages. Inform. Retrieval 7, 1-2, 33-52.
    • HOLLINK, V., KAMPS, J., MONZ, C., AND DE RIJKE, M. 2004. Monolingual document retrieval for European languages. Inform. Retrieval 7, 1-2, 33-52.
  • 18
    • 0002910412 scopus 로고    scopus 로고
    • Stemming algorithms: A case study for detailed evaluation
    • HULL, D. A. 1996. Stemming algorithms: A case study for detailed evaluation. J. Amer. Soc. Inform. Sci. 47, 1, 70-84.
    • (1996) J. Amer. Soc. Inform. Sci , vol.47 , Issue.1 , pp. 70-84
    • HULL, D.A.1
  • 20
    • 0012435995 scopus 로고    scopus 로고
    • A probabilistic model of information retrieval: Development and comparative experiments
    • JONES, K. S., WALKER, S., AND ROBERTSON, S. E. 2000. A probabilistic model of information retrieval: Development and comparative experiments. Inf. Process. Manag. 36, 6, 779-808.
    • (2000) Inf. Process. Manag , vol.36 , Issue.6 , pp. 779-808
    • JONES, K.S.1    WALKER, S.2    ROBERTSON, S.E.3
  • 24
    • 0001794236 scopus 로고
    • Development of a stemming algorithm
    • LOVINS, J. 1968. Development of a stemming algorithm. Mechanical Transia. Computa. 11, 22-31.
    • (1968) Mechanical Transia. Computa , vol.11 , pp. 22-31
    • LOVINS, J.1
  • 25
    • 3843127500 scopus 로고    scopus 로고
    • Character n-gram tokenization for European language text retrieval
    • MCNAMEE, P. AND MAYFIELD, J. 2004. Character n-gram tokenization for European language text retrieval. Inform. Retrieval 7, 1-2, 7397.
    • (2004) Inform. Retrieval , vol.7 , Issue.1-2 , pp. 7397
    • MCNAMEE, P.1    MAYFIELD, J.2
  • 26
    • 38149118555 scopus 로고
    • Tata Bahasa Baku Bahasa Indonesia (The Standard Indonesian Grammar)
    • Republik Indonesia, Jakarta, Indonesia
    • MOELIONO, A. M. AND DARDJOWIDJOJO, S. 1988. Tata Bahasa Baku Bahasa Indonesia (The Standard Indonesian Grammar). Departemen Pendidikan dan Kebudayaan, Republik Indonesia, Jakarta, Indonesia.
    • (1988) Departemen Pendidikan dan Kebudayaan
    • MOELIONO, A.M.1    DARDJOWIDJOJO, S.2
  • 27
    • 84868701321 scopus 로고    scopus 로고
    • Confix-stripping: Approach to stemming algorithm for Bahasa Indonesia
    • Univ. of Indonesia, Depok, Jakarta
    • NAZIEF, B. A. A. AND ADRIANI, M. 1996. Confix-stripping: Approach to stemming algorithm for Bahasa Indonesia. Internal publication, Faculty of Computer Science, Univ. of Indonesia, Depok, Jakarta.
    • (1996) Internal publication, Faculty of Computer Science
    • NAZIEF, B.A.A.1    ADRIANI, M.2
  • 28
    • 0034274806 scopus 로고    scopus 로고
    • NG, C., WILKINSON, R., AND ZOBEL, J. 2000. Experiments in spoken document retrieval using phonetic n-grams. Speech Comm (Special issue on Accessing Information in Spoken Audio). 32, 12 61-77.
    • NG, C., WILKINSON, R., AND ZOBEL, J. 2000. Experiments in spoken document retrieval using phonetic n-grams. Speech Comm (Special issue on Accessing Information in Spoken Audio). 32, 12 61-77.
  • 31
    • 0030216658 scopus 로고    scopus 로고
    • Method for evaluation of stemming algorithms based on error counting
    • PAICE, C. D. 1996. Method for evaluation of stemming algorithms based on error counting. J. Amer. Soc. Inform. Sci. 47, 8, 632-649.
    • (1996) J. Amer. Soc. Inform. Sci , vol.47 , Issue.8 , pp. 632-649
    • PAICE, C.D.1
  • 33
    • 84989549444 scopus 로고
    • The effectiveness of stemming for natural-language access to Slovene textual data
    • POPOVIČ, M. AND WILLETT, P. 1992. The effectiveness of stemming for natural-language access to Slovene textual data. J. Amer. Soc. Inform. Sci. 43, 5, 384-390.
    • (1992) J. Amer. Soc. Inform. Sci , vol.43 , Issue.5 , pp. 384-390
    • POPOVIČ, M.1    WILLETT, P.2
  • 34
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • PORTER, M. 1980. An algorithm for suffix stripping. Program 13, 3, 130-137.
    • (1980) Program , vol.13 , Issue.3 , pp. 130-137
    • PORTER, M.1
  • 37
    • 0342986731 scopus 로고
    • Stemming of French words based on grammatical categories
    • January
    • SAVOY, J. 1993. Stemming of French words based on grammatical categories. J. Amer. Soc. Inform. Sci. 44, 1 (January), 1-9.
    • (1993) J. Amer. Soc. Inform. Sci , vol.44 , Issue.1 , pp. 1-9
    • SAVOY, J.1
  • 38
    • 0000958726 scopus 로고    scopus 로고
    • A stemming procedure and stopword list for general French corpora
    • SAVOY, J. 1999. A stemming procedure and stopword list for general French corpora. J. Amer. Soc. Inform. Sci. 50, 10, 944-952.
    • (1999) J. Amer. Soc. Inform. Sci , vol.50 , Issue.10 , pp. 944-952
    • SAVOY, J.1
  • 43
    • 0027113212 scopus 로고
    • Approximate string-matching with q-grams and maximal matches
    • UKKONEN, E. 1992. Approximate string-matching with q-grams and maximal matches. Theor. Comput. Sci. 92, 1, 191-211.
    • (1992) Theor. Comput. Sci , vol.92 , Issue.1 , pp. 191-211
    • UKKONEN, E.1
  • 45
    • 38149045527 scopus 로고    scopus 로고
    • VOORHEES, E. AND HARMAN, D. 1999. Overview of the 8th Text REtrieval Conference (TREC-8). In Proceedings of the Text Retrieval Conference (TREC). E. Voorhees and D. Harman, Eds. TREC, NIST Special Publication 500-246, 1-23.
    • VOORHEES, E. AND HARMAN, D. 1999. Overview of the 8th Text REtrieval Conference (TREC-8). In Proceedings of the Text Retrieval Conference (TREC). E. Voorhees and D. Harman, Eds. TREC, NIST Special Publication 500-246, 1-23.
  • 46
    • 0002565067 scopus 로고    scopus 로고
    • Overview of the 9th TREC conference (TREC-9)
    • E. Voorhees and D. Harman, Eds. NIST Special Publication 500-249
    • VOORHEES, E. M. AND HARMAN, D. 2000. Overview of the 9th TREC conference (TREC-9). In Proceedings of the Text Retrieval Conference (TREC). E. Voorhees and D. Harman, Eds. NIST Special Publication 500-249, 1-14.
    • (2000) Proceedings of the Text Retrieval Conference (TREC) , pp. 1-14
    • VOORHEES, E.M.1    HARMAN, D.2
  • 47
    • 38149074524 scopus 로고    scopus 로고
    • WIDYAMARTAYA, A. 2003. Seni Menerjemahkan, 13th Ed. Kanisius, Yogyakarta, Indonesia.
    • WIDYAMARTAYA, A. 2003. Seni Menerjemahkan, 13th Ed. Kanisius, Yogyakarta, Indonesia.
  • 48
    • 17444374240 scopus 로고    scopus 로고
    • Searchable words on the Web
    • WILLIAMS, H. AND ZOBEL, J. 2005. Searchable words on the Web. Int. J. Digit. Libr. 5, 2, 99-105.
    • (2005) Int. J. Digit. Libr , vol.5 , Issue.2 , pp. 99-105
    • WILLIAMS, H.1    ZOBEL, J.2
  • 51
    • 0031599183 scopus 로고    scopus 로고
    • Corpus-based stemming using cooccurrence of word variants
    • XU, J. AND CROFT, W. B. 1998. Corpus-based stemming using cooccurrence of word variants. ACM Trans. Inform. Syst. 16, 1, 61-81.
    • (1998) ACM Trans. Inform. Syst , vol.16 , Issue.1 , pp. 61-81
    • XU, J.1    CROFT, W.B.2
  • 52
    • 0032272626 scopus 로고    scopus 로고
    • How reliable are the results of large-scale information retrieval experiments?
    • W. B. Croft, A. Moffat, C. J. van Rijsbergen, R. Wilkinson, and J. Zobel, Eds. ACM
    • ZOBEL, J. 1998. How reliable are the results of large-scale information retrieval experiments? In Proceedings of the ACM-SIGIR International Conference on Research and Development in Information Retrieval. W. B. Croft, A. Moffat, C. J. van Rijsbergen, R. Wilkinson, and J. Zobel, Eds. ACM, 307-314.
    • (1998) Proceedings of the ACM-SIGIR International Conference on Research and Development in Information Retrieval , pp. 307-314
    • ZOBEL, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.