메뉴 건너뛰기




Volumn , Issue , 2006, Pages 801-809

WebKhoj: Indian language IR from multiple character encodings

Author keywords

Indian languages; Non standard encodings; Web search

Indexed keywords

FORMAL LANGUAGES; INFORMATION ANALYSIS; SEARCH ENGINES; STANDARDIZATION; WORLD WIDE WEB; ENCODING (SYMBOLS); INFORMATION RETRIEVAL; LINGUISTICS; QUERY LANGUAGES;

EID: 34250613116     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1135777.1135898     Document Type: Conference Paper
Times cited : (22)

References (15)
  • 1
    • 34250617894 scopus 로고    scopus 로고
    • J. Allan, J. Aslam, N. Belkin, G. Buckley, J. Callan, B. Croft, S. Dumais, N. Fuhr, D. Harman, D. J. Harper, D. Hiemstra, T. Hofmann, E. Hovy, W. Kraaij, J. Lafferty, V. Lavrenko, D. Lewis, L. Liddy, R. Manmatha, A. McCallum, J. Ponte, J. Prager, D. Radev, P. Resnik, S. Robertson, R. Rosenfeld, S. Roukos, M. Sanderson, R. Schwartz, A. Singhal, A. Smeaton, H. Turtle, E. Voorhees, R. Weischedel, J. Xu, and C. Zhai. Challenges in Information Retrieval and Language Modeling: Report of a Workshop held at the Center for Intelligent Information Retrieval, University of Massachusetts Amherst, September 2002. SIGIR Forum, 37(1):31-47, 2003.
    • J. Allan, J. Aslam, N. Belkin, G. Buckley, J. Callan, B. Croft, S. Dumais, N. Fuhr, D. Harman, D. J. Harper, D. Hiemstra, T. Hofmann, E. Hovy, W. Kraaij, J. Lafferty, V. Lavrenko, D. Lewis, L. Liddy, R. Manmatha, A. McCallum, J. Ponte, J. Prager, D. Radev, P. Resnik, S. Robertson, R. Rosenfeld, S. Roukos, M. Sanderson, R. Schwartz, A. Singhal, A. Smeaton, H. Turtle, E. Voorhees, R. Weischedel, J. Xu, and C. Zhai. Challenges in Information Retrieval and Language Modeling: Report of a Workshop held at the Center for Intelligent Information Retrieval, University of Massachusetts Amherst, September 2002. SIGIR Forum, 37(1):31-47, 2003.
  • 2
    • 84880240041 scopus 로고    scopus 로고
    • A. Arasu, J. Cho, H. Garcia-Molina, A. Paepcke, and S. Raghavan. Searching the Web. ACM Trans. Inter. Tech., 1(1):2-43, 2001.
    • A. Arasu, J. Cho, H. Garcia-Molina, A. Paepcke, and S. Raghavan. Searching the Web. ACM Trans. Inter. Tech., 1(1):2-43, 2001.
  • 3
    • 34250662651 scopus 로고    scopus 로고
    • G. B. 14th ed. Ethnologue: Languages of the World. SIL International, Dallas, TX, 2003.
    • G. B. 14th ed. Ethnologue: Languages of the World. SIL International, Dallas, TX, 2003.
  • 5
    • 0038283958 scopus 로고    scopus 로고
    • The Internet in India: Better times ahead?
    • G. E. Burkhart, S. E. Goodman, A. Mehta, and L. Press. The Internet in India: Better times ahead? Commun. ACM, 41(11):21-26, 1998.
    • (1998) Commun. ACM , vol.41 , Issue.11 , pp. 21-26
    • Burkhart, G.E.1    Goodman, S.E.2    Mehta, A.3    Press, L.4
  • 7
    • 10644236664 scopus 로고    scopus 로고
    • Cross Language Information Retrieval: A Research Roadmap
    • F. Gey, N. Kando, and C. Peters. Cross Language Information Retrieval: A Research Roadmap. SIGIR Forum, 36(2):72-80, 2002.
    • (2002) SIGIR Forum , vol.36 , Issue.2 , pp. 72-80
    • Gey, F.1    Kando, N.2    Peters, C.3
  • 11
    • 0036992280 scopus 로고    scopus 로고
    • Unicode for Multilingual Representation in Digital Libraries from the Indian Perspective
    • New York, NY, USA, ACM Press
    • D. P. Madalli. Unicode for Multilingual Representation in Digital Libraries from the Indian Perspective. In JCDL '02: Proceedings of the 2nd ACM/IEEE-CS Joint Conference on Digital Libraries, pages 398-398, New York, NY, USA, 2002. ACM Press.
    • (2002) JCDL '02: Proceedings of the 2nd ACM/IEEE-CS Joint Conference on Digital Libraries , pp. 398-398
    • Madalli, D.P.1
  • 13
    • 45549117987 scopus 로고
    • Term-weighting Approaches in Automatic Text Retrieval
    • G. Salton and C. Buckley. Term-weighting Approaches in Automatic Text Retrieval. Information Process. Management, 24(5):513-523, 1988.
    • (1988) Information Process. Management , vol.24 , Issue.5 , pp. 513-523
    • Salton, G.1    Buckley, C.2
  • 15
    • 34250687526 scopus 로고    scopus 로고
    • F. Yergeau. UTF-8, a transformation format of ISO 10646. RFC Editor, United States, 2003.
    • F. Yergeau. UTF-8, a transformation format of ISO 10646. RFC Editor, United States, 2003.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.