메뉴 건너뛰기




Volumn 44, Issue 4, 2008, Pages 1517-1537

Extraction of complex index terms in non-English IR: A shallow parsing based approach

Author keywords

Finite state transducers; Information retrieval; Linguistic variation; Natural language processing; Shallow parsing

Indexed keywords

COMPUTATIONAL LINGUISTICS; INFORMATION RETRIEVAL SYSTEMS; NATURAL LANGUAGE PROCESSING SYSTEMS; PROBLEM SOLVING; SYNTACTICS; TEXT PROCESSING;

EID: 44449174059     PISSN: 03064573     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.ipm.2007.12.005     Document Type: Article
Times cited : (14)

References (67)
  • 1
    • 44449142808 scopus 로고    scopus 로고
    • ACRoTermite - Terminology of telecommunications database. International Telecommunication Union. http://www.itu.int/terminology/index.html (visited on August 2007).
    • ACRoTermite - Terminology of telecommunications database. International Telecommunication Union. http://www.itu.int/terminology/index.html (visited on August 2007).
  • 2
    • 0030361955 scopus 로고    scopus 로고
    • Partial parsing via finite-state cascades
    • Abney S. Partial parsing via finite-state cascades. Natural Language Engineering 2 4 (1997) 337-344
    • (1997) Natural Language Engineering , vol.2 , Issue.4 , pp. 337-344
    • Abney, S.1
  • 3
    • 44449167480 scopus 로고    scopus 로고
    • Alonso, M., Cabrero, D., de la Clergerie, E., & Vilares, M. (1999). Tabular algorithms for TAG parsing. In Proceedings of the nineth conference of the European chapter of the ACL (EACL'99) (pp. 150-157).
    • Alonso, M., Cabrero, D., de la Clergerie, E., & Vilares, M. (1999). Tabular algorithms for TAG parsing. In Proceedings of the nineth conference of the European chapter of the ACL (EACL'99) (pp. 150-157).
  • 4
    • 44449154202 scopus 로고    scopus 로고
    • 2 system used for MUC-7. In Proceedings of the seventh message understanding conference (MUC-7).
    • 2 system used for MUC-7. In Proceedings of the seventh message understanding conference (MUC-7).
  • 7
    • 44449148684 scopus 로고    scopus 로고
    • Buckley, C. (1985). Implementation of the SMART information retrieval system. Tech. Rep., Department of Computer Science, Cornell University, Source code available at ftp://ftp.cs.cornell.edu/pub/smart (visited on August 2007).
    • Buckley, C. (1985). Implementation of the SMART information retrieval system. Tech. Rep., Department of Computer Science, Cornell University, Source code available at ftp://ftp.cs.cornell.edu/pub/smart (visited on August 2007).
  • 8
    • 44449133513 scopus 로고    scopus 로고
    • Buckley, C., Allan, J., & Salton, G. (1993). Automatic routing and ad-hoc retrieval using SMART: TREC 2. In D. K. Harman (Ed.), Proceedings of the second text retrieval conference (TREC-2) (pp. 45-56).
    • Buckley, C., Allan, J., & Salton, G. (1993). Automatic routing and ad-hoc retrieval using SMART: TREC 2. In D. K. Harman (Ed.), Proceedings of the second text retrieval conference (TREC-2) (pp. 45-56).
  • 9
    • 44449150172 scopus 로고    scopus 로고
    • Buyse, K. (2003) Generating corpora and lexicons for language specific purposes. Experiences from the ElektraVoc-II project. In Proceedings of the 36th international meeting of the societas linguistica Europaea.
    • Buyse, K. (2003) Generating corpora and lexicons for language specific purposes. Experiences from the ElektraVoc-II project. In Proceedings of the 36th international meeting of the societas linguistica Europaea.
  • 11
    • 44449116508 scopus 로고    scopus 로고
    • Carrol, J., Briscoe, T., & Sanfilippo, A. (1998). Parser evaluation: A survey and a new proposal. In Proceedings of the first international conference on language resources and evaluation (LREC 1998) (pp. 447-454).
    • Carrol, J., Briscoe, T., & Sanfilippo, A. (1998). Parser evaluation: A survey and a new proposal. In Proceedings of the first international conference on language resources and evaluation (LREC 1998) (pp. 447-454).
  • 12
    • 44449140360 scopus 로고    scopus 로고
    • CLEF. http://www.clef-campaign.org (visited on August 2007).
    • CLEF. http://www.clef-campaign.org (visited on August 2007).
  • 15
    • 44449123995 scopus 로고    scopus 로고
    • Fagan, J. L. (1987). Experiments in automatic phrase indexing for document retrieval: A comparison of syntactic and non-syntactic methods (PhD thesis). Tech. Rep. TR87-868, Cornell University, USA.
    • Fagan, J. L. (1987). Experiments in automatic phrase indexing for document retrieval: A comparison of syntactic and non-syntactic methods (PhD thesis). Tech. Rep. TR87-868, Cornell University, USA.
  • 16
    • 44449137350 scopus 로고    scopus 로고
    • Figuerola, C. G., Gómez, R., Zazo Rodríguez, A. F., & Alonso Berrocal, J. L. (2001). Stemming in Spanish: A first approach to its impact on information retrieval. In C. Peters (Ed.), Results of the CLEF 2001 cross-language system evaluation campaign, Working notes for the CLEF 2001 workshop (pp. 197-202).
    • Figuerola, C. G., Gómez, R., Zazo Rodríguez, A. F., & Alonso Berrocal, J. L. (2001). Stemming in Spanish: A first approach to its impact on information retrieval. In C. Peters (Ed.), Results of the CLEF 2001 cross-language system evaluation campaign, Working notes for the CLEF 2001 workshop (pp. 197-202).
  • 17
    • 23744486846 scopus 로고    scopus 로고
    • Term conflation methods in information retrieval: Non-linguistic and linguistic approaches
    • Galvez C., de Moya-Anegón F., and Solana V.H. Term conflation methods in information retrieval: Non-linguistic and linguistic approaches. Journal of Documentation 61 4 (2006) 520-547
    • (2006) Journal of Documentation , vol.61 , Issue.4 , pp. 520-547
    • Galvez, C.1    de Moya-Anegón, F.2    Solana, V.H.3
  • 18
    • 84867758262 scopus 로고    scopus 로고
    • Gamallo, P., Agustini, A., & Lopes, G. P. (2001). Selection restrictions acquisition from corpora. In Proceedings of the 10th Portuguese conference on artificial intelligence (EPIA'01). Lecture notes in artificial intelligence (pp. 30-43). Springer-Verlag.
    • Gamallo, P., Agustini, A., & Lopes, G. P. (2001). Selection restrictions acquisition from corpora. In Proceedings of the 10th Portuguese conference on artificial intelligence (EPIA'01). Lecture notes in artificial intelligence (pp. 30-43). Springer-Verlag.
  • 19
    • 33646404912 scopus 로고    scopus 로고
    • Clustering syntactic positions with similar semantic requirements
    • Gamallo P., Agustini A., and Lopes G.P. Clustering syntactic positions with similar semantic requirements. Journal of Computational Linguistics 31 1 (2005) 107-146
    • (2005) Journal of Computational Linguistics , vol.31 , Issue.1 , pp. 107-146
    • Gamallo, P.1    Agustini, A.2    Lopes, G.P.3
  • 20
    • 84942868234 scopus 로고    scopus 로고
    • A common solution for tokenization and part-of-speech tagging: One-pass Viterbi algorithm vs. iterative approaches
    • Sojka P., Kopeček I., and Pala K. (Eds), Springer-Verlag
    • Graña J., Alonso M.A., and Vilares M. A common solution for tokenization and part-of-speech tagging: One-pass Viterbi algorithm vs. iterative approaches. In: Sojka P., Kopeček I., and Pala K. (Eds). Text, speech and dialogue. Lecture notes in computer science Vol. 2448 (2002), Springer-Verlag 3-10
    • (2002) Text, speech and dialogue. Lecture notes in computer science , vol.2448 , pp. 3-10
    • Graña, J.1    Alonso, M.A.2    Vilares, M.3
  • 22
    • 44449131743 scopus 로고    scopus 로고
    • Graña, J., Chappelier, J.-C., & Vilares, M. (2001). Integrating external dictionaries into stochastic part-of-speech taggers. In G. Angelova, K. Bontcheva, R. Mitkov, N. Nocolov, & N. Nikolov (Eds.), Proceedings of the euroconference recent advances in natural language processing (RANLP 2001) (pp. 122-128).
    • Graña, J., Chappelier, J.-C., & Vilares, M. (2001). Integrating external dictionaries into stochastic part-of-speech taggers. In G. Angelova, K. Bontcheva, R. Mitkov, N. Nocolov, & N. Nikolov (Eds.), Proceedings of the euroconference recent advances in natural language processing (RANLP 2001) (pp. 122-128).
  • 24
    • 44449164310 scopus 로고    scopus 로고
    • Hearst, M., Pedersen, J., Pirolli, P., Schutze, H., Grefenstette, G., & Hull, D. (1996). Xerox site report: Four TREC-4 tracks. In D. K. Harman (Ed.), Proceedings of the fourth text retrieval conference (TREC-4) (pp. 97-119).
    • Hearst, M., Pedersen, J., Pirolli, P., Schutze, H., Grefenstette, G., & Hull, D. (1996). Xerox site report: Four TREC-4 tracks. In D. K. Harman (Ed.), Proceedings of the fourth text retrieval conference (TREC-4) (pp. 97-119).
  • 25
    • 0001797799 scopus 로고    scopus 로고
    • FASTUS: A cascaded finite-state transducer for extracting information from natural-language text
    • Roche E., and Schabes Y. (Eds), MIT Press
    • Hobbs J.R., Appelt D., Bear J., Israel D., Kameyama M., Stickel M., et al. FASTUS: A cascaded finite-state transducer for extracting information from natural-language text. In: Roche E., and Schabes Y. (Eds). Finite-state language processing (1997), MIT Press 383-406
    • (1997) Finite-state language processing , pp. 383-406
    • Hobbs, J.R.1    Appelt, D.2    Bear, J.3    Israel, D.4    Kameyama, M.5    Stickel, M.6
  • 28
    • 44449143825 scopus 로고    scopus 로고
    • Hull, D. A., Grefenstette, G., Schulze, B. M., Gaussier, E., Schütze, H., & Pedersen, J. O. (1997). Xerox TREC-5 site report: Routing, filtering, NLP, and Spanish tracks. In E. M. Voorhees, & D. K. Harman (Eds.), Proceedings of the fifth text retrieval conference (TREC-5) (pp. 167-180).
    • Hull, D. A., Grefenstette, G., Schulze, B. M., Gaussier, E., Schütze, H., & Pedersen, J. O. (1997). Xerox TREC-5 site report: Routing, filtering, NLP, and Spanish tracks. In E. M. Voorhees, & D. K. Harman (Eds.), Proceedings of the fifth text retrieval conference (TREC-5) (pp. 167-180).
  • 29
    • 44449143323 scopus 로고    scopus 로고
    • Husson, J. L., Viscogliosi, N., Romary, L., Descotte, S., & Campenhoudt, M. V. (2000). DHYDRO: a generic environment developed to edit and access multilingual terminological data on the Internet. In Proceedings of the second conference on maritime terminology (pp. 47-61).
    • Husson, J. L., Viscogliosi, N., Romary, L., Descotte, S., & Campenhoudt, M. V. (2000). DHYDRO: a generic environment developed to edit and access multilingual terminological data on the Internet. In Proceedings of the second conference on maritime terminology (pp. 47-61).
  • 30
    • 44449098018 scopus 로고    scopus 로고
    • Jacquemin, C. (1999). Syntagmatic and paradigmatic representations of term variation. In Proceedings of the 37th annual meeting of the ACL (ACL'99) (pp. 341-348).
    • Jacquemin, C. (1999). Syntagmatic and paradigmatic representations of term variation. In Proceedings of the 37th annual meeting of the ACL (ACL'99) (pp. 341-348).
  • 32
    • 44449097484 scopus 로고    scopus 로고
    • Jacquemin, C., & Tzoukermann, E. (1999). NLP for term variant extraction: Synergy between morphology, lexicon and syntax. In Strzalkowski (1999) (pp. 25-74).
    • Jacquemin, C., & Tzoukermann, E. (1999). NLP for term variant extraction: Synergy between morphology, lexicon and syntax. In Strzalkowski (1999) (pp. 25-74).
  • 34
    • 44449173394 scopus 로고    scopus 로고
    • Kelledy, F., & Smeaton, A.F. (1997) Automatic phrase recognition and extraction from text. In Proceedings of 19th annual BCS-IRSG colloquium on IR. Workshops in Computing. BCS.
    • Kelledy, F., & Smeaton, A.F. (1997) Automatic phrase recognition and extraction from text. In Proceedings of 19th annual BCS-IRSG colloquium on IR. Workshops in Computing. BCS.
  • 38
    • 84945190539 scopus 로고    scopus 로고
    • Comparing the effect of syntactic vs. statistical phrase indexing strategies for Dutch
    • Nicolaou C., and Stephanidis C. (Eds), Springer-Verlag
    • Kraaij W., and Pohlmann R. Comparing the effect of syntactic vs. statistical phrase indexing strategies for Dutch. In: Nicolaou C., and Stephanidis C. (Eds). Research and advanced technology for digital libraries. Lecture notes in computer science Vol. 1513 (1998), Springer-Verlag 605-614
    • (1998) Research and advanced technology for digital libraries. Lecture notes in computer science , vol.1513 , pp. 605-614
    • Kraaij, W.1    Pohlmann, R.2
  • 41
    • 44449170775 scopus 로고    scopus 로고
    • Mitra, M., Buckley, C., Singhal, A., & Cardie, C. (1997). An analysis of statistical and syntactic phrases. In Proceedings of the fifth international conference "Recherche d'information assistee par ordinateur" (RIAO-97) (pp. 200-214).
    • Mitra, M., Buckley, C., Singhal, A., & Cardie, C. (1997). An analysis of statistical and syntactic phrases. In Proceedings of the fifth international conference "Recherche d'information assistee par ordinateur" (RIAO-97) (pp. 200-214).
  • 42
    • 0035725602 scopus 로고    scopus 로고
    • Mittendorfer, M., & Winiwarter, W. (2001) A simple way of improving traditional IR methods by structuring queries. In Proceedings of the 2001 IEEE international workshop on natural language processing and knowledge engineering (NLPKE 2001).
    • Mittendorfer, M., & Winiwarter, W. (2001) A simple way of improving traditional IR methods by structuring queries. In Proceedings of the 2001 IEEE international workshop on natural language processing and knowledge engineering (NLPKE 2001).
  • 43
    • 0036722343 scopus 로고    scopus 로고
    • Exploiting syntactic analysis of queries for information retrieval
    • Mittendorfer M., and Winiwarter W. Exploiting syntactic analysis of queries for information retrieval. Data and Knowledge Engineering 42 3 (2002) 315-325
    • (2002) Data and Knowledge Engineering , vol.42 , Issue.3 , pp. 315-325
    • Mittendorfer, M.1    Winiwarter, W.2
  • 45
    • 0033652020 scopus 로고    scopus 로고
    • Narita, M., & Ogawa, Y. (2000). The use of phrases from query texts in information retrieval. In Proceedings of the 23rd annual international ACM SIGIR conference on research and development in information retrieval (SIGIR'00) (pp. 318-320).
    • Narita, M., & Ogawa, Y. (2000). The use of phrases from query texts in information retrieval. In Proceedings of the 23rd annual international ACM SIGIR conference on research and development in information retrieval (SIGIR'00) (pp. 318-320).
  • 47
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • Porter M.F. An algorithm for suffix stripping. Program 14 3 (1980) 130-137
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.F.1
  • 48
    • 0034576844 scopus 로고    scopus 로고
    • Reynoso, G. A., March, A. D., Berra, C. M., Strobietto, R. P., Barani, M., Iubatti, M., et al. (2000). Development of the Spanish version of the systematized nomenclature of medicine: Methodology and main issues. In Proceedings of the 2000 American medical informatics association symposium (AMIA) (pp. 694-698).
    • Reynoso, G. A., March, A. D., Berra, C. M., Strobietto, R. P., Barani, M., Iubatti, M., et al. (2000). Development of the Spanish version of the systematized nomenclature of medicine: Methodology and main issues. In Proceedings of the 2000 American medical informatics association symposium (AMIA) (pp. 694-698).
  • 50
    • 44449124475 scopus 로고    scopus 로고
    • Savoy, J. (2003). Report on CLEF 2003 monolingual tracks: Fusion of probabilistic models for effective monolingual retrieval. In C. Peters, & F. Borri (Eds.), Results of the CLEF 2003 cross-language system evaluation campaign, Working Notes for the CLEF 2003 Workshop (pp. 179-188).
    • Savoy, J. (2003). Report on CLEF 2003 monolingual tracks: Fusion of probabilistic models for effective monolingual retrieval. In C. Peters, & F. Borri (Eds.), Results of the CLEF 2003 cross-language system evaluation campaign, Working Notes for the CLEF 2003 Workshop (pp. 179-188).
  • 51
    • 0043272901 scopus 로고
    • The application of morpho-syntactic language processing to effective phrase matching
    • Sheridan P., and Smeaton A.F. The application of morpho-syntactic language processing to effective phrase matching. Information Processing and Management 28 3 (1992) 349-369
    • (1992) Information Processing and Management , vol.28 , Issue.3 , pp. 349-369
    • Sheridan, P.1    Smeaton, A.F.2
  • 53
    • 44449157049 scopus 로고    scopus 로고
    • Smeaton, A. F., O'Donnell, R., & Kelledy, F. (1995). Indexing structures derived from syntax in TREC-3: System description. In NIST special publication 500-225: Overview of the third text retrieval conference (TREC 3) (pp. 55-63).
    • Smeaton, A. F., O'Donnell, R., & Kelledy, F. (1995). Indexing structures derived from syntax in TREC-3: System description. In NIST special publication 500-225: Overview of the third text retrieval conference (TREC 3) (pp. 55-63).
  • 55
    • 44449119802 scopus 로고    scopus 로고
    • Stevenson, M. (2003). Word sense disambiguation: The case for combinations of knowledge sources. Studies in computational linguistics. CSLI.
    • Stevenson, M. (2003). Word sense disambiguation: The case for combinations of knowledge sources. Studies in computational linguistics. CSLI.
  • 56
    • 1542347785 scopus 로고    scopus 로고
    • Stokoe, C., Oakes, M. P., & Tait, J. (2003). Word sense disambiguation in information retrieval revisited. In Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval (SIGIR'03) (pp. 159-166).
    • Stokoe, C., Oakes, M. P., & Tait, J. (2003). Word sense disambiguation in information retrieval revisited. In Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval (SIGIR'03) (pp. 159-166).
  • 57
    • 44449105419 scopus 로고    scopus 로고
    • Strzalkowski T. (Ed), Kluwer Academic Publishers
    • In: Strzalkowski T. (Ed). Natural language information retrieval. Text, speech and language technology Vol. 7 (1999), Kluwer Academic Publishers
    • (1999) Text, speech and language technology , vol.7
  • 58
    • 44449115042 scopus 로고    scopus 로고
    • Strzalkowski, T., & Perez-Carballo, J. (1994). Recent developments in natural language text retrieval. In D. K. Harman (Ed.), Proceedings of the second text retrieval conference (TREC-2) (pp. 123-136).
    • Strzalkowski, T., & Perez-Carballo, J. (1994). Recent developments in natural language text retrieval. In D. K. Harman (Ed.), Proceedings of the second text retrieval conference (TREC-2) (pp. 123-136).
  • 59
    • 0030690721 scopus 로고    scopus 로고
    • Tzoukermann, E., Klavans, J., & Jacquemin, C. (1997). Effective use of natural language processing techniques for automatic conflation of multi-word terms: The role of derivational morphology, part of speech tagging, and shallow parsing. In Proceedings of the 20th annual international ACM SIGIR conference on research and development in information retrieval (SIGIR'97) (pp. 148-155).
    • Tzoukermann, E., Klavans, J., & Jacquemin, C. (1997). Effective use of natural language processing techniques for automatic conflation of multi-word terms: The role of derivational morphology, part of speech tagging, and shallow parsing. In Proceedings of the 20th annual international ACM SIGIR conference on research and development in information retrieval (SIGIR'97) (pp. 148-155).
  • 60
    • 44449093931 scopus 로고    scopus 로고
    • VERBA - Polytechnic and Plurilingual Terminological Database. European Language Resources Association (ELRA). http://www.elra.info/ (visited on August 2007).
    • VERBA - Polytechnic and Plurilingual Terminological Database. European Language Resources Association (ELRA). http://www.elra.info/ (visited on August 2007).
  • 61
    • 33645997866 scopus 로고    scopus 로고
    • Morphological and syntactic processing for text retrieval
    • Galindo F., Takizawa M., and Traunmüller R. (Eds), Springer-Verlag
    • Vilares J., Alonso M.A., and Vilares M. Morphological and syntactic processing for text retrieval. In: Galindo F., Takizawa M., and Traunmüller R. (Eds). Database and expert systems applications. Lecture notes in computer science Vol. 3180 (2004), Springer-Verlag 371-380
    • (2004) Database and expert systems applications. Lecture notes in computer science , vol.3180 , pp. 371-380
    • Vilares, J.1    Alonso, M.A.2    Vilares, M.3
  • 63
    • 0035751909 scopus 로고    scopus 로고
    • Vilares, M., Ribadas, F. J., & Graña, J. (2001b). Approximately common patterns in shared-forests. In H. Paques, L. Liu, & D. Grossman (Eds.), Proceedings of the 2001 ACM CIKM - 10th international conference on information and knowledge management (pp. 73-80).
    • Vilares, M., Ribadas, F. J., & Graña, J. (2001b). Approximately common patterns in shared-forests. In H. Paques, L. Liu, & D. Grossman (Eds.), Proceedings of the 2001 ACM CIKM - 10th international conference on information and knowledge management (pp. 73-80).
  • 66
    • 84904117366 scopus 로고    scopus 로고
    • Voorhees, E. M. (1994). Query expansion using lexical-semantic relations. In Proceedings of the 17th annual international ACM SIGIR conference on research and development in information retrieval (SIGIR'94) (pp. 61-69).
    • Voorhees, E. M. (1994). Query expansion using lexical-semantic relations. In Proceedings of the 17th annual international ACM SIGIR conference on research and development in information retrieval (SIGIR'94) (pp. 61-69).
  • 67
    • 0030407491 scopus 로고    scopus 로고
    • Xu, J., & Croft, W. B. (1996). Query expansion using local and global document analysis. In Proceedings 19th annual international ACM SIGIR conference on research and development in information retrieval (SIGIR'96) (pp. 4-11).
    • Xu, J., & Croft, W. B. (1996). Query expansion using local and global document analysis. In Proceedings 19th annual international ACM SIGIR conference on research and development in information retrieval (SIGIR'96) (pp. 4-11).


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.