-
1
-
-
0345376175
-
The web as a parallel corpus
-
Resnik, P., Smith, N., The web as a parallel corpus. Comput. Ling. 29 (2003), 349–380.
-
(2003)
Comput. Ling.
, vol.29
, pp. 349-380
-
-
Resnik, P.1
Smith, N.2
-
2
-
-
84944081118
-
A tool for producing structured interoperable data from product features on the web
-
Özacar, T., A tool for producing structured interoperable data from product features on the web. Inf. Syst. 56 (2016), 36–54.
-
(2016)
Inf. Syst.
, vol.56
, pp. 36-54
-
-
Özacar, T.1
-
3
-
-
84855651895
-
Quality-aware similarity assessment for entity matching in web data
-
Yerva, S.R., Miklós, Z., Aberer, K., Quality-aware similarity assessment for entity matching in web data. Inf. Syst. 37 (2012), 336–351.
-
(2012)
Inf. Syst.
, vol.37
, pp. 336-351
-
-
Yerva, S.R.1
Miklós, Z.2
Aberer, K.3
-
4
-
-
35348903881
-
Measuring semantic similarity between words using web search engines
-
Bollegala, D., Matsuo, Y., Ishizuka, M., Measuring semantic similarity between words using web search engines. Proc. of 16th international conference on World Wide Web, WWW 2007, Banff, Alberta, Canada, 2007, 757–766.
-
(2007)
Proc. of 16th international conference on World Wide Web, WWW 2007, Banff, Alberta, Canada
, pp. 757-766
-
-
Bollegala, D.1
Matsuo, Y.2
Ishizuka, M.3
-
5
-
-
65449137151
-
Detecting privacy leacks using corpus-based association rules
-
Chow, R., Golle, P., Staddon, J., Detecting privacy leacks using corpus-based association rules. Proc. of 14th Conference on Knowledge Discovery and Data Mining, Las Vegas, NV, 2008, 893–901.
-
(2008)
Proc. of 14th Conference on Knowledge Discovery and Data Mining, Las Vegas, NV
, pp. 893-901
-
-
Chow, R.1
Golle, P.2
Staddon, J.3
-
8
-
-
38349143926
-
Extracting accurate and complete results from search engines: case study windows live
-
Thelwall, M., Extracting accurate and complete results from search engines: case study windows live. J. Am. Soc. Inf. Sci. Technol. 59 (2007), 38–50.
-
(2007)
J. Am. Soc. Inf. Sci. Technol.
, vol.59
, pp. 38-50
-
-
Thelwall, M.1
-
10
-
-
84948177273
-
Mining the web for synonyms: PMI-IR versus LSA on TOEFL
-
Turney, P.D., Mining the web for synonyms: PMI-IR versus LSA on TOEFL. Proc. of 12th European Conference on Machine Learning, ECML 2001, Freiburg, Germany, 2001, 491–502.
-
(2001)
Proc. of 12th European Conference on Machine Learning, ECML 2001, Freiburg, Germany
, pp. 491-502
-
-
Turney, P.D.1
-
11
-
-
0344154400
-
Using the web to obtain frequencies for unseen bigrams
-
Keller, F., Lapata, M., Using the web to obtain frequencies for unseen bigrams. Comput. Ling. 29 (2003), 459–484.
-
(2003)
Comput. Ling.
, vol.29
, pp. 459-484
-
-
Keller, F.1
Lapata, M.2
-
12
-
-
80053271050
-
Search engine statistics beyond the n-gram: application to noun compound bracketing
-
Nakov, P., Hearst, M., Search engine statistics beyond the n-gram: application to noun compound bracketing. Proc. of Ninth Conference on Computational Natural Language Learning, Ann Arbor, Michigan, US, 2005, 17–24.
-
(2005)
Proc. of Ninth Conference on Computational Natural Language Learning, Ann Arbor, Michigan, US
, pp. 17-24
-
-
Nakov, P.1
Hearst, M.2
-
13
-
-
17644423946
-
Unsupervised named-entity extraction form the web: an experimental study
-
Etzioni, O., Cafarella, M., Downey, D., Popescu, A., Shaked, T., Soderland, S., Weld, D., Yates, A., Unsupervised named-entity extraction form the web: an experimental study. Artif. Intell. 165 (2005), 91–134.
-
(2005)
Artif. Intell.
, vol.165
, pp. 91-134
-
-
Etzioni, O.1
Cafarella, M.2
Downey, D.3
Popescu, A.4
Shaked, T.5
Soderland, S.6
Weld, D.7
Yates, A.8
-
14
-
-
79956128897
-
Automatic extraction of acronym definitions from the web
-
Sánchez, D., Isern, D., Automatic extraction of acronym definitions from the web. Appl. Intell. 34 (2011), 311–327.
-
(2011)
Appl. Intell.
, vol.34
, pp. 311-327
-
-
Sánchez, D.1
Isern, D.2
-
15
-
-
38649107088
-
Learning non-taxonomic relationships from web documents for domain ontology construction
-
Sánchez, D., Moreno, A., Learning non-taxonomic relationships from web documents for domain ontology construction. Data Knowl. Eng. 63 (2008), 600–623.
-
(2008)
Data Knowl. Eng.
, vol.63
, pp. 600-623
-
-
Sánchez, D.1
Moreno, A.2
-
16
-
-
40549141816
-
Pattern-based automatic taxonomy learning from the web
-
Sánchez, D., Moreno, A., Pattern-based automatic taxonomy learning from the web. AI Commun. 21 (2008), 27–48.
-
(2008)
AI Commun.
, vol.21
, pp. 27-48
-
-
Sánchez, D.1
Moreno, A.2
-
17
-
-
77951142171
-
A methodology to learn ontological attributes from the web
-
Sánchez, D., A methodology to learn ontological attributes from the web. Data Knowl. Eng. 69 (2010), 573–597.
-
(2010)
Data Knowl. Eng.
, vol.69
, pp. 573-597
-
-
Sánchez, D.1
-
18
-
-
84855907139
-
Learning relation axioms from text: an automatic web-based approach
-
Sánchez, D., Moreno, A., Vasto-Terrientes, L.D., Learning relation axioms from text: an automatic web-based approach. Expert Syst. Appl. 39 (2012), 5792–5805.
-
(2012)
Expert Syst. Appl.
, vol.39
, pp. 5792-5805
-
-
Sánchez, D.1
Moreno, A.2
Vasto-Terrientes, L.D.3
-
19
-
-
78650169343
-
Ontology-driven web-based semantic similarity
-
Sánchez, D., Batet, M., Valls, A., Gibert, K., Ontology-driven web-based semantic similarity. J. Intell. Inf. Syst. 35 (2010), 383–413.
-
(2010)
J. Intell. Inf. Syst.
, vol.35
, pp. 383-413
-
-
Sánchez, D.1
Batet, M.2
Valls, A.3
Gibert, K.4
-
20
-
-
14744280872
-
Towards the self-annotating web
-
Cimiano, P., Handschuh, S., Staab, S., Towards the self-annotating web. Proc. of 13th international conference on World Wide Web, WWW 2004, New York, USA, 2004, 462–471.
-
(2004)
Proc. of 13th international conference on World Wide Web, WWW 2004, New York, USA
, pp. 462-471
-
-
Cimiano, P.1
Handschuh, S.2
Staab, S.3
-
21
-
-
79955999805
-
Content annotation for the semantic web: an automatic web-based approach
-
Sánchez, D., Isern, D., Millán, M., Content annotation for the semantic web: an automatic web-based approach. Knowl. Inf. Syst. 27 (2011), 393–418.
-
(2011)
Knowl. Inf. Syst.
, vol.27
, pp. 393-418
-
-
Sánchez, D.1
Isern, D.2
Millán, M.3
-
22
-
-
84867842712
-
Preventing automatic user profiling in web 2.0 applications
-
Viejo, A., Sánchez, D., Castellà-Roca, J., Preventing automatic user profiling in web 2.0 applications. Knowl.-Based Syst. 36 (2012), 191–205.
-
(2012)
Knowl.-Based Syst.
, vol.36
, pp. 191-205
-
-
Viejo, A.1
Sánchez, D.2
Castellà-Roca, J.3
-
23
-
-
36549074243
-
Polyphonet: an advanced social network extraction system from the web
-
Matsuo, Y., Mori, J., Hamasaki, M., Nishimura, T., Takeda, H., Hasida, K., Ishizuka, M., Polyphonet: an advanced social network extraction system from the web. Web Semant. 5 (2007), 262–278.
-
(2007)
Web Semant.
, vol.5
, pp. 262-278
-
-
Matsuo, Y.1
Mori, J.2
Hamasaki, M.3
Nishimura, T.4
Takeda, H.5
Hasida, K.6
Ishizuka, M.7
-
24
-
-
84975078571
-
C-sanitized: a privacy model for document redaction and sanitization
-
Sánchez, D., Batet, M., C-sanitized: a privacy model for document redaction and sanitization. J. Assoc. Inf. Sci. Technol. 67 (2016), 148–163.
-
(2016)
J. Assoc. Inf. Sci. Technol.
, vol.67
, pp. 148-163
-
-
Sánchez, D.1
Batet, M.2
-
26
-
-
80053460837
-
Yahoo! learning to rank challange overview
-
Chapelle, O., Chang, Y., Yahoo! learning to rank challange overview. Proc. of Yahoo! Learning to Rank Challenge at ICML 2010, Haifa, Israel, 2011, 1–24.
-
(2011)
Proc. of Yahoo! Learning to Rank Challenge at ICML 2010, Haifa, Israel
, pp. 1-24
-
-
Chapelle, O.1
Chang, Y.2
-
27
-
-
84946074841
-
Evaluating the retrieval effectiveness of web search engines using a representative query sample
-
Lewandowski, D., Evaluating the retrieval effectiveness of web search engines using a representative query sample. J. Assoc. Inf. Sci. Technol. 66 (2015), 1763–1775.
-
(2015)
J. Assoc. Inf. Sci. Technol.
, vol.66
, pp. 1763-1775
-
-
Lewandowski, D.1
-
28
-
-
34548474030
-
Evaluation of web search for the information practitioner
-
Macfarlane, A., Evaluation of web search for the information practitioner. Aslib Proc. 59 (2007), 352–366.
-
(2007)
Aslib Proc.
, vol.59
, pp. 352-366
-
-
Macfarlane, A.1
-
29
-
-
79551499272
-
Performance evaluation and comparison of the five most used search engines in retrieving web resources
-
Deka, S.K., Lahkar, N., Performance evaluation and comparison of the five most used search engines in retrieving web resources. Online Inf. Rev. 34 (2010), 757–771.
-
(2010)
Online Inf. Rev.
, vol.34
, pp. 757-771
-
-
Deka, S.K.1
Lahkar, N.2
-
30
-
-
84865209921
-
Ranking, relevance judgment, and precision of information retrieval on children's queries: evaluation of Google, Yahoo!, Bing, Yahoo! Kids, and ask Kids
-
Bilal, D., Ranking, relevance judgment, and precision of information retrieval on children's queries: evaluation of Google, Yahoo!, Bing, Yahoo! Kids, and ask Kids. J. Am. Soc. Inf. Sci. Technol. 63 (2012), 1879–1896.
-
(2012)
J. Am. Soc. Inf. Sci. Technol.
, vol.63
, pp. 1879-1896
-
-
Bilal, D.1
-
31
-
-
77956190113
-
Search engines? responses to several search feature selections
-
Zhang, J., Fei, W., Search engines? responses to several search feature selections. Int. Inf. Library Rev. 42 (2010), 212–225.
-
(2010)
Int. Inf. Library Rev.
, vol.42
, pp. 212-225
-
-
Zhang, J.1
Fei, W.2
-
32
-
-
82255166998
-
A method to assess search engine results
-
Bar-Ilan, J., Levene, M., A method to assess search engine results. Online Inf. Rev. 35 (2011), 854–868.
-
(2011)
Online Inf. Rev.
, vol.35
, pp. 854-868
-
-
Bar-Ilan, J.1
Levene, M.2
-
33
-
-
0142030258
-
A taxonomy of web search
-
Broder, A., A taxonomy of web search. ACM Sigir forum 36 (2002), 3–10.
-
(2002)
ACM Sigir forum
, vol.36
, pp. 3-10
-
-
Broder, A.1
-
34
-
-
51049099404
-
Quantitative comparisons of search engine results
-
Thelwall, M., Quantitative comparisons of search engine results. J. Am. Soc. Inf. Sci. Technol. 59 (2008), 1702–1710.
-
(2008)
J. Am. Soc. Inf. Sci. Technol.
, vol.59
, pp. 1702-1710
-
-
Thelwall, M.1
-
35
-
-
68249153264
-
Investigation of the accuracy of search engine hit counts
-
Uyar, A., Investigation of the accuracy of search engine hit counts. J. Inf. Sci 35 (2009), 469–480.
-
(2009)
J. Inf. Sci
, vol.35
, pp. 469-480
-
-
Uyar, A.1
-
37
-
-
78649832936
-
Reliability verification of search engines’ hit counts: how to select a reliable hit count for a query
-
Springer
-
Funahashi, T., Yamana, H., Reliability verification of search engines’ hit counts: how to select a reliable hit count for a query. Current Trends in Web Engineering, 2010, Springer, 114–125.
-
(2010)
Current Trends in Web Engineering
, pp. 114-125
-
-
Funahashi, T.1
Yamana, H.2
-
38
-
-
78649865426
-
Predicting web search hit counts
-
Tian, T., Geller, J., Chun, S.A., Predicting web search hit counts. Proc. of 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), Toronto, ON, Canada, 2010, 162–166.
-
(2010)
Proc. of 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), Toronto, ON, Canada
, pp. 162-166
-
-
Tian, T.1
Geller, J.2
Chun, S.A.3
-
39
-
-
80054080083
-
A prediction model for web search hit counts using word frequencies
-
Tian, T., Chun, S.A., Geller, J., A prediction model for web search hit counts using word frequencies. J. Inf. Sci. 37 (2011), 462–475.
-
(2011)
J. Inf. Sci.
, vol.37
, pp. 462-475
-
-
Tian, T.1
Chun, S.A.2
Geller, J.3
-
40
-
-
84988826696
-
Evaluating the suitability of web search engines as proxies for knowledge discovery from the web
-
Martínez-Sanahuja, L., Sánchez, D., Evaluating the suitability of web search engines as proxies for knowledge discovery from the web. Proc. of 20th International Conference on Knowledge Based and Intelligent Information and Engineering Systems, York, UK, 2016, 169–178.
-
(2016)
Proc. of 20th International Conference on Knowledge Based and Intelligent Information and Engineering Systems, York, UK
, pp. 169-178
-
-
Martínez-Sanahuja, L.1
Sánchez, D.2
-
41
-
-
85040046501
-
-
Netmarketshare. Desktop Search Engine Market Share. March 2017. Available at
-
Netmarketshare. Desktop Search Engine Market Share. March 2017. Available at https://www.netmarketshare.com/search-engine-market-share.aspx?qprid=4&qpcustomd=0.
-
-
-
-
42
-
-
34047135006
-
Googleology is bad science
-
Kilgarriff, A., Googleology is bad science. Comput. Ling. 33 (2007), 147–151.
-
(2007)
Comput. Ling.
, vol.33
, pp. 147-151
-
-
Kilgarriff, A.1
-
43
-
-
84863873032
-
Contextual correlates of synonymy
-
Rubenstein, H., Goodenough, J., Contextual correlates of synonymy. Commun. ACM 8 (1965), 627–633.
-
(1965)
Commun. ACM
, vol.8
, pp. 627-633
-
-
Rubenstein, H.1
Goodenough, J.2
-
44
-
-
0033408730
-
Can search engines be used as tools for web-link analysis? A critical view
-
Snyder, H., Rosenbaum, H., Can search engines be used as tools for web-link analysis? A critical view. J. Doc. 55 (1999), 375–384.
-
(1999)
J. Doc.
, vol.55
, pp. 375-384
-
-
Snyder, H.1
Rosenbaum, H.2
-
45
-
-
0035612855
-
Internet search engines - fluctuations in document accessibility
-
Mettrop, W., Nieuwenhuysen, P., Internet search engines - fluctuations in document accessibility. J. Doc. 57 (2001), 623–651.
-
(2001)
J. Doc.
, vol.57
, pp. 623-651
-
-
Mettrop, W.1
Nieuwenhuysen, P.2
-
46
-
-
55149106898
-
Performance of compressed inverted list caching in search engines
-
Zhang, J., Long, X., Suel, T., Performance of compressed inverted list caching in search engines. Proc. of 17th international conference on World Wide Web Beijing, China, 2008, 387–396.
-
(2008)
Proc. of 17th international conference on World Wide Web Beijing, China
, pp. 387-396
-
-
Zhang, J.1
Long, X.2
Suel, T.3
-
47
-
-
85040045472
-
A difference of a factor of 70,000 between hit counts and results returned in Google.
-
In: Unpublished technical note;
-
E. Davis. A difference of a factor of 70,000 between hit counts and results returned in Google. In: Unpublished technical note; 2015.
-
(2015)
-
-
Davis, E.1
-
49
-
-
57349194555
-
ResIn: a combination of results caching and index pruning for high-performance web search engines
-
Skobeltsyn, G., Junqueira, F., Plachouras, V., Baeza-Yates, R., ResIn: a combination of results caching and index pruning for high-performance web search engines. Proc. of 31st annual international ACM SIGIR conference on Research and development in information retrieval, Singapore, Singapore, 2008, 131–138.
-
(2008)
Proc. of 31st annual international ACM SIGIR conference on Research and development in information retrieval, Singapore, Singapore
, pp. 131-138
-
-
Skobeltsyn, G.1
Junqueira, F.2
Plachouras, V.3
Baeza-Yates, R.4
-
50
-
-
33750701866
-
Methods for evaluating dynamic changes in search engine rankings: a case study
-
Bar-Ilan, J., Levene, M., Mat-Hassan, M., Methods for evaluating dynamic changes in search engine rankings: a case study. J. Doc. 62 (2006), 708–729.
-
(2006)
J. Doc.
, vol.62
, pp. 708-729
-
-
Bar-Ilan, J.1
Levene, M.2
Mat-Hassan, M.3
-
51
-
-
84963625619
-
Review on Semantic Similarity
-
3rd IGI Global
-
Batet, M., Sánchez, D., Review on Semantic Similarity. Encyclopedia of Information Science and Technology, 3rd, 2014, IGI Global, 7575–7583.
-
(2014)
Encyclopedia of Information Science and Technology
, pp. 7575-7583
-
-
Batet, M.1
Sánchez, D.2
-
52
-
-
85015963191
-
HESML: a scalable ontology-based semantic similarity measures library with a set of reproducible experiments and a replication dataset
-
Lastra-Díaz, JuanJ., García-Serrano, A., Batet, M., Fernández, M., Chirigati, F., HESML: a scalable ontology-based semantic similarity measures library with a set of reproducible experiments and a replication dataset. Inf. Syst. 66 (2017), 97–118.
-
(2017)
Inf. Syst.
, vol.66
, pp. 97-118
-
-
Lastra-Díaz, J.1
García-Serrano, A.2
Batet, M.3
Fernández, M.4
Chirigati, F.5
-
53
-
-
80955140428
-
Ontology based semantic clustering
-
Batet, M., Ontology based semantic clustering. AI Commun. 24 (2011), 291–292.
-
(2011)
AI Commun.
, vol.24
, pp. 291-292
-
-
Batet, M.1
-
54
-
-
84888198960
-
Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text
-
McInnes, B.T., Pedersen, T., Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text. J. Biomed. Inf. 46 (2013), 1116–1124.
-
(2013)
J. Biomed. Inf.
, vol.46
, pp. 1116-1124
-
-
McInnes, B.T.1
Pedersen, T.2
-
55
-
-
84875634268
-
A semantic framework to protect the privacy of electronic health records with non-numerical attributes
-
Martínez, S., Sánchez, D., Valls, A., A semantic framework to protect the privacy of electronic health records with non-numerical attributes. J. Biomed. Inf. 46 (2013), 294–303.
-
(2013)
J. Biomed. Inf.
, vol.46
, pp. 294-303
-
-
Martínez, S.1
Sánchez, D.2
Valls, A.3
-
56
-
-
84897535843
-
The distributional hypothesis
-
Sahlgren, M., The distributional hypothesis. Rivista di Linguistica 20 (2008), 33–53.
-
(2008)
Rivista di Linguistica
, vol.20
, pp. 33-53
-
-
Sahlgren, M.1
-
57
-
-
84855418436
-
Normalized (Pointwise) mutual information in collocation extraction
-
Bouma, G., Normalized (Pointwise) mutual information in collocation extraction. Proc. of Biennial GSCL Conference 2009, Tübingen, Germany, 2009, 31–40.
-
(2009)
Proc. of Biennial GSCL Conference 2009, Tübingen, Germany
, pp. 31-40
-
-
Bouma, G.1
-
58
-
-
32344447157
-
Distributional measures of semantic distance: a survey
-
Mohammad, S., Hirst, G., Distributional measures of semantic distance: a survey. http://arxiv.org/abs/1203.1858, 2006.
-
(2006)
-
-
Mohammad, S.1
Hirst, G.2
-
59
-
-
78049256235
-
A relational model of semantic similarity between words using automatically extracted lexical pattern clusters from the web
-
Bollegala, D., Matsuo, Y., Ishizuka, M., A relational model of semantic similarity between words using automatically extracted lexical pattern clusters from the web. Proc. of Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Singapore, Republic of Singapore, 2009, 803–812.
-
(2009)
Proc. of Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Singapore, Republic of Singapore
, pp. 803-812
-
-
Bollegala, D.1
Matsuo, Y.2
Ishizuka, M.3
-
60
-
-
84957638868
-
Estimating search engine index size variability: a 9-year longitudinal study
-
van den Bosch, A., Bogers, T., de Kunder, M., Estimating search engine index size variability: a 9-year longitudinal study. Scientometrics 107 (2016), 839–856.
-
(2016)
Scientometrics
, vol.107
, pp. 839-856
-
-
van den Bosch, A.1
Bogers, T.2
de Kunder, M.3
-
61
-
-
34248172904
-
Measures of semantic similarity and relatedness in the biomedical domain
-
Pedersen, T., Pakhomov, S., Patwardhan, S., Chute, C., Measures of semantic similarity and relatedness in the biomedical domain. J. Biomed. Inf. 40 (2007), 288–299.
-
(2007)
J. Biomed. Inf.
, vol.40
, pp. 288-299
-
-
Pedersen, T.1
Pakhomov, S.2
Patwardhan, S.3
Chute, C.4
-
62
-
-
85017479160
-
The pitfalls of using Google ngram to study language
-
Zhang, S., The pitfalls of using Google ngram to study language. in: Wired, 2015.
-
(2015)
in: Wired
-
-
Zhang, S.1
|