-
1
-
-
84866793760
-
Specialized research datasets in the CiteSeerX digital library
-
S. Bhatia, C. Caragea, H.-H. Chen, J. Wu, P. Treeratpituk, Z. Wu, M. Khabsa, P. Mitra, and C. L. Giles. Specialized research datasets in the CiteSeerX digital library. In D-Lib Magazine, volume 18, 2012.
-
(2012)
D-Lib Magazine
, vol.18
-
-
Bhatia, S.1
Caragea, C.2
Chen, H.-H.3
Wu, J.4
Treeratpituk, P.5
Wu, Z.6
Khabsa, M.7
Mitra, P.8
Giles, C.L.9
-
2
-
-
33846516584
-
-
Springer-Verlag New York, Inc., Secaucus, NJ, USA
-
C. M. Bishop. Pattern Recognition and Machine Learning (Information Science and Statistics). Springer-Verlag New York, Inc., Secaucus, NJ, USA, 2006.
-
(2006)
Pattern Recognition and Machine Learning (Information Science and Statistics)
-
-
Bishop, C.M.1
-
3
-
-
84899928992
-
Citeseerx: A scholarly big dataset
-
C. Caragea, J. Wu, A. Ciobanu, K. Williams, J. Fernandez-Ramirez, H.-H. Chen, Z. Wu, and C. L. Giles. Citeseerx: A scholarly big dataset. ECIR '14, pages 311-322, 2014.
-
(2014)
ECIR '14
, pp. 311-322
-
-
Caragea, C.1
Wu, J.2
Ciobanu, A.3
Williams, K.4
Fernandez-Ramirez, J.5
Chen, H.-H.6
Wu, Z.7
Giles, C.L.8
-
4
-
-
33749012764
-
Layout and content extraction for pdf documents
-
Springer
-
H. Chao and J. Fan. Layout and content extraction for pdf documents. In Document Analysis Systems VI, pages 213-224. Springer, 2004.
-
(2004)
Document Analysis Systems VI
, pp. 213-224
-
-
Chao, H.1
Fan, J.2
-
5
-
-
79960548564
-
Collabseer: A search engine for collaboration discovery
-
ACM
-
H.-H. Chen, L. Gou, X. Zhang, and C. L. Giles. Collabseer: A search engine for collaboration discovery. In Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries, pages 231-240. ACM, 2011.
-
(2011)
Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital Libraries
, pp. 231-240
-
-
Chen, H.-H.1
Gou, L.2
Zhang, X.3
Giles, C.L.4
-
6
-
-
84882271960
-
CSSeer: An expert recommendation system based on CiteseerX
-
ACM
-
H.-H. Chen, P. Treeratpituk, P. Mitra, and C. L. Giles. CSSeer: An expert recommendation system based on CiteseerX. In Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL '13, pages 381-382. ACM, 2013.
-
(2013)
Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL '13
, pp. 381-382
-
-
Chen, H.-H.1
Treeratpituk, P.2
Mitra, P.3
Giles, C.L.4
-
7
-
-
84889610268
-
Figure metadata extraction from digital documents
-
IEEE
-
S. R. Choudhury, P. Mitra, A. Kirk, S. Szep, D. Pellegrino, S. Jones, and C. L. Giles. Figure metadata extraction from digital documents. In Proceedings of ICDAR, pages 135-139. IEEE, 2013.
-
(2013)
Proceedings of ICDAR
, pp. 135-139
-
-
Choudhury, S.R.1
Mitra, P.2
Kirk, A.3
Szep, S.4
Pellegrino, D.5
Jones, S.6
Giles, C.L.7
-
8
-
-
84959891060
-
Automatic identification of research articles from crawled documents
-
Cornelia Caragea, Jian Wu and C. L. Giles. Automatic identification of research articles from crawled documents. In Proceedings of WSDM-WSCBD, 2014.
-
(2014)
Proceedings of WSDM-WSCBD
-
-
Caragea, C.1
Wu, J.2
Giles, C.L.3
-
9
-
-
34249753618
-
Support-vector networks
-
C. Cortes and V. Vapnik. Support-vector networks. Machine Learning, 20(3):273-297, 1995.
-
(1995)
Machine Learning
, vol.20
, Issue.3
, pp. 273-297
-
-
Cortes, C.1
Vapnik, V.2
-
10
-
-
36348953944
-
Flux-cim: Flexible unsupervised extraction of citation metadata
-
E. Cortez, A. S. da Silva, M. A. Goncalves, F. Mesquita, and E. S. de Moura. Flux-cim: Flexible unsupervised extraction of citation metadata. JCDL '07, pages 215-224, 2007.
-
(2007)
JCDL '07
, pp. 215-224
-
-
Cortez, E.1
Da Silva, A.S.2
Goncalves, M.A.3
Mesquita, F.4
De Moura, E.S.5
-
11
-
-
85029602093
-
Parscit: An open-source crf reference string parsing package
-
I. G. Councill, C. L. Giles, and M.-Y. Kan. Parscit: An open-source crf reference string parsing package. LREC '08, 2008.
-
(2008)
LREC '08
-
-
Councill, I.G.1
Giles, C.L.2
Kan, M.-Y.3
-
12
-
-
79960522068
-
On identifying academic homepages for digital libraries
-
New York, NY, USA, ACM
-
S. D. Gollapalli, C. L. Giles, P. Mitra, and C. Caragea. On identifying academic homepages for digital libraries. In Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital Libraries, JCDL '11, pages 123-132, New York, NY, USA, 2011. ACM.
-
(2011)
Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital Libraries, JCDL '11
, pp. 123-132
-
-
Gollapalli, S.D.1
Giles, C.L.2
Mitra, P.3
Caragea, C.4
-
13
-
-
84941274546
-
Automatic document metadata extraction using support vector machines
-
IEEE
-
H. Han, C. L. Giles, E. Manavoglu, H. Zha, Z. Zhang, and E. A. Fox. Automatic document metadata extraction using support vector machines. In Digital Libraries, 2003. Proceedings. 2003 Joint Conference on, pages 37-48. IEEE, 2003.
-
(2003)
Digital Libraries, 2003. Proceedings. 2003 Joint Conference on
, pp. 37-48
-
-
Han, H.1
Giles, C.L.2
Manavoglu, E.3
Zha, H.4
Zhang, Z.5
Fox, E.A.6
-
14
-
-
84871049973
-
Recommending citations: Translating papers into references
-
ACM
-
W. Huang, S. Kataria, C. Caragea, P. Mitra, C. L. Giles, and L. Rokach. Recommending citations: Translating papers into references. In Proceedings of the 21st ACM international conference on Information and knowledge management, pages 1910-1914. ACM, 2012.
-
(2012)
Proceedings of the 21st ACM International Conference on Information and Knowledge Management
, pp. 1910-1914
-
-
Huang, W.1
Kataria, S.2
Caragea, C.3
Mitra, P.4
Giles, C.L.5
Rokach, L.6
-
15
-
-
84901218198
-
The number of scholarly documents on the public web
-
M. Khabsa and C. L. Giles. The number of scholarly documents on the public web. PloS one, 9(5):e93949, 2014.
-
(2014)
PloS One
, vol.9
, Issue.5
, pp. e93949
-
-
Khabsa, M.1
Giles, C.L.2
-
17
-
-
0142192295
-
Conditional random fields: Probabilistic models for segmenting and labeling sequence data
-
J. D. Laérty, A. McCallum, and F. C. N. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. ICML '01, pages 282-289, 2001.
-
(2001)
ICML '01
, pp. 282-289
-
-
Laérty, J.D.1
McCallum, A.2
Pereira, F.C.N.3
-
18
-
-
84882270532
-
Evaluation of header metadata extraction approaches and tools for scientific pdf documents
-
New York, NY, USA, ACM
-
M. Lipinski, K. Yao, C. Breitinger, J. Beel, and B. Gipp. Evaluation of header metadata extraction approaches and tools for scientific pdf documents. In Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL '13, pages 385-386, New York, NY, USA, 2013. ACM.
-
(2013)
Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL '13
, pp. 385-386
-
-
Lipinski, M.1
Yao, K.2
Breitinger, C.3
Beel, J.4
Gipp, B.5
-
19
-
-
36348992621
-
TableSeer: Automatic table metadata extraction and searching in digital libraries
-
ACM
-
Y. Liu, K. Bai, P. Mitra, and C. L. Giles. TableSeer: Automatic table metadata extraction and searching in digital libraries. In Proceedings of the 7th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL '07, pages 91-100. ACM, 2007.
-
(2007)
Proceedings of the 7th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL '07
, pp. 91-100
-
-
Liu, Y.1
Bai, K.2
Mitra, P.3
Giles, C.L.4
-
20
-
-
77952067594
-
GROBID: Combining automatic bibliographic data recognition and term extraction for scholarship publications
-
Berlin, Heidelberg, Springer-Verlag
-
P. Lopez. GROBID: Combining automatic bibliographic data recognition and term extraction for scholarship publications. In Proceedings of the 13th European Conference on Research and Advanced Technology for Digital Libraries, ECDL'09, pages 473-474, Berlin, Heidelberg, 2009. Springer-Verlag.
-
(2009)
Proceedings of the 13th European Conference on Research and Advanced Technology for Digital Libraries, ECDL'09
, pp. 473-474
-
-
Lopez, P.1
-
21
-
-
67650417928
-
Automated analysis of images in documents for intelligent document search
-
X. Lu, S. Kataria, W. J. Brouwer, J. Z. Wang, P. Mitra, and C. L. Giles. Automated analysis of images in documents for intelligent document search. IJDAR, 12(2):65-81, 2009.
-
(2009)
IJDAR
, vol.12
, Issue.2
, pp. 65-81
-
-
Lu, X.1
Kataria, S.2
Brouwer, W.J.3
Wang, J.Z.4
Mitra, P.5
Giles, C.L.6
-
22
-
-
29244464687
-
Information extraction from research papers using conditional random fields
-
July
-
F. Peng and A. McCallum. Information extraction from research papers using conditional random fields. Inf. Process. Manage., 42(4):963-979, July 2006.
-
(2006)
Inf. Process. Manage.
, vol.42
, Issue.4
, pp. 963-979
-
-
Peng, F.1
McCallum, A.2
-
23
-
-
70450273106
-
Disambiguating authors in academic publications using random forests
-
P. Treeratpituk and C. L. Giles. Disambiguating authors in academic publications using random forests. JCDL '09, pages 39-48, 2009.
-
(2009)
JCDL '09
, pp. 39-48
-
-
Treeratpituk, P.1
Giles, C.L.2
-
24
-
-
84889575897
-
Automatic detection of pseudocodes in scholarly documents using machine learning
-
S. Tuarob, S. Bhatia, P. Mitra, and C. Giles. Automatic detection of pseudocodes in scholarly documents using machine learning. In Proceedings of ICDAR, 2013.
-
(2013)
Proceedings of ICDAR
-
-
Tuarob, S.1
Bhatia, S.2
Mitra, P.3
Giles, C.4
-
25
-
-
84863539696
-
Improving algorithm search using the algorithm co-citation network
-
S. Tuarob, P. Mitra, and C. L. Giles. Improving algorithm search using the algorithm co-citation network. In Proceedings of JCDL, pages 277-280, 2012.
-
(2012)
Proceedings of JCDL
, pp. 277-280
-
-
Tuarob, S.1
Mitra, P.2
Giles, C.L.3
-
26
-
-
84882283563
-
A classification scheme for algorithm citation function in scholarly works
-
S. Tuarob, P. Mitra, and C. L. Giles. A classification scheme for algorithm citation function in scholarly works. In Proceedings of JCDL, JCDL '13, pages 367-368, 2013.
-
(2013)
Proceedings of JCDL, JCDL '13
, pp. 367-368
-
-
Tuarob, S.1
Mitra, P.2
Giles, C.L.3
-
27
-
-
84901755944
-
Scholarly big data information extraction and integration in the CiteSeerX digital library
-
IEEE
-
K. Williams, J. Wu, S. R. Choudhury, M. Khabsa, and C. L. Giles. Scholarly big data information extraction and integration in the CiteSeerX digital library. In Data Engineering Workshops (ICDEW), 2014 IEEE 30th International Conference on, pages 68-73. IEEE, 2014b.
-
(2014)
Data Engineering Workshops (ICDEW), 2014 IEEE 30th International Conference on
, pp. 68-73
-
-
Williams, K.1
Wu, J.2
Choudhury, S.R.3
Khabsa, M.4
Giles, C.L.5
-
28
-
-
84968621395
-
Utility-based control feedback in a digital library search engine: Cases in CiteSeerX
-
USENIX Association
-
J. Wu, A. Ororbia, K. Williams, M. Khabsa, Z. Wu, and C. L. Giles. Utility-based control feedback in a digital library search engine: Cases in CiteSeerX. In 9th International Workshop on Feedback Computing (Feedback Computing 14). USENIX Association, 2014.
-
(2014)
9th International Workshop on Feedback Computing (Feedback Computing 14)
-
-
Wu, J.1
Ororbia, A.2
Williams, K.3
Khabsa, M.4
Wu, Z.5
Giles, C.L.6
-
29
-
-
84870493887
-
Web crawler middleware for search engine digital libraries: A case study for citeseerx
-
New York, NY, USA, ACM
-
J. Wu, P. Teregowda, M. Khabsa, S. Carman, D. Jordan, J. San Pedro Wandelmer, X. Lu, P. Mitra, and C. L. Giles. Web crawler middleware for search engine digital libraries: A case study for citeseerx. WIDM '12, pages 57-64, New York, NY, USA, 2012. ACM.
-
(2012)
WIDM '12
, pp. 57-64
-
-
Wu, J.1
Teregowda, P.2
Khabsa, M.3
Carman, S.4
Jordan, D.5
San Pedro Wandelmer, J.6
Lu, X.7
Mitra, P.8
Giles, C.L.9
-
30
-
-
84869071720
-
The evolution of a crawling strategy for an academic document search engine: Whitelists and blacklists
-
New York, NY, USA, ACM
-
J. Wu, P. Teregowda, J. P. F. Ramirez, P. Mitra, S. Zheng, and C. L. Giles. The evolution of a crawling strategy for an academic document search engine: whitelists and blacklists. In Proceedings of the 3rd Annual ACM Web Science Conference, WebSci '12, pages 340-343, New York, NY, USA, 2012. ACM.
-
(2012)
Proceedings of the 3rd Annual ACM Web Science Conference, WebSci '12
, pp. 340-343
-
-
Wu, J.1
Teregowda, P.2
Ramirez, J.P.F.3
Mitra, P.4
Zheng, S.5
Giles, C.L.6
-
31
-
-
84964608895
-
Citeseerx: Ai in a digital library search engine
-
J. Wu, K. Williams, H.-H. Chen, M. Khabsa, C. Caragea, A. Ororbia, D. Jordan, and C. L. Giles. Citeseerx: Ai in a digital library search engine. In The Twenty-Sixth Annual Conference on Innovative Applications of Artificial Intelligence, IAAI '14, 2014.
-
(2014)
The Twenty-Sixth Annual Conference on Innovative Applications of Artificial Intelligence, IAAI '14
-
-
Wu, J.1
Williams, K.2
Chen, H.-H.3
Khabsa, M.4
Caragea, C.5
Ororbia, A.6
Jordan, D.7
Giles, C.L.8
-
32
-
-
84889563700
-
Measuring term informativeness in context
-
Z. Wu and C. L. Giles. Measuring term informativeness in context. In Proceedings of NAACL-HLT 2013, page 259-269, 2013.
-
(2013)
Proceedings of NAACL-HLT 2013
, pp. 259-269
-
-
Wu, Z.1
Giles, C.L.2
-
33
-
-
84889592982
-
Can back-of-the-book indexes be automatically created?
-
Z. Wu, Z. Li, P. Mitra, and C. L. Giles. Can back-of-the-book indexes be automatically created? In Proceedings of CIKM, pages 1745-1750, 2013.
-
(2013)
Proceedings of CIKM
, pp. 1745-1750
-
-
Wu, Z.1
Li, Z.2
Mitra, P.3
Giles, C.L.4
-
35
-
-
84919397810
-
Towards building a scholarly big data platform: Challenges, lessons and opportunities
-
JCDL
-
Z. Wu, J. Wu, M. Khabsa, K. Williams, H.-H. Chen, W. Huang, S. Tuarob, S. R. Choudhury, A. Ororbia, P. Mitra, and others. Towards building a scholarly big data platform: Challenges, lessons and opportunities. In Proceedings of the International Conference on Digital Libraries 2014, volume 447, page 12. JCDL 2014.
-
(2014)
Proceedings of the International Conference on Digital Libraries 2014
, vol.447
, Issue.12
-
-
Wu, Z.1
Wu, J.2
Khabsa, M.3
Williams, K.4
Chen, H.-H.5
Huang, W.6
Tuarob, S.7
Choudhury, S.R.8
Ororbia, A.9
Mitra, P.10
|