메뉴 건너뛰기




Volumn 10, Issue 1, 2013, Pages 9-23

Confronting the digital: Or how academic history writing lost the plot

Author keywords

Digital history; Digital humanities; OCR; Referencing; Scholarship; Search; Standards

Indexed keywords


EID: 84873650358     PISSN: 14780038     EISSN: 14780046     Source Type: Journal    
DOI: 10.2752/147800413X13515292098070     Document Type: Article
Times cited : (106)

References (57)
  • 1
    • 84873673326 scopus 로고    scopus 로고
    • In a corporate blog post, on 5 August 2010 (8:26 a.m.), Leonid Taycher, a software engineer employed by Google, claimed there were 129,864,880 books in the world. See Leonid Taycher, 'Books of the World, Stand up and Be Counted! All 129,864,880 of You'
    • In a corporate blog post, on 5 August 2010 (8:26 a.m.), Leonid Taycher, a software engineer employed by Google, claimed there were 129,864,880 books in the world. See Leonid Taycher, 'Books of the World, Stand up and Be Counted! All 129,864,880 of You' (http://booksearch.blogspot.co.uk/2010/08/books-of-world-stand-up-and-be-counted.html).
  • 2
    • 84873643499 scopus 로고    scopus 로고
    • Google Books Search
    • Google Books Search (http://booksearch.blogspot.co.uk/).
  • 3
    • 84873667926 scopus 로고    scopus 로고
    • United States is likely to have a significant impact on the development of this new digital environment, but moves in both Europe and the US to create 'national digital libraries' and continuing commercial and state interest in the wider project suggest that there will be no substantial let up in the pace of digitization in the near future. For the issues associated with the Google Books Settlement, see
    • The legal mire that has overtaken the Google Books Settlement in the United States is likely to have a significant impact on the development of this new digital environment, but moves in both Europe and the US to create 'national digital libraries' and continuing commercial and state interest in the wider project suggest that there will be no substantial let up in the pace of digitization in the near future. For the issues associated with the Google Books Settlement, see.
  • 5
    • 84873661976 scopus 로고    scopus 로고
    • The relatively paucity of non-Western materials available in a digital format is particularly noteworthy, suggesting that the very process of digitization is effectively reproducing a kind of Western cultural hegemony that would not be acceptable if it was a product of selfconscious policy
    • The relatively paucity of non-Western materials available in a digital format is particularly noteworthy, suggesting that the very process of digitization is effectively reproducing a kind of Western cultural hegemony that would not be acceptable if it was a product of selfconscious policy.
  • 6
    • 84873666149 scopus 로고    scopus 로고
    • See Google Books
    • See Google Books (http://books.google.co.uk/bkshp?hl=en&tab=wp).
  • 7
    • 84873634559 scopus 로고    scopus 로고
    • Early English Books Online (EEBO)
    • Early English Books Online (EEBO) (http://eebo.chadwyck.com/home).
  • 8
    • 84873693549 scopus 로고    scopus 로고
    • Eighteenth Century Collections Online (ECCO)
    • Eighteenth Century Collections Online (ECCO) (http://gale.cengage.co.uk/product-highlights/history/eighteenth-century-collections-online.aspx).
  • 9
    • 84873646750 scopus 로고    scopus 로고
    • Digital Archive
    • The Times Digital Archive (http://gale.cengage.co.uk/times.aspx/).
    • The Times
  • 10
    • 84873643801 scopus 로고    scopus 로고
    • British Library's Newspaper Collections
    • British Library's Newspaper Collections (http://www.bl.uk/welcome/newspapers.html).
  • 11
    • 84873629329 scopus 로고    scopus 로고
    • Project Gutenberg
    • Project Gutenberg (http://www.gutenberg.org/).
  • 12
    • 84873696704 scopus 로고    scopus 로고
    • Million Book Project
    • Million Book Project (http://archive.org/details/millionbooks).
  • 13
    • 84873626844 scopus 로고    scopus 로고
    • Parliamentary Papers
    • Parliamentary Papers (http://parlipapers.chadwyck.co.uk/marketing/index.jsp).
  • 14
    • 84873696297 scopus 로고    scopus 로고
    • Nineteenth Century Censuses
    • Nineteenth Century Censuses (http://www.nationalarchives.gov.uk/records/census-records.htm).
  • 15
    • 84873695469 scopus 로고    scopus 로고
    • The Clergy of the Church of England Database
    • The Clergy of the Church of England Database (http://www.theclergydatabase.org.uk/index.html).
  • 16
    • 84873672101 scopus 로고    scopus 로고
    • Old Bailey Online
    • Old Bailey Online (http://www.oldbaileyonline.org/).
  • 17
    • 84873675078 scopus 로고    scopus 로고
    • Even the small subset of primary sources for British history from 1500 to 1900 available through the Connected Histories site, which currently (6 December 2011) excludes all books and includes only larger collections that meet a minimum standard of academic presentation, runs to over ten billion words of text and 140,000 images. It should be noted that the author has spent the last twelve years helping to create four substantial web resources: the Old Bailey Online
    • Even the small subset of primary sources for British history from 1500 to 1900 available through the Connected Histories site (http://www.connectedhistories.org/Default.aspx), which currently (6 December 2011) excludes all books and includes only larger collections that meet a minimum standard of academic presentation, runs to over ten billion words of text and 140,000 images. It should be noted that the author has spent the last twelve years helping to create four substantial web resources: the Old Bailey Online.
  • 18
    • 84873677051 scopus 로고    scopus 로고
    • London Lives, 1690-1800
    • London Lives, 1690-1800, (http://www.londonlives.org/).
  • 19
    • 84873660541 scopus 로고    scopus 로고
    • Connected Histories, and Locating London's Past, and is fully implicated in the process and project
    • Connected Histories, and Locating London's Past (http://www.locatinglondon.org/index.html) and is fully implicated in the process and project.
  • 20
    • 84873668250 scopus 로고    scopus 로고
    • This is not to denigrate the many, primarily small-scale, digitization projects led by academics, and generally directed towards addressing specific research questions. Transcribe Bentham
    • This is not to denigrate the many, primarily small-scale, digitization projects led by academics, and generally directed towards addressing specific research questions. Transcribe Bentham (http://www.ucl.ac.uk/transcribe-bentham/).
  • 21
    • 84873646871 scopus 로고    scopus 로고
    • The Newton Project
    • The Newton Project (http://www.newtonproject.sussex.ac.uk/prism.php?id=1).
  • 22
    • 84873688411 scopus 로고    scopus 로고
    • The Electronic Enlightenment, to take just three examples among many others, represent a high academic standard, created in pursuit of answers to serious academic questions. But, by their nature, and in comparison to the large commercial digitization projects, they represent a small part of a wider online landscape of historical materials
    • The Electronic Enlightenment (http://www.e-enlightenment.com/), to take just three examples among many others, represent a high academic standard, created in pursuit of answers to serious academic questions. But, by their nature, and in comparison to the large commercial digitization projects, they represent a small part of a wider online landscape of historical materials.
  • 23
    • 84873663221 scopus 로고    scopus 로고
    • Many of the librarians and archivists involved in the wider process of digitization are employed by universities or major research libraries and would consider themselves 'academics', but the widespread adoption of the term 'alt-ac' (alternative-academic) to describe the careers of the people involved reflects the distance between the professional academic discipline of history and the people most fully engaged in the wider project
    • Many of the librarians and archivists involved in the wider process of digitization are employed by universities or major research libraries and would consider themselves 'academics', but the widespread adoption of the term 'alt-ac' (alternative-academic) to describe the careers of the people involved reflects the distance between the professional academic discipline of history and the people most fully engaged in the wider project.
  • 25
    • 84906032677 scopus 로고    scopus 로고
    • New Universes or Black Holes? Does Digital Change Anything?
    • Toni Weller (ed.), Oxon. and London, forthcoming
    • Valerie Johnson and David Thomas, 'New Universes or Black Holes? Does Digital Change Anything?', in Toni Weller (ed.), History in the Digital Age (Oxon. and London, forthcoming 2012).
    • (2012) History In the Digital Age
    • Johnson, V.1    Thomas, D.2
  • 26
    • 84873663640 scopus 로고    scopus 로고
    • This excludes figures for the British Library (which were not available), and money generated for organizations such as the National Archives by participating in commercial digital projects. It should be noted that even this ten-year total is dwarfed by the amount spent each year supporting more traditional forms of humanist scholarship. 8. See
    • This excludes figures for the British Library (which were not available), and money generated for organizations such as the National Archives by participating in commercial digital projects. It should be noted that even this ten-year total is dwarfed by the amount spent each year supporting more traditional forms of humanist scholarship. 8. See.
  • 27
    • 84873633672 scopus 로고    scopus 로고
    • Funding for Sustainability: How Funders
    • June
    • Nancy L. Maron and Matthew Loy, 'Funding for Sustainability: How Funders' Practices Influence the Future of Digital Resources', June 2011, p. 9, note 3 (http://www.jisc.ac.uk/media/documents/programmes/contentalliance/sandrfundingforsustainability.pdf).
    • (2011) Practices Influence the Future of Digital Resources , Issue.3 , pp. 9
    • Maron, N.L.1    Loy, M.2
  • 28
    • 84873619143 scopus 로고    scopus 로고
    • Of the NOF projects, approximately 15 per cent were located in universities and a further 20 per cent in recognized academic institutions such as the National Archives and British Library, but projects across the programme suffered a similar rate of failure
    • Of the NOF projects, approximately 15 per cent were located in universities and a further 20 per cent in recognized academic institutions such as the National Archives and British Library, but projects across the programme suffered a similar rate of failure.
  • 29
    • 84873643131 scopus 로고    scopus 로고
    • The Je-S system went live in May 2003, and the first use of the system by the AHRB/C was recorded in June 2005. See Je-S Help
    • The Je-S system went live in May 2003, and the first use of the system by the AHRB/C was recorded in June 2005. See Je-S Help (https://je-s.rcuk.ac.uk/Handbook/index.htm#pages/JeSHelpdesk.htm).
  • 30
    • 84873656993 scopus 로고    scopus 로고
    • It is difficult to access local and subject-specific information in this area, but for a general overview of the patterns of concentration of journal publishing across the academy see
    • It is difficult to access local and subject-specific information in this area, but for a general overview of the patterns of concentration of journal publishing across the academy see.
  • 31
    • 84873671535 scopus 로고    scopus 로고
    • (in cooperation with the Oxford Internet Institute), Oxford
    • Corinne M. Flick (in cooperation with the Oxford Internet Institute), Geographies of the World's Knowledge (Oxford, 2011), http://www.oii.ox.ac.uk/publications/convoco_geographies_en.pdf, pp.14-19.
    • (2011) Geographies of the World's Knowledge , pp. 14-19
    • Flick, C.M.1
  • 32
    • 84873697979 scopus 로고    scopus 로고
    • Internationally the largest players are Elsevier, Wiley, Springer and Taylor and Francis
    • Internationally the largest players are Elsevier, Wiley, Springer and Taylor and Francis.
  • 33
    • 84873660212 scopus 로고    scopus 로고
    • The most common system used for this 'graphical mapping' is called Olive. See
    • The most common system used for this 'graphical mapping' is called Olive. See http://www.olivesoftware.com/.
  • 34
    • 77950651836 scopus 로고    scopus 로고
    • Measuring Mass Text Digitization Quality and Usefulness
    • Simon Tanner, Trevor Muñoz and Pich Hemy Ros, 'Measuring Mass Text Digitization Quality and Usefulness', D-Lib Magazine, 15(7/8) (2009), http://www.dlib.org/dlib/july09/munoz/07munoz.html.
    • (2009) D-Lib Magazine , vol.15 , Issue.7-8
    • Tanner, S.1    Muñoz, T.2    Ros, P.H.3
  • 35
    • 84873693757 scopus 로고    scopus 로고
    • The Burney Collection is being highlighted here only because of the existence of a detailed assessment. It is largely typical of other sites providing OCRd text, and better than many
    • The Burney Collection is being highlighted here only because of the existence of a detailed assessment. It is largely typical of other sites providing OCRd text, and better than many.
  • 36
    • 84873687911 scopus 로고    scopus 로고
    • Seventeenth and Eighteenth Century Burney Collection of newspapers
    • Seventeenth and Eighteenth Century Burney Collection of newspapers (http://www.bbk.ac.uk/lib/elib/databases/newspapers/burney).
  • 37
    • 63849314074 scopus 로고    scopus 로고
    • (London, England), Saturday 16 May 1778
    • Daily Advertiser (London, England), Saturday 16 May 1778.
    • Daily Advertiser
  • 38
    • 84873630818 scopus 로고    scopus 로고
    • Issue 14784, Gale Document Number: Z2000162519
    • Issue 14784, Gale Document Number: Z2000162519.
  • 39
    • 84873619550 scopus 로고    scopus 로고
    • This selection of raw OCR was constructed by stringing together snippets of text exposed as part of the search results in Connected Histories. It is impossible to link directly to this text as the system of licensing used by Gale Cengage embeds local access data within the URL. An accurate transcription of this text would read: '"It is supposed that we have actually 40 Frigates at Sea for the Protection of our Trade. The St. Domingo Fleet is safely arrived at La Rochelle with a rich Cargo
    • This selection of raw OCR was constructed by stringing together snippets of text exposed as part of the search results in Connected Histories. It is impossible to link directly to this text as the system of licensing used by Gale Cengage embeds local access data within the URL. An accurate transcription of this text would read: '"It is supposed that we have actually 40 Frigates at Sea for the Protection of our Trade. The St. Domingo Fleet is safely arrived at La Rochelle with a rich Cargo.
  • 40
    • 84873682978 scopus 로고    scopus 로고
    • Its Arrival is the more agreeable to the Merchants, as there was not a Ship of the whole Fleet insured Victualler From Cork For New-York, Dated Boston, March 30
    • Its Arrival is the more agreeable to the Merchants, as there was not a Ship of the whole Fleet insured. Extract of a Letter from Mr. Coxwood, mate of the Ulster, Victualler from Cork for New-York, dated Boston, March 30.
    • Extract of a Letter From Mr. Coxwood, Mate of the Ulster
  • 41
    • 84873644723 scopus 로고    scopus 로고
    • th of January we fell in with an American Privateer, commanded by Captain Bailey, called the Flower of the Sea, of 26 Guns and 180 Men, to which Vessel we were obliged to strike after discharging our Guns, six four Pounders. We were towed into this Place by the Privateer the 23d instant, after a very dangerous Passage, having carried away all our Masts."'
    • th of January we fell in with an American Privateer, commanded by Captain Bailey, called the Flower of the Sea, of 26 Guns and 180 Men, to which Vessel we were obliged to strike after discharging our Guns, six four Pounders. We were towed into this Place by the Privateer the 23d instant, after a very dangerous Passage, having carried away all our Masts."'
  • 42
    • 84873663197 scopus 로고    scopus 로고
    • According to Wikipedia, the statute acreage of the administrative county of Hertfordshire in 1891 was 406,932. The relevant article footnotes the census report for this date. See 'Administrative counties of England' in Wikipedia, accessed 6 December
    • According to Wikipedia, the statute acreage of the administrative county of Hertfordshire in 1891 was 406,932. The relevant article footnotes the census report for this date. See 'Administrative counties of England' in Wikipedia (http://en.wikipedia.org/wiki/Administrative_counties_of_England#Area_and_population, accessed 6 December 2011).
    • (2011)
  • 43
    • 84873620400 scopus 로고    scopus 로고
    • Interestingly, more sophisticated users who knew the digital landscape and original sources more fully might have relied on the highly academic and comprehensive edition of this material available through Histpop: The Online Historical Population Reports Project, a site that gives full access to the original census reports. But if they had done this, they would have been confronted with three different areas for Hertfordshire in 1891 - for the ancient county, the administrative county (used in Wikipedia) and the registration county
    • Interestingly, more sophisticated users who knew the digital landscape and original sources more fully might have relied on the highly academic and comprehensive edition of this material available through Histpop: The Online Historical Population Reports Project (http://www.histpop.org/ohpr/servlet/Show?page=Home), a site that gives full access to the original census reports. But if they had done this, they would have been confronted with three different areas for Hertfordshire in 1891 - for the ancient county, the administrative county (used in Wikipedia) and the registration county.
  • 46
    • 84873631758 scopus 로고    scopus 로고
    • Exclusive: How Google's Algorithm Rules the Web
    • For an accessible account of these developments, and of the evolution of the Google algorithm more generally, see, 22 February
    • For an accessible account of these developments, and of the evolution of the Google algorithm more generally, see Steven Levy, 'Exclusive: How Google's Algorithm Rules the Web', Wired, 22 February 2010 (http://www.wired.com/magazine/2010/02/ff_google_algorithm/).
    • (2010) Wired
    • Levy, S.1
  • 47
    • 84873638038 scopus 로고    scopus 로고
    • Google's description of this process makes a positive virtue of its ever hanging results: 'When you perform a search, the results are often displayed with the information: About XXXX results (X seconds). Google's calculation of the total number of search results is an estimate. We understand that a ballpark figure is valuable, and by providing an estimate rather than an exact account, we can return quality search results faster. In addition, when you click on the next page of search results, the total number of search results can change
    • Google's description of this process makes a positive virtue of its ever hanging results: 'When you perform a search, the results are often displayed with the information: About XXXX results (X seconds). Google's calculation of the total number of search results is an estimate. We understand that a ballpark figure is valuable, and by providing an estimate rather than an exact account, we can return quality search results faster. In addition, when you click on the next page of search results, the total number of search results can change.
  • 48
    • 84873637036 scopus 로고    scopus 로고
    • Google's search index is constantly changing, and your second search results page may come from a slightly different version of the index than the first page
    • Google's search index is constantly changing, and your second search results page may come from a slightly different version of the index than the first page.' (http://support.google.com/webmasters/bin/answer.py?hl=en&answer=70920).
  • 49
    • 84873629119 scopus 로고    scopus 로고
    • These searches were undertaken from my home computer located in Muswell Hill, UK, on 8 December 2011 and, as explained in the text, cannot be repeated or effectively cited
    • These searches were undertaken from my home computer located in Muswell Hill, UK, on 8 December 2011 and, as explained in the text, cannot be repeated or effectively cited.
  • 50
    • 77956894398 scopus 로고    scopus 로고
    • Google Books: A Metadata Train Wreck
    • See, 29 August
    • See Geoffrey Nunberg, 'Google Books: A Metadata Train Wreck', Language Log, 29 August 2009 (http://languagelog.ldc.upenn.edu/nll/?p=1701).
    • (2009) Language Log
    • Nunberg, G.1
  • 51
    • 84873664945 scopus 로고    scopus 로고
    • See the BISG webpage advertising the BISAC subject headings, 2011 edition, accessed December
    • See the BISG webpage advertising the BISAC subject headings, 2011 edition (http://www.bisg.org/publications/product.php?p=149, accessed December 2011).
    • (2011)
  • 52
    • 84873616692 scopus 로고
    • It is perhaps invidious to point out any single example, but it was only when the Clergy of the Church of England Database project attempted to use the detailed information contained in the Royal Historical Society's, Frederic A. Young, that the inconsistent character of many entries became apparent
    • It is perhaps invidious to point out any single example, but it was only when the Clergy of the Church of England Database project attempted to use the detailed information contained in the Royal Historical Society's Guide to the Local Administrative Units of England (Frederic A. Young, vol. 1, 1979, vol. 2, 1991) that the inconsistent character of many entries became apparent.
    • (1979) Guide to The Local Administrative Units of England , vol.2
  • 53
    • 84873692195 scopus 로고    scopus 로고
    • One looks to the Royal Historical Society, the Historical Association or the British Academy, or for a clear steer from the major academic presses and journals
    • One looks to the Royal Historical Society, the Historical Association or the British Academy, or for a clear steer from the major academic presses and journals.
  • 54
    • 6144294325 scopus 로고
    • The Sky Line "Mother Jacob's Home Remedies
    • 1 December
    • Lewis Mumford, 'The Sky Line "Mother Jacob's Home Remedies"', New Yorker, 1 December 1962, p. 148 (http://www.newyorker.com/archive/1962/12/01/1962_12_01_148_TNY_CARDS_000269697).
    • (1962) New Yorker , pp. 148
    • Mumford, L.1
  • 55
    • 84873683269 scopus 로고    scopus 로고
    • This observation was made as an aside directed at urban planners in a review of a book by Jane Jacobs. It was located following a halfremembered reference in an article in a popular newspaper I have since thrown away, and a Google search on the phrase 'questions that computers can answer'
    • This observation was made as an aside directed at urban planners in a review of a book by Jane Jacobs. It was located following a halfremembered reference in an article in a popular newspaper I have since thrown away, and a Google search on the phrase 'questions that computers can answer'.
  • 57
    • 79955425563 scopus 로고    scopus 로고
    • Quantitative Analysis of Culture Using Millions of Digitized Books
    • The Google Books Team, published online ahead of print, 16, December
    • Jean-Baptiste Michel, Yuan Kui Shen, Aviva Presser Aiden, Adrian Veres, Matthew K. Gray, William Brockman, The Google Books Team, Joseph P. Pickett, Dale Hoiberg, Dan Clancy, Peter Norvig, Jon Orwant, Steven Pinker, Martin A. Nowak and Erez Lieberman Aiden, 'Quantitative Analysis of Culture Using Millions of Digitized Books', Science (published online ahead of print, 16 December 2010, http://www.sciencemag.org/content/331/6014/176.full).
    • (2010) Science
    • Michel, J.-B.1    Shen, Y.K.2    Aiden, A.P.3    Veres, A.4    Gray, M.K.5    Brockman, W.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.