메뉴 건너뛰기




Volumn 12, Issue 1, 2006, Pages 29-36

Don't leave the data in the dark: Issues in digitizing print statistical publications

Author keywords

[No Author keywords available]

Indexed keywords


EID: 31144470797     PISSN: 10829873     EISSN: 10829873     Source Type: Journal    
DOI: 10.1045/january2006-linden     Document Type: Article
Times cited : (2)

References (17)
  • 2
    • 0003857217 scopus 로고    scopus 로고
    • U.S. Census Bureau, Statistical Abstracts, 〈http://www.census.gov/prod/www/abs/statab.html〉.
    • Statistical Abstracts
  • 3
    • 84858530498 scopus 로고    scopus 로고
    • FRASER® Federal Reserve Archival System for Economic Research, 〈http://fraser.stlouisfed.org/〉.
  • 5
    • 31144470634 scopus 로고    scopus 로고
    • note
    • Data users need to know a dataset's context (methodology, sampling, etc), source, funding and authoring body, purpose, validity, accuracy, version, orientation in space and time, provenance, relationships to related variables, and potential applications.
  • 9
    • 33747381506 scopus 로고
    • INEGI provides Excel tables from this series, back to 2002, accessed November 30, 2005
    • Instituto Nacional de Estadística, Geografía e Informática (INEGI), Anuarios Estadísticos de los Estados, 1994-2000. INEGI provides Excel tables from this series, back to 2002, on its web site: 〈http://www.inegi.gob.mx/inegi/〉, accessed November 30, 2005.
    • (1994) Anuarios Estadísticos de Los Estados
  • 10
    • 84858526836 scopus 로고    scopus 로고
    • Archive-friendly PDF in the works
    • March 15, The Library of Congress provides details about PDF/A, including evaluation of sustainability, quality, and functionality factors: Sustainability of Digital Formats: Planning for Library of Congress Collections: PDF/A, PDF for Long-term Preservation, 〈http://www.digitalpreservation.gov/formats/fdd/fdd000125.shtml〉, accessed December 9, 2005.
    • We selected PDF/A format in anticipation of its suitability for long-term preservation. "Among federal archivists and records managers, PDF-A is viewed as one of two leading data format candidates for preserving future access to electronic records and documents....The proposed PDF-A standard specifies what should be stored in an archived file by prohibiting, for example, proprietary encryption schemes and embedded files such as executable scripts." Florence Olsen, "Archive-friendly PDF in the works," Federal Computer Week (March 15, 2004), 〈htttp://www.fcw.com/fcw/articles/2004/0315/news-pdf-03-15-04.asr〉. The Library of Congress provides details about PDF/A, including evaluation of sustainability, quality, and functionality factors: Sustainability of Digital Formats: Planning for Library of Congress Collections: PDF/A, PDF for Long-term Preservation, 〈http://www.digitalpreservation.gov/formats/fdd/fdd000125.shtml〉, accessed December 9, 2005.
    • (2004) Federal Computer Week
    • Olsen, F.1
  • 11
    • 31144432604 scopus 로고    scopus 로고
    • note
    • We limited spreadsheet production to this subset for budgetary reasons. Document Solutions, Inc. (DSI) used custom zoned scanning software to process an image file for a particular page and determine the boundaries of the individual table or tables on the page. The text was processed with optical character recognition (OCR) software. DSI staff reviewed and corrected suspected OCR errors and manually corrected the layout. We also contracted for checksum macros, and the accuracy rate for alpha and numeric characters was certified at least 99%.
  • 12
    • 84858519715 scopus 로고    scopus 로고
    • DDI Version 2.1, 〈http://www.icpsr.umich.edu/DDI/users/dtd/index.html#version2.0〉.
    • DDI Version 2.1
  • 13
    • 84858531090 scopus 로고    scopus 로고
    • The same record, rendered with an XSL stylesheet, is available at: 〈http://webapp.icpsr.umich.edu/cocoon/DDI/SAMPLES /Aguascalientes_2000_03_011.xml〉
    • A sample DDI record is available at: 〈http://ssrs.yale.edu/egcdl/xml/Aguascalientes/2000/ Aguascalientes_2000_03_11.xml〉. The same record, rendered with an XSL stylesheet, is available at: 〈http://webapp.icpsr.umich.edu/cocoon/DDI/SAMPLES /Aguascalientes_2000_03_011.xml〉.
  • 14
    • 23844444821 scopus 로고    scopus 로고
    • Digital preservation: Architecture and technology for trusted digital repositories
    • June, 〈doi: 10.1045/june2005-jantz〉
    • Ronald Jantz and Michael J. Giarlo, "Digital Preservation: Architecture and Technology for Trusted Digital Repositories," D-Lib Magazine 11 no. 6 (June 2005), 〈doi: 10.1045/june2005-jantz〉.
    • (2005) D-lib Magazine , vol.11 , Issue.6
    • Jantz, R.1    Giarlo, M.J.2
  • 15
    • 79956316613 scopus 로고    scopus 로고
    • December provides a definition and discussion of "faithful digital reproduction."
    • Digital Library Federation, "Benchmark for Faithful Digital Reproductions of Monographs and Serials," Version 1 (December 2002), 〈http://purl.oclc.org/DLF/benchrepro0212〉, provides a definition and discussion of "faithful digital reproduction."
    • (2002) "Benchmark for Faithful Digital Reproductions of Monographs and Serials," Version 1
  • 17
    • 84909786590 scopus 로고
    • Scientific information = data + meta-data
    • photocopy of draft to be published November 1-2, held at the U.S. Navy Postgraduate School, Monterey, California (Department of Statistics Technical Report, Stanford University, 1985)
    • John McCarthy, "Scientific Information = Data + Meta-data," photocopy of draft to be published in Database Management: Proceedings of a Workshop November 1-2, 1984, held at the U.S. Navy Postgraduate School, Monterey, California (Department of Statistics Technical Report, Stanford University, 1985), 16.
    • (1984) Database Management: Proceedings of a Workshop , pp. 16
    • McCarthy, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.