Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Digitization of Text documents Using PDF/A.

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • المؤلفون: Yan Han; Xueheng Wan
  • المصدر:
    Information Technology & Libraries. Mar2018, Vol. 37 Issue 1, p52-64. 13p. 1 Color Photograph, 2 Charts, 1 Graph.
  • معلومة اضافية
    • الموضوع:
    • الموضوع:
    • نبذة مختصرة :
      The purpose of this article is to demonstrate a practical use case of PDF/A for digitization of text documents following FADGI's recommendation of using PDF/A as a preferred digitization file format. The authors demonstrate how to convert and combine TIFFs with associated metadata into a single PDF/A-2b file for a document. Using real-life examples and open source software, the authors show readers how to convert TIFF images, extract associated metadata and International Color Consortium (ICC) profiles, and validate against the newly released PDF/A validator. The generated PDF/A file is a self-contained and self-described container that accommodates all the data from digitization of textual materials, including page-level metadata and ICC profiles. Providing theoretical analysis and empirical examples, the authors show that PDF/A has many advantages over the traditionally preferred file format, TIFF/JPEG2000, for digitization of text documents. [ABSTRACT FROM AUTHOR]
    • نبذة مختصرة :
      Copyright of Information Technology & Libraries is the property of American Library Association and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)