License: Creative Commons Attribution 4.0 International license (CC BY 4.0)
When quoting this document, please refer to the following
DOI: 10.4230/DagSemProc.10291.10
URN: urn:nbn:de:0030-drops-29012
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2010/2901/
Go to the corresponding Portal


Milic-Frayling, Natasa

Digital Object Characterization: Document Conversion and Qualiity Assurance

pdf-format:
10291.MilicFraylingNatasa.Paper.2901.pdf (0.3 MB)


Abstract

Whether we are migrating document formats to achieve interoperability or ensure long term preservation, we are faced with the issue of assessing the quality of the digital object transformation. However, comparing two digital objects is not straightforward. It raises the issue of properties that are inherent to the digital objects and those that are dependent on the environment in which the objects are created, viewed, and compared to one another. That has implications for devising methods to extract document properties, interpret observed characteristics, and apply similarity metrics. Furthermore, in order to take actions based on collected measurements, we need to define or learn the significance of individual document properties from the perspective of human perception and usage scenarios. We illustrate the complexity of these issues by presenting a method for comparing converted office documents and discussing the challenges from the technical and methodology point of view.


BibTeX - Entry

@InProceedings{milicfrayling:DagSemProc.10291.10,
  author =	{Milic-Frayling, Natasa},
  title =	{{Digital Object Characterization: Document Conversion and Qualiity Assurance}},
  booktitle =	{Automation in Digital Preservation},
  pages =	{1--8},
  series =	{Dagstuhl Seminar Proceedings (DagSemProc)},
  ISSN =	{1862-4405},
  year =	{2010},
  volume =	{10291},
  editor =	{Jean-Pierre Chanod and Milena Dobreva and Andreas Rauber and Seamus Ross},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/opus/volltexte/2010/2901},
  URN =		{urn:nbn:de:0030-drops-29012},
  doi =		{10.4230/DagSemProc.10291.10},
  annote =	{Keywords: Characterization, quality assurance, format migration, file conversion}
}

Keywords: Characterization, quality assurance, format migration, file conversion
Collection: 10291 - Automation in Digital Preservation
Issue Date: 2010
Date of publication: 28.12.2010


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI