Characterisation Adrian Brown The National Archives, UK.
-
Upload
jason-corcoran -
Category
Documents
-
view
216 -
download
0
Transcript of Characterisation Adrian Brown The National Archives, UK.
![Page 1: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/1.jpg)
Characterisation
Adrian Brown
The National Archives, UK
![Page 2: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/2.jpg)
Overview
• Develop tools and services to characterise the significant properties of digital objects, to support:– Development of preservation plans– Validation of preservation actions (evaluating
change)
• The subproject considers:– Representation properties– Inherent properties
![Page 3: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/3.jpg)
Aims & Objectives
• To deliver:– Methodologies for describing significant
properties– Tools and services for automating
measurement and comparison of these properties
– Recommendations for improving the preservation characteristics of digital object types
![Page 4: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/4.jpg)
Aims & Objectives
![Page 5: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/5.jpg)
Achievements (Year 1)
• Characterisation registry
• Property description and extraction methodology and tools
• Characterisation tool framework
![Page 6: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/6.jpg)
Characterisation registry
• First iteration registry (bringing PRONOM to its next generation)
• Persistent Unique Identifier scheme for registry information
• Support for registry-driven characterisation tool framework
![Page 7: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/7.jpg)
![Page 8: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/8.jpg)
Describing and extracting characteristics
• Extensible Characterisation Description Language (XCDL)
• Extensible Characterisation Extraction Language (XCEL)
![Page 9: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/9.jpg)
Migrator
tiff
png
Extractor
tiff XCEL png XCEL
... XCEL... XCEL
Comparer
png XCDL
tiff XCDL
93%
XCDL & XCEL
![Page 10: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/10.jpg)
XCDL/XCEL tools
• Command line interface for extractor
• Preliminary specification for comparator
• GUI for extractor experiments
![Page 11: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/11.jpg)
GUI example
![Page 12: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/12.jpg)
Characterisation tool framework
• Registry-driven framework for automated deployment of tools
• Initial tools implemented:– DROID– JHOVE– Java POI (MS Office documents)– JAXP (XML validation)
![Page 13: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/13.jpg)
![Page 14: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/14.jpg)
Planned activities (Year 2)
• Final XC*L specifications
• Characterisation registry (iteration 2)
• Representation Information Registries White Paper
• XCDL extraction tool
• Characterisation tool wrapper specification
• Emerging technologies report
![Page 15: Characterisation Adrian Brown The National Archives, UK.](https://reader035.fdocument.pub/reader035/viewer/2022062404/551537d3550346a87d8b5b94/html5/thumbnails/15.jpg)
Thank you!