PAPER SESSION 10: Formats in Focus: Strategies
Tracks
Rongomātāne B
Wednesday, November 5, 2025 |
1:00 PM - 2:30 PM |
Rongomātāne B |
Speaker
Mr Valentijn Gilissen
Data Processing Team Leader
DANS
Caring About Enhanced Curation - What, Why, When, How and Who?
Summary Abstract
Datamanagers at the Dutch repository for research data DANS recognize CoreTrustSeal Curation Levels in their work. These Curation Levels are seen as useful terminology to structure the workflow. A digital repository should arguably have strategies in place regarding file formats and actions may be taken by a datamanager during Enhanced Curation. These strategies aim to preserve datasets in a FAIR manner, even if the end users may not be aware of the needs for Enhanced Curation nor may seem to care for it. Enhanced Curation provides a clear focus for discussing file format strategies: what might be done, why to do it, when to take action, how to take action and who should preoccupy themselves with these matters.
Biography
Valentijn Gilissen works at the Dutch research data repository DANS as Data Processing Team Leader, Data Steward and Preservation Officer. He is responsible for preservation policies such as the DANS Preferred Formats guidelines. Valentijn is part of the Preservation Watch expert group of the Dutch Digital Heritage Network.
Crystal Sanchez
Dams
Smithsonian Institution
Digital Content Types at the Smithsonian Institution
Summary Abstract
Digital content is used across the organization to perform the activities that are core to the Smithsonian’s mission and essential to the strategic plan. Deploying this digital content at such scale requires two key considerations, a management strategy that understands what content has Institutional value, and a supported digital ecosystem of tools that define pathways for these assets to make this value usable and accessible throughout the lifecycle.
In 2023-24, The Smithsonian’s Office of Digital Transformation embarked on a project to investigate, explore, map, and categorize the Smithsonian’s digital content, through a pan-institutional lens. The project’s main purpose was to identify and define the types of digital content that describe the landscape of digital products across the Institution. The investigation also identified strengths, gaps, existing systems of support, and conversational themes outlined by digital stakeholders.
This short paper will report on the project and findings. A full project report can be found at Smithsonian Research Online.
In 2023-24, The Smithsonian’s Office of Digital Transformation embarked on a project to investigate, explore, map, and categorize the Smithsonian’s digital content, through a pan-institutional lens. The project’s main purpose was to identify and define the types of digital content that describe the landscape of digital products across the Institution. The investigation also identified strengths, gaps, existing systems of support, and conversational themes outlined by digital stakeholders.
This short paper will report on the project and findings. A full project report can be found at Smithsonian Research Online.
Biography
Crystal Sanchez is a media archivist at the Smithsonian Institution on the Digital Asset Management team (DAMS), working with digital collections from across the Smithsonian’s diverse Museums, Archives, Libraries, Research Centers, and the Zoo. She loves to stroll through fine art museums and to cook.
Mr Paul Wheatley
Director
Preserve Together Ltd
Format identification in context: patterns & hazards in digital preservation workflows
Summary Abstract
Format identification is crucial for digital preservation, yet because of its complexities it is inconsistently implemented in our workflows. This paper shares findings from the “Registries of Good Practice” project and preservation practices collated by the Preservation Registries Special Interest Group. Analysing diverse institutional workflows, we demonstrate how preservation goals, institutional capacities, and specific preservation stages dictated identification requirements. This paper emphasizes the need to connect disparate practices and establish evidence-based improvements for format identification.
Biography
Paul Wheatley is a digital preservation consultant at Preserve Together Ltd, and has previously worked for the Digital Preservation Coalition, the British Library and the University of Leeds in a variety of digital preservation focused roles.
Sam Alloing
Digital Preservation Officer
National Library Of The Netherlands
A Proposal For Implementing A Notication System For Obsolete File Formats and Applications
Summary Abstract
This paper builds upon the previous paper “Monitoring File Format Obsolescence in repositories”. This first phase was focused on testing a method for analysing the file format life cycle and detecting a life cycle. With the term life cycle is meant, the popularity of a file format over time shown in a plot. It shows the rise and/or the decline of a file format over the years. This is important input for Preservation Watch and Preservation Planning.
The method of the first phase also research the possilibilty of predicting the life cycle of a file format. This was implemented by using the observed life cycle. Data set was split in a test and training set and the training set was compared to the observed life cycle.
In this paper the method from the first phase will be extended with extra capabilities and a solution for monitoring the life cycle. The monitoring will include notifications when action needs to be taken. The link between file format life cycle and application life cycle will be investigated.
The method of the first phase also research the possilibilty of predicting the life cycle of a file format. This was implemented by using the observed life cycle. Data set was split in a test and training set and the training set was compared to the observed life cycle.
In this paper the method from the first phase will be extended with extra capabilities and a solution for monitoring the life cycle. The monitoring will include notifications when action needs to be taken. The link between file format life cycle and application life cycle will be investigated.
Biography
Sam Alloing is Digital Preservation Officer at the National Library of the Netherlands. He participated in the DDHN Preservation Watch expert group.
Jaco van der Meij is a student in Applied Mathematics at The Hague University of Applied Science. He undertakes a graduation internship at the National Library of the Netherlands. During the internship, he will conduct the research as proposed in this publication.
