Header image

Archives in the Cloud - Exploring machine learning to transform Archives New Zealand's digital services for agencies

Tracks
IM Room: Rongomātāne C
Tuesday, November 19, 2024
2:45 PM - 3:15 PM
Breakout Room C, Rongomātāne C

Overview

Joshua Ng, Archives New Zealand


Speaker

Agenda Item Image
Joshua Ng
Digital Preservation Analyst
Archives New Zealand

Archives in the Cloud - Exploring machine learning to transform Archives New Zealand's digital services for agencies

Abstract

From February to July 2022, Archives New Zealand led a proof-of-concept project to explore machine learning (ML) and hyperscale cloud computing for auto-classifying digital public records, with a focus on surfacing information relevant to Māori. Funded by the Digital Government Partnership Innovation Fund, the project aimed to tackle challenges in managing vast digital information from public offices, much of which holds long-term archival significance under the Public Records Act 2005. Traditional systems cannot manage the massive volume of digital records, making manual sorting impractical. Archives New Zealand collaborated with stakeholders, including the Ministry of Justice, Ministry for Primary Industries, Microsoft, and AWS, to evaluate ML tools for appraising records and identifying Māori-specific content. Initial results were promising, but further model refinement is needed. Key future steps include ongoing collaboration with Māori, ensuring cultural relevance, and aligning with the Algorithm Charter. This project highlights the transformative potential of ML in digital record management, setting the stage for more efficient government archival practices and culturally inclusive records management.

Biography

Joshua Ng is currently a Digital Preservation Analyst at Archives New Zealand, specializing in digital audiovisual preservation. His primary responsibility is to ensure that all necessary processes are in place to maintain the integrity of the Government Digital Archive. He is actively learning and experimenting with artificial intelligence and machine learning technologies, with the goal of applying these in the next phases of the mass digital preservation project for at-risk audiovisual magnetic media (Utaina).
loading