Currently, there is no large-scale production usage of commercial data preservation services by the public research sector. ARCHIVER is a unique initiative in the context of the European Open Science Cloud (EOSC) that over a three year pre-commercial procurement process developed innovative services for Long Term Digital Preservation of scientific datasets. R&D was performed competitively by commercial suppliers, over 3 phases, selecting pilot services led by Arkivum and LIBNOVA, for scientific data archiving and preservation. The R&D was driven by a diverse range of stakeholders including CERN, operating the Large Hadron Collider near Geneva, DESY (the Deutsches Elektronen-Synchrotron, in Germany), the EMBL-EBI (European Bioinformatics Institute, in Cambridge), and PIC (Port d’Informació Científica, near Barcelona), contributing high-value datasets associated to the EOSC
The work accomplished in ARCHIVER is a game-changer for the approach taken to long-term research data management both from a mindset and technological perspective, i.e. what data do researchers retain, how to keep intellectual control of it and what data stewards must do to ensure long-term value can be realised from it. The ARCHIVER selected companies are promoting solutions that are environmentally sustainable, providing the means to analyse and reduce the carbon footprint as an aid towards carbon neutrality.
A key component of sustainability is to ensure the innovation developed during the project has a wide exposure to potential procurers within the European research community and other business sectors. To achieve this, the project has started an onboarding process for the resulting services to be available for Early Adopters. Making ARCHIVER services available through the EOSC marketplace, will give the possibility to researchers and procuring organisations to have sustainable access to these services, being able to trial them, evaluate their functionality and purchase them with a clear costing model.
The ARCHIVER effort resulted in services that are immediately usable by the public research sector in Europe. This will immediately expose the novel service offerings, relevant for at least 18 pan-European infrastructures to the estimated 1.7 million European researchers and 70 million professionals in science and technology, public and private sectors combined, that are expected to make use of the European Open Science Cloud (EOSC).
The Pilot End-Phase Event guided ARCHIVER stakeholders at large throught the services developed and their exploitation potential by the research community in EOSC. The event was open to policy makers, repository managers, researchers and Early Adopters willing to know more about the ARCHIVER model and the technical solutions implemented.
The event took place on Monday 13 June 2022 (09.00 am to 11.00 am CEST)
Event chair: Sara Pittonet, Trust-IT & ARCHIVER outreach manager
09.00 - 09.10
Welcome from - Martin Gasthuber - (DESY) and ARCHIVER overview, João Fernandes (CERN)
9.10 - 09.30
Pilot Phase - Buyers Group use cases (CERN, DESY, EMBL-EBI, PIC)
09.30 - 09.40
Early Adopter Use Case: ECMWF Open Data - RaoultBaudouin, Principal Software Strategist/Architect (ECMWF)
09.40 - 09.50
09.50 - 10.20
Presentation from Libnova consortium
"LABDRIVE, the ultimate Research Data Management and Digital Preservation platform"
10.20 - 10.50
Presentation from Arkivum consortium
"Arkivum Digital Archiving and Preservation Solution"
10.50 - 11.00
Closing remarks - João Fernandes (CERN)
Two Training Webinars to explore the pilot platform
In order to better explain the functionalities of the resulting pilot platform, two public webinar training sessions are being organised on the 22nd of June at 09.30 CEST and on the 23rd of June at 14:00 CEST.
Arkivum provides a digital archiving and preservation solution that has been built to meet the varied and challenging long-term data management use cases of the scientific research community. Our technology is able to preserve petabyte level datasets, in the most cost-effective and environmentally sustainable way.
LABDRIVE is a Research Data Management and Digital Preservation platform that focuses on scientific datasets. LABDRIVE allows organizations to transition from a siloed approach in which each series of datasets, departments or units are using multiple, disaggregated systems to keep content to a single repository that can adapt to the particularities of each dataset, unifying all content in a single platform.