Nieuws

Review and presentations DigiShape day Large databases in the water sector

DigiShape dag26-05-22 at 13.07.14

On Friday 22 May, the DigiShape community gathered at Port of Rotterdam for a day about large data files in the water sector. The central question: how do you, as a public party, knowledge institution or market party, effectively deal with growing amounts of data from sensors, models and other data sources?

That question was looked at from several angles. The plenary presentations showed what large data files mean for operational systems at Rijkswaterstaat and KNMI. In the parallel sessions, participants delved deeper into data collection at sea, shared digital infrastructure and the use of data lakes. During lunch and at the end of the afternoon, DigiShape seedmoney was central, with new project ideas for 2026 and updates of projects from 2025.

On this page you will find the retrospective, presentations and project updates.

Plenary presentations by Rijkswaterstaat and KNMI

Floris Langeraert showed how Rijkswaterstaat works with large amounts of data in operational systems, including within RWsOS. His presentation covered various points of attention, such as data flows from multiple sources, monitoring of data quality, reprocessing of historical data and the question of how data can be made more findable and reusable.

Download presentation Floris Langeraert, Rijkswaterstaat (pdf)

Wim Som de Cerff gave an insight into data and data management at KNMI. He showed which types of large datasets KNMI works with, such as weather model data, climate model data, satellite data, observations and warnings. He also discussed the KNMI Data Platform, APIs, user questions and future developments around federated data networks and AI applications.

Download presentation Wim Som de Cerff, KNMI (pdf)

These parallel sessions with a practical approach

MIVSP: information provision at sea

In the parallel session on MIVSP, the Maritime Information Service Point, Martijn van Dijk of Rijkswaterstaat discussed data collection and information provision in the North Sea. This included sensors at sea, storage, access and the use of datasets by various users.

Martijn showed that there is always a “manual” with data, no matter how well datasets are described. Users are asked to be aware of the handling of datasets. Not everything can be compared one-to-one, because there may be differences in the data source or in the way data is processed.

Using examples, he made it clear that we often work with data quite easily and make assumptions in the process. A simple example is clicking away from a disclaimer without reading it, to go directly to the dataset.

This becomes even more relevant when an AI agent picks up data. Such an agent can assume that all data is fully described and clear, and then combine datasets that cannot simply be combined.

The invitation from the session was therefore: talk to the data supplier.

D-OSP: Multicloud Architecture and Captain Hindsight

D-OSP is about digital collaboration across organizational boundaries. The session covered topics such as trust, interoperability, shared infrastructure and the use of data without having to copy or move it all the time.

Floris Langeraert explained the findings from the desk study carried out. In addition, he presented the realization of parts of the architecture and the experiences gained in the process.

The realized infrastructure has been applied, also within the D-OSP project, in the use case Captain Hindsight. The purpose of this use case was to be able to reconstruct incidents and near misses in shipping afterwards. Fedor Baart demonstrated how the developed architecture has been used for this purpose and showed what additional applications are possible with this architecture. Read more in this interview with Fedor Baart.

With these results, the three objectives of D-OSP, the removal of the most important barriers to digital collaboration, have largely been achieved. The developed environment is still active and available. On request, participants of the session can be given temporary access to experience for themselves how the architecture works in practice.

Data lakes and Beacon

Peter Thijsse of MARIS spoke with those present about the question of what “large data files” actually are. This is not only in the size of files, for example in GBs or TBs, but also in the use. Do datasets have to cross the line before they can be used? Is an additional pipeline needed to transform the data? This all plays a role in the question of whether files are perceived as “large” in practice.

After that, MARIS shared experiences with dealing with large datasets. This included harmonization of data, metadata and formats, such as ZARR, Parquet and NetCDF. It also discussed how a smart data access service can help to better use and reuse large data files in addition to the data.

This was demonstrated with a use case with ERA5 data. In addition, a Beacon instance was placed next to this data, for use of very large datasets in Python notebooks within a C3S project for climate change attribution.

Seedmoney 2025: Project Updates

In the afternoon, the three seedmoney projects from 2025 gave an update on their progress.

PortVision

PortVision is working on visibility forecasts for safe shipping in the port of Rotterdam. The update explained how historical visibility measurements and KNMI meteorological data are used for an AI model that can predict visibility conditions in advance. Follow-up steps were also discussed, such as spatial vision predictions and further validation in practice.

KLIC at Sea

KLIC at Sea focuses on up-to-date, reliable and complete information about cables and pipelines at sea. The update discussed, among other things, differences between investigated databases, management, procedures and user needs. The progress towards a draft at the end of June was also mentioned.

Digital Dike

Digital Dike is investigating how a digital twin can contribute to dike management and the duty of care. The update discussed the plan of action, the translation of subsurface data into a 3D digital twin and the application in a proof of concept. The plan of action has been discussed with the project partners and is being finalized.

Download the Digital Dike project update (pdf)

Seedmoney 2026: New Ideas

During lunch, six ideas were pitched for the seedmoney call 2026. The pitch round offered space to share ideas, find possible partners and further refine proposals. The pitched ideas were:

The proposals will be further elaborated in the coming period and after the summer we will hear which of these projects will be awarded seed money for 2026. Are you interested in one of these ideas or would you like to think about a sequel? Please contact the contact person mentioned.

Gerelateerd nieuws

29 May 2026

On the Dutch waterways, data and models provide more and more insight into shipping movements, waiting times, energy consumption and [...]

27 May 2026

To keep the Netherlands safe, liveable and accessible in a time of climate change, the AI Impulse Programme water movement [...]

9 April 2026

If you work for a DigiShape partner and have an innovative idea for digitization of the water sector, this is [...]

9 April 2026

In the port of Rotterdam, poor visibility has a direct impact on activities. In limited visibility, for example due to [...]