CNCF-hosted Co-located Events Europe 2025 taking place on 1 April. This event is happening in person at Excel Londonin London, England. The Sched app allows you to build your schedule, but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Europe 2025, and have an All-Access pass in order to participate in the sessions.
To view the full event schedule for a specific CNCF-hosted Co-located event, you can use the right-hand navigation bar to sort and filter.
ML workloads require repetitive access to data for model training. This repetitive access can be both slow and costly in cloud environments further slowing down model training and leaving GPU resources idle waiting for data to load. As datasets and training workloads become larger and more sophisticated in the era of GenAI, efficient data access is crucial to improving training workload speed and efficiency. In this talk, we will discuss optimized data caching for ML workloads using Apache Iceberg, Apache Arrow Flight, and Kubernetes. We will demonstrate a distributed in-memory cache of an Iceberg table across a fleet of Kubernetes pods used to load data more efficiently into Kubeflow training workloads.