Loading…
CNCF-hosted Co-located Events Europe 2025 taking place on 1 April. This event is happening in person at Excel London in London, England.

The Sched app allows you to build your schedule, but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Europe 2025, and have an All-Access pass in order to participate in the sessions.

To view the full event schedule for a specific CNCF-hosted Co-located event, you can use the right-hand navigation bar to sort and filter.

The schedule is subject to change.
Tuesday April 1, 2025 14:50 - 15:15 BST
As the demand for scalable machine learning (ML) workloads increases, efficient training in distributed environments has become crucial. This talk will delve into Kubeflow innovations that advance distributed training on Kubernetes with JAX and automate hyperparameter optimization for Large Language Models (LLMs).
JAX, known for high-performance large-scale computations, requires Kubernetes integration for efficient scaling. Additionally, hyperparameter optimization for LLMs has been manual and time-intensive, with existing tools lacking seamless Kubernetes integration.
To address these gaps, we extended Kubeflow to support distributed JAX workloads and developed a high-level API to automate LLM hyperparameter optimization. These advancements make complex, resource-intensive training more efficient. The speakers will highlight how these capabilities streamline end-to-end ML workloads, establishing Kubeflow as a powerful platform for modern AI development.
Speakers
avatar for Sandipan Panda

Sandipan Panda

Member of Technical Staff, DevZero
Sandipan enjoys collaborating with people on developing software. He is a Member of Kubernetes and Kubeflow and a CNCF Ambassador. Sandipan has been a Mentee at CNCF under the Linux Foundation Mentorship Program, where he worked on Cilium, and a Google Summer of Code Contributor at... Read More →
avatar for Hezhi Xie

Hezhi Xie

Contributor, Independent
Hezhi Xie is a master’s student in computer science at University of California, Davis, and an active contributor to the Kubeflow open-source project. During Google Summer of Code 2024, she developed a hyperparameter optimization API for Large Language Models (LLMs) in Kubeflow’s... Read More →
Tuesday April 1, 2025 14:50 - 15:15 BST
Level 1 | Hall Entrance N10 | Room E

Attendees (6)


Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link