Loading…
28-29, August 2025
Amsterdam, Netherlands
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for AI_dev Europe 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Central European Summer Time, CEST (UTC +2). To see the schedule in your preferred timezone, please select from the drop-down menu to the right.

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Thursday August 28, 2025 15:40 - 16:05 CEST
Model startup latency is a persistent bottleneck for modern inference workloads, particularly when using custom kernels written in Triton that are Just In Time (JIT) compiled. In this talk, we’ll present a novel approach to speeding up model boot times by wrapping Triton kernel caches in OCI container images.
Speakers
avatar for Maryam Tahhan

Maryam Tahhan

Principal Software Engineer, Red Hat
Maryam is a Principal Engineer on the Emerging Tech team in the Office of the CTO at Red Hat. Her research is focused on Networking and Sustainability. She's contributed to and led several OpenSource projects. She has been working on AF_XDP and preparing it for cloud native use cases... Read More →
avatar for Alessandro Sangiorgi

Alessandro Sangiorgi

Software Engineer, Red Hat
Alessandro Sangiorgi is a Software Engineer in the Emerging Technologies Group within the Office of the CTO at Red Hat. He has extensive experience across Cloud, Distributed Systems, AI, and Networking products and technologies.
Thursday August 28, 2025 15:40 - 16:05 CEST
G001-002

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link