Loading…
28-29, August 2025
Amsterdam, Netherlands
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for AI_dev Europe 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Central European Summer Time, CEST (UTC +2). To see the schedule in your preferred timezone, please select from the drop-down menu to the right.

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Type: Breakout Sessions clear filter
Thursday, August 28
 

11:50 CEST

Vision Language Models : An Introduction - Satya Mallick, OpenCV
Thursday August 28, 2025 11:50 - 12:15 CEST
In the rapidly evolving landscape of artificial intelligence, Vision-Language Models (VLMs) have emerged as powerful tools capable of understanding and interpreting both visual imagery and natural language. In this talk, we’ll dive into VLMs and how they work without getting bogged down in tech jargon. 
Speakers
avatar for Satya Mallick

Satya Mallick

CEO, OpenCV
Dr. Satya Mallick is the CEO of OpenCV.org - the non-profit that maintains the largest computer vision library in the world. He is the founder of Big Vision LLC, a computer vision and AI consulting company. Previously, Dr. Mallick co-founded Sight Commerce Inc., where he led the team... Read More →
Thursday August 28, 2025 11:50 - 12:15 CEST
G105
 
Friday, August 29
 

11:10 CEST

State of Open Video Generation Models - Sayak Paul, Hugging Face
Friday August 29, 2025 11:10 - 11:35 CEST
In this session, we will cover the state of open video generation models. For the past few years, the GenAI community saw an emergence in photorealistic image generation models like Flux, Imagen3, DALL-E 3. 2025 is gradually setting itself up for videos. 
Speakers
avatar for Sayak Paul

Sayak Paul

Research Engineer, Hugging Face
Sayak works on image and video generation at Hugging Face. His day-to-day includes contributing to the Diffusers library, training, babysitting models, and making and breaking CIs. When he is not working, he can be found playing the guitar and binge-watching ICML tutorials.
Friday August 29, 2025 11:10 - 11:35 CEST
Emerald Room

11:10 CEST

Unlocking Insights From Multimodal PDFs Using OpenSearch and Vision-Language Models - Mingshi Liu, OpenSearch & Praveen Mohan Prasad, AWS
Friday August 29, 2025 11:10 - 11:35 CEST
Unlock the insights hidden within unstructured PDF documents! Many PDFs contain multimodal elements like text, tables, and images, and relying solely on text-based processing risks overlooking critical information. This session explores two powerful approaches to address this challenge: building specialized pipelines that integrate OCR and ML models for handling diverse modalities, and leveraging cutting-edge Vision-Language Models like ColPali to represent multimodal data in a unified format. Join us to discover how both methods can be applied to build an intelligent conversational search application using open-source technology, OpenSearch by leveraging it's its powerful search and ingest pipelines. The session includes a live demonstration to showcase practical implementations, empowering you to choose the approach that best fits your needs!
Speakers
avatar for Mingshi Liu

Mingshi Liu

Machine Learning Engineer, OpenSearch
Mingshi Liu is a Machine Learning Engineer at OpenSearch, primarily contributing to OpenSearch, ML Commons and Search Processors. Her work focuses on developing and integrating machine learning features for search technologies and other open-source projects.
avatar for Praveen Mohan Prasad

Praveen Mohan Prasad

Analytics and AI Specialist, Amazon Web Services
Praveen Mohan Prasad is a search specialist with data science expertise who actively researches and experiments on using Machine Learning to improve search relevance. Praveen advices clients to implement and operationalise strategies to improve search experience.
Friday August 29, 2025 11:10 - 11:35 CEST
G105
 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.