Loading…
Attending this event?
June 19-20, 2024
Paris, France
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for AI_dev Europe to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in CEST (Central European Summer Time) UTC/GMT +2 hours. To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Wednesday, June 19 • 12:05 - 12:35
Hallucination-Free LLMs: Strategies for Monitoring and Mitigation - Wojtek Kuberski, NannyML

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

The talk will cover why and how to monitor LLMs deployed to production. We will focus on the state-of-the-art solutions for detecting hallucinations, split into two types: 1. Uncertainty Quantification 2. LLM self-evaluation In the Uncertainty Quantification part, we will discuss algorithms to leverage token probabilities to estimate the quality of model responses. This includes simple accuracy estimation and more advanced methods for estimating Semantic Uncertainty or any classification metric. In the LLM self-evaluation part, we will cover using (potentially the same) LLM to quantify the quality of the answer. We will also cover state-of-the-art algorithms such as SelfCheckGPT and LLM-eval. You will build an intuitive understanding of the LLM monitoring methods, their strengths and weaknesses, and learn how to set up an LLM monitoring system easily.

Speakers
avatar for Wojtek Kuberski

Wojtek Kuberski

CTO and Co-Founder, NannyML
Wojtek Kuberski is an AI professional and entrepreneur with a master's in AI from KU Leuven. He founded Prophecy Labs, a consultancy specializing in machine learning, before assuming his current role as a co-founder and CTO of NannyML. NannyML is an OSS for ML monitoring and silent... Read More →


Wednesday June 19, 2024 12:05 - 12:35 CEST
Saint-Victor (Level 3)
  AI Quality & Security
Feedback form isn't open yet.