Loading…
Attending this event?
June 19-20, 2024
Paris, France
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for AI_dev Europe to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in CEST (Central European Summer Time) UTC/GMT +2 hours. To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Thursday, June 20 • 11:20 - 12:35
Workshop: Choosing the Best Open Source LLM for Your Application - Nikolai Liubimov, HumanSignal

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

The pace of innovation for open source LLMs is exciting—new models are being released and fine-tuned on a daily basis. But when it comes to selecting the right LLM to solve real-world problems, the hype and open leaderboards can be misleading. Before you make significant investments in LLM infrastructure, building new GenAI pipelines, and fine-tuning, we'll walk you through the process of evaluating and selecting the best LLMs for your use case. Topics covered in the technical workshop will include: - How the open leaderboards work today: popular benchmarks and methodologies for NLP model evaluation - Dimensions to evaluate LLMs, and which measures are most important based on your use case - Why auto evaluators are not enough, and ultimately human supervision based on ground truth data is the best indicator of quality - How to efficiently apply human supervision to LLM evaluation: open source toolchain and process - Different approaches to curating test data for deterministic vs generative AI Attendees will walk away with actionable steps, referenced reports, and open source tools to evaluate LLMs for their business applications.

Speakers
avatar for Nikolai Liubimov

Nikolai Liubimov

CTO & Co-Founder, HumanSignal
Nikolai is the co-founder and CTO of HumanSignal, and creator of Label Studio, the most popular OSS data labeling platform with 300K+ users globally. Based on his experience deploying ML models at scale for Yandex and Huawei, he believes data quality is not only essential to success... Read More →


Thursday June 20, 2024 11:20 - 12:35 CEST
Saint-Victor (Level 3)
  AI Quality & Security
Feedback form isn't open yet.