Loading…
Attending this event?
June 19-20, 2024
Paris, France
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for AI_dev Europe to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in CEST (Central European Summer Time) UTC/GMT +2 hours. To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Thursday, June 20 • 14:15 - 14:45
ML Data Version Control and Reproducibility at Scale - Einat Orr, Treeverse

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Petabytes of unstructured data stand as the cornerstone upon which triumphant Machine Learning (ML) models are built.

One commonplace method for researchers to extract subsets of data to their local environments is by simply using the age-old copy-paste, for model training. This method allows for iterative experimentation, but it also introduces challenges with the efficiency of data management when developing machine learning models, including reproducibility constraints, inefficient data transfer, alongside limited compute power.

This is where data version control technologies can help overcome these challenges for computer vision researchers.

In this workshop we'll cover:
- How to use open source tooling to version control your data when working with data locally.
- Best practices for working with data, preventing the need to copy data locally, while enabling the training of models at scale directly on the cloud.

This will be demoed with an OSS stack:
- Langchain
-Tensorflow
- PyTorch
- Keras

You will come away with practical methods to improve your data management when developing and iterating upon Machine Learning models, built for modern computer vision research.

Speakers
EO

Einat Orr

lakeFS co-creator and Treeverse co-Founder, lakeFS; Treeverse
Einat Orr has 20+ years of experience building R&D organizations and leading the technology vision at multiple companies, the latest being Similarweb, that IPO in NYSE last May. Currently she serves as Co-founder and CEO of Treeverse, the company behind lakeFS, an open source platform... Read More →


Thursday June 20, 2024 14:15 - 14:45 CEST
Sorbonne Descartes & Lutèce (Level 5)
  AI Research & Methodologies
Feedback form isn't open yet.