Video Placeholder Image
June 25-27 2024 • San Francisco

The Biggest Technical AI conference in San Francisco

Where the leading companies, founders, VPs of AI & AI Engineers meet. Featuring an Expo floor showcasing 30+ companies pioneering the AI Engineering landscape, and over 100+ speakers delivering talks and workshops across 9 tracks.

1500+Founders, VPs of AI & AI Engineers
0+Talks from Top Engineers & Founders
0+Workshops & Expo Sessions
0+Top AI Companies in Expo
AIEWF Logo Wall

From the team behind the
AI Engineer Summit

The AI Engineer World's Fair
is the event to discover what's now and
What's Next

Talks from Engineers Who Ship
Talks from Engineers Who Ship

No theoretical promissory hooplah. Just engineers and founders on the cutting edge of AI Engineering, sharing their knowledge.

Cutting-edge Expo
Cutting-edge Expo

AI Engineering moves fast. Meet the engineers & founders behind the companies who are innovating at the edge of what's possible -- and building a better future.

Facilitated Discussions with Inspiring Peers
Facilitated Discussions with Inspiring Peers

The hallway track is buzzing with AI Engineers & founders. Every conversation you have -- facilitated with moderators or open -- is ensure to educate & inspire.

In-depth Workshops
In-depth Workshops

We've curated over 20 workshops for you to choose from, servicing everyone from experienced engineers just starting with AI Engineering, to experienced AI engineers looking to get an edge on the competition.

Algorithmic Networking
Algorithmic Networking

Get matched with people we (er, our algo) thinks will provide you with value. Refine your results by adding more information to your profile.

The Nerdiest Fun You Can Have
The Nerdiest Fun You Can Have

Stimulation for your brain, inspiration for your soul, education for your business. And excellent food to power you through the excitement!

World Class


With 9 tracks and over 100 sessions, you can design the program schedule that perfectly matches your business needs. There are up to 5 simultaneous sessions running at any one time, so it's a good thing we have a GROUP discount for teams attending together!

  • Chris Lattner

    Chris Lattner


  • Justine Tunney

    Justine Tunney


  • Thomas Dohmke

    Thomas Dohmke


  • Michelle Pokrass

    Michelle Pokrass


  • Shawn Wang

    Shawn Wang


  • Lin Qiao

    Lin Qiao


  • Scott Wu

    Scott Wu


  • Danielle Perszyk

    Danielle Perszyk


  • Dylan Patel

    Dylan Patel


  • Daniel Han

    Daniel Han


  • Logan Kilpatrick

    Logan Kilpatrick


  • Jerry Liu

    Jerry Liu


  • Jason Liu

    Jason Liu


  • Devendra Chaplot

    Devendra Chaplot


  • Sandra Kublik

    Sandra Kublik


  • Vivek Muppalla

    Vivek Muppalla


  • Cheng Lou

    Cheng Lou


  • Lukas Biewald

    Lukas Biewald

    Weights & Biases

  • Alex Albert

    Alex Albert


  • Harrison Chase

    Harrison Chase


  • Shelby Heinecke

    Shelby Heinecke


  • Noah Shpak

    Noah Shpak

  • Ben Hylak

    Ben Hylak

    Dawn Analytics

  • Karan Goel

    Karan Goel


  • Kathleen Kenealy

    Kathleen Kenealy


  • Atty Eleti

    Atty Eleti


  • Aman Sanger

    Aman Sanger

    Anysphere (Cursor)

  • Beyang Liu

    Beyang Liu


  • Antje Barth

    Antje Barth


  • Heejin Jeong

    Heejin Jeong


  • Shreya Shankar

    Shreya Shankar

    UC Berkeley

  • Aparna Dhinkaran

    Aparna Dhinkaran


  • Nabila Babar

    Nabila Babar


  • Ahmed Menshawy

    Ahmed Menshawy


  • Ian Webster

    Ian Webster


  • Sunny Madra

    Sunny Madra


  • Nikhil Thota

    Nikhil Thota


  • Mark Moyou

    Mark Moyou


  • Shawn Jansepar

    Shawn Jansepar

    Khan Academy

  • Ankur Goyal

    Ankur Goyal


  • Phoebe Klett

    Phoebe Klett

    Normal Computing

  • Emil Eifrem

    Emil Eifrem


  • Atita Arora

    Atita Arora


  • Dr Bryan Bischof

    Dr Bryan Bischof


  • Stephen Hood

    Stephen Hood


  • Scott Stephenson

    Scott Stephenson


  • Daniel Whitenack

    Daniel Whitenack

    Prediction Guard

  • Thierry Moreau

    Thierry Moreau


  • Dominik Kundel

    Dominik Kundel


  • Arjun Bansal

    Arjun Bansal


  • Eno Reyes

    Eno Reyes


  • Eugene Yan

    Eugene Yan


  • Jamie Turner

    Jamie Turner


  • Fryderyk Wiatrowski

    Fryderyk Wiatrowski

    Zeta Labs

  • Greg Baugues

    Greg Baugues

    HaiHai Labs

  • Gunjan Patel

    Gunjan Patel

    Palo Alto Networks

  • Hamel Husain

    Hamel Husain

    Parlance Labs

  • Hubert Misztela

    Hubert Misztela


  • Ishan Anand

    Ishan Anand


  • Jeronim Morina

    Jeronim Morina


  • Deanna Emery

    Deanna Emery


  • João Moura

    João Moura


  • Kyle Corbitt

    Kyle Corbitt


  • Kevin Van Gundy

    Kevin Van Gundy


  • Maxime Labonne

    Maxime Labonne

    Liquid AI

  • Morgante Pell

    Morgante Pell


  • Pankaj Gupta

    Pankaj Gupta


  • Patrick Debois

    Patrick Debois


  • Paul Henry

    Paul Henry


  • Philip Kiely

    Philip Kiely


  • Phlo Young

    Phlo Young


  • Benjamin Stein

    Benjamin Stein


  • Charles Frye

    Charles Frye

    Modal Labs

  • Rémi Louf

    Rémi Louf

    .txt (Outlines)

  • Emil Sedgh

    Emil Sedgh


  • Sumit Agarwal

    Sumit Agarwal


  • Damien Murphy

    Damien Murphy


  • Tanmai Gopal

    Tanmai Gopal


  • Tom Redman

    Tom Redman


  • Sheila Gulati

    Sheila Gulati

    Tola Capital

  • Vasek Mlejnsky

    Vasek Mlejnsky


  • Vibhor Kumar

    Vibhor Kumar


  • Manuel Odendahl

    Manuel Odendahl

    The Tree Center

  • Santosh Radha

    Santosh Radha

    Agnostiq (Covalent)

  • Chang She

    Chang She


  • Leo Pekelis

    Leo Pekelis


  • Sonia Dogra

    Sonia Dogra

    Capital One

  • Alison Cossette

    Alison Cossette


  • Peter Albert

    Peter Albert

    Zeta Labs

  • Ben Perlmutter

    Ben Perlmutter


  • Ivan Leo

    Ivan Leo

    567 Labs (Instructor)

  • Quinn Slack

    Quinn Slack


  • Nicolai Baldin

    Nicolai Baldin

  • Prakul Agarwal

    Prakul Agarwal


  • Ado Kukic

    Ado Kukic


  • Alex Volkov

    Alex Volkov

    Weights & Biases

  • Trey Doig

    Trey Doig

    Echo AI

  • Sean Hughes

    Sean Hughes

    ServiceNow Research

  • Apoorva Joshi

    Apoorva Joshi


  • Fabian Valle

    Fabian Valle


  • Zach Blumenfeld

    Zach Blumenfeld


  • Olmo Maldonado

    Olmo Maldonado


  • Ben Flast

    Ben Flast


  • Pedro Torruella

    Pedro Torruella


  • Raza Habib

    Raza Habib


  • Benjamin Fletcher

    Benjamin Fletcher

  • Alex Malebranche

    Alex Malebranche


  • Dave Burnison

    Dave Burnison


  • Chris Rec

    Chris Rec


  • Karthik Suresh

    Karthik Suresh


  • Gabriel Paunescu

    Gabriel Paunescu


  • Gabriela de Quieroz

    Gabriela de Quieroz


  • Aishwarya Srinivasan

    Aishwarya Srinivasan


  • Cedric Vidal

    Cedric Vidal


  • David Smith

    David Smith


  • Banjo Obayomi

    Banjo Obayomi


  • Dr. Sarah Buchner

    Dr. Sarah Buchner

    Trunk Tools

  • Rob Cheung

    Rob Cheung


  • Dimitrios Philliou

    Dimitrios Philliou


  • Ray Thai

    Ray Thai


  • Benjamin Dunphy

    Benjamin Dunphy

    Software 3.0, LLC

9 Tracks of Content

Our content tracks span they keynote stage, breakout stages, small group discussion sessions, expo sponsor presentations, and in-depth workshops. With so many tracks to choose from, you can design the perfect curriculum for yourself and your engineers.

New Our full talk schedule is now published here.

June 27: At last year's Summit, Logan Kilpatrick declared this the “Year of Multimodal AI”. Every frontier model now can consume and generate images, audio, video, code, and all other modalities in between. This opens up a brave new world of possibilities for putting even more general intelligence to work - we'll gather the best of 2024 to show off these capabilities!

June 27: Almost everybody is GPU poor, but you can make a -lot- out of the GPUs you can get. Optimize your inference and training costs and maximize your throughput! Hear from Groq,, PredictionGuard and more.


There are up to 6 simultaneous things at any one time in this multitrack conference! The first conference day runs the CodeGen, Open Models, RAG, and Fortune 500 tracks concurrently, whereas the second has Multimodality, GPUs, Evals, and Agents tracks.

You can walk the World's Fair Expo (ft. 30+ booths across the AI Engineering landscape) on any of the 3 days, while Keynote, and AI Leadership sessions span the 2 conference days. There are also plenty of breaks for the most important track of all: the "hallway track"!

New Our full talk schedule is now published here - better for mobile readers





RAG & LLM Frameworks

Evals & LLM Ops

Open Models

CodeGen & Dev Tools

GPUs & Inference


AI in the Fortune 500

AI Leadership

Note: sessions presented here are INCOMPLETE! Expo Sessions from Gold and above sponsors are TBA. Our team is hard at work uploading and confirming sessions every day. Our Speaker list is most representative of who is scheduled to speak, but not when.


We've assembled a team of top engineers who build with the technologies they're teaching every day. From core-contributors, maintainers, and founders of the top AI Engineering tools & infra, you'll learn in hours what they've have mastered over years. Most workshops happen on Jun 25th.

Turn Your Idea into an AI Application in Minutes: Quick Start with AI Templates

Gabriela de QuierozDirector of AI - Microsoft for Startups, Microsoft

Aishwarya SrinivasanSenior AI Advisor, Microsoft

WorkshopsBuilding and deploying generative AI solutions can be challenging and time-consuming, especially for startups with limited resources and expertise. In this workshop, you will learn how to use AI templates and GitHub to quickly prototype and deploy generative AI applications...


Build, Evaluate and Deploy a RAG-based retail copilot with Azure AI

David SmithPrincipal Cloud Advocate, Microsoft

Cedric VidalPrincipal AI Advocate, Microsoft

WorkshopsBuilding generative AI applications for production requires a paradigm shift to LLM Ops, with new tools, platforms and processes for orchestrating end-to-end development workflows. In this session, you’ll learn to build, evaluate and deploy an enterprise copilot application end-to-end, using...


Building with Generative AI on AWS: A Hands-On Starter

Banjo ObayomiSenior Developer Advocate, AWS

Workshops"Learn to build generative AI applications on AWS using PartyRock and Amazon Bedrock. You will gain skills in prompt engineering and using the Bedrock API. We will also explore how to 'chat with your documents' through knowledge bases, retrieval augmented...


Building Agents with a Fully Open AI Stack

Apoorva JoshiSr. AI Developer Advocate, MongoDB

Ben PerlmutterSr. Engineer, Chatbot Framework, MongoDB

Open ModelsIn this 2 hour workshop, we will build an AI research agent that can search for research papers, summarize them, and answer questions on topics based on past research. We will use MongoDB as the agent's memory provider and knowledge...


Low Level Technicals of LLMs

Daniel HanCEO, Unsloth

Open ModelsThis workshop will be split into 3x one hour blocks: 1. How to analyze & fix LLMs - how to find and fix bugs in Gemma, Phi-3, Llama & tokenizers 2. Finetuning with Unsloth - continued pretraining, reward modelling, QLoRA & more 3....


Systematically improving your RAG

Jason LiuConsultant (Instructor), Independent

Ivan LeoResearch Engineer, 567 Labs (Instructor)

RAG & LLM FrameworksEvaluating and scaling Retrieval Augmented Generation (RAG) systems can be challenging due to the complexity and numerous components involved. This workshop aims to equip participants with a range of practical skills and techniques to systematically improve their RAG systems. We'll...


Convex workshop tba

Tom RedmanHead of DX, Convex

RAG & LLM FrameworksDetails tba


LLMs for the working programmer. Become a 10x programming centaur today!

Manuel OdendahlPrincipal Engineer, The Tree Center

AgentsIn this hands-on workshop, learn how Large Language Models (LLMs) can significantly improve your productivity as a software developer. Drawing from three years of experience using LLMs in every aspect of his work as a principal engineer, the presenter will...


How to add secure code interpreting in your AI app

Vasek MlejnskyCEO, E2B

CodeGen & Dev ToolsIn this workshop, I'll show you how to add secure AI code execution that supports any LLM in your AI app using E2B. AI-powered code execution improves reasoning of LLMs and allows you to build AI-based dashboards where users can...


GitHub Copilot - The World’s Most Widely Adopted AI Developer Tool

Dave BurnisonSenior Developer Advocate , GitHub

Alex Malebranche Global Engagement Lead, GitHub

Dimitrios PhilliouProduct Manager, GitHub

CodeGen & Dev ToolsGitHub Copilot was introduced as “Your AI pair programmer” in January of 2021. Since then, we have been adding more and more capabilities to increase developer productivity and happiness. We’ve gone from simply leveraging AI to generate code to explaining...


Building & Scaling an AI Agent Swarm of low latency real time voice bots!

Damien MurphySenior Applied Engineer, Deepgram

MultimodalityAI Agents are becoming more powerful at a rapid pace! In this talk you will learn best practices and considerations to think about when building your AI Agent Swarm. All aspects from low latency Speech to Text, Large Language Model...


AI Music Generation: From Prompt to Production

Phlo YoungRapper, Independent

MultimodalityThe rise of AI music generation tools has opened up a world of creative possibilities for musicians and non-musicians alike. This workshop will demystify these tools, providing a hands-on introduction to their capabilities and potential. Participants will learn how to...


From model weights to API endpoint with TensorRT-LLM

Philip KielyHead of Developer Relations, Baseten

Pankaj GuptaCo-Founder, Baseten

GPUs & InferenceTensorRT-LLM is the highest-performance model serving framework, but it can have a steep learning curve when you’re just getting started. We run TensorRT and TensorRT-LLM in production and have seen both the incredible performance gains it offers and the hurdles...


LLM Quality Optimization Bootcamp

Thierry MoreauCo-Founder, OctoAI

Pedro TorruellaStaff DevRel Engineer, OctoAI

GPUs & InferenceWant to unlock the full potential of your LLM applications? In this session you'll learn essential LLM quality optimization techniques for custom behavior, reliability and accuracy. We'll pack a ton into the hour, including: Level 1: Learn prompt engineering to master...


The AI Engineer Expo

From foundation models to domain-specific products & services, the Expo is your chance to meet some of the best engineers in our industry, working on the most cutting-edge technologies that empower product teams and developers like you.

MicrosoftAWSMongoDBGoogle Cloudneo4jSourcegraphDataStaxGalileoNeptune.aiLambdaOctoAICovalentHasuraCrusoeGroqFireworksDeepgramWeights and BiasesTwilioAirtrainTrunk ToolsSubstrate BlackClericElasticMoveAIWriterDatabricksFriendliAIVespaMatillionSnykOpenPipeEmergence
In numbers, there's probably going to be significantly more AI Engineers than there are ML engineers / LLM engineers. One can be quite successful in this role without ever training anything.
Andrej Karpathy

Andrej Karpathy, Formerly OpenAI, Tesla

Program Overview

We've designed & curated a program to provide you with maximum value, intrigue, and fun. Here's the high level overview of the main content:

Workshop AttendeesWorkshop Attendees
/ Tue, June 25

Workshop Day + Evening Expo & Reception

Workshops - Exclusive to “Conference + Workshop Pass” ticketholders, choose from up to 5 different workshops available concurrently across various subjects and skill levels. Instructed by companies, founders, & engineers who are pushing the boundaries of AI Engineering.

Up to 5 different workshops will be available concurrently across various subjects and skill levels, assuring that you will find the content that will level up your skills, career, and business.

Welcome Reception - Open to all ticketholders, the evening welcome reception takes place from 4:00 - 7:00pm in the Grand Assembly & the Expo. Mingle with other attendees & sponsors over food and drinks, and take in some sessions from our top sponsors at their booths and in the expo session salons.

Conference SpeakerConference Speaker
/ June 26 - 27

Session Days

A full day of talks across 9 tracks, bookended with inspiring and revealing keynotes from the biggest and most consequential companies, founders, & engineers in the industry. Stay for the Pinecone Afterparty on June 26!

The most exciting and innovative expo of the year continues all day, with additional technical breakout sessions from our Gold and above sponsors.

Conference + Workshop Pass ticketholders also receive access to additional workshops on these days.

Group settingGroup setting
/ June 26 - 27

Leadership Track

Exclusive track for VPs & Execs. Purchase the VP Pass to get access to the exclusive leadership track where you'll get learn from & connect with highly experienced technical business leaders, along with facilitated sessions to share your knowledge in small group topical discussions with other engineering and business leaders.

Leadership Track Perks. This 300-person track comes with exclusive access to the VIP welcome reception on June 25th with speakers in the stunning View Lounge on top of the Marquis. Continue to enjoy 360-degree views of the city at the View Lounge as our buyout extends to June 26 & 27 for all-day networking, facilitated discussions, & a premium sit-down lunch.

Other EventsOther Events
/ June 24 - 28

Pre-party, Hackathon and Other SF AI Events

Come for the World's Fair, stay for the SF AI scene!

With much of the AI world flying in to SF, there are lots of meetings and side events that will be organized in the surrounding days. We recommend flying in for the whole week from Monday through Friday so as not to miss out.

If you are organizing an event around the week of June 24-28, please get in touch with us to list on our calendar for attendees to find you!

There are roughly two orders of magnitude more software engineers than there are machine learning engineers. By building good tools, we think it is possible for AI Engineers to use machine learning in the same way they can use normal software.
Ben Firshman

Ben Firshman, Replicate

Venue & Hotel

The Marriott Marquis

Workshop Attendees

One of the largest hotels in the city, and conveniently located downtown near public transportation and plenty of cafes, restaurants, bars, and sights, the Marriott Marquis is an ideal choice to host the AI Engineer World's Fair.

The hotel's Yerba Buena Ballroom is the largest pillarless ballroom west of Las Vegas, and will serve as a centralized, comfortable location for keynotes, breakout sessions, the expo, networking, and food. One floor above is the Golden Gate Ballroom, serving as a dedicated space for workshops & breakout talks. Plenty of other salons and breakout rooms will serve as Expo Sessions and additional meeting space.

Workshop Attendees

View Lounge: VP Pass Exclusive

Attendees who purchase the VP pass get exclusive access to the leadership track, which utilizes the top-floor View Lounge as a VIP welcome reception on June 25th in addition to daytime facilitated discussions & open networking on the 26th and 27th.

Workshop Attendees

Hotel Rooms

We have limited hotel number of rooms at the Marquis available for a negotiated rate of $299. We also have a limited number of overflow hotel rooms at Hotel Nikko, a 7-minute walk away, for $229.

Book your hotel room today:


We've carefully curated a sponsor expo & non-exhibiting sponsors who are relevant, interesting, and pushing the boundaries of the AI Engineering ecosystem. These are the companies that are building & innovating with AI — from Devtools & Infra to Vector DBs & Open Models. Learn more about each of the companies by clicking on their logo below, and meet & discuss with their founders & engineering representatives at the summit to learn how they can help you take your company, product, and internal processes to unparalleled heights.

Presenting Sponsor
In this new era of AI, Microsoft is helping organizations unlock AI innovation in every business, in every app, for everyone. Microsoft Azure delivers AI services and supercomputing infrastructure purpose-built to meet the compute-intensive needs of AI and high-performance workloads. Our full stack AI solution meets the most stringent security, reliability and compliance requirements while providing exceptional performance, flexibility and cost efficiencies. Choose Microsoft Azure for your groundbreaking AI applications.
Innovation Partner
Scale the next wave of innovation in AI by leveraging more than 25 years of pioneering AI experience from Amazon. AWS makes AI accessible to more people - from builders and data scientists to business analysts and students. With the most comprehensive set of AI services, tools, and resources, AWS brings deep expertise to over 100,000 customers to meet the demands of their business and unlock the value of their data. Security, privacy, and responsible AI have never been more critical. Customers can build and scale with AWS on a foundation of privacy, end-to-end security, and AI governance to transform at an unprecedented rate.
Platinum Sponsors
MongoDBGoogle Cloudneo4j
Gold Sponsors
Silver Sponsors
DeepgramWeights and BiasesTwilioAirtrainTrunk ToolsSubstrate BlackClericElasticMoveAIWriterDatabricksFriendliAIVespaMatillionSnykOpenPipe
HypermodeConvexChromaPineconeBotDojoPlumbGentracePlanbyIonicLog10MozillaArizeBasetenKhanmigoModularQdrantQuotient AIZapierZeta LabsCharacter AIEyeLevel

Buy Tickets

We have now sold out of Early Bird tickets; General Admission sold out last year with a week to go!

Buy Tickets

* Expo sessions include talks, workshops, and facilitated discussions led by expo partners and organizer-curated speakers in the Expo Arena breakout rooms

Watch the livestream

Subscribe to our newsletter and get a free remote ticket to the main stage livestream. Livestream does NOT include 3 out of 4 tracks, most of the breakout, expo, and workshop sessions, but will include keynotes.

Email list also gets first notification for all future events!