Evan Fossier & Joey Pinhas - How do you know your agent works?
This talk will explore the role of evaluations (evals) in AI agent development, from full-agent assessments to micro-evals. Drawing parallels to traditional software engineering, the talk will discuss why evals are essential for building reliable AI agents that actually solve customer needs and lessons learned from shipping agents at Datadog.
- Date
- Time
- Track
- Expo Stage
- Room
- TimesCenter Expo

