Events & Webinars

Learn from experts, watch demos, and stay updated on the latest in AI quality and safety.

2025-06-03

Will Agent Evaluation via MCP Stabilize Agent Frameworks?

Discover how exposing complex AI Evaluation frameworks to agents via MCP (Model Context Protocol) allows for a new paradigm of controllable self-improvement.

Watch recording →

2025-03-13

Empowering AI with Cost-Effective LLM Judges

explore the dual themes of agent evaluations and EvalOps in this comprehensive technical session on cost-effective LLM judges.

Watch recording →

2025-02-19

Agent Evals: Finally, With The Map

A comprehensive look at agent evaluation frameworks and methodologies, delivered at the AI Engineer Summit: Agents at Work!

Watch recording →

2025-02-06

10 Critical LLM Blunders - Detect and Fix with LLM Judges

Dive deep into the intricacies of LLM-based applications and learn to detect, block, and remedy the most common yet critical errors that undermine reliability.

Watch recording →

2024-12-04

TOP-10 Misconceptions about LLM Judges in Production

Debunking common myths and misconceptions about implementing LLM judges in production environments based on real-world experience.

Watch recording →

2024-11-07

EvalOps - Mastering The Game of LLM Judges

A comprehensive keynote on operational excellence in LLM evaluation and judgment systems.

Watch recording →

2024-09-18

Building Your Optimal LLM Evaluation Stack

Learn how to create a robust framework for evaluating and optimizing large language models, covering best practices, tools, and strategies for production reliability.

Watch recording →