ChatGPT 5.4 in the Real World: Benchmarks + Live AMA
Thursday, March 12
1 PM ET
OpenAI just launched ChatGPT 5.4. Zapier got early access and we put it through a new eval and our own workflows and agents to see how well it performs.
Join Zapier’s Head of AI, Anna Marie Clifton, for a live AMA where we’ll:
– Break down how we benchmark models on real business workflows
– Explain what our “pass rate” actually measures
– Demo 5.4 inside real automations (Zaps, agents, and MCP)
– Answer your questions live
Bring your questions.




What you’ll learn and see, live:
– What the Zapier Benchmarking Framework actually measures:Not just “scores,” but how AI performs end-to-end across real business workflows, from CRM to email, scheduling, docs, and more.
– Why traditional benchmarks miss the real story:Token-level tests and synthetic scoring don’t capture workflow success. Learn why pass-rate on real tasks matters more than any single metric.
– How ChatGPT 5.4 compares inside real automations: See 5.4 perform inside Zaps, MCP-powered workflows, and Zapier Agents, and what that tells you about reliability and practical performance.
– Cost-aware design patterns that actually work:See how smart triggers and orchestration keep AI usage efficient (no expensive always-on polling) and reliable.
Can’t attend live?
Register today, and we'll send you a recording after the webinar ends.
Register now