-
LLM evals are the new unit tests — and most teams are skipping them
Running an agentic evals team at CEGID has made one thing clear: shipping LLM-powered features without evals is like shipping code without tests. Here's our approach.
-
Challenges in Taking AI Agents to Production
A look at the real engineering challenges when moving LLM-powered agents from demo to production.