• Post #2: Beyond Assertions: How Do You Unit Test an AI Agent?

    A Motivational Example John built a travel-planning AI agent that worked flawlessly during development.Three weeks after release, users suddenly reported: John’s code hadn’t changed.The foundation model had — and because his tests only checked exact outputs, nothing caught the regression. This is the modern testing problem: You’re not just testing code anymore.You’re testing behavior. Why…