artificial-intelligence – BeyondAssertions

Can Software Metrics Really Help Us Find Defects?

Teams shouldn’t ask “Will this module have a defect?”They should ask “Which modules are riskier than others?” Why This Analysis Exists? This analysis does not claim to perfectly predict defects. Instead, it evaluates whether common software metrics — lines of code (LOC), cyclomatic complexity (v(g)), essential complexity (ev(g)), design complexity (iv(g)), and effort — provide…

the_new_tester

February 10, 2026

Post #4: Observability 2.0 (Part 2): What Tracing Must Become for Agentic Systems

From Tracing Execution to Tracing Decisions In Part 1, we showed why traditional tracing breaks for agentic AI. Distributed tracing was built to explain pipelines.Agentic systems behave like decision graphs. Tracing today tells us where time went.Agentic systems require us to understand why decisions were made. That is the core shift behind Tracing 2.0. Why…

the_new_tester

January 20, 2026

Post #2: Beyond Assertions: How Do You Unit Test an AI Agent?

A Motivational Example John built a travel-planning AI agent that worked flawlessly during development.Three weeks after release, users suddenly reported: John’s code hadn’t changed.The foundation model had — and because his tests only checked exact outputs, nothing caught the regression. This is the modern testing problem: You’re not just testing code anymore.You’re testing behavior. Why…

the_new_tester

November 13, 2025