📐 AI Evaluation Beyond Metrics