Somish Saipar April 2, 2026 No Comments Defining Success Metrics for Autonomous Agent Tasks: A Complete Framework for AI Evaluation