Inverse Rubric Optimization: A testbed for agent science
Inverse Rubric Optimizationagent scienceblack-box judgepoetry generationmachine learning
Author: etherio
Date: 6/11/2026
Article Summary:
This article presents a testbed for agent science called Inverse Rubric Optimization (IRO), where an agent learns the preferences of a black-box judge model by interacting with it and submitting generated outputs.