How AI judges score a solution