A human designer doubtless won’t be capable to capture all of those considerations in a reward function on their first attempt, and, even in the event that they did manage to have an entire set of considerations in thoughts, it might be fairly troublesome to translate these conceptual preferences