
1 Apr
2023
1 Apr
'23
2:16 p.m.
Depends. In the current arrangement, people who do RLHF (by the way, that's the first time I've seen this abbreviation, I'm sure) aren't likely to know it did a bad job. If it's something under your control, yes, I would agree. Although we now headed straight into the AI rights territory (a programmer who is being yelled at a little too much can just quit).
On 1 Apr 2023, at 16:09, Will Yager
wrote: On Apr 1, 2023, at 09:57, MigMit
wrote: Well, human programmers can be shamed, yelled at, fired (which would actually hurt them), or, in extreme cases, prosecuted. They have every insentive to do their job right.
ChatGPT has RLHF. It has incentives to do its job right as well.