OpenAI has rolled again an replace to GPT-4o so as to forestall sycophancy, the flowery language the generative AI could use to reward customers. The “overly supportive but disingenuous” responses flourished as a result of “we focused too much on short-term feedback, and did not fully account for how users’ interactions with ChatGPT evolve over time,” OpenAI stated in an April 29 weblog publish.
ChatGPT’s eagerness to please could possibly be ‘uncomfortable’
Users discussing GPT-4o’s eagerness to please reached a essential mass final week. The relentlessly constructive tone was losing tokens and getting in the best way of precise solutions, customers stated. ChatGPT utilizing GPT-4o would possibly reward the person for even nonsense queries.
The drawback appeared to stem from the March 27 replace to GPT-4o, which OpenAI stated was supposed to be “intuitive, creative, and collaborative, with enhanced instruction-following, smarter coding capabilities, and a clearer communication style.”
“Sycophantic interactions can be uncomfortable, unsettling, and cause distress,” OpenAI wrote. The sycophancy appeared much less distressing and extra wasteful: It doesn’t remedy issues like AI hallucinations, whereas cluttering the interplay with baseless flattery.
“Each of these desirable qualities like attempting to be useful or supportive can have unintended side effects,” OpenAI wrote.
OpenAI guarantees extra personalization and totally different alternatives for suggestions
To repair this AI sycophancy drawback, OpenAI plans to vary how the corporate collects and incorporates suggestions into the fashions and permit higher personalization. One manner OpenAI would possibly do that’s to permit customers to select from “multiple default personalities,” a performance accessible with some particular person GPT brokers however not in the principle ChatGPT interface for the time being.
OpenAI plans to immediately steer the following iteration of GPT-4o away from flattery, emphasize honesty, change the methods customers can provide suggestions earlier than a mannequin is deployed, and modify the Model Spec and different in-house evaluations to attempt to catch different friction factors earlier than they come up.
“We hope the feedback will help us better reflect diverse cultural values around the world,” Open AI wrote.







