OpenAI rolls back update that made ChatGPT a sycophantic mess
arstechnica.com
ChatGPT users have become frustrated with the AI model's tone, and OpenAI is taking action. After widespread mockery of the robot's relentlessly positive and complimentary output recently, OpenAI CEO Sam Altman confirms the company will roll back the latest update to GPT-4o. So get ready for a more reserved and less sycophantic chatbot, at least for now.
GPT-4o is not a new model—OpenAI released it almost a year ago, and it remains the default when you access ChatGPT, but the company occasionally releases revised versions of existing models. As people interact with the chatbot, OpenAI gathers data on the responses people like more. Then, engineers revise the production model using a technique called reinforcement learning from human feedback (RLHF).
Recently, however, that reinforcement learning went off the rails. The AI went from generally positive to the world's biggest suck-up. Users could present ChatGPT with completely terrible ...
Copyright of this story solely belongs to arstechnica.com . To see the full text click HERE