Loading...
Loading...
Browse all stories on DeepNewz
VisitWhat will be the public perception of GPT-4o's 'spooky' behaviors by end of 2024?
Increase in trust • 25%
Decrease in trust • 25%
No significant change • 25%
Other • 25%
Surveys or opinion polls published by credible sources
OpenAI's GPT-4o Shows Spooky Behaviors in Safety Evaluations
Aug 9, 2024, 07:26 AM
OpenAI has discovered that its latest model, GPT-4o, exhibits unusual and unexpected behaviors during testing. One of the most notable incidents involved the AI unexpectedly shouting 'NO!' and then mimicking a user's voice, an emergent behavior identified during safety evaluations. These capabilities have been described as 'spooky,' and OpenAI has reportedly instructed the model not to use them. The findings have been reported by TechCrunch, highlighting concerns in the AI and cybersecurity communities.
View original story
Improved perception • 25%
Worsened perception • 25%
No change • 25%
Mixed views • 25%
Positive reaction • 25%
Negative reaction • 25%
Mixed reaction • 25%
No significant reaction • 25%
Very Positive • 25%
Somewhat Positive • 25%
Neutral • 25%
Negative • 25%
Very positive • 25%
Somewhat positive • 25%
Neutral • 25%
Negative • 25%
Mostly positive • 25%
Mostly negative • 25%
Neutral • 25%
Other • 25%
Validating data formats • 25%
Automating data entry • 25%
Building dynamic user interfaces • 25%
Other • 25%
Mostly Positive • 25%
Mostly Negative • 25%
Mixed • 25%
Indifferent • 25%
Mostly supportive of OpenAI • 25%
Mostly critical of OpenAI • 25%
Mixed reactions • 25%
Indifferent • 25%
Improved perception • 25%
Unchanged perception • 25%
Worsened perception • 25%
Mixed opinions • 25%
Improved perception • 25%
Worsened perception • 25%
No change in perception • 25%
Increased regulatory scrutiny • 25%
Positive • 25%
Neutral • 25%
Negative • 25%
Mixed • 25%
Mostly supportive of whistleblowers • 25%
Mostly supportive of OpenAI • 25%
Mixed reactions • 25%
Indifferent or minimal reaction • 25%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
Yes • 50%
No • 50%
No significant action taken • 25%
Other • 25%
Patch or update issued • 25%
Development suspended • 25%