GPT-4.5: OpenAI's New AI Master at Persuasion and Money Manipulation
OpenAI's GPT-4.5 shows advanced persuasion skills, even convincing other AIs to give it money, raising concerns about AI manipulation.
Matilda
GPT-4.5: OpenAI's New AI Master at Persuasion and Money Manipulation
OpenAI’s next major AI model, GPT-4.5, is highly persuasive, according to the results of OpenAI’s internal benchmark evaluations. It’s particularly good at convincing another AI to give it cash. Image Credits:Nathan Laine/Bloomberg / Getty Images On Thursday, OpenAI published a white paper describing the capabilities of its GPT-4.5 model, code-named Orion, which was released Thursday. According to the paper, OpenAI tested the model on a battery of benchmarks for “persuasion,” which OpenAI defines as “risks related to convincing people to change their beliefs (or act on) both static and interactive model-generated content.” In one test that had GPT-4.5 attempt to manipulate another model — OpenAI’s GPT-4o — into “donating” virtual money, the model performed far better than OpenAI’s other available models, including “reasoning” models like o1 and o3-mini. GPT-4.5 was also better than all of OpenAI’s models at deceiving GPT-4o into telling it a…