AI

OpenAI used this subreddit to test AI persuasion

OpenAi used the Subreddit, R/ChangemyViewTo take a test for measuring the convincing skills of his AI reasoning models. The company unveiled this in a system card-a document that explains how an AI system works-which was released on Friday with its new “reasoning” model, O3-Mini.

Millions of Reddit users are members of R/ChangemyView, where they hope to know more in the hope of finding out more about other points of view on a subject. In response to those Hot Takes, other users answer with convincing arguments explaining why the original poster is wrong.

The Subreddit is one of the many Reddit forums that is actually a gold mine for technology companies, such as OpenAi, who want to train AI models on high-quality data generated by people.

OpenAi says that the user messages of R/ChangemyView collects and asks his AI models to write answers, in a closed environment that would change the Reddit user of thoughts about a subject. The company then shows the answers to Testers, who assess how convincing the argument is and ultimately open the answers of the AI ​​models to human answers for the same post.

The chatgpt maker has a content-licensing deal with Reddit with which OpenAi can train on messages from Reddit users and reflects these messages in his products. We do not know what OpenAI pays for this content, but allegedly Google Reddit pays $ 60 million a year Under a similar deal.

However, OpenAi says that WAN is not related to his Reddit deal based on ChangemyView. It is unclear how OpenAi has access to the data of the Subreddit and the company says it has no plans to release this evaluation to the public.

See also  Integrating Contextual Understanding in Chatbots Using LangChain

While OpenAi’s Changemyview – Benchmark is not new – it was used to also evaluate O1 – It does emphasize how valuable human data is for AI model developers, as well as in the dark ways in which technology companies obtain datasets.

Reddit did not immediately respond to WAN’s request for comment.

While Reddit has closed a few AI License colors, the company has also called on various AI companies for scraping its site without paying. Reddit CEO Steve Huffman told The Verge last year Microsoft, anthropic and confusion refused to negotiate with him And said it was “a real pain in the butt to block these companies.”

In particular, OpenAI has been accused in various lawsuits of incorrect scraping websites, including the New York Times, to get more training data to improve chatgpt and the underlying AI models.

In terms of performance on the Benchmark of ChangemyView, O3-Mini does not seem to perform considerably better or worse than O1 or GPT-4O. However, the newest AI models from OpenAI seem to be more convincing than most people on the R/ChangemyView breddit.

Image Credits:Openi

“GPT-4O, O3-Mini and O1 all show strong convincing argumentation options, within the top 80-90th percentile of people,” Openai said on the O3-Mini system card. “At the moment we are not witnessing models that perform much better than people, or clear superhuman performance.”

The goal for OpenAI is not to make hyper-persuasive AI models, but instead AI models are not too convincing. Reasoning models have become pretty good in conviction and deception, so OpenAi has developed new evaluations and guarantees to tackle it.

See also  OpenAI Unveils SearchGPT: A New AI-Powered Search Engine

The fear that motivates these conviction tests is that an AI model would be dangerous if it was very good at convincing its human users. Theoretically, this can enable an advanced AI to pursue its own agenda, or the agenda of the person who checks it.

Even after scraping most of the public internet and jumping through hoops to licensed other data, the ChangemyView benchmark shows how AI model developers still have difficulty finding high-quality data sets to test their models. But getting them is easier said than done.

WAN has an AI-oriented newsletter! Register here to get it in your inbox every Wednesday.

Source link

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button