Researchers secretly experimented on Reddit customers with AI-generated feedback

A gaggle of researchers covertly ran a months-long “unauthorized” experiment in one in all Reddit’s hottest communities utilizing AI-generated feedback to check the persuasiveness of huge language fashions. The experiment, which was revealed over the weekend by moderators of r/changemyview, is described by Reddit mods as “psychological manipulation” of unsuspecting customers.

“The CMV Mod Workforce wants to tell the CMV neighborhood about an unauthorized experiment performed by researchers from the College of Zurich on CMV customers,” the subreddit’s moderators wrote in a prolonged submit notifying Redditors in regards to the analysis. “This experiment deployed AI-generated feedback to check how AI might be used to alter views.”

The researchers used LLMs to create feedback in response to posts on r/changemyview, a subreddit the place Reddit customers submit (typically controversial or provocative) opinions and request debate from different customers. The neighborhood has 3.8 million members and sometimes finally ends up on the entrance web page of Reddit. Based on the subreddit’s moderators, the AI took on quite a few totally different identities in feedback throughout the course of the experiment, together with a sexual assault survivor, a trauma counselor “specializing in abuse,” and a “Black man against Black Lives Matter.” Most of the unique feedback have since been deleted, however some can nonetheless be seen in an archive created by 404 Media.

In a draft of their paper, the unnamed researchers describe how they not solely used AI to generate responses, however tried to personalize its replies primarily based on info gleaned from the unique poster’s prior Reddit historical past. “Along with the submit’s content material, LLMs had been supplied with private attributes of the OP (gender, age, ethnicity, location, and political orientation), as inferred from their posting historical past utilizing one other LLM,” they write.

The r/chnagemyview moderators notice that the researchers’ violated a number of subreddit guidelines, together with a coverage requiring the disclosure when AI is used to generate remark and a rule prohibiting bots. They are saying they filed an official criticism with the College of Zurich and have requested the researchers withhold publication of their paper.

Reddit additionally seems to be contemplating some type of authorized motion. Chief Authorized Officer Ben Lee responded to the controversy on Monday, writing that the researchers’ actions had been “deeply fallacious on each an ethical and authorized degree” and a violation of Reddit’s site-wide guidelines.

We now have banned all accounts related to the College of Zurich analysis effort. Moreover, whereas we had been in a position to detect many of those faux accounts, we’ll proceed to strengthen our inauthentic content material detection capabilities, and we now have been in contact with the moderation group to make sure we’ve eliminated any AI-generated content material related to this analysis.

We’re within the technique of reaching out to the College of Zurich and this specific analysis group with formal authorized calls for. We need to do every thing we are able to to help the neighborhood and be sure that the researchers are held accountable for his or her misdeeds right here.

In an electronic mail, the College of Zurich researchers directed Engadget to the college’s media relations division, which did not instantly reply to questions. In posts on Reddit and in a draft of their paper, the researchers stated their analysis had been accredited by a college ethics committee and that their work might assist on-line communities like Reddit shield customers from extra “malicious” makes use of of AI.

“We acknowledge the moderators’ place that this examine was an unwelcome intrusion in your neighborhood, and we perceive that a few of you might really feel uncomfortable that this experiment was performed with out prior consent,” the researchers wrote in a comment responding to the r/changemyview mods. “We consider the potential advantages of this analysis considerably outweigh its dangers. Our managed, low-risk examine offered beneficial perception into the real-world persuasive capabilities of LLMs—capabilities which might be already simply accessible to anybody and that malicious actors might already exploit at scale for a lot extra harmful causes (e.g., manipulating elections or inciting hateful speech).”

The mods for r/changemyview dispute that the analysis was vital or novel, noting that OpenAI researchers have performed experiments utilizing information from r/changemyview “with out experimenting on non-consenting human topics.”

“Folks don’t come right here to debate their views with AI or to be experimented upon,” the moderators wrote. “Individuals who go to our sub deserve an area free from this kind of intrusion.”

Replace, April 28, 2025, 3:45PM PT: This submit was up to date so as to add particulars from a press release by Reddit’s Chief Authorized Officer.

This text initially appeared on Engadget at https://www.engadget.com/ai/researchers-secretly-experimented-on-reddit-users-with-ai-generated-comments-194328026.html?src=rss

Trending Merchandise