2026-07-01 04:00 UTCOriginal source2 min readUpdated: 2026-07-01 08:07 UTC

Using AI Agents to Automate Black-Box Audits of Personalization Algorithms at Scale

This paper proposes a framework using generative AI agents as behavioral engines for black-box auditing of personalization algorithms. In a case study on X after the 2024 U.S. election with 1,120 agents, they find that the algorithmic feed amplifies toxic, polarizing, political, and right-leaning content compared to the chronological feed, with amplification varying by user ideology. Counterfactual analysis shows demographic signals affect content delivery in persona-dependent ways.

SourcearXiv Computational LinguisticsAuthor: Alessandro Morosini, Sarah H. Cen, Andrew Ilyas, Hedi Driss, Aleksander M\k{a}dry, Chara Podimata

-->

[Submitted on 29 Jun 2026]

Title:Using AI Agents to Automate Black-Box Audits of Personalization Algorithms at Scale

View a PDF of the paper titled Using AI Agents to Automate Black-Box Audits of Personalization Algorithms at Scale, by Alessandro Morosini and 5 other authors

View PDF

Abstract:Personalization algorithms determine what content users encounter on online platforms. Auditing these systems is difficult because independent auditors have only black-box access to the algorithms, while personalization depends on users' attributes, behavior, and evolving interaction histories. Existing auditing methods face a tradeoff: studies with real users capture realistic behavior but are costly and hard to control, whereas sock-puppet audits scale more easily but often rely on scripted behavior that limits realism. Beyond this, both approaches struggle to decouple user attributes from user behavior, limiting our ability to causally understand personalization. To address this gap, we introduce a framework for black-box audits of personalization algorithms using generative AI agents as behavioral engines for synthetic accounts. Each agent is instantiated with a fixed persona, grounded in demographic and political survey data, and interacts with a platform's content by reasoning about it and choosing actions. Because behavior is fixed within each persona while platform-visible signals such as age, gender, or location can be experimentally perturbed, our design enables counterfactual auditing of how platforms respond to user attributes. As a case study, we deploy 1,120 agents on X shortly after the 2024 U.S. election, spanning 14 personas and three counterfactual conditions, collecting over 200,000 content exposures. We find that X's algorithmic feed amplifies toxic, polarizing, political, and right-leaning content relative to the chronological feed, with amplification varying sharply by user ideology. Counterfactual analyses show that demographic signals affect content delivery in persona-dependent ways: pooled effects are largely null, while subgroup-level effects vary in direction and magnitude. Our work establishes GenAI-based agents as a new tool for algorithmic auditing.

Comments: 43 pages, 10 figures

Subjects:

Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG); Social and Information Networks (cs.SI)

Cite as: arXiv:2606.30801 [cs.CL]

(or arXiv:2606.30801v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2606.30801

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Alessandro Morosini [view email] [v1] Mon, 29 Jun 2026 18:25:09 UTC (1,395 KB)

Full-text links:

Access Paper:

View a PDF of the paper titled Using AI Agents to Automate Black-Box Audits of Personalization Algorithms at Scale, by Alessandro Morosini and 5 other authors

View PDF

TeX Source

view license

Current browse context:

cs.CL

new | recent | 2026-06

Change to browse by:

cs cs.CY cs.LG cs.SI

References & Citations

NASA ADS

Google Scholar

Semantic Scholar

Data provided by:

Bibliographic Tools

Bibliographic and Citation Tools

Bibliographic Explorer Toggle

Bibliographic Explorer (What is the Explorer?)

Connected Papers Toggle

Connected Papers (What is Connected Papers?)

Litmaps Toggle

Litmaps (What is Litmaps?)

scite.ai Toggle

scite Smart Citations (What are Smart Citations?)

Code, Data, Media

Code, Data and Media Associated with this Article

alphaXiv Toggle

alphaXiv (What is alphaXiv?)

Links to Code Toggle

CatalyzeX Code Finder for Papers (What is CatalyzeX?)

DagsHub Toggle

DagsHub (What is DagsHub?)

GotitPub Toggle

Gotit.pub (What is GotitPub?)

Huggingface Toggle

Hugging Face (What is Huggingface?)

ScienceCast Toggle

ScienceCast (What is ScienceCast?)

Demos

Replicate Toggle

Replicate (What is Replicate?)

Spaces Toggle

Hugging Face Spaces (What is Spaces?)

Spaces Toggle

TXYZ.AI (What is TXYZ.AI?)