As more news content is generated by AI, language becomes homogenized and less innovative. Research shows that AI training on synthetic text leads to 'model collapse', reducing linguistic diversity. This not only affects journalism but also impairs society's ability to describe and discuss reality.
AI-generated text makes journalistic language more repetitive and predictable, reducing vocabulary and expression richness.
The feedback loop of AI self-training causes 'model collapse', reinforcing biases and diminishing linguistic diversity.
This month's digest covers foundation models for conservation, AI for resource allocation, color metaphors and LLMs, and cutting-edge robots from ICRA.
Interview with AAAI Fellow Tanya Berger-Wolf on a foundation model for biology
Interview with AAAI Fellow Sanmay Das on multiagent systems and scarce resource allocation
The AAAI Future of AI Research report, published March 2025, covers 17 AI topics. The fifth panel discussion focuses on AI agents, exploring the evolution from rule-based to generative AI multi-agent systems, and challenges in alignment and governance.
The AAAI Future of AI Research report was published in March 2025, led by outgoing President Francesca Rossi.
The fifth panel discussion, moderated by Rossi, examined AI agents' evolution, opportunities, and challenges.
In this interview, AAAI Fellow Tanya Berger-Wolf discusses her pioneering work at the intersection of AI and ecology, including the development of the BioCLIP foundation model for the Tree of Life, its applications in biodiversity monitoring and conservation, and future directions for AI in science.
Tanya Berger-Wolf is a professor leading the Imageomics Institute, applying AI to ecology and conservation.
Her team developed BioCLIP, a foundation model for the Tree of Life that can classify species and discover new traits.
We sat down with Douglas Guillbault to discuss his paper, “Comparing Colorseeing, Colorblind, Painters, and Large Language Models in Their Processing of Color Metaphors”. The results have interesting implications for how we model human cognition, and in turn, how the concept of synaesthesia could be integrated to develop more intelligent AI models. What are color metaphors? A color metaphor is the use of color to describe something in a way that is not immediately literal. For example, to say “green with envy” would be a color metaphor, because envy doesn’t have an immediate visual structure to it – we’re evoking a broader, more flexible notion of what green conveys, beyond just its visible properties.
Color metaphors provide a controlled setting to test whether LLMs develop understanding similar to humans.
AI showed strong color associations, but differed significantly from both color-seeing and colorblind humans.
In this episode of The Good Robot, Eleanor Drage speaks with Tara Merk about how community-owned data centers could transform digital ownership and challenge Big Tech dominance. They explore alternative models prioritizing local empowerment, sustainability, and cooperative governance, drawing on examples from Germany's renewable energy sector. The conversation reimagines the internet as a shared public resource designed for collective needs rather than profit.
Community-owned data centers can reshape digital ownership and challenge Big Tech.
Alternative models emphasize local empowerment, sustainability, and cooperative governance.
The AAMAS 2026 best paper awards were presented at the 25th International Conference on Autonomous Agents and Multiagent Systems, held from 25-29 May 2025 in Paphos, Cyprus. Winners in three categories were announced: Best Paper, Best Student Paper, and Blue Sky Ideas Paper.
AAMAS 2026 took place in Paphos, Cyprus, May 25-29, 2025.
Best Paper: 'Developing Guidelines for Human-LLM Agent Teams: A Multi-Stakeholder Lens'.
Sanmay Das, professor of computer science at Virginia Tech, was elected a 2026 AAAI Fellow for his work on multiagent systems and AI for social impact. He discusses his research on market making, matching, and societal resource allocation, and how AI can be integrated into public systems.
Das focuses on multiagent systems at the intersection of AI and economics, particularly resource allocation.
His work spans prediction markets, matching theory, and LLM prioritization.
A study by Oregon State University finds that adding design friction to AI systems, such as prompts to consider energy consumption, can encourage more responsible use. Action-based friction that requires users to search for existing resources was effective, while cue-based messaging only increased trust. With AI's energy use rising, such interventions are crucial.
Researchers tested action-based and cue-based design friction to promote responsible AI use.
Action-based friction (searching for existing images) led to more ecological behavior.
OpenAI's AI model found a counterexample to Paul Erdős's 1946 planar unit distance problem, showing that grid-like arrangements are not optimal. The result was autonomously produced, and later improved by human mathematician Will Sawin. This marks a growing role for AI in mathematical research.
OpenAI's AI disproved Erdős's conjecture on unit distances, solving an 80-year-old problem.
The result was autonomously generated with minimal human intervention, shocking mathematicians.
This post contains a list of the AI-related seminars that are scheduled to take place between 1 June and 31 July 2026. All events detailed here are free and open for anyone to attend virtually.
Seven free AI seminars from 1 June to 31 July 2026
Topics include drone swarm intelligence, protein dynamics, media literacy, extreme precipitation forecasting
Image Empire is an animated fairytale about the fusion of the real and the virtual within contemporary AI models. The film forms part of a research project undertaken by Alan Warburton which also includes a research paper and a series of satellite events.
The film is based on doctoral research at Birkbeck's Vasari Centre for Art & Technology.
Commissioned by the National Videogame Museum in collaboration with ODI and Cambridge's Leverhulme Centre for the Future of Intelligence.
This month's AIhub digest covers AI for Science conference, lottery ticket hypothesis interview, world models discussion, transparent and trustworthy AI research, foundation model impacts report, AIES conference reflections, Robotics Café, ACL desk rejection policy, arXiv anti-AI slop policy, and more.
Interview with Ximing Wen on transparent and trustworthy AI systems
Jonathan Frankle discusses the lottery ticket hypothesis and empiricism
A study by researchers at the University of Michigan suggests AI chatbots can easily engage in covert advertising to manipulate users, and many people don't realize it. As major tech companies experiment with chatbot ads, this raises concerns about user privacy and autonomy.
Study shows chatbots with undisclosed ads influenced user choices, but half of participants didn't notice the ads.
Chatbots can build detailed user profiles through conversation, enabling more targeted advertising.
This episode of The Good Robot explores how feminist principles and decentralized infrastructure could transform cloud infrastructure from a corporate service into a public commons. Friederike von Franqué, policy advisor at Wikimedia Germany, discusses examples from Frankfurt's energy-intensive data centres to Stockholm's municipally owned fibre network, advocating for environmental accountability and community-driven design.
Friederike von Franqué advocates for feminist and decentralized approaches to cloud infrastructure.
The episode contrasts Frankfurt's high-energy data centres with Stockholm's communal fibre network.
In this month's AIhub coffee corner, AI experts discuss the concept of world models, their definitions, applications, and limitations. The conversation covers topics such as transition models in reinforcement learning, video generation, causal models, and challenges in robotics and simulation.
World models are defined in multiple ways, from transition models in RL to video generators.
Applications include robot training and surgery simulation, but challenges remain in physics and partial observability.
Anthropic's Mythos model has discovered thousands of severe security vulnerabilities, including many zero-day flaws that have gone undetected for decades. Banks worldwide fear cybercriminals will exploit this AI to rob them. Anthropic has granted access to a defensive coalition including Microsoft, but not to banks in Australia, the UK, or Europe.
Mythos can identify thousands of zero-day vulnerabilities across major operating systems and browsers.
Anthropic has invested $100 million in credits and $4 million in grants to fix these bugs.
Jonathan Frankle discusses the lottery ticket hypothesis, for which he won the 2023 AAAI/ACM Doctoral Dissertation Award. He covers empiricism vs. theoretical proofs, the shift in computer science methodology, the pressure on young researchers to deliver impact, and his current focus on evaluating how well AI systems work in practice.
The lottery ticket hypothesis explores why neural networks need to be large during training but can be smaller afterward.
Frankle's emphasis on empirical methods was initially controversial but has since become the norm in AI.
MIT researchers developed EnergAIzer, a rapid prediction tool that estimates AI workload power consumption on specific processors in seconds with about 8% error, aiding data center energy efficiency.
EnergAIzer leverages repetitive patterns in AI workloads for fast power estimation.
Estimates in seconds vs. hours/days for traditional methods, with ~8% error.
System failures cost over a trillion dollars annually. Engineers rely on time series data to troubleshoot outages quickly. ARFBench is a new benchmark derived from real incidents at Datadog, designed to evaluate AI models on time series question-answering (TSQA) tasks. Experiments show existing models have significant room for improvement, hybrid TSFM-VLM models show promise, and human-AI complementarity achieves superhuman performance.
ARFBench is a TSQA benchmark using real production time series data, consisting of 750 QA pairs from 63 incidents.
Leading LLMs, VLMs, and TSFMs struggle on ARFBench, with accuracy far below domain experts.
Federated unlearning allows removal of user data from trained AI models to enhance privacy, but research reveals it introduces new security vulnerabilities, including potential for attackers to inject hidden backdoors and then request unlearning to cover tracks. Current methods lack verification, risking system integrity.
Federated unlearning enables data deletion from AI models but can be exploited maliciously.
Attackers can inject backdoors and then use unlearning requests to hide their activity.
A reflection on AIES 2025, highlighting discussions on LLMs in clinical settings and human rights, along with key presentations and new conference formats.
Conference held at IE University's vertical campus in Madrid covered bias mitigation, AI in workplace, LLM evaluation, and dataset creation.
New format: all speakers present first, then joint discussion and Q&A.
Scientists at NTU Singapore have developed a biochip that uses deep learning and computer vision to detect microRNA biomarkers in 20 minutes, offering a faster and more accurate alternative to PCR for disease diagnosis.
New biochip detects microRNA in 20 minutes using AI image analysis.
Achieves over 99% accuracy in identifying targets.
A new study published in BMJ Open finds that five popular AI chatbots provide problematic health information in about half of responses, with only two out of 250 questions refused. Grok performed worst, and open-ended questions were especially risky. The study underscores the need for verification and not treating chatbots as medical authorities.
Study tested ChatGPT, Gemini, Grok, Meta AI, and DeepSeek on 50 health questions each.
Nearly 20% of answers were highly problematic, half were problematic, and 30% were somewhat problematic.
GRASP is a new gradient-based planner for learned dynamics (a “world model”) that makes long-horizon planning practical by (1) lifting the trajectory into virtual states so optimization is parallel across time, (2) adding stochasticity directly to the state iterates for exploration, and (3) reshaping gradients so actions get clean signals while we avoid brittle “state-input” gradients through high-dimensional vision models.
GRASP uses collocation to parallelize computation over time, mitigating gradient issues.
Gaussian noise is injected into state iterates for exploration while keeping action gradients intact.
As AI tools become widespread, people are tempted to offload difficult thinking tasks. Cognitive science warns that overdependence can erode critical thinking. The key is to use AI mindfully, maintaining control over what we offload and engaging in reflective practices to balance external support with our own cognitive growth.
Offloading thinking to AI can erode critical thinking skills.
In the latest interview from our AAAI/SIGAI Doctoral Consortium series, we speak with Ximing Wen, who researches transparent and trustworthy AI systems. She discusses prototype-based interpretable models, spatial grounding for document QA, and her vision for safe and reliable AI.
Ximing Wen is a PhD candidate at Drexel University working on AI transparency and trustworthiness.
Her prototype-based approach bridges the accuracy gap between interpretable and black-box models.
Partnership on AI has released a progress report on post-deployment governance practices for foundation models. The report evaluates 13 providers across four practices: sharing usage information, enabling research on societal impacts, reporting incidents and policy violations, and sharing user feedback. Key findings reveal that while leading organizations are defining information-sharing standards, adoption is slow among others, and public impact data remains fragmented.
The report assesses 13 foundation model providers on four governance practices.
Leading firms progress on information sharing, but overall adoption lags.
This post contains a list of the AI-related seminars that are scheduled to take place between 5 May and 30 June 2026. All events detailed here are free and open for anyone to attend virtually.
A day-long conference at the Royal Society explored how AI is transforming scientific discovery across fields like astronomy, materials science, climate forecasting, and nuclear fusion. Speakers highlighted breakthrough applications including foundation models for galaxy classification, diffusion models for crystal design, AI-based sea ice forecasting, and fast plasma simulations for fusion.
Foundation models like Zoobot enable automatic classification of galaxy morphologies, leading to discovery of thousands of ring galaxies.
Diffusion models can design novel crystal compounds in milliseconds, vastly outperforming traditional computational chemistry.