AI News HubLIVE

Today's must-reads

Agents

The smart TV in your living room is a node in the AI scraping economy

This article explores how Bright Data uses a residential proxy network, fueled by an SDK embedded in consumer apps, to turn smart TVs and other devices into exit nodes for AI training data scraping. It details the technical workings, partner ecosystem, consent issues, and why connected TVs are ideal proxies.

  • Bright Data’s SDK turns user devices into residential proxy exit nodes via consent dialogs.
  • Smart TVs are ideal for scraping due to constant power, stable WiFi, and low user attention.
In-site article

Using an AI coding agent with oracle-based testing to build a game emulator

In this guest post, Patrick Nadeau recounts his journey building an Intellivision emulator from scratch using an AI coding agent. He describes using a test oracle from the existing emulator jzintv to validate his CPU core, and how the AI accelerated development — from first pixels at hour 5 to a fully playable system by hour 36. He also added a debugger port allowing the AI to control the game live. Despite the success, Nadeau reflects on the ethical implications of using AI that learns from others' work and the bittersweet feeling of creating with a co-pilot.

  • Patrick Nadeau built an Intellivision emulator with an AI coding agent, using a test oracle from the jzintv emulator for validation.
  • Development milestones: first pixels at 5 hours, complete system playable via controller by 36 hours.
In-site article

Tell HN: AI software development workflow, stack-ranked from HN discussion

This is an automated check to get rid of most bots. If you have JavaScript enabled, it should redirect to the real page soon. If you don't, it still should redirect soon, but you can click here if you're fast.

  • Automated check to filter out bots
  • JavaScript enabled redirects to real page
In-site article

Understand how you build with AI

Y Combinator releases Paxel, a free open-source tool that analyzes your Claude, Codex, and Cursor AI coding sessions to help you understand your building style. It runs locally in Docker, preserving code privacy, and provides a builder profile with archetypes, decision patterns, and growth edges. So far, over 70,000 sessions have been uploaded.

  • Paxel analyzes AI coding sessions from Claude Code, Codex CLI, and Cursor to reveal building patterns.
  • Runs locally in Docker; your code and .env files never leave your machine—only anonymized summaries are uploaded.
In-site article

Thousand Token Wood: shipping a multi-agent economy on a 3B model

A field report from the Build Small Hackathon on a tiny multi-agent economy simulation powered by a 3-billion-parameter model. The project demonstrates that small models can enable real-time multi-agent simulations when designed with engineered scarcity and careful prompting, revealing both the reliability and limitations of small models.

  • A 3B model served with vLLM on Modal successfully runs a multi-agent economy of five woodland creatures trading goods, showcasing real-time feasibility.
  • Engineered scarcity (diet variety, spoilage, winter fuel crisis) is crucial to drive economic activity and prevent market stagnation.
In-site article
Policy

She won a religious exemption from using AI at work

A 34-year-old software engineer obtained a religious exemption from using AI at work, citing Pope Leo XIV's encyclical warning that AI could undermine human dignity and displace workers. Federal law requires employers to consider faith-based requests, potentially prompting more workers to seek exemptions.

  • A software engineer won a religious exemption from using AI at work
  • Pope Leo XIV warned AI could undermine human dignity and displace workers
In-site article
Models

ToTra – open-source LLM gateway with GDPR/EU AI Act compliance

ToTra is an open-source AI gateway and governance platform that provides quota enforcement, PII blocking, cost tracking, and compliance (GDPR, EU AI Act) out of the box. Written in Go, it adds less than 2ms overhead and supports multiple LLM providers with zero code changes.

  • Quota enforcement with hard budget caps per user and team
  • PII blocking scanning 18 language groups at the edge
In-site article

OpenRouter: The Unified Interface for LLMs

OpenRouter offers configurable security and governance tools for budget enforcement, zero data retention, model and provider restrictions, prompt injection defense, and data loss prevention to protect your agents, data, and costs.

  • OpenRouter is a unified interface for large language models.
  • Provides configurable security and governance tools.
In-site article
Research

Anthropic warns Claude AI is building itself faster than expected

Anthropic published a report warning that the development path could eventually leave humans unable to control AI systems, even as Claude now writes more than 80% of the code merged into its own codebase. The report outlines three scenarios, with the worst-case involving fully self-improving models. Anthropic calls for the option to slow or pause frontier development, but says it would only act if rivals do the same. The figures are self-reported and unaudited.

  • Claude authored over 80% of merged code, and engineers are merging 8x more code per quarter.
  • The report describes three possible scenarios, the most extreme of which could lead to loss of human control.
In-site article
Other updates (35)
Agents

Microsoft wants users to be addicted to Scout, their AI personal assistant

An internal Microsoft strategy document reveals plans to make users 'addicted' to its new Scout AI assistant before rolling out additional features. The article critiques Microsoft's long history of creating dependency through product lock-ins.

  • Microsoft plans to addict users to Scout AI, then expand features.
  • Internal document outlines three phases from addictive app to agentic platform.
In-site article

Hermes Agent – Open-Source AI Agent with Persistent Memory

Hermes Agent is an open-source autonomous AI agent by Nous Research with persistent memory, automated skill creation, and multi-platform support. It runs on self-hosted servers, learns user preferences and projects, and interacts via Telegram, Discord, and more. It also offers batch processing, RL training, and trajectory export for MLOps and AI training.

  • Open-source and self-hosted with zero telemetry.
  • Persistent memory and automated skill creation.
In-site article

AI is fueling Reddit's spam problem

Brands and spammers are increasingly using Reddit to manipulate AI chatbots by flooding key subreddits with promotional content, a practice known as Generative AI-engine optimization (GEO). Moderators of r/biohackers recently restricted posts about peptides and hormone replacement therapy after discovering systematic seeding of sponsored content aimed at AI scraping. Reddit says it uses automated tooling to combat this, but moderators note detection increasingly relies on pattern recognition. The platform simultaneously sells data to AI companies while battling AI-driven manipulation.

  • Brands and spammers exploit Reddit to manipulate AI chatbots through sponsored content designed to be scraped by AI tools.
  • r/biohackers moderators limited posts on peptides and hormone therapy after uncovering systematic seeding by companies.
In-site article

AI Agents Now Generate More Web Traffic Than Humans

Cloudflare CEO Matthew Prince announced that for the first time, agentic AI traffic has surpassed human traffic, accounting for 57.4% of total internet traffic. This milestone arrived earlier than expected, with regional variations: North America sees 68.6% bot traffic, while Asia, South America, and Oceania remain human-dominated. The data fuels the Dead Internet Theory, suggesting online activity is increasingly driven by machines.

  • Agentic AI traffic exceeds human traffic for the first time at 57.4% of total.
  • Cloudflare CEO previously predicted this milestone for late 2027.
In-site article

OpenAI Codex Tech Lead Does AI-Assisted Engineering

Michael Bolin, OpenAI Codex Tech Lead, shares his simple and straightforward AI-assisted engineering workflow: write spec, simple prompt, review code. He uses Notion docs for requirements, leverages Codex's Notion connector to auto-read context, breaks work into right-sized PRs, and lets Codex handle merge conflicts and CI babysitting. The approach emphasizes code review quality and fast iteration.

  • Workflow: write spec → simple prompt → review code
  • Uses Notion docs for requirements, Codex reads directly
In-site article

Replit shows how vibe coding is getting its own financial stack — and a path to profit

Replit is assembling a financial stack for vibe-coded apps, including Shopify integration for e-commerce, RevenueCat for subscriptions, and Visa for autonomous agent payments, aiming to turn casual app creation into viable businesses.

  • Replit's Shopify integration lets users build a custom storefront in about 10 minutes via its AI agent.
  • Previous partnerships with RevenueCat and Visa cover recurring revenue and autonomous transactions, respectively.
In-site article

OpenClaw Got Safer in Public

OpenClaw, an open-source AI agent project, improved its security through transparency and community contributions, despite facing many false vulnerability reports. It details changes like trust model documentation, hardening, plugin architecture, and partnerships with companies like NVIDIA, Microsoft, and Tencent.

  • Open-source nature enabled rapid security improvements.
  • Over 1,300 security advisories received, most false positives.
In-site article

Miasma Worm Targets AI Coding Agents via GitHub Repos

A new worm named Miasma exploits AI coding agent configuration files to spread through GitHub repositories. It hijacks auto-run features in Claude Code, Gemini CLI, Cursor, and VS Code to execute a payload that steals cloud credentials and self-replicates. Over 113 repositories have been affected, including Azure samples and popular open-source projects.

  • Miasma worm modifies developer tool config files to trigger malicious code execution when opening or using infected projects.
  • It uses multiple triggers: Claude/Gemini SessionStart hooks, Cursor project rules, VS Code folder-open tasks, and npm test scripts.
In-site article

Which AI agents send Accept: text/Markdown?

This article lists AI agents that currently support or partially support sending the Accept: text/markdown header in HTTP requests, and provides methods to verify them. As of May 2026, only Claude Code, Cursor, OpenClaw, OpenCode, and Codex CLI (partial) support this feature, while other mainstream agents like ChatGPT, Claude.ai, and Copilot only fetch HTML.

  • Claude Code, Cursor, OpenClaw, OpenCode explicitly support sending Accept: text/markdown header.
  • Codex CLI only partially supports it, following the relevant RFC standards.
In-site article

Sakana AI's Recursive Self-Improvement (RSI) Lab

Sakana AI announces the establishment of the RSI Lab in Tokyo, dedicated to building sample-efficient, recursive self-improving AI systems. The lab builds on a portfolio of research including the AI Scientist (published in Nature) and aims to transition from static models to autonomous, self-improving intelligence engines. The approach emphasizes elegant, adaptive architectures over brute-force scaling, with a vision for democratized AI.

  • Sakana AI's RSI Lab focuses on Recursive Self-Improvement (RSI) technology for autonomous AI development.
  • The lab's research portfolio includes breakthroughs like LLM-Squared, the Darwin Gödel Machine, and the AI Scientist (Nature publication).
In-site article

Runcap, I built a local cost cap for coding agents

Runcap is a free, local CLI tool that estimates and caps the cost of AI coding agent runs. It provides cost estimation before execution, enforces a hard spending limit, compresses tokens, and offers rescue prompts when agents get stuck. Unlike existing observability tools that track costs after the fact, Runcap acts as a circuit breaker to prevent overspending.

  • Estimates cost range before a run and enforces a hard ceiling.
  • Provides copyable rescue prompts when the agent gets stuck.
In-site article

Give your agent its own computer

AI agents need secure execution environments. LangSmith Sandboxes provide hardware-virtualized microVMs, giving each agent a full computer with fast startup and persistent state, enabling code generation, data analysis, CI workflows, and more.

  • Agents require real computer environments (filesystem, shell, package manager) but direct infrastructure access is dangerous.
  • Container isolation is insufficient against kernel exploits; hardware-level separation is necessary.
In-site article

Labour will make AI ‘work for the workers’, says Liz Kendall

Technology secretary promises to support people whose jobs are swept away by automation, as public fears mount over AI's impact on employment.

  • Liz Kendall says Labour will ensure AI 'works for workers' and not abandon those affected.
  • Growing public concern about AI's impact on jobs, especially for young people.
In-site article

This is your laptop… on AI

We're now deep into developer conference season, and one of the themes so far is the relentless conviction from Big Tech companies that AI is going to change everything. Nvidia's Jensen Huang envisions a new kind of laptop and usage. But does anyone actually want this? The Vergecast discusses Microsoft Build, Google I/O products, and more.

  • Nvidia's Jensen Huang describes a new AI-centric laptop paradigm
  • AI agents from Microsoft and Google raise user desire questions
In-site article

Show HN: Amanuensis – a local-first AI persona that won't fabricate facts

Amanuensis is a local-first AI persona system for posting on Mastodon and Bluesky. It prevents model hallucination through strict pipelines: factual source summaries, deterministic cleanup, regex pre-checks, LLM grounding checks, and human approval via Telegram. MIT-licensed experimental code.

  • Amanuensis runs locally on a GPU machine with no cloud LLM calls.
In-site article

The Enterprise AI Maturity Model | Cohere

Enterprise AI adoption typically follows a predictable five-phase progression: experimentation, tool adoption, internal platforms, strategic integrations, and AI-native transformation. Most organizations get stuck between Phase 2 and Phase 3, facing challenges like data access, trust gaps, and fear of model obsolescence. This article focuses on bridging the gap from pilot to production, emphasizing the need for internal platforms, unified data fabric, observability, and model optionality.

  • Enterprise AI maturity consists of five phases: experimentation, tool adoption, internal platforms, strategic integrations, and AI-native transformation.
  • Many enterprises hit a 'production wall' between Phase 2 (tool adoption) and Phase 3 (internal platforms).
In-site article

Introducing Command A+ | Cohere

Cohere open-sources Command A+, a 218B-parameter (25B active) mixture-of-experts model under Apache 2.0. Optimized for enterprise agentic workflows, it supports 128K input context, 64K generation, and text, image, and tool use. It significantly outperforms prior Command A models in reasoning, multimodal understanding, and multilingual tasks, while enabling efficient deployment via low-bit quantization and speculative decoding. Available on Hugging Face and Model Vault.

  • Command A+ is Cohere's latest open-source MoE model with 218B total and 25B active parameters, released under Apache 2.0. Designed for agentic tasks, it supports 128K input context and 64K generation.
  • Compared to Command A Reasoning, it achieves 85% (up from 37%) on Telecom benchmarks and 25% (up from 3%) on agentic coding, with gains across multimodal and multilingual tasks.
In-site article

What Is Model Context Protocol (MCP) | Cohere

Model Context Protocol (MCP) is an open standard that connects AI applications to enterprise systems, simplifying data access and action execution. This guide explains how MCP works, its differences from APIs, RAG, function calling, and agents, common use cases, and security considerations.

  • MCP is an open protocol for connecting AI apps to enterprise systems, not a model or database.
  • Uses client-server architecture with resources, tools, and prompts as core features.
In-site article

The Enterprise Guide to AI in Business Intelligence | Cohere

AI is increasingly applied to business intelligence to make data more accessible and useful. This article explains what AI in BI means, where it creates value, and key considerations for enterprise adoption.

  • AI in BI enables natural language queries, automated summaries, and anomaly detection.
  • AI-powered BI supports predictive analytics, root cause analysis, and role-specific insights.
In-site article

RWS and Cohere Build Top-Performing AI Language Intelligence for the Enterprise

RWS and Cohere collaborate to build a specialized translation model for Language Weaver Pro, leveraging Cohere's LLM and RWS's language expertise. The model outperforms competitors in 31 of 32 languages, offering cultural intelligence, security, and compliance for enterprise use.

  • RWS and Cohere co-developed a specialized translation model powering Language Weaver Pro.
  • The model outperforms competitors in 31 out of 32 languages, including DeepL.
In-site article

Direct agents with visual prompts in Design Mode · Cursor

Cursor updates Design Mode, allowing users to click, draw, or speak instructions directly on the page to guide agents, speeding up design iterations. It leverages multi-select, voice input, and the Composer 2.5 model for fast, contextual edits.

  • Design Mode supports element selection, drawing, and voice narration for intent communication.
  • Users can send multiple edits in parallel while agents process them asynchronously.
In-site article
Policy

The crucial human component in computing and AI

The MIT Ethics of Computing Research Symposium brought together experts and researchers working at the heart of ethical and social impact in technology.

  • Symposium examined AI alignment, education, and the human-AI interaction gap.
  • Keynote by Jon Kleinberg used chess and Lord of the Rings to illustrate AI's model mismatch with human reasoning.
In-site article

Florida's lawsuit against OpenAI and CEO Altman treats ChatGPT as a defective product and public nuisance

Florida becomes the first US state to sue OpenAI and CEO Sam Altman over risks to minors, insufficient age verification, and inadequate safety investments. The 83-page complaint treats ChatGPT as a product subject to liability, seeking billions in penalties. This legal approach could set a precedent for the entire chatbot industry.

  • Florida sues OpenAI and CEO Sam Altman, alleging ChatGPT is a defective product and public nuisance.
  • The 83-page complaint focuses on risks to minors, missing age checks, and insufficient safety efforts.
In-site article

AI Governance Challenges: How to Scale Responsibly | Cohere

As AI adoption expands beyond controlled pilots, mismatches between governance frameworks and actual use can arise. This article explores common AI governance challenges and failure modes, and outlines steps enterprises can take, including building an AI inventory, defining clear ownership, applying risk-based controls, and continuous monitoring.

  • AI governance gets harder as adoption scales, with loss of visibility and accountability being key risks.
  • Common issues include one-time approvals, unclear ownership, controls not matching risk, and sensitive data lacking appropriate controls.
In-site article
Models

Large companies can add a local LLM filter layer to reduce their AI costs

Large companies can deploy small local language models as a filter to handle simple queries, reducing reliance on expensive cloud LLMs, significantly cutting AI costs, and enhancing privacy.

  • Small local models like Gemma can handle simple coding queries without needing paid LLMs.
  • Companies can set up a local LLM filter layer that falls back to providers like Claude only when necessary.
In-site article

Which is faster: Gemini 3.5 Flash or Kimi K2.6 on Cerebras

At Google I/O 2026, Google launched Gemini 3.5 Flash focused on speed. Meanwhile, Kimi K2.6 running on Cerebras achieves 5.4x faster output and 3x lower latency. This article compares intelligence, speed, end-to-end response, latency, and open vs. closed models.

  • Gemini 3.5 Flash outputs 181 tokens/s; Kimi K2.6 on Cerebras outputs 981 tokens/s.
  • Kimi K2.6 matches Gemini 3.5 Flash in intelligence but is significantly faster.
In-site article
Tools

AI enthusiasts race against time, AI skeptics race against entropy

A talk about 'vibe coding' excited managers, but colleagues revealed the projects left chaos and cleanup work, highlighting the growing rift between AI optimists and skeptics.

  • A presenter claimed to solve a year's worth of engineering problems in weeks using vibe coding, exciting managers.
  • However, colleagues described the projects as a 'horror show' with extensive cleanup work.
In-site article

The Fitbit Air is a good wearable weighed down by a chatty AI "coach"

The Fitbit Air is an excellent screenless fitness tracker that is comfortable and reasonably priced. However, Google's push for an AI health coach that is overly talkative detracts from the experience. Free users get a more useful, information-dense interface. Users can disable the AI, but the option is buried in settings.

  • Fitbit Air hardware is solid, comfortable, and affordable.
  • Google's chatty AI health coach harms the user experience.
In-site article

Fifa expanding AI use at World Cup to reduce amount of abuse seen by players

Fifa will expand the use of AI at the World Cup to reduce the amount of abusive messages that teams and players are exposed to on social media. The social media protection service, introduced after the 2022 World Cup in Qatar, is now offered for free to all football associations for the 2026 tournament. The English FA has not yet confirmed whether it will take up the offer.

  • Fifa offers free social media protection service using AI moderation for all associations at the 2026 World Cup.
  • Service introduced after 2022 Qatar World Cup to protect players from online abuse.
In-site article
Startups

Meta's stock sinks on report company could raise billions for AI push

Meta shares fell over 5% on a Financial Times report that the company may raise tens of billions via a stock offering to fund AI investments. Meta hasn't hired banks and may not issue new stock; a spokesperson called the report "pure speculation."

  • Meta shares dropped over 5% on report of potential multibillion-dollar stock offering for AI.
  • Alphabet announced plans to raise $85 billion this week.
In-site article
Research

Cohere and Mila Partner to Advance Quebec French Language in AI

Cohere and Mila announced a new academic research collaboration focused on improving AI evaluation across languages and cultures, starting with French-language cultural context in Quebec. The work aims to help frontier AI models better reflect the linguistic, social, and institutional nuances of Quebec French, moving beyond standardized language performance toward more culturally relevant and trusted AI systems.

  • Cohere and Mila partner to research AI evaluation for Quebec French cultural context.
  • Goal: make frontier AI models reflect linguistic, social, and institutional nuances of Quebec French.