AI News HubLIVE

Source Mix

  • Hacker News AI11
  • The Decoder11
  • The Verge AI4
  • ZDNet AI3
  • Artificial Intelligence News2
  • arXiv Computational Linguistics2
  • Last Week in AI2
  • O'Reilly AI & ML Radar2

Topic Mix

  • Agents31
  • Research14
  • Models12
  • Chips11
  • Policy11
  • Robotics6
  • Tools5
  • Startups1

Timeline

  • 2026-05-2813
  • 2026-05-279
  • 2026-05-257
  • 2026-05-267
  • 2026-05-246
  • 2026-05-224
  • 2026-05-234

Latest Updates

Google Cloud responds to AI-accelerated cyberattacks with a platform that aims to close security gaps in minutes

Google Cloud has unveiled "AI Threat Defense," a platform designed to automatically find, assess, and patch security flaws in enterprise systems. The company bundles technologies it partly acquired through acquisitions.

  • Google Cloud launches AI Threat Defense platform to combat AI-driven cyberattacks.
  • The platform automatically discovers, assesses, and patches security vulnerabilities.
In-site article

Google Pay preps for AI agents with Universal Commerce Protocol

Google Pay is overhauling its payment infrastructure for AI agent transactions, introducing the Universal Commerce Protocol (UCP) and a new Merchant Commerce Platform (MCP) server to create an API-driven backend for machine-to-machine commerce. The updates include dynamic callbacks, expanded WebView support, and cross-device biometric authentication to address security challenges. This signals a shift towards a machine-driven economy where enterprises must adapt their digital presence for AI agents.

  • Google Pay introduces Universal Commerce Protocol (UCP) to standardize AI agent payments.
  • New Merchant Commerce Platform (MCP) server acts as intermediary, aggregating transaction data.
In-site article

Google launches a tiny board that runs Gemma 3 locally

Google unveiled the new Coral Board at Google I/O - a compact single-board computer for on-device AI. It runs Gemma 3 270M locally and features a RISC-V based NPU.

  • Coral Board is a compact SBC for on-device AI, targeting headphones, AR glasses, and smartwatches
  • It features a RISC-V based Coral NPU and a Synaptics Astra SL2619 chip
In-site article

AGI timelines shift with whichever lab is dominant

A new analysis shows that top AI forecasters adjust their AGI timelines based on which lab is currently leading the field, with predictions swinging from earlier to later and back again as the dominant lab changes from ChatGPT to xAI/Meta/Gemini to Anthropic.

  • Predictions for when most cognitive labor will be automated (AGI) fluctuate significantly based on which AI lab is currently dominant.
  • From 2023-2025, most researchers moved AGI timelines earlier; from 2025-2026, they moved them later; in early 2026, under Anthropic's rapid progress, they moved earlier again.
In-site article

To Become a Better Designer with AI, Become a Digital Hoarder

The article argues that to create unique and tasteful designs with AI, designers must curate a library of visual references (digital hoarding) to develop taste and codify it for AI models. It highlights Google's new Gemini Omni model as a move towards multi-modal reasoning, and stresses that text-only inputs lead to generic 'AI slop'. By collecting and analyzing visual inspirations, designers can steer AI outputs away from mediocrity and towards originality.

  • Google's Gemini Omni model signals a shift towards multi-modal AI that can reason across text, image, audio, and video.
  • Relying solely on text prompts results in generic, 'slop' designs; visual references are essential for unique aesthetics.
In-site article

I'm an iPhone user, but Gemini with Android Auto beats Siri in the car any day - here's why

As an iPhone owner, I primarily use Siri through CarPlay when I'm driving. Apple's voice assistant can handle basic tasks, but since my Toyota Camry supports Android Auto, I wanted to see how Google Gemini would fare. With Gemini, you can send emails, get restaurant info, play games, and more. Here's how to set it up and my experience.

  • The author, an iPhone user, finds Gemini with Android Auto superior to Siri in the car.
  • Gemini handles a wide range of tasks from basic commands to complex interactions.
In-site article

Mistral rebrands LeChat as Vibe, betting its chatbot's future is as a full-blown work agent

Mistral AI is renaming its chatbot Le Chat to Vibe and bundling chat, coding agents and a new Work Mode under one brand. The Work Mode docks onto Google Workspace, Outlook, Slack or GitHub and processes tasks such as emails, reports or pull requests independently. The Pro tariff has been reduced from €17.99 to €14.99, although Mistral has not specified any concrete usage limits. The company is thus positioning itself more directly against the agent-based offerings from OpenAI, Google and Anthropic.

  • Mistral AI rebrands Le Chat as Vibe, integrating chat, coding agents, and a new Work Mode.
  • Work Mode connects to Google Workspace, Outlook, Slack, or GitHub to autonomously handle tasks.
In-site article

Your AI Agent Already Forgot Half of What You Told It

This article is the seventh in a series on agentic engineering and AI-driven development, focusing on context management in AI sessions. The author shares a personal experience with Gemini forgetting earlier notes, introduces the concept of context compaction, and provides four practical techniques: split discovery from documentation, use handoff documents, give acceptance criteria rather than procedures, and use spec documents as bridges. These techniques apply to both developers and regular users, helping reduce frustration caused by AI forgetting.

  • AI assistants can 'forget' earlier information in long conversations due to context window limits, a phenomenon called context compaction.
  • Four practical techniques: split discovery from documentation, use handoff documents, give acceptance criteria, and use spec documents as bridges.
In-site article

Money Printer Pro – Open-source AI content generator

Money Printer Pro is an open-source AI content generator powered by Google Gemini and VEO 3.1, enabling photorealistic images and cinematic videos with identity preservation. It features 7 visual engines, autopilot batch generation, AI quality scoring, and a publish guard. Users pay Google directly with no markup or subscription.

  • Generates photorealistic images and 8-second cinematic videos with consistent identity across outputs.
  • Integrates 7 visual engines for lighting, shadow, motion, weather, outfit, scene validation, and context orchestration.
In-site article

Superpowers: An Agentic Skills Framework for AI Coding Workflows

Superpowers is a complete software development methodology for coding agents, built on composable skills and initial instructions. It emphasizes test-driven development, design-first approach, and subagent-driven iteration, supporting multiple coding assistants like Claude Code, Codex CLI, and Gemini CLI.

  • Superpowers provides a skills library including TDD, systematic debugging, collaboration planning, enabling agents to work autonomously for hours.
  • The workflow starts with brainstorming specifications, followed by design approval, implementation plan generation, and subagent-driven execution with two-stage review.
In-site article

A Coding Guide to Implement a pgvector-Powered Semantic, Hybrid, Sparse, and Quantized Vector Search System

This tutorial builds a complete pgvector playground in Google Colab, covering installation, embedding creation, HNSW indexing, semantic search, filtered search, distance metric comparisons, half-precision storage, binary quantization, sparse vector search, hybrid retrieval, and vector aggregation. All using open-source tools without external API keys.

  • Set up PostgreSQL with pgvector extension in Google Colab from scratch.
  • Generate embeddings with SentenceTransformers and build HNSW indexes for efficient search.
In-site article

Former Google and Apple Researchers Launch a Startup to Build AI's Missing Feed

A group of former researchers from Google DeepMind, Apple, OpenAI, and Meta have launched a startup called Trajectory, aiming to help companies continuously improve their AI products by training on real-world user interactions. The company has raised a $15 million seed round at a $115 million valuation, led by Conviction. Trajectory's platform enables continuous learning for AI models, updating them based on real-world failures. It currently works with AI-native companies like Clay and Harvey, and plans to expand to Fortune 500 companies.

  • Trajectory is founded by ex-Google DeepMind, Apple, OpenAI, and Meta researchers to enable continuous learning for AI.
  • The startup raised $15M seed funding at $115M valuation, with investors including Jeff Dean and Fei-Fei Li.
In-site article

Bridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Models

Researchers identify a Stability-Expressivity Gap in spoken language models when using synthetic data for low-resource languages, and propose two self-alignment frameworks (DGSA and TDSC) that recover prosodic variability and outperform commercial systems like ElevenLabs and Gemini Pro, enabling zero-shot voice cloning for Lao.

  • Spoken Language Models (SLMs) for low-resource languages suffer from a trade-off between phonetic accuracy and prosodic expressivity when trained on synthetic data.
  • The proposed Disentanglement-Guided Self-Alignment (DGSA) recovers expressivity by separating prosody and timbre.
In-site article

Microsoft's MAI-Image-2.5 pulls even with Google's Nano Banana 2 on benchmarks

Microsoft's MAI-Image-2.5 ranks third on Arena's text-to-image leaderboard, on par with Google's Nano Banana 2 but still behind OpenAI's Image-2. The model shows clear gains over its predecessor, especially in rendering text inside images and commercial visuals.

  • MAI-Image-2.5 ranks third on Arena leaderboard, tied with Google's Nano Banana 2
  • Improvements in text rendering and commercial visuals
In-site article

This AI-free Google alternative is surging in popularity - how to try it for yourself

DuckDuckGo, an AI-free search alternative, is seeing a surge in users due to Google's AI Overviews. This article explains how to use DuckDuckGo without AI for private searching and browsing.

  • DuckDuckGo installs surged after Google I/O 2026, with iOS app peaking at 69.9% growth.
  • DuckDuckGo offers both AI-free search and AI chat options, giving users choice.
In-site article

With Google’s debut, the most important AI agent feature is now the most boring one

Google, Anthropic, and AWS all launched managed AI agent runtimes within six weeks, signaling that agent infrastructure has become table stakes. The real differentiator is shifting to data location, cost, and portability.

  • Google, Anthropic, and AWS shipped nearly identical managed agent runtimes within six weeks.
  • The managed runtime is no longer a competitive differentiator; it's a baseline expectation.
In-site article

Google folds Display Ads into AI-first Demand Gen platform

Google is folding Display Ads into its AI-powered Demand Gen platform, marking the end of a long-standing digital advertising model. The transition requires marketers to move from manual campaign controls to AI-driven automation, changing how campaigns are created, measured, and optimized.

  • Google integrates Display Ads into its AI-first Demand Gen platform, phasing out traditional GDN model.
  • Advertisers provide creative assets and business goals, while Google's AI automates ad formats, placements, and audience targeting.
In-site article

Agent Skills: Making AI Coding Agents Follow Good Engineering Practices

AI coding agents default to the shortest path to 'done,' skipping specs, tests, and reviews that senior engineers know are essential. Addy Osmani's Agent Skills project builds senior-engineer scaffolding for agents, using workflows instead of prose. It includes 20 skills across six SDLC phases, incorporating Google engineering practices. Key principles: process over prose, anti-rationalization tables, nonnegotiable verification, progressive disclosure, and scope discipline. The article also covers three usage modes and patterns to steal even without installing.

  • AI coding agents take the shortest path to complete tasks, ignoring specifications, tests, and reviews—the same failure mode senior engineers learn to avoid.
  • Agent Skills uses workflow Markdown files to guide agents, each with steps, checkpoints, and exit criteria.
In-site article

Last Week in AI #341 - Musk loses to OpenAI, Google's IO updates, OpenAI solves Erdős

This week's top AI news includes Elon Musk losing his $150 billion lawsuit against OpenAI, Google unveiling major AI updates at I/O 2026, OpenAI's AI solving an 80-year-old math problem, the Take It Down Act enforcement, and SpaceX planning to acquire coding startup Cursor after its IPO.

  • Elon Musk's $150B lawsuit against OpenAI dismissed; OpenAI prepares for IPO.
  • Google I/O 2026 introduces Gemini 3.5 Flash, Gemini Spark AI agent, Gemini Omni, and more.
In-site article

Crew44: Turn coding agents into specialist teams

Crew44 is a local-first, open-source tool that organizes multiple AI coding agents (like Claude Code, Codex, Gemini, Cursor) into coordinated specialist teams. Free, no account required, MIT licensed, with memory and compounding skills.

  • Crew44 unifies multiple AI coding agents into a single local workspace for team collaboration.
  • Users create specialist roles (e.g., Cofounder, Engineer, Product Lead) and bind each to the best runtime/model.
In-site article

The AI Agent Harness: The Glue That Turns LLMs into Digital Workers

AI models have plateaued on raw intelligence, and the next gains come from what you build around them. The AI agent harness provides tools, memory, and human-in-the-loop capabilities to transform LLMs into useful digital assistants. Companies like Google, LangChain, OpenAI, and Anthropic offer different solutions.

  • AI intelligence gains are plateauing; agent harnesses are the new frontier.
  • Agent harnesses add tools, memory, and human oversight to LLMs.
In-site article

AI Weekly Issue #496: Anthropic's Pentagon model is now everyone's model

Anthropic released its formerly classified Mythos model to the public, collapsing the gap between sovereign and developer AI. DeepMind's Demis Hassabis moved AGI timeline to 2029. Critical vulnerabilities in Starlette impacted millions of AI agents, and a coordinated takedown dismantled the Glassworm botnet. BNP Paribas partnered with Mistral for sovereign AI security, while China restricted travel for top AI engineers at Alibaba and DeepSeek. Corporate AI spending and layoffs made headlines: Uber burned its full-year AI budget by April, ClickUp restructured with a 3:1 AI-to-human ratio, and Sam Altman reversed his white-collar apocalypse prediction. However, MIT Technology Review data showed AI-exposed roles have lower unemployment.

  • Anthropic releases Mythos, previously limited to government contractors, now available via standard API.
  • DeepMind CEO Hassabis advances AGI timeline to 2029, citing AlphaProof Nexus solving nine Erdős problems cheaply.
In-site article

Some ideas for what comes next, May 2026

2026 continues to accelerate AI progress with open models lagging in agentic capabilities, Google's Gemini not yet competitive with Claude Code/Codex, American open models rising, a fierce competition between Anthropic and OpenAI, and power structures asserting control.

  • Open models are 5-6 months behind in agentic capabilities, likely extending to 12+ months.
  • Google's Gemini lacks a clear competitor to Claude Code and Codex.
In-site article

Sundar Pichai on AI, the future of search, and what’s happening to the web

In a Decoder interview after Google I/O, CEO Sundar Pichai discusses Google's AI-first pivot, the restructuring of DeepMind, the controversial AI Overviews in Search, the 'Google Zero' phenomenon, and his thoughts on AGI.

  • Google merged Brain and DeepMind into Google DeepMind and centralized AI infrastructure.
  • Search is evolving with AI Overviews and the Gemini Spark agent platform.
In-site article

Google Cloud COO says AI security belongs in the boardroom, not just the server room

Google Cloud COO Francis de Souza urges companies to integrate security into their AI strategy from day one, emphasizing that AI security is a boardroom issue, not just a technical one.

  • Google Cloud COO calls for security to be built into AI strategy from the start
  • AI security needs attention and resources at the board level
In-site article

The Sequence Knowledge #866: Three Text Diffusion Models You Need To Know About

Text diffusion models challenge the autoregressive paradigm by generating text through iterative denoising, treating generation as editing rather than typing. Three key systems define the field: LLaDA (proof of scaling), Mercury (commercial speed advantage), and Gemini Diffusion (frontier validation), representing the three phases of a new architecture class: scientific proof, industrial deployment, and frontier validation.

  • Text diffusion models generate text by iterative refinement from noise, using bidirectional context.
  • LLaDA proved diffusion can scale to a large language model.
In-site article

AI Claims 9 Erdős Problems: Google DeepMind’s AlphaProof Nexus Solves Decades-Old Math Puzzles

Google DeepMind's AlphaProof Nexus, powered by Gemini 3.1 Pro and the Lean theorem prover, has cracked 9 open problems from the Erdős list, including one unsolved for 56 years. It also proved 44 OEIS conjectures, solved a 15-year-old algebraic geometry problem, and improved a convex optimization bound — all at a cost of a few hundred dollars per problem.

  • AlphaProof Nexus solved 9 Erdős problems, 44 OEIS conjectures, and a 15-year-old algebraic geometry problem.
  • The system uses a loop of LLM (Gemini 3.1 Pro) and Lean compiler feedback, with four increasingly sophisticated agent variants.
In-site article

LWiAI Podcast #246 - Gemini 3.5 + Omni, Musk Loses, OpenAI vs Erdős

Google unveils Gemini 3.5 and Gemini Spark agent, plus Gemini Omni multimodal video generation; Elon Musk loses OpenAI lawsuit on statute of limitations; Anthropic agrees to $30B funding at $900B valuation; AI solves 80-year-old Erdős geometry problem.

  • Google launches Gemini 3.5 and always-on agent Gemini Spark with MCP tool support.
  • Gemini Omni converts images, audio, and text into video.
In-site article

ContextVault – Local-First AI Conversation Recorder for ChatGPT, Claude, Gemini

ContextVault is a browser extension that captures AI conversations in real-time across major LLM platforms like ChatGPT, Claude, and Gemini, storing them locally in IndexedDB. It allows one-click export as Markdown or ZIP, ensuring your data never leaves your device. Free, open source, no accounts or backend required.

  • Real-time capture across 7 LLM platforms including ChatGPT, Claude, and Gemini.
  • All data stored locally in IndexedDB, no cloud sync or third-party access.
In-site article

Google Deepmind's AlphaProof Nexus solves decades-old math problems for a few hundred dollars

Google Deepmind's AlphaProof Nexus has autonomously solved nine open Erdős problems, including two that stumped mathematicians for 56 years, for just a few hundred dollars per problem in inference costs. Unlike OpenAI's natural-language approach, the system uses the Lean compiler to verify every proof step automatically. Still, the overall success rate sits at just 2.5 percent.

  • AlphaProof Nexus autonomously solved nine open Erdős problems, including two that had remained unsolved for 56 years.
  • Each problem cost only a few hundred dollars in inference costs.
In-site article

Show HN: HTML Deployer – AI Code to Website Publisher

HTML Deployer is a Chrome extension that extracts AI-generated HTML from ChatGPT, Claude, and Gemini, allowing users to preview, download ZIP, or publish directly to Netlify, GitHub, FTP, or self-hosted servers. It's designed for developers, founders, marketers, agencies, and beginners.

  • Extract HTML from ChatGPT, Claude, and Gemini.
  • Preview, export ZIP, or publish directly to cloud, FTP, or self-hosted.
In-site article

I saw the future of Android Auto, and now Google has me dreading my own car

Google's upcoming Android Auto update introduces a redesigned interface with Material 3 Expressive, custom widgets, immersive navigation, and deeper Gemini integration. The author's demo left him impressed and anticipating the update later this year.

  • New Android Auto interface features Material 3 Expressive design with three-panel layout and custom widgets.
  • Google Maps gets immersive navigation with detailed 3D buildings and terrain.
In-site article

Google Antigravity 2.0: The Full Developer Guide (I/O 2026)

Google didn’t just ship an update at I/O 2026. They redrew the map. Google Antigravity 2.0 is a full platform pivot from AI-assisted coding to multi-agent orchestration as the core development model.

  • Antigravity 2.0 is a completely rebuilt platform centered on multi-agent orchestration, not just an IDE refresh.
  • New features include a standalone desktop app, a Go-based CLI, an SDK, and managed agents via the Gemini API.
In-site article

AI models often give the right answers but point to the wrong sources

Leading AI models like GPT and Gemini routinely cite text passages in document analyses that don't actually support their answers. Even when the answer is right, the cited evidence is often wrong. Researchers at Peking University call this "attribution hallucination," a risk for regulated fields like law and medicine. Their new CiteVQA benchmark is the first to test for it systematically.

  • AI models often cite irrelevant text passages to support answers
  • Even accurate answers can be backed by wrong evidence ('attribution hallucination')
In-site article

Can AI Guess What You Know? Performance Comparison of Large Language Models for Human Domain Knowledge Estimation From Communication Logs

This study evaluates seven LLMs (including Gemini, Claude, and GPT families) on inferring individual domain knowledge from long-term Slack logs. Using 27,188 messages from 43 users, zero-shot estimates were compared with self-reported skill ratings from 27 participants. Gemini 2.5 Flash achieved the lowest error (MAE 21.13%), while GPT models showed larger discrepancies. Accuracy depends weakly on message volume, highlighting limits and the need for privacy-aware deployments and richer knowledge representations.

  • Employees often struggle to identify expertise, causing productivity loss
  • Gemini 2.5 Flash achieved lowest MAE of 21.13% in zero-shot inference
In-site article

Show HN: Live AI music sequencing agent

Pretzel is an experimental live AI music agent that lets everyone chat with the same AI and hear synchronized music in real time. Built during Google IO hackathon, it uses a Rust agent harness called Talon for easy self-hosting.

  • Pretzel is a web-synchronized music sequencer controlled by an AI agent.
  • All users interact with the same AI agent and hear the same music simultaneously.
In-site article

Deepmind's Hassabis sees humanity "in the foothills of the singularity" while LeCun says current AI isn't intelligent

Yann LeCun says current AI systems aren't genuinely intelligent. Demis Hassabis thinks humanity is already "standing in the foothills of the singularity." And Gemini co-lead Oriol Vinyals splits the difference: today's models would've looked like AGI seven years ago, but they still can't learn from experience or produce real breakthroughs.

  • Yann LeCun asserts current AI systems lack genuine intelligence.
  • Demis Hassabis believes humanity is at the early stage of the singularity.
In-site article

The Sequence Radar #865: Last Week in AI: Karpathy, Google, Colossus, and the Coming IPO Wave

The last three weeks marked a phase transition in AI: Google unveiled Gemini Omni and an agent-first platform; Andrej Karpathy joined Anthropic to accelerate pretraining; Anthropic secured a $45B compute lease from xAI's Colossus; Cerebras IPO surged to a ~$95B market cap; and SpaceX, OpenAI, and Anthropic are planning to go public within six months, collectively worth trillions. Research highlights include HRM-Text efficient pretraining, AI reviewer evaluation, NVIDIA's unified AR-diffusion model, and more.

  • Google I/O introduced Gemini Omni, Gemini 3.5 Flash, Antigravity agent platform, and TPU 8i for a vertically integrated agent pipeline.
  • Andrej Karpathy joined Anthropic to lead a team using Claude to accelerate pretraining, signaling a practical self-improvement flywheel.
In-site article

Why you shouldn't leave model selection on default in Copilot, Gemini and other AI tools

When analyzing data, Microsoft Copilot invents country differences where none exist. Mathematician Adam Kucharski fed the tool identical datasets with different country labels, and Copilot delivered detailed stereotypes instead of accurate results. Thinking models catch the trick, but only if users know when to reach for them.

  • Microsoft Copilot invented stereotypes when given identical datasets with different country labels.
  • Thinking models can catch the trick but require user awareness.
In-site article

OpenAI and Nvidia Are Using Google's SynthID to Watermark AI Content

Google's SynthID watermarking system for AI content is being adopted by OpenAI, Nvidia, ElevenLabs, and Kakao, marking a shift toward a shared industry standard for detection of AI-generated media.

  • SynthID embeds watermarks directly into pixels and audio waveforms, making them harder to remove than metadata.
  • OpenAI, Nvidia, ElevenLabs, and Kakao are now using SynthID for their image, video, and voice generation tools.
In-site article

Researchers let Claude Code discover AI scaling algorithms that humans probably wouldn't have designed

Researchers from UMD, Google, Meta, and other institutions use AutoTTS to let a coding agent independently discover control algorithms for AI reasoning. The algorithm it found cuts compute by about 70 percent compared to standard self-consistency while matching its accuracy. The whole search cost $40 and took 160 minutes.

  • AutoTTS uses an offline simulation environment to let a coding agent autonomously explore test-time scaling algorithms without human-written rules.
  • The discovered algorithm achieves higher accuracy per compute on math benchmarks than established methods like self-consistency.
In-site article

Google CEO Admits Coding Lag, Discusses AI Strategy and Public Concerns

In a New York Times podcast, Sundar Pichai acknowledged that Google's Gemini is behind in coding, especially for complex long-horizon tasks. He discussed the biggest search redesign in 25 years, public anxiety about AI, and the path to AGI, emphasizing that Google is investing heavily but must move thoughtfully.

  • Pichai admitted Gemini lags in coding agents and instruction following.
  • Google is rolling out the largest search overhaul in 25 years but will not abruptly switch to AI Mode.
In-site article

Google’s new anything-to-anything AI model is wild

Google has released the Omni family of generative models, which can take any input (photo, video, text) and produce any output. The author deepfakes a stuffed deer and themselves to test video generation, finding improved quality and consistency over Veo but still with AI glitches and high credit costs. Deepfake videos can be convincing enough to fool close observers, raising concerns about misuse.

  • Google's Omni models aim to transform any input into any output, initially focusing on video.
  • Omni Flash offers better character consistency than Veo but still produces artifacts and inconsistencies.
In-site article

Google CEO Pichai now calls links a "part" of search, redefining the web's role in its own product

Google CEO Sundar Pichai now calls links and sources a "part" of search, when in reality, they're its foundation. The wording is deliberate: new features keep users inside the Google ecosystem, and the company is shifting from traffic distributor to AI publisher whose source selection is becoming a question of editorial power.

  • Google CEO redefines links as a 'part' of search, downplaying their foundational role.
  • New features aim to keep users within Google's ecosystem.
In-site article

Strengthening Singapore's AI Future: A New National Partnership

Google DeepMind announces a new national partnership with Singapore to apply frontier AI in healthcare, education, sustainability, and more, aiming to responsibly deploy AI for economic growth and public benefit, with an estimated S$3.3 billion in economic value by 2040.

  • Partnership with Singapore Government and multiple organizations to advance AI in public sector, business, and workforce. Focus on healthcare, scientific discovery, and education.
  • Initiatives include AI co-clinician research, pandemic preparedness, a running assistant for blind athletes, and Gemini for Education in schools.
In-site article

[AINews] All Model Labs are now Agent Labs

Ahead of OpenAI's likely IPO filing, Greg Brockman signals a shift from pure models to agent products. DeepSeek makes 75% price cut permanent, MCP protocol becomes stateless, Google launches 24/7 AI agent, and Anthropic finds over 10,000 critical vulnerabilities. Agentification is the new normal.

  • Greg Brockman says model alone is no longer the product; harness+agent+workflow is key
  • DeepSeek V4 Pro permanently discounted 75%, slashing inference costs
In-site article

Did Google’s AI agents really build an operating system for $916?

Google claimed its AI agents built an entire operating system with a single prompt and about $900 in API costs, but this analysis highlights multiple issues: the prompt was actually thousands of lines long, the scaffold may be overfitted, and critical details like code, logs, and methodology are missing. The article underscores the need for independent evaluation and proposes norms for 'open-world evaluations'.

  • Google claims AI agents built an OS for $916, but the single prompt was actually thousands of lines
  • Unresolved issues include potential overfitting, code copying, and lack of transparency
In-site article

Catch up on the Dialogues stage at Google I/O 2026.

A recap of the 2026 I/O Dialogues, where leaders discuss the future of AI, quantum computing, robotics and creativity.

  • Google CEO Sundar Pichai sat down with Matt Berman to unpack I/O announcements.
  • Teams discussed AI agents, quantum-AI intersection, scientific problem-solving, and robotics.
In-site article

Elon, stop trying to make Grok happen

A Reuters report reveals that Elon Musk's AI chatbot Grok is underperforming and rarely used by the US government, appearing in only three out of 400+ AI vendor mentions. Despite Musk's grand claims, Grok lags behind rivals like OpenAI, Google, and Anthropic in quality and adoption, raising questions about its role in SpaceX's massive IPO valuation.

  • Grok appeared in only 3 of 400+ US government AI use cases, mostly for basic tasks.
  • Government sources and public rankings indicate Grok is inferior to competitors.
In-site article

Company Directory