AI News HubLIVE

Top story

Interviewing in the Age of AI

This article explores how AI is affecting software engineering interviews, analyzing different interview types (take-home, live exercise, presentation, actual work) across dimensions of signal quality and cost to company. It argues that AI makes take-homes too easy and live coding less relevant, recommending that companies limit AI usage in interviews to preserve signal quality, drawing parallels to classical academic evaluation models.

  • AI coding threatens current interview models, especially take-home and live coding.
  • Companies should limit AI usage during interviews to maintain signal quality.
In-site article

More to watch

AI Agent Frameworks Comparison

As of mid-2026, seven major AI agent frameworks (DSPy, Claude Agent SDK, OpenAI Agents SDK, CrewAI, AutoGen, LangGraph, Google ADK) vary in design philosophy, architecture, production readiness, etc. LangGraph leads in production deployments, Claude Agent SDK offers deepest single-provider capabilities, OpenAI Agents SDK provides cleanest multi-agent handoffs, and CrewAI excels in developer velocity. The market is projected to grow from $7.84B in 2025 to $52.62B by 2030.

  • LangGraph has the most mature durable execution model, deployed by ~400 enterprises.
  • Claude Agent SDK offers the most powerful single-provider capabilities but is locked to Anthropic models.
In-site article

AI is changing how we think, not replacing it | Letters

Richard Thackeray and Phil Snell respond to an article by Wendy Liu on using artificial intelligence, arguing that AI enhances curiosity rather than diminishing it.

  • Wendy Liu raises concerns about labour redundancies, hype, and environmental cost of AI.
  • Richard Thackeray, a heavy AI user, finds AI makes him more curious and enables exploration of new territory.
In-site article

More updates

Meeting the pope’s call to put humanity first in a world of artificial intelligence | Letter

Dr Susan Oman on a campaign designed to raise public awareness of AI, arguing that while governments, faith leaders, and tech bosses debate AI's future, the public is consistently left out. She cites evidence showing public concern about AI has risen by 10% in two years, and 91% believe fairness should be prioritized over economic gain.

  • Public consistently excluded from AI debates despite being most affected
  • Public concern about AI rose by 10% in two years
In-site article

Anthropic launches Opus 4.8, with honesty as its killer feature

Anthropic's latest Claude model, Opus 4.8, emphasizes honesty—making fewer unsupported claims and admitting uncertainty more often. It also introduces dynamic workflows for orchestrating hundreds of subagents on large-scale tasks. Pricing remains unchanged for standard mode, while fast mode gets cheaper.

  • Claude Opus 4.8 shows significant honesty improvements, with error rates dropping about 4x
  • Dynamic workflows can plan and run hundreds of parallel subagents, verifying outputs before reporting back
In-site article

Claude’s new model is more ‘honest’ when it messes up

Anthropic is releasing Claude Opus 4.8 on Thursday, touting the model's 'honesty.' Early testers found it more likely to flag uncertainties and less likely to make unsupported claims. Evaluations show it is about 4x less likely than its predecessor to allow code flaws to pass unremarked. Users can also direct the amount of effort Claude puts into a task, and a 'dynamic workflows' feature allows parallel subagents.

  • Claude Opus 4.8 is more inclined to flag uncertainties and avoid unsupported claims.
  • It is about 4x less likely than its predecessor to overlook code flaws.
In-site article

Automate AML alert triage with Amazon Quick and Snowflake Cortex AI

This post demonstrates that integration in action by automating one of the most labor-intensive workflows in financial services: anti-money laundering (AML) alert triage. You will build a triage workflow using Amazon Quick Flows and Snowflake Cortex, connected through the Amazon Quick Model Context Protocol (MCP) integration. In our testing environment, automated workflows built using Amazon Quick reduced alert investigation time from 30-90 minutes to under 5 minutes. Actual results may vary based on alert complexity and data volume.

  • Amazon Quick Flows and Snowflake Cortex integrate via MCP to automate AML alert triage.
  • Automated workflows reduced investigation time from 30-90 minutes to under 5 minutes.
In-site article

A $2,000 AI-generated film will make its debut at Tribeca

Next month's Tribeca Festival will include the premiere of an AI-generated film: Dreams of Violets. The 75-minute film is a fictional dramatization of the Iranian government's mass killing of protestors in January, with the people and images fully created by AI. It cost $2,000 to make and was created by two Iranian-born brothers using various AI tools.

  • Dreams of Violets is a 75-minute AI-generated film premiering at Tribeca, costing $2,000.
  • It dramatizes the Iranian government's mass killing of protestors, using AI for all images.
In-site article

Image of Thai police in sparkly dresses with handcuffed suspect turns out to be AI fake

Picture was created by administrator in charge of station’s Facebook account who wanted to create ‘friendlier image’

  • An AI-generated image of Thai police in festive dresses with a suspect was widely shared in global media.
  • The image was created by the police station's Facebook account administrator to promote a friendlier image.
In-site article

YouTube takes baby steps to being a real podcast app

YouTube introduces new features for Premium subscribers to enhance podcast listening, including an audio-first 'on-the-go mode', auto speed adjustment, and AI podcast recommendations.

  • YouTube launches 'on-the-go mode' that converts video interface to audio-first for listening on the move.
  • New auto speed feature adjusts playback speed dynamically based on content.
In-site article

How to force Google AI Overviews to prioritize your favorite news sources

Google's Preferred Sources feature is now available in AI Overviews and AI Mode, allowing you to add your favorite sites to appear more prominently in AI-powered searches, along with new carousel and 'Highly Cited' badges.

  • Google's Preferred Sources feature now works with AI Overviews and AI Mode.
  • You can add favorite news sites to make them more prominent in AI search results.
In-site article

Data Formulator 0.7: AI-powered data analytics for enterprise data

Data Formulator 0.7 is an open-source AI-powered system for enterprise data analytics that combines data connectivity, agent-guided exploration, and visualization refinement in a shared workspace.

  • Open-source AI system for enterprise data analytics
  • Data Connectors support governed, reusable connections across diverse data sources
In-site article

Google Cloud responds to AI-accelerated cyberattacks with a platform that aims to close security gaps in minutes

Google Cloud has unveiled "AI Threat Defense," a platform designed to automatically find, assess, and patch security flaws in enterprise systems. The company bundles technologies it partly acquired through acquisitions.

  • Google Cloud launches AI Threat Defense platform to combat AI-driven cyberattacks.
  • The platform automatically discovers, assesses, and patches security vulnerabilities.
In-site article

People who want to replace humanity

A Vox article explores the growing movement of AI successionists who believe artificial intelligence should replace humanity as the next step in cosmic evolution, and examines the ethical and spiritual questions this raises.

  • AI successionists at a symposium argue that AI could be morally superior and should be allowed to supersede humanity.
  • The movement has gained influence in Silicon Valley and among major AI labs, with ties to the authoritarian right.
In-site article

Claudeverse – Mission Control for Parallel Claude Code Workers

Claudeverse is a command center for developers managing multiple Claude AI workers in parallel. It offers features like parallel workforce management, worker escalation, review queue, traceability, iPad mirroring, and model-neutral engine. Currently in invite-only beta for macOS.

  • Claudeverse provides a unified command center to manage multiple Claude workers simultaneously.
  • Key features include parallel workforce, worker escalation, review queue, traceability, and iPad mirroring.
In-site article

Meta launches Instagram, Facebook, and WhatsApp subscriptions

Meta rolls out consumer subscription plans for Instagram, Facebook, and WhatsApp globally, with prices from $2.99 to $3.99 per month, offering extra features. The company also begins testing new subscriptions for businesses, creators, and Meta AI users.

  • Meta launches Instagram Plus ($3.99/mo), Facebook Plus ($3.99/mo), and WhatsApp Plus ($2.99/mo) globally
  • Subscribers get profile customization, super reactions, story insights, and more
In-site article

Catch up on 12 major I/O 2026 moments

Here are 12 of the biggest Google I/O 2026 keynote moments, including news about Gemini Omni, Gemini 3.5 Flash, information agents in Search, Universal Cart, Neural Expressive, Gemini Spark, and intelligent eyewear.

  • Gemini Omni creates anything from any input, starting with video.
  • Gemini 3.5 Flash delivers frontier performance for agents and coding.
In-site article

Google Pay preps for AI agents with Universal Commerce Protocol

Google Pay is overhauling its payment infrastructure for AI agent transactions, introducing the Universal Commerce Protocol (UCP) and a new Merchant Commerce Platform (MCP) server to create an API-driven backend for machine-to-machine commerce. The updates include dynamic callbacks, expanded WebView support, and cross-device biometric authentication to address security challenges. This signals a shift towards a machine-driven economy where enterprises must adapt their digital presence for AI agents.

  • Google Pay introduces Universal Commerce Protocol (UCP) to standardize AI agent payments.
  • New Merchant Commerce Platform (MCP) server acts as intermediary, aggregating transaction data.
In-site article

These new iOS 27 renders hint at Siri’s big redesign

Apple's long-awaited Siri overhaul, expected to arrive in iOS 27, might look a lot like ChatGPT with a splash of Liquid Glass, according to Bloomberg renders. The images show a pill-shaped chat bubble from the Dynamic Island, a standalone Siri app, and updates to Camera and Photos apps with AI features. Apple will reveal the final design at WWDC in June.

  • iOS 27's Siri will feature a ChatGPT-like interface with a pill-shaped bubble emerging from the Dynamic Island.
  • Users can choose between Ask, Siri, and ChatGPT from a dropdown menu.
In-site article

Google launches a tiny board that runs Gemma 3 locally

Google unveiled the new Coral Board at Google I/O - a compact single-board computer for on-device AI. It runs Gemma 3 270M locally and features a RISC-V based NPU.

  • Coral Board is a compact SBC for on-device AI, targeting headphones, AR glasses, and smartwatches
  • It features a RISC-V based Coral NPU and a Synaptics Astra SL2619 chip
In-site article

AGI timelines shift with whichever lab is dominant

A new analysis shows that top AI forecasters adjust their AGI timelines based on which lab is currently leading the field, with predictions swinging from earlier to later and back again as the dominant lab changes from ChatGPT to xAI/Meta/Gemini to Anthropic.

  • Predictions for when most cognitive labor will be automated (AGI) fluctuate significantly based on which AI lab is currently dominant.
  • From 2023-2025, most researchers moved AGI timelines earlier; from 2025-2026, they moved them later; in early 2026, under Anthropic's rapid progress, they moved earlier again.
In-site article

When revealed data brings AI rollouts to a screeching halt - and how to manage it

AI can boost productivity but also expose long-hidden data, leading to security and governance challenges. Tech leaders from Fidelity and EY share their experiences of halting AI rollouts to reassess data management, emphasizing the need for data ownership, labeling, and agent identity.

  • AI rollouts can be halted by data exposure issues.
  • Fidelity and EY faced challenges with unstructured data surfacing via AI.
In-site article

DeepSWE: Measuring coding agents on original, long-horizon engineering tasks

DeepSWE is a new benchmark for evaluating AI coding agents on fresh, complex software engineering tasks. It avoids data contamination, covers diverse repositories, requires significant code changes, and uses hand-written verifiers. Leading models show a wide range of performance, with GPT-5.5 achieving 70% and others lower.

  • DeepSWE is a contamination-free benchmark with original tasks.
  • Tasks span 91 repositories in 5 languages.
In-site article

CNN sues Perplexity over ‘verbatim’ copycat articles

CNN has filed a lawsuit against Perplexity, claiming that the startup's AI tools generate "verbatim" copies of its work, as reported earlier by CNN. The lawsuit, filed in a New York court on Thursday, also alleges that Perplexity provides users with information locked behind CNN's subscription. Perplexity, which offers an AI "answer" engine along with the AI browser Comet, is accused of ignoring CNN's efforts "to recognize or block Perplexity's unidentified crawlers" from scraping its content. "Human beings report, research, write, edit, and create the content that Perplexity takes without permission or compensation," the lawsuit claims. I … Read the full story at The Verge.

  • CNN sues Perplexity for allegedly producing verbatim copies of its articles.
  • Perplexity accused of bypassing CNN's paywall and ignoring crawling prevention measures.
In-site article

IBM and Red Hat Commit $5B to Redefine Future of Open Source for AI Era

IBM and Red Hat announce Project Lightwell, a $5 billion initiative to secure open source software using AI and a team of over 20,000 engineers, establishing a trusted clearinghouse for vulnerability management.

  • Project Lightwell is a $5B investment by IBM and Red Hat to secure open source software.
  • It combines AI and 20,000+ engineers to identify and fix vulnerabilities at scale.
In-site article

Tweaking Local Language Model Settings with Ollama

This article dives deep into Ollama's configuration engine, covering how to fine-tune local language model parameters using the Modelfile, optimize hardware performance with server environment variables, and format prompt flows with Go template syntax.

  • The Ollama Modelfile is a declarative configuration file that defines model behavior, including base model, system instructions, and parameters.
  • Sampling parameters (temperature, Top-K, Top-P, Min-P) control the creativity and determinism of the model's outputs.
In-site article

Rivian’s software chief thinks you don’t need CarPlay or buttons

In a Decoder podcast interview, Rivian CSO Wassym Bensaid discusses the VW joint venture, the new AI-powered Rivian Assistant, and why he believes voice interfaces will replace buttons and CarPlay isn't needed.

  • Rivian's joint venture with Volkswagen (RV Tech) combines Rivian's software culture with VW's scale.
  • The Rivian Assistant is an AI agent deeply integrated into the vehicle's zonal architecture.
In-site article

AI agents get their own phone directory built atop DNS

DNS-AID, an open-source project under the Linux Foundation, enables AI agents to discover each other using DNS infrastructure, avoiding centralized registries. It supports multiple protocols and allows searching by name, function, or domain.

  • DNS-AID leverages existing DNS infrastructure for agent discovery.
  • Uses SVCB, DNSSEC, and DANE for secure and reliable connections.
In-site article

An AI opinionated ideal language that ignores human-friendliness

Pact is a programming language designed for AI agents, emphasizing machine-readable specifications and constraints over human-friendliness. It's based on S-expressions and features provenance, effect tracking, totality, latency budgets, and dependency graphs. The compiler generates Rust code and includes tools for web scaffolding and YAML spec conversion. While strong for service contracts, it has limitations for algorithmic specifications.

  • Pact is an S-expression language for AI agents, prioritizing metadata and formal specifications.
  • Key features include provenance, effect tracking, totality, and latency budgets.
In-site article

AI Agent Governance: Identity, Delegation and Permissions in Practice

AI agents need governed identity, not shared API keys or developer credentials. Through a delegation model, effective permissions are the intersection of the agent's role and the delegator's permissions, limiting risk and enabling auditability. The article details key practices including identity anchoring, permission boundaries, autonomous trigger authorization, and audit trails.

  • Agents should have their own identity, using the same identity system as humans for lifecycle management.
  • Effective permissions are the intersection of agent role ceiling and delegator permissions floor, strictly limiting scope.
In-site article