AI News HubLIVE

Live updates

Building AI agents for business support using Amazon Bedrock AgentCore

In this post, we share how the AWS Generative AI Innovation Center (GenAIIC) collaborated with Works Human Intelligence (WHI) to build two AI agents using Amazon Bedrock AgentCore. We discuss the challenges encountered and the solutions that reduced costs by up to 97% while improving operational efficiency.

  • AI agents automate routine HR tasks such as commuting allowance approval and browser operations.
  • Migration to AgentCore and Strand Agents architecture reduced costs by up to 97%.
In-site article

From data overload to actionable insights: How Verizon Connect scaled agentic AI to 100,000 users

Verizon Connect built an agentic AI solution on AWS to transform overwhelming fleet data into clear, actionable insights for 100,000 users daily. The architecture uses serverless anomaly detection, Strands Agents for dynamic reasoning, and Amazon Nova Lite to cut input token costs by 70%. This post covers architectural decisions, implementation challenges, and measurable results.

  • Agentic AI processes 500 million daily data points from 1.2 million vehicles to serve 100,000 users.
  • Serverless statistical models handle anomaly detection, avoiding LLM pitfalls with raw tabular data.
In-site article

How AWS SMGS uses an AI-powered conversational assistant to transform business management with Amazon Bedrock AgentCore

AWS SMGS built NarrateAI using Amazon Bedrock AgentCore to deliver business intelligence at scale. The solution features a two-layer architecture separating batch narrative generation from real-time interaction, specialized AI agents for routing and validation, and key engineering patterns for production deployment, enabling natural language queries, row-level security, and role-tailored experiences.

  • NarrateAI uses a two-layer architecture (batch processing + real-time interaction) to overcome latency and data fragmentation in traditional BI.
  • Amazon Bedrock AgentCore enables multi-agent orchestration for natural language queries and context-aware responses.
In-site article

Microsoft's MAI-Image-2.5 pulls even with Google's Nano Banana 2 on benchmarks

Microsoft's MAI-Image-2.5 ranks third on Arena's text-to-image leaderboard, on par with Google's Nano Banana 2 but still behind OpenAI's Image-2. The model shows clear gains over its predecessor, especially in rendering text inside images and commercial visuals.

  • MAI-Image-2.5 ranks third on Arena leaderboard, tied with Google's Nano Banana 2
  • Improvements in text rendering and commercial visuals
In-site article

This AI-free Google alternative is surging in popularity - how to try it for yourself

DuckDuckGo, an AI-free search alternative, is seeing a surge in users due to Google's AI Overviews. This article explains how to use DuckDuckGo without AI for private searching and browsing.

  • DuckDuckGo installs surged after Google I/O 2026, with iOS app peaking at 69.9% growth.
  • DuckDuckGo offers both AI-free search and AI chat options, giving users choice.
In-site article

Powering agentic AI sales strategy with Amazon Bedrock AgentCore

AWS Sales built Field Advisor on Amazon Bedrock AgentCore to orchestrate over 20 domain-specific agents, reducing cognitive load for sales reps and improving efficiency. The solution saved up to 2 hours per week per rep and reduced latency by 41%.

  • Field Advisor orchestrates 20+ specialized agents with a single conversational interface.
  • Human-in-the-loop workflows ensure data accuracy and accountability.
In-site article

Robinhood lets AI agents trade shares and make credit card purchases for customers

Robinhood now lets customers connect AI agents like Anthropic's Claude to a separate investment account via MCP. The agents can autonomously trade stocks and make credit card purchases. US regulator FINRA has flagged such agents as a new risk area, warning about unchecked decisions. Robinhood also admits the product isn't for everyone.

  • Robinhood enables AI agents such as Claude to be connected to investment accounts via MCP.
  • AI agents can autonomously trade stocks and initiate credit card purchases.
In-site article

“Tokenmaxxing is real, expensive & it’s spreading”: New tools emerge to stop AI budgets from exploding

Tokenmaxxing, the unrestrained use of AI tokens, is causing enterprise budget blowouts. Uber’s CTO recently admitted to overspending on Anthropic’s Claude Code. Lanai’s new Token Tuner helps companies map token consumption to workflows and outcomes, encouraging a shift from tokenmaxxing to outcomemaxxing.

  • Tokenmaxxing is causing AI budget overruns at Uber and other companies.
  • Lanai's Token Tuner tracks token usage against workflows and outcomes, providing efficiency scores and model recommendations.
In-site article

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

Artificial Analysis and IBM launch ITBench-AA, a benchmark for agentic enterprise IT tasks focusing on Site Reliability Engineering. Frontier models score below 50%, with Claude Opus 4.7 leading at 47%. The benchmark evaluates models on Kubernetes incident response, requiring diagnosis from logs and traces.

  • Claude Opus 4.7 leads at 47%, with GPT-5.5 at 46% and Qwen3.7 Max at 42%.
  • All frontier models score below 50%, making ITBench-AA one of the least saturated agentic benchmarks.
In-site article

NVIDIA Releases Polar, a Token-Faithful Rollout Framework for GRPO Training Across Codex, Claude Code, and Qwen Code

NVIDIA researchers have introduced Polar, a rollout framework that trains language agents using reinforcement learning without modifying their agent harnesses. Polar places a model API proxy between the harness and the inference server, capturing token-level interactions and reconstructing trainer-ready trajectories. Using GRPO on a Qwen3.5-4B base model, Polar improves SWE-Bench Verified pass@1 by 22.6 points under the Codex harness, 4.8 points under Claude Code, and 6.2 points under Pi. The framework is registered as a NeMo Gym environment and released under the ProRL Agent Server repository.

  • Polar enables RL training on any agent harness via a model API proxy without modifying the harness code
  • Achieves up to 22.6 point improvement on SWE-Bench Verified using GRPO on Qwen3.5-4B across four coding harnesses
In-site article

I found an easy way to automatically keep AI out of my search results - and it works in nearly every browser

Tired of AI results in your search? This article explains how to add a custom search engine to exclude AI results, with step-by-step instructions for Chrome, Firefox, Safari, and other browsers.

  • Add a custom search engine with the URL https://www.google.com/search?q=%s&udm=14 to remove AI results.
  • Works in Firefox, Chrome, and most browsers; Safari requires a free extension.
In-site article

YouTube will try to automatically flag AI videos starting this month

YouTube is tightening its AI labeling rules. Labels for photorealistic or heavily AI-altered content will now show up in more visible spots, below the player for long videos and as an overlay on Shorts. Starting May 2026, an automatic detection system will flag AI-generated content even if creators don't disclose it. Recommendations and monetization won't be affected.

  • YouTube tightens AI labeling with more visible labels for altered content.
  • From May 2026, automatic detection will flag AI content even if not disclosed by creators.
In-site article

Get a Good Return on Your AI Investments

O'Reilly's Infrastructure & Ops superstream explored the infrastructure needs, costs, and security challenges of AI workloads. DORA's report shows AI increases code delivery by about 10% but reduces stability, adding verification costs. Experts emphasize platform engineering, governance, and cognitive debt, recommending investment in internal platforms to ensure production readiness for AI applications.

  • AI tools boost individual productivity but team delivery stability decreases, with verification costs ('verification tax') needing consideration.
  • Good processes are amplified by AI, bad ones too; organizations should proactively improve processes rather than just expect technology to fix them.
In-site article

I think Anthropic and OpenAI have found product-market fit

The article argues that Anthropic and OpenAI have achieved product-market fit by shifting enterprise customers to API-based pricing and capitalizing on coding agent products. This inflection point, which began with model improvements in November 2025, accelerated in April 2026 with new model releases and pricing changes.

  • Both Anthropic and OpenAI have moved enterprise plans to API token pricing, with coding agents like Claude Code and Codex driving significant usage and revenue.
  • April 2026 saw new frontier models with higher API prices and enterprise customers locked into those rates via contract renewals.
In-site article

AI Factories: The New Infrastructure of Intelligence

AI factories are a new class of infrastructure that convert energy into tokens—the unit of production for reasoning models, agents, and intelligent systems. As agentic AI scales, performance per watt and cost per token become the critical economics. This article explores how AI factories work, their full-stack optimization, and how NVIDIA's latest hardware drives efficiency.

  • AI factories convert energy into tokens, serving as the 'power plants' of the AI age.
  • Agentic AI creates deeper, more complex inference workloads requiring real-time orchestration.
In-site article

Extending Human Intelligence Through AI

Modern AI systems are powerful not because they replicate human intelligence, but because they extend structures already present in human cognition and language. This perspective explains AI's capabilities and limitations, and reframes AI safety as a system-level challenge requiring engineering and governance, not fear of rogue AI.

  • AI systems extend human intelligence by modeling sedimented structures of understanding in language, not by replicating human minds.
  • Hallucinations and the compositionality gap arise from AI's lack of lived engagement with the world that anchors meaning and truth.
In-site article

AI companies' feud accidentally boosts obscure politician

The battle between OpenAI and Anthropic over AI regulation has inadvertently elevated New York assemblyman Alex Bores, who wrote early AI legislation. Despite millions spent by a super PAC to attack him, Bores has gained name recognition and now leads in the primary race.

  • OpenAI and Anthropic are spending millions attacking each other in NY-12 primary, but the real winner is Alex Bores.
  • Bores wrote one of the first AI regulatory laws, making him a target.
In-site article

AI is an arms race, and the US wants $9 billion in Nvidia superchips to keep up

The government has secretly requested $9 billion for Nvidia GB10 superchips to help the CIA and NSA keep up with leading AI firms like Anthropic and OpenAI. The funding requires congressional approval, while $800 million has been repurposed for cloud compute. The article covers chip specs, costs, and the escalating AI hardware race.

  • The US government secretly requested $9 billion for Nvidia GB10 superchips to help the CIA and NSA keep pace with big AI players.
  • Each GB10 chip consumes only 140W but delivers 1 petaflop of FP4 performance, enabling fine-tuning of 70-billion-parameter models.
In-site article

How Lyft Built a Self-Serve AI Agent Platform with LangGraph and LangSmith

Lyft used LangGraph and LangSmith to build a self-serve AI agent platform for customer support, cutting agent development from months to weeks. The platform empowers non-technical domain experts to build agents via prompts and configuration, with a router-based multi-agent architecture and robust evaluation pipeline.

  • Lyft moved agent development closer to domain experts by letting ops teams, VoC leads, and product managers define agents through prompts and configuration.
  • A router-based multi-agent architecture with LangGraph routes rider and driver requests across specialized subagents with safety checks and state management.
In-site article

What the Pope Got Wrong

Pope Leo XIV's AI encyclical Magnifica Humanitas correctly identifies issues like algorithmic bias, water use, and data sovereignty, but fails to address AGI and catastrophic risks, offers no concrete solutions to mass unemployment, and is criticized as outdated and disappointing.

  • Pope Leo XIV's AI encyclical Magnifica Humanitas is criticized as outdated and failing to address key issues of the AI era.
  • The encyclical mentions algorithmic bias, water use, and data sovereignty but lacks discussion of AGI and catastrophic risks.
In-site article

With Google’s debut, the most important AI agent feature is now the most boring one

Google, Anthropic, and AWS all launched managed AI agent runtimes within six weeks, signaling that agent infrastructure has become table stakes. The real differentiator is shifting to data location, cost, and portability.

  • Google, Anthropic, and AWS shipped nearly identical managed agent runtimes within six weeks.
  • The managed runtime is no longer a competitive differentiator; it's a baseline expectation.
In-site article

Nvidia Signals $150B Spend in Taiwan

Speaking at a launch event for Nvidia’s upcoming Taiwan headquarters, CEO Jensen Huang deemed the country the “epicenter” of the AI revolution.

  • Nvidia CEO Jensen Huang calls Taiwan the epicenter of AI revolution
  • Nvidia plans to invest approximately $150 billion in Taiwan
In-site article

How the lakebase architecture stays resilient to cloud failures

As agent workloads strain cloud infrastructure, Databricks' lakebase architecture ensures reliability through stateless Postgres compute, zone-redundant storage, control plane separation, cell-based isolation, and rigorous chaos testing. With tens of millions of database starts daily, the design prioritizes resilience from the ground up.

  • Agents create databases 4x faster than humans, driving millions of daily database starts.
  • Stateless compute and zone-redundant storage enable instant failover without hot standbys.
In-site article

Why the future of AI is on-premises - business advice from Dell Tech World 2026

With rising costs, sovereignty requirements, and agent adoption, Dell's latest conference focused on how enterprises can transition AI workloads to a hybrid infrastructure.

  • Dell Tech World 2026 emphasized practical AI execution, particularly building on-premises AI capabilities.
  • Soaring cloud LLM costs drive enterprises to move AI workloads to on-premises compute.
In-site article

Robinhood will let your AI agent trade stocks and make (or lose) lots of money

Robinhood is opening its trading platform to AI agents. Users can create a separate account for an AI agent, fund it, and let the agent buy and sell stocks. The company promotes it as a way to automate investment decisions, but warns of significant risks, including total loss of investment. Additionally, Robinhood Gold Card users can link an AI agent to a virtual credit card for automated purchases.

  • Robinhood launches AI agent trading with dedicated accounts and funding.
  • Company warns of high risk, including potential total loss of investment.
In-site article

AI-Writing Scandals Are Getting Confusing

Steven Rosenbaum's book 'The Future of Truth' contains fake quotes, which he blames on AI. A wave of literary AI scandals this week, including a Nobel laureate and Commonwealth prize controversy, highlights the blurry line between acceptable and unacceptable AI use in writing.

  • Steven Rosenbaum blames ChatGPT for errors in his book but acknowledges he failed to verify AI-generated content.
  • Multiple scandals in one week: Nobel winner misunderstood, author accused of using AI for prize-winning story.
In-site article

Show HN: Mneme HQ – repo-native architectural rules for AI coding agents

Mneme HQ provides architectural governance for AI-assisted development by enforcing constraints before code generation, preventing architectural drift and reducing review overhead. It integrates directly into the AI coding agent workflow, blocking banned frameworks, cross-boundary calls, and superseded decisions before they reach the PR queue.

  • Enforces architectural rules before AI agents generate code, stopping violations at the source
  • Works with major AI coding assistants and agent frameworks
In-site article

Buffer API

One API to publish across every social platform.

  • Single API for multiple social platforms
  • Simplifies social media management
In-site article

Pandas GroupBy Explained With Examples

This tutorial covers Pandas GroupBy operations with a retail sales dataset, including basic aggregation, multiple aggregations, named aggregations, multi-column grouping, sorting, count vs size, transform, filter, apply, and date grouping.

  • GroupBy allows grouping rows by one or more categories for efficient aggregation.
  • Use agg() for multiple functions, named aggregations for clarity, and as_index=False for DataFrame output.
In-site article