AI News HubLIVE

Live updates

Identifying and Understanding Human Values in Text: A Tailorable LLM-based Architecture

This paper introduces an LLM-based architecture to detect and quantify the intensity of human values in text. The architecture comprises three coordinated modules that can adapt to various value theories, and experiments on the ValueEval dataset show good detection performance.

  • Proposes a modular LLM architecture for identifying human values in text, avoiding dependence on specific value theories or complex prompt engineering.
  • Three modules: generate structured value specifications, label texts using them, and assign graded support or resistance based on rhetorical and semantic evidence.
In-site article

Show HN: The Two Pillars – A conceptual framework for post-AI software work

A paper argues that with generative AI dissolving the human capacity to write correct code as the binding constraint, software work reorganizes around two pillars: Mixer Mode (humans operating multiple judgment axes continuously like a sound engineer) and Meta-Software (software that observes, validates, and governs other software). The two pillars are inseparable, drawing a parallel to the historical transition from artisanal to mass production.

  • The production of code is ceasing to be the dominant problem in software organizations due to generative AI.
  • Mixer Mode describes a new human role where practitioners continuously operate multiple judgment axes.
In-site article

Your Future job will be to keep AI on task

Noah Smith argues that as AI becomes more capable, humans will shift from technical work to ensuring AI alignment—keeping AI focused on human goals. He draws parallels to 'Office Space' and warns about the rise of AI-generated 'slop'.

  • Humans will be needed to maintain AI alignment, ensuring AI stays on task.
  • The author compares future human roles to the 'Lumbergh' manager from Office Space.
In-site article

Safescript – A Language for AI Era

Safescript is a programming language for AI agents that proves safety properties statically before execution, eliminating the need for sandboxes or VMs. It compiles to a static DAG, enabling full visibility into data flow and host calls, with zero overhead and zero cold starts.

  • Statically enforces security without runtime sandboxing.
  • Compiles to a static DAG that traces all data flows and hosts.
In-site article

AIPass – Persistent agent workspace with identity, memory, and email

AIPass is a CLI-native scaffold that adds persistent memory, identity, and coordination to AI agents. Agents share a filesystem, use JSON files for memory, require no cloud or extra API keys. The project includes 13 core agents for multi-agent collaboration, task dispatching, quality audits, and real-time monitoring.

  • AIPass provides a CLI-native framework for persistent memory, identity, and coordination of AI agents.
  • All agents share a local filesystem with JSON file storage, no cloud dependency.
In-site article

Language Modeling Materializes a World Model of Protein Biology [pdf]

This paper presents a world model of protein biology realized through language modeling, demonstrating how large-scale language models can understand and predict protein structure and function.

  • Language models can capture complex patterns in protein sequences
  • The model excels in protein structure prediction and function annotation
In-site article

Illinois Lawmakers Just Passed America's Strongest AI Safety Bill

Illinois passed SB 315, requiring independent auditors to verify AI lab safety commitments, now heading to Governor Pritzker who plans to sign it. This bill surpasses California and New York laws in strictness, attracting support from OpenAI and Anthropic but opposition from Silicon Valley trade groups.

  • SB 315 mandates independent auditing of AI safety practices.
  • It is the strongest state-level AI safety law in the U.S.
In-site article

AI Cheats [pdf]

A PDF report on AI cheating, but the content cannot be directly parsed.

  • Cannot extract text from PDF
  • Report likely from METR organization
In-site article

Sakana AI Proposes DiffusionBlocks: a Block-wise Training Framework That Converts Residual Networks into Independently Trainable Denoising Modules

Researchers from Sakana AI and the University of Tokyo propose DiffusionBlocks, which trains transformer-based networks one block at a time, reducing training memory by a factor of B (where B is the number of blocks) while maintaining performance across diverse architectures. The method interprets residual connections as Euler steps of reverse diffusion, enabling a principled local objective via score matching.

  • DiffusionBlocks partitions networks into B independently trainable blocks, reducing memory by B×.​
  • It leverages the connection between residual networks and diffusion models to provide a theoretically grounded local training objective.​
In-site article

I dug deeper into my Oura Ring data using this free app - here's what I found

Simple Wearable Report turns Oura data into a lab-style report. The free tool provides an option to upload to chatbots, allowing further AI analysis. Here's how I've been using it.

  • Simple Wearable Report transforms Oura Ring data into scannable reports for sharing with doctors or uploading to AI chatbots.
  • Compared to Oura's built-in AI advisor, third-party chatbots like Gemini provide more detailed, quantitative analysis.
In-site article

Robinhood Will Let Agents Trade -- It Could Be a Trend

Given that the stock trading app operates in a highly regulated industry, the company’s move to use agents could prompt other finance firms to take a bold step and do the same.

  • Robinhood will allow AI agents to trade on its platform
  • This move is groundbreaking in a highly regulated industry
In-site article

The Authorization Paradox: Who Has the Keys to Your AI? [video]

This article explores the authorization paradox in AI systems, questioning who truly holds control over AI. Presented as a video, it discusses security and privacy implications.

  • Authorization issues in AI are increasingly critical
  • Who holds the 'keys' to AI is a central question
In-site article

Apple Presents Latest Research at CVPR 2026

Apple is showcasing new research at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026 in Denver, June 3-7. The company is sponsoring the conference and presenting work on video generation, multimodal understanding, image compression, and more.

  • Apple will present multiple research papers at CVPR 2026, including STARFlow-V, AToken, and Velox.
  • Scheduled activities include keynote talks, invited talks, poster sessions, and booth presentations.
In-site article

OpenAI’s Frontier Governance Framework

Explore OpenAI’s Frontier Governance Framework and how our AI safety, security, and risk practices align with emerging EU and California regulations.

  • OpenAI released its Frontier Governance Framework for AI safety and alignment.
  • The framework aligns with upcoming EU and California regulations.
In-site article

Show HN: Liiists, a Markdown-first, iOS and CLI list app

Liiists is a markdown-first list app that works on terminal, iOS, and through AI agents via an MCP server, all reading and writing the same plain-text .md files. It offers a CLI, native iOS app with Share Extension and Siri, and an MCP server for AI integration. No account needed, no lock-in, and supports iCloud sync or any folder including Obsidian vault.

  • Works across terminal, iOS, and AI agents using the same markdown files
  • CLI written in Go with no dependencies
In-site article

sqlite AGENTS.md

SQLite has added an AGENTS.md file to clarify its policy on AI-generated contributions: it does not accept pull requests without prior agreement, and does not accept agentic code at all, though it welcomes bug reports with reproducible test cases. The forum has been flooded with AI-generated bugs, leading to a separate bug forum.

  • SQLite added AGENTS.md to define AI contribution policy
  • Pull requests require prior agreement and legal paperwork
In-site article

Building the Future of Accessible Tech: Inside Uvilox AI

Uvilox AI bridges the communication gap with real-time sign language interpretation, emergency response, and accessible calling — powered by next-generation vision AI. With sub-80ms latency, 97.4% accuracy, support for 200+ sign variants, and military-grade security, it is now open for beta access.

  • Real-time sign language recognition with <80ms latency and 97.4% accuracy.
  • Supports over 200 ASL and BSL signs, works in low-light conditions.
In-site article

NeuralAgent 2.5: Personal AI Assistant Now with Voice Mode, Watch & Learn, and Parallel Agents

NeuralAgent 2.5 introduces Voice Mode, Watch & Learn, and Parallel Agents, allowing the AI to listen, speak, and perform multiple tasks simultaneously. Users can control their entire computer via natural language without touching the keyboard or mouse. The update also improves workflows, @ mentions, and memory.

  • Voice Mode enables two-way conversation; users speak commands and the AI responds and executes tasks.
  • Watch & Learn lets users demonstrate a task once, and the AI saves it as a repeatable workflow.
In-site article

Fixing agent failures in production: Interrupt 2026 recap | LangChain Newsletter

Recapping two days of Interrupt 2026 — LangSmith Engine, Sandboxes GA, LangChain Labs, and 23 talks from teams at LinkedIn, Rippling, Cisco, and more. Now on demand.

  • LangSmith Engine automates failure analysis from production traces.
  • LangSmith Sandboxes reaches General Availability for secure agent execution.
In-site article

Reliable LLM Inference at Scale

At Databricks, we’ve built a unique inference platform that serves every frontier model, from open source to proprietary, powering some of the largest agentic applications. Serving over 120T tokens per month, we tackle challenges of reliability and latency through abstractions like model units for capacity management, cost-aware load balancing and autoscaling that save over 80% GPU costs, and runtime reliability mechanisms including black-box health checks that detect silent failures. Profiling multimodal bottlenecks unlocked 3x throughput gains.

  • Databricks' inference platform serves frontier models including open source and proprietary, handling 120T tokens/month.
  • Model units provide a VM-like abstraction for capacity management, enabling cost-aware routing and scaling.
In-site article

Snowflake Commits $6B to AWS as It Pushes Deeper into AI

Snowflake has committed $6 billion over five years to Amazon Web Services for Graviton compute and AI infrastructure, marking its largest cloud spend commitment. The deal covers AWS's ARM-based Graviton processors and GPU-accelerated EC2 instances for AI training and inference. Snowflake will also expand to 10 new AWS regions and leverage cost-efficient Graviton instances for its data warehousing business to free up resources for AI workloads.

  • Snowflake commits $6 billion over five years to AWS for Graviton and GPU compute.
  • The deal supports AI model training and inference using AWS instances.
In-site article

Building AI agents for business support using Amazon Bedrock AgentCore

In this post, we share how the AWS Generative AI Innovation Center (GenAIIC) collaborated with Works Human Intelligence (WHI) to build two AI agents using Amazon Bedrock AgentCore. We discuss the challenges encountered and the solutions that reduced costs by up to 97% while improving operational efficiency.

  • AI agents automate routine HR tasks such as commuting allowance approval and browser operations.
  • Migration to AgentCore and Strand Agents architecture reduced costs by up to 97%.
In-site article

From data overload to actionable insights: How Verizon Connect scaled agentic AI to 100,000 users

Verizon Connect built an agentic AI solution on AWS to transform overwhelming fleet data into clear, actionable insights for 100,000 users daily. The architecture uses serverless anomaly detection, Strands Agents for dynamic reasoning, and Amazon Nova Lite to cut input token costs by 70%. This post covers architectural decisions, implementation challenges, and measurable results.

  • Agentic AI processes 500 million daily data points from 1.2 million vehicles to serve 100,000 users.
  • Serverless statistical models handle anomaly detection, avoiding LLM pitfalls with raw tabular data.
In-site article

How AWS SMGS uses an AI-powered conversational assistant to transform business management with Amazon Bedrock AgentCore

AWS SMGS built NarrateAI using Amazon Bedrock AgentCore to deliver business intelligence at scale. The solution features a two-layer architecture separating batch narrative generation from real-time interaction, specialized AI agents for routing and validation, and key engineering patterns for production deployment, enabling natural language queries, row-level security, and role-tailored experiences.

  • NarrateAI uses a two-layer architecture (batch processing + real-time interaction) to overcome latency and data fragmentation in traditional BI.
  • Amazon Bedrock AgentCore enables multi-agent orchestration for natural language queries and context-aware responses.
In-site article

Microsoft's MAI-Image-2.5 pulls even with Google's Nano Banana 2 on benchmarks

Microsoft's MAI-Image-2.5 ranks third on Arena's text-to-image leaderboard, on par with Google's Nano Banana 2 but still behind OpenAI's Image-2. The model shows clear gains over its predecessor, especially in rendering text inside images and commercial visuals.

  • MAI-Image-2.5 ranks third on Arena leaderboard, tied with Google's Nano Banana 2
  • Improvements in text rendering and commercial visuals
In-site article

This AI-free Google alternative is surging in popularity - how to try it for yourself

DuckDuckGo, an AI-free search alternative, is seeing a surge in users due to Google's AI Overviews. This article explains how to use DuckDuckGo without AI for private searching and browsing.

  • DuckDuckGo installs surged after Google I/O 2026, with iOS app peaking at 69.9% growth.
  • DuckDuckGo offers both AI-free search and AI chat options, giving users choice.
In-site article

Powering agentic AI sales strategy with Amazon Bedrock AgentCore

AWS Sales built Field Advisor on Amazon Bedrock AgentCore to orchestrate over 20 domain-specific agents, reducing cognitive load for sales reps and improving efficiency. The solution saved up to 2 hours per week per rep and reduced latency by 41%.

  • Field Advisor orchestrates 20+ specialized agents with a single conversational interface.
  • Human-in-the-loop workflows ensure data accuracy and accountability.
In-site article

Robinhood lets AI agents trade shares and make credit card purchases for customers

Robinhood now lets customers connect AI agents like Anthropic's Claude to a separate investment account via MCP. The agents can autonomously trade stocks and make credit card purchases. US regulator FINRA has flagged such agents as a new risk area, warning about unchecked decisions. Robinhood also admits the product isn't for everyone.

  • Robinhood enables AI agents such as Claude to be connected to investment accounts via MCP.
  • AI agents can autonomously trade stocks and initiate credit card purchases.
In-site article

“Tokenmaxxing is real, expensive & it’s spreading”: New tools emerge to stop AI budgets from exploding

Tokenmaxxing, the unrestrained use of AI tokens, is causing enterprise budget blowouts. Uber’s CTO recently admitted to overspending on Anthropic’s Claude Code. Lanai’s new Token Tuner helps companies map token consumption to workflows and outcomes, encouraging a shift from tokenmaxxing to outcomemaxxing.

  • Tokenmaxxing is causing AI budget overruns at Uber and other companies.
  • Lanai's Token Tuner tracks token usage against workflows and outcomes, providing efficiency scores and model recommendations.
In-site article