AI News HubLIVE
Public articles 211Collected articles 220Trust 78Refresh 30 min
Health Auto-pausedSource type MediaFull-text rights In-site rewriteLast ingested 2026-06-13ID the-decoderStatus Not enabled

Media source; summary-only unless authorization is obtained.

Latest public articles

Microsoft CEO Satya Nadella admits he's a token-maxer, too: "It's addictive"

Microsoft CEO Satya Nadella warns against "token-maxing," using the most powerful AI models for every problem. He says frontier models shouldn't be wasted on everyday tasks, and the marginal cost of productivity gains must match the token cost. Yet he admits, "I'm like a token-maxer too. So it is addictive."

  • Nadella warns against token-maxing, the overuse of powerful AI models for simple tasks.
  • He advocates using frontier models only for complex problems and matching cost to benefit.
In-site article

Google Research's Gemini-SQL2 tops text-to-SQL benchmarks by a wide margin

Google Research's Gemini-SQL2 turns natural language into executable SQL queries. Built on Gemini 3.1 Pro, it tops the BIRD benchmark at 80.04 percent accuracy, well ahead of OpenAI and Anthropic. Google says the technology could improve natural language features across its data services.

  • Gemini-SQL2 translates natural language into SQL queries using Gemini 3.1 Pro.
  • Achieves 80.04% accuracy on the BIRD benchmark, surpassing OpenAI and Anthropic.
In-site article

Microsoft's SkillOpt boosts GPT-5.5 by using nothing but a trained Markdown file

Microsoft and three Chinese universities have developed SkillOpt, a method that optimizes instruction documents for AI agents using principles from traditional model training. A simple Markdown file is enough to boost GPT-5.5 by about 23 points on procedural tasks, and the same file transfers across models and agent environments like Codex and Claude Code.

  • SkillOpt treats skill documents as trainable external state for frozen target models, using a separate optimizer model to propose limited edits accepted only if they improve validation performance.
  • On GPT-5.5, average gains of ~23 points across six benchmarks, with biggest improvements on tasks requiring strict formatting and tool use.
In-site article

Claude Fable 5 outpaces GPT-5.5 by 13 points on FrontierMath's toughest problems

Anthropic's Claude Fable 5 hits 88 percent accuracy on the hardest FrontierMath tier, a massive jump from Opus 4.5, which sat below 10 percent in early 2026. OpenAI's GPT-5.5 reaches about 75 percent on the same tier. The pace of improvement in AI math keeps accelerating.

  • Claude Fable 5 achieves 88% on FrontierMath's hardest problems
  • That's a leap from Opus 4.5's below 10% in early 2026
In-site article

Meta shifts from "tokenmaxxing" to token managing as internal AI costs reportedly hit billions

An internal memo to 6,000 employees reveals Meta is heading toward billions in AI costs from internal use alone. Starting in 2027, budgets, allocations, and a central dashboard called "AI Gateway" will govern token consumption. CTO Andrew Bosworth put it bluntly: "All motion is not progress and token usage alone is not a measure of impact of any kind."

  • Meta's internal AI costs are expected to reach billions of dollars
  • Token management via AI Gateway dashboard to begin in 2027
In-site article

Moonshot's open model Kimi K2.7 Code undercuts GPT-5.5 and Claude by up to 12x on price per token

Moonshot AI has released Kimi K2.7 Code, an open-weights model with one trillion parameters built for programming. It still trails GPT-5.5 and Claude Opus 4.8 in coding benchmarks but costs a fraction of the price. So the key question isn't whether it's the best model, but whether the extra runs you get for the same budget make up for the gap in quality.

  • Kimi K2.7 Code is an open-weights model with 1 trillion parameters for programming.
  • It lags behind GPT-5.5 and Claude Opus 4.8 in coding benchmarks.
In-site article

US government forces Anthropic to disable Claude Fable 5 and Mythos 5 for all customers worldwide

The US government has ordered Anthropic to shut down global access to Fable 5 and Mythos 5, citing alleged jailbreak risks. Anthropic is complying but pushing back publicly: the vulnerabilities are minor and exist in competing models like GPT-5.5, the company says, an ironic turn after the company spent months hyping the cybersecurity risks of its own Mythos class. Anthropic warns the move could set a precedent that halts all frontier deployments.

  • US government orders Anthropic to disable Fable 5 and Mythos 5 worldwide over jailbreak concerns.
  • Anthropic argues the vulnerabilities are minor and also present in rivals like GPT-5.5.
In-site article

Over half of Americans fear losing both their jobs and their independent thinking to AI, survey finds

Anthropic surveyed nearly 52,000 Americans about their hopes and fears around AI. Sixty-four percent fear job losses, and 56 percent worry about losing the ability to think for themselves. Daily AI users are far less concerned. Still, most people reject AI in their own workplace, even for tasks they think it can handle.

  • 64% fear job loss, 56% fear losing independent thought
  • Daily AI users show less concern
In-site article

OpenAI kicks off the AI price wars with flexible rate-limit resets for its Codex coding agent

OpenAI now lets Codex users bank their rate-limit resets and trigger them manually instead of watching them expire on a fixed schedule. If you hit your usage cap mid-session, you can cash in a saved reset right away instead of waiting. Users on the Go, Plus, Pro, and Business plans each get one free reset to start. Plus and Pro users can also invite friends to unlock extra resets.

  • Codex users can now store rate-limit resets and use them on demand.
  • Go, Plus, Pro, and Business plan users each receive one free reset.
In-site article

Anthropic's Claude Fable 5 costs twice as much for 5.7 percent more performance

Claude Fable 5 tops the Artificial Analysis Intelligence Index with 64.9 points and sets records in five of ten benchmarks. But the gain over Opus 4.8 is just 5.7 percent at double the token price. Safety filters with fallback routing push costs even higher.

  • Claude Fable 5 scores 64.9 on the AI Index, setting records in five benchmarks.
  • The model offers only 5.7% performance improvement over Opus 4.8 at double the token price.
In-site article

Google files first joint lawsuit with FBI over Chinese AI scam network, OpenAI blocks PRC influence clusters

Within days of each other, Google and OpenAI separately exposed operations allegedly originating in China that use AI for fraud and covert influence campaigns. Both target US infrastructure and political debates.

  • Google and FBI jointly sue Chinese cybercrime network for using Gemini AI to defraud Americans.
  • OpenAI bans two ChatGPT clusters linked to China for manipulating US tech policy debates.
In-site article

Mistral AI seeks 3 billion euros to fund its European AI push

French AI startup Mistral AI is negotiating a new funding round of around 3 billion euros at a valuation of approximately 20 billion euros.

  • Mistral AI negotiating 3 billion euro funding round
  • Valuation around 20 billion euros
In-site article

The AI industry's platform trap is starting to look a lot like Microsoft's

Anthropic is throttling its new Mythos model for certain tasks while building apps that directly compete with its largest customers. Customers, partners, and investors are pushing back.

  • Anthropic throttles Mythos model for certain tasks
  • Anthropic builds apps competing with its largest customers
In-site article

OpenAI buys Ona to push Codex toward long-running, autonomous coding tasks

OpenAI is acquiring Ona, formerly Gitpod, a startup founded in Kiel, Germany in 2020 that specializes in AI agents and secure cloud development environments for software development.

  • OpenAI acquires Ona (formerly Gitpod), a German startup founded in 2020.
  • Ona focuses on AI agents and secure cloud development environments.
In-site article

Jeff Bezos' AI startup Prometheus closes $12 billion round at a $41 billion valuation

Jeff Bezos' AI startup Prometheus has closed a $12 billion funding round at a $41 billion valuation. The company launched just last November with $6.2 billion in seed funding. No products yet, because Bezos says sharing details would be 'premature.'

  • Prometheus raises $12 billion at $41 billion valuation
  • Founded last November with $6.2 billion seed funding
In-site article

OpenAI vs. Anthropic: A price war over API tokens is brewing

OpenAI is considering cutting API token prices to win customers from Anthropic, according to the Wall Street Journal, signaling a potential price war in the AI industry.

  • OpenAI plans to lower token prices to attract Anthropic's customers
  • The move could trigger a broader price war in AI APIs
In-site article

Dario Amodei's new essay reads like a Cold War playbook for the AI age

Anthropic publishes a sweeping essay and two policy frameworks. The company calls for binding audits of frontier models and paints a picture of AI as a strategic weapon wielded by nation-states.

  • Amodei uses a Lord of the Rings analogy to argue the political system is too slow to react to AI risks.
  • Anthropic calls for mandatory third-party audits of frontier models and government authority to block risky models.
In-site article

Google's new open model DiffusionGemma generates text from noise instead of word by word

Google released DiffusionGemma, a 26-billion-parameter model that generates text via diffusion, achieving 1,000 tokens per second on an H100 GPU—four times faster than autoregressive models, but with lower quality. It's currently experimental.

  • 26-billion-parameter diffusion model for text generation
  • Reaches 1,000 tokens/sec on a single H100 GPU
In-site article

OpenAI's IPO slips as Altman tells staff to expect a public offering "within the next year"

Sam Altman told employees he expects an OpenAI IPO "within the next year," but a delay to 2027 is possible. He frames it as caution around self-improving AI, though Anthropic's stronger growth numbers and imminent IPO may be the real reason to wait.

  • Altman expects OpenAI IPO within a year, possibly by 2027
  • He cites caution over self-improving AI
In-site article

SpaceX wants to put data centers in orbit, and Musk says it's no big deal

SpaceX wants to launch data centers into space, and Elon Musk is pitching it as a near-trivial engineering problem ahead of the company's IPO. A first AI satellite would match the output of a single Nvidia GB300 rack. But Google's own research suggests real AI training would require about 10,000 tightly coupled satellites.

  • SpaceX plans to put data centers in orbit; Musk calls it a trivial engineering challenge.
  • First AI satellite would equal a single Nvidia GB300 rack's performance.
In-site article

Landmark German ruling declares Google's AI Overviews are Google's own words and makes it liable for false answers

A German regional court has ruled that Google is directly liable for the content of its AI search overviews. According to the court, previous limited liability protections for search engine operators don't apply to AI overviews. In this case, Google's AI had falsely linked two publishers to fraud and made claims that didn't appear in any of the linked sources. The ruling could set a precedent for AI-generated content liability worldwide.

  • German court rules Google directly liable for AI overview content
  • Limited liability protections for search engines do not apply to AI-generated answers
In-site article

Beijing's $295 billion AI buildout would require 80 percent domestic chips, locking out US suppliers

China plans to invest roughly $295 billion in a nationwide AI data center network over the next five years, with at least 80% of technology from domestic suppliers like Huawei. Meanwhile, Taiwan is considering criminalizing AI chip smuggling to China for the first time.

  • China plans $295B investment in AI data centers over five years
  • 80% of chips and tech to come from domestic suppliers like Huawei
In-site article

Apple Intelligence gets a second shot with help from Google and Nvidia

At WWDC 2026, Apple showed off a rebuilt version of Siri. The assistant runs on foundation models developed with Google. For complex queries, it taps Nvidia GPUs.

  • Apple unveiled a rebuilt Siri at WWDC 2026.
  • The assistant leverages foundation models co-developed with Google.
In-site article

OpenAI now says "entirely automating everything is not the future we want"

OpenAI steps back from fully autonomous AI by 2028, now advocating human-machine collaboration. Altman and Pachocki propose an international body to potentially slow frontier AI development.

  • OpenAI abandons 2028 full autonomy goal, pivots to human-AI tandem.
  • CEO Altman and scientist Pachocki call for international oversight.
In-site article

OpenAI says going public is "a complicated set of tradeoffs" and is unsure about the timing

OpenAI has confidentially filed an S-1 registration with the SEC, taking the first formal step toward an IPO. There's no set timeline, and the company calls it "a complicated set of tradeoffs." Rival Anthropic recently filed its own IPO paperwork, which likely adds to the pressure.

  • OpenAI confidentially filed S-1 with SEC for IPO.
  • No set timeline; company cites complex tradeoffs.
In-site article

Microsoft Research's Lens proves detailed captions matter more than raw scale for training efficient image generators

Microsoft Research introduces Lens, a 3.8B parameter text-to-image model that rivals much larger models by training on 800M detailed captions generated by GPT-4.1. It requires a fraction of the compute. Lens-Turbo generates images in under a second. Open source under MIT.

  • Lens uses 800M detailed captions from GPT-4.1 instead of vague web alt-text, boosting training efficiency.
  • With only 3.8B parameters, Lens matches or outperforms models many times its size on benchmarks.
In-site article

Intel gets a second life as Google and Nvidia explore it as a TSMC backup for AI chips

Google has ordered more than three million AI chips from Intel for 2028. Nvidia is testing Intel's manufacturing tech for its upcoming Feynman architecture. Both moves come as TSMC can't keep up with AI chip demand. Intel's long-struggling foundry division is getting a rare second chance.

  • Google orders over 3 million AI chips from Intel for 2028 delivery.
  • Nvidia tests Intel's manufacturing process for its Feynman architecture.
In-site article

Most companies are flying blind on AI spending

Only 26% of companies have full visibility into their AI costs, a KPMG survey finds.

  • KPMG survey: only 26% of companies have full visibility into AI spending
  • Token-based billing creates unpredictability for finance departments
In-site article

All sources