Empirical is an AI memory infrastructure that provides a unified personal memory layer for all AI tools, ensuring consistent context and preferences across platforms.
Empirical acts as a memory layer for AI, syncing user data and preferences across tools.
Aims to solve the 'memory silo' problem between AI applications.
Ornith-1.0 is a family of open-source agentic coding models post-trained on Gemma 4 and Qwen 3.5, using reinforcement learning to jointly optimize scaffold and solution rollouts. Available in 9B, 35B MoE, and 397B MoE sizes, it achieves state-of-the-art results on coding benchmarks like Terminal-Bench, SWE-Bench, NL2Repo, and OpenClaw. MIT licensed, supports OpenAI-compatible API and tool calling.
Ornith-1.0 offers 9B (dense), 35B (MoE), and 397B (MoE) variants, achieving best-in-class performance among open-source models on multiple coding benchmarks.
Its self-improving RL framework jointly trains search scaffold and solution generation, enhancing search trajectory quality.
Anthropic’s Claude models in Microsoft Foundry — hosted on Microsoft Azure and running on NVIDIA GB300 Blackwell Ultra GPUs — are now generally available, giving Azure-native enterprises a powerful new way to build autonomous and domain-specific AI agents.
Anthropic Claude models are now generally available on Azure via Microsoft Foundry with NVIDIA GB300 GPU acceleration.
The integration enables enterprises to build more powerful agentic AI systems with autonomous sub-agents for advanced tasks.
Dynamic subagents let AI agents orchestrate work at scale using code instead of tool calls. Learn how programmatic orchestration in Deep Agents guarantees coverage, handles fan-out, and unlocks reliable multi-step, complex agent pipelines with common orchestration patterns and live traces.
Dynamic subagents replace tool-call-based subagent invocation with programmatic orchestration, improving reliability at scale. They allow models to write code (loops, branches, concurrency) to manage subagents.
Key benefits include deterministic coverage (no skipped items) and reliable complex orchestration for multi-phase pipelines, fan-out + synthesis, and conditional branching.
The article argues that observability will not evolve into a single universal AI agent but will instead consist of thousands of specialized agents built by individual teams, emphasizing the importance of context, openness, and shared investigation artifacts.
Observability's future is not a universal SRE agent but a multitude of team-specific agents.
Agents will increase the breadth of investigations, straining data systems.
This edition of The Download covers the pitfalls of using metrics to quantify life, AI systems to prevent elephant-human conflicts in India, and other tech stories including AI models, chip restrictions, and more.
Metrics can obscure what's truly important and redefine our sense of value.
AI-powered warning systems in India aim to reduce deadly elephant-human clashes.
PR Focus AI Pro is a Chrome extension that leverages a BYOK architecture to achieve zero server cost. It provides AI-powered risk scoring, summaries, and draft reviews for GitHub Pull Requests, all processed locally without a backend.
Local AI processing: Users bring their own API key (OpenAI, Groq, etc.), and code/keys never leave the local browser.
AI triage: 0–100 risk score based on CI status, PR age, and code scope; summaries generated from actual diffs.
DeepReinforce releases Ornith-1.0, an open-weights (MIT) model series based on Gemma 4 and Qwen 3.5, achieving state-of-the-art performance on coding benchmarks among open-source models of comparable size. The author tests the 35B MoE variant with LM Studio and Pi, finding it proficient at handling multiple tool calls for agentic coding tasks.
Open-weights (MIT) model from DeepReinforce
Built on Gemma 4 and Qwen 3.5 with variants from 9B to 397B
An interview with AI educator Harper Carroll covers fine-tuning vs. prompting, whether to learn coding in 2025, and what the AI field gets wrong in public communication. Carroll argues that AI is a medium where the outcome depends on user input, demonstrates fine-tuning to replicate her writing style, and emphasizes intuition as a key human advantage. The article also explores AI-assisted writing workflows and the importance of raising ambitions rather than fearing job displacement.
Fine-tuning shifts model output distribution while prompting only bends surface behavior.
Learning to code remains important, but focus should shift to system understanding over syntax.
Candidly built a state-aware conversational agent harness that uses an Input-Output Hidden Markov Model (IO-HMM) to infer user engagement states in real time from conversation traces, enabling targeted response policies that reduce disengagement. The system identifies four states—Engaged, Detailed, Guided, and Disengaging—and cuts disengaging turns from 23% to 11%.
Candidly uses an IO-HMM to model user states from lightweight trace features, achieving 0.90 AUC for outcome prediction.
Four engagement states emerge: Engaged (53%), Detailed (7%), Guided (17%), Disengaging (23%), with resolution rates from 78% to 30%.
Katra is an open-source, self-hosted memory system that provides AI agents with human-like cognitive memory, including episodic recall, semantic search, knowledge graphs, and temporal analysis. It integrates with any MCP-compatible agent (e.g., OpenClaw, Claude Code) via 35 specialized tools. Inspired by Star Trek's Vulcan mind meld (katra), it aims to achieve emergent behaviors through multi-layered memory architecture and sleep consolidation.
Multi-layered memory: episodic, semantic, working memory, knowledge graph, and temporal querying.
Compatible with any MCP agent, offering 35 specialized tools.
Google's Richard Seroter explains the full-stack approach to AI, how Google integrates hardware, models, and platforms, and offers three ways to start building.
Full-stack AI means an integrated system covering infrastructure, models, orchestration, and interfaces.
Google's decade-long investment in TPUs and models enables reliability and competitive pricing.
A new proposal would ban the sale of Americans' health and location information to data brokers, including information people reveal to AI chatbots like ChatGPT or Claude. The bill expands a 2022 version and covers data entered into AI systems, with enforcement by the FTC and private rights of action.
The updated Health and Location Data Protection Act prohibits companies from selling health and location data, including AI inputs, to brokers.
AI labs are expanding into health products, raising privacy concerns as users share sensitive data.
LlamaIndex unveils LlamaParse Index with a Retrieval Harness that gives AI agents filesystem-style tools for document traversal, plus visual preservation, managed infra, and observability.
LlamaParse Index introduces a Retrieval Harness with four filesystem primitives: Hybrid Retrieve, List Files, File Grep, and File Read.
Visual Layout Preservation captures page screenshots to resolve layout-dependent content.