AI News HubLIVE

Source Mix

  • Hacker News AI20
  • Microsoft Research Blog6
  • The Decoder5
  • MarkTechPost4
  • GitHub AI & ML3
  • The Verge AI3
  • ZDNet AI2
  • AI Business1

Topic Mix

  • Agents37
  • Policy22
  • Research18
  • Models9
  • Chips6
  • Tools3
  • Startups2

Timeline

  • 2026-05-138
  • 2026-05-147
  • 2026-05-215
  • 2026-05-124
  • 2026-05-164
  • 2026-05-153
  • 2026-05-183
  • 2026-05-243

Latest Updates

Data Formulator 0.7: AI-powered data analytics for enterprise data

Data Formulator 0.7 is an open-source AI-powered system for enterprise data analytics that combines data connectivity, agent-guided exploration, and visualization refinement in a shared workspace.

  • Open-source AI system for enterprise data analytics
  • Data Connectors support governed, reusable connections across diverse data sources
In-site article

Microsoft's MAI-Image-2.5 pulls even with Google's Nano Banana 2 on benchmarks

Microsoft's MAI-Image-2.5 ranks third on Arena's text-to-image leaderboard, on par with Google's Nano Banana 2 but still behind OpenAI's Image-2. The model shows clear gains over its predecessor, especially in rendering text inside images and commercial visuals.

  • MAI-Image-2.5 ranks third on Arena leaderboard, tied with Google's Nano Banana 2
  • Improvements in text rendering and commercial visuals
In-site article

Extending Human Intelligence Through AI

Modern AI systems are powerful not because they replicate human intelligence, but because they extend structures already present in human cognition and language. This perspective explains AI's capabilities and limitations, and reframes AI safety as a system-level challenge requiring engineering and governance, not fear of rogue AI.

  • AI systems extend human intelligence by modeling sedimented structures of understanding in language, not by replicating human minds.
  • Hallucinations and the compositionality gap arise from AI's lack of lived engagement with the world that anchors meaning and truth.
In-site article

theta: a humble approach to harness agnostic configuration

theta is a Rust CLI that manages agent configurations by reading a theta.toml file, resolving, locking, materializing, and casting them to any supported harness (e.g., Claude Code, Codex CLI, GitHub Copilot, Cursor). It works like a package manager for agent harness resources. Installation is straightforward, and it supports adding rules, tools, skills, and subagents, with validation and casting commands. The project is heavily inspired by uv and is the canonical implementation of the theta-spec.

  • theta is a Rust CLI for managing agent configurations
  • Supports multiple harnesses: Claude Code, Codex CLI, GitHub Copilot, Cursor, and more
In-site article

Microsoft Copilot Cowork Exfiltrates Files

A vulnerability in Microsoft Copilot Cowork allows attackers to exfiltrate OneDrive files through prompt injection and external images in automatically sent emails.

  • Copilot Cowork agents can send emails to user's inbox without approval
  • External images in emails can trigger network requests, leaking data
In-site article

Global AI Diffusion: Q1 2026 Trends and Insights [pdf]

This PDF report from Microsoft Research analyzes global AI diffusion trends for Q1 2026, offering key insights and data. The full content is available in the original PDF document.

  • Report from Microsoft Research on Q1 2026 AI diffusion
  • Includes trend analysis and key insights
In-site article

AI Weekly Issue #495: Musk, Zuckerberg killed Trump's AI safety order in three phone calls

Over the weekend: Musk, Zuckerberg, and Sacks killed Trump's draft AI safety executive order in three Wednesday-night phone calls. Anthropic closed a $30B+ round the same Saturday — while Microsoft quietly cancelled its internal Claude Code pilot after token billing ate the entire annual AI budget, redirecting developers to Copilot. CISA logged 15,000 attacks on a same-week Drupal SQL flaw. The first cross-registry supply chain attack — TrapDoor — hit npm, PyPI, and Crates.io at once, using .cursorrules and CLAUDE.md config files as the carrier. And the White House personally overrode the Pentagon to keep Claude inside the NSA.

  • Musk, Zuckerberg, and Sacks killed Trump's AI safety executive order in three phone calls before it went public
  • Anthropic closed $30B+ round while Microsoft cancelled Claude Code pilot due to token costs consuming entire AI budget
In-site article

AI Stock Is the Ultimate Set-It-and-Forget-It Buy for Long-Term Investors

Microsoft is a key AI player with its OpenAI investment and growing cloud AI business, which achieved an annual revenue run rate of over $37 billion. Despite a recent 12% decline, the stock is a strong long-term buy due to deep integration with corporate customers and AI integration opportunities. At 25x forward earnings, it offers an attractive entry point.

  • Microsoft's AI cloud business annual revenue run rate exceeded $37 billion, up 123%.
  • AI is not a threat but an opportunity to enhance Microsoft's software.
In-site article

Why you shouldn't leave model selection on default in Copilot, Gemini and other AI tools

When analyzing data, Microsoft Copilot invents country differences where none exist. Mathematician Adam Kucharski fed the tool identical datasets with different country labels, and Copilot delivered detailed stereotypes instead of accurate results. Thinking models catch the trick, but only if users know when to reach for them.

  • Microsoft Copilot invented stereotypes when given identical datasets with different country labels.
  • Thinking models can catch the trick but require user awareness.
In-site article

Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%

Microsoft Research introduces Webwright, a terminal-native browser agent framework that replaces click-trace web automation with reusable Playwright scripts. Using a single agent loop across three modules and roughly 1,000 lines of code, Webwright powered by GPT-5.4 reaches 60.1% on the long-horizon Odysseys benchmark and 86.7% on Online-Mind2Web — the highest AutoEval score among open-sourced harness recipes.

  • Webwright uses a terminal loop where the agent writes and runs Playwright code instead of predicting one browser action at a time.
  • GPT-5.4 reached 86.7% on Online-Mind2Web (100-step budget) and 60.1% on Odysseys — 26.6 points above the base GPT-5.4 score of 33.5%.
In-site article

This rugged Windows tablet handles mud and rain - but didn't impress with the basics

The Getac G140 is a rugged Windows 11 Pro tablet designed for harsh environments like fire/rescue, automotive, and utility work. It features an AMD Ryzen AI processor, hot-swappable batteries, and extensive port options. While extremely durable and capable of running Microsoft Copilot+ AI tools, it underperforms in benchmarks, is heavy at 3.95 pounds, and has a dim screen at 1000 nits. The device is ideal for niche industrial use but expensive (up to $4,000) and bulky.

  • The Getac G140 is a rugged Windows 11 Pro tablet with MIL-STD-810H and IP66 certifications, designed for harsh environments.
  • Features an AMD Ryzen AI processor, up to 64GB RAM, 2TB SSD, and hot-swappable batteries.
In-site article

Microsoft reports AI is more expensive than paying human employees

Microsoft is canceling most Claude Code licenses due to high costs, shifting engineers to GitHub Copilot. Uber burned through its 2026 AI budget in four months. Experts note that while token prices drop, consumption surges, making AI more expensive than human labor in some cases.

  • Microsoft cancels Claude Code licenses, pushes GitHub Copilot CLI.
  • Uber exhausted 2026 AI coding budget in four months.
In-site article

Microsoft Releases Fara1.5: A Family of Browser Computer-Use Agents (4B/9B/27B) That Outperform OpenAI Operator and Gemini 2.5 Computer Use on Online-Mind2Web

Microsoft Research released Fara1.5, a family of browser computer-use agents in 4B, 9B, and 27B sizes. Fara1.5-27B scores 72% on Online-Mind2Web, outperforming OpenAI Operator and Gemini 2.5 Computer Use. The release also includes FaraGen1.5, a synthetic data pipeline that trains agents on gated domains.

  • Fara1.5 is a family of browser computer-use agents from Microsoft Research in 4B, 9B, and 27B parameters, built on Qwen3.5.
  • Fara1.5-27B achieves 72% on Online-Mind2Web, surpassing OpenAI Operator (58.3%) and Gemini 2.5 Computer Use (57.3%).
In-site article

How CopilotKit Is Redefining the Agentic AI Stack in 2026

CopilotKit's 2026 releases include three tools: AG-UI protocol, AIMock testing suite, and Pathfinder knowledge server, addressing interaction, testing, and knowledge retrieval gaps for production-grade agentic AI. Adopted by major cloud providers and Fortune 500 companies.

  • AG-UI protocol fills the missing agent-user interaction layer, enabling real-time streaming, dynamic UI, and human-in-the-loop, backed by Google, Microsoft, and others.
  • AIMock mocks the entire agent call chain including 11 LLM providers, with record-replay, drift detection, and chaos testing.
In-site article

Why the world’s banks are so worried about Anthropic’s latest AI model

Anthropic's Mythos model has discovered thousands of severe security vulnerabilities, including many zero-day flaws that have gone undetected for decades. Banks worldwide fear cybercriminals will exploit this AI to rob them. Anthropic has granted access to a defensive coalition including Microsoft, but not to banks in Australia, the UK, or Europe.

  • Mythos can identify thousands of zero-day vulnerabilities across major operating systems and browsers.
  • Anthropic has invested $100 million in credits and $4 million in grants to fix these bugs.
In-site article

MagenticLite, MagenticBrain, Fara1.5: An agentic experience optimized for small models

Microsoft Research releases MagenticLite, an agentic application designed for small models, along with MagenticBrain orchestrator and Fara1.5 computer-use model. The system works across browser and local file system, achieving state-of-the-art results on web navigation tasks while keeping data on-device.

  • MagenticLite is a next-gen agentic app that operates across browser and local file system, optimized for small models.
  • Powered by MagenticBrain (14B orchestrator) and Fara1.5 (4B-27B computer-use model) working together seamlessly.
In-site article

Vega: Zero-knowledge proofs for digital identity in the age of AI

Vega is a new zero-knowledge proof system from Microsoft Research that enables users to prove facts from government-issued credentials without revealing the credential itself. It achieves under 100ms proving time on commodity devices using folding schemes, and is designed for real-world digital identity formats like mobile driver's licenses and the EU Digital Identity Wallet.

  • Vega turns full credentials into a single zero-knowledge proof, sharing only what's needed.
  • Zero-knowledge proofs generated in under 100ms on commodity devices with no trusted setup.
In-site article

Best Enterprise Level Agentic AI Platforms for 2026

In 2026, enterprise agentic AI has moved from pilots to production. This guide ranks the top 10 platforms — Salesforce Agentforce, Microsoft Copilot Studio, ServiceNow, LangGraph, and more — with verified pricing, real adoption data, and honest constraints to help enterprise teams make the right platform decision.

  • Salesforce Agentforce leads for CRM-native workflows with $800M ARR and 29,000 deals. But value narrows outside Salesforce ecosystem.
  • Microsoft Copilot Studio has highest volume: 160,000 organizations, 400,000+ agents. Best for Microsoft 365 enterprises.
In-site article

How Coding Harnesses Are Used: An Introspection

Tamarillo analyzed ~400K public GitHub repositories containing configuration files for AI coding assistants (harnesses) like Cursor, Copilot, and Claude. The study covers market share, adoption dynamics, configuration surface anatomy, multi-harness co-occurrence, and repo demographics by stars, language, and owner type. It reflects configuration intentions and is a lower bound on actual usage.

  • Approximately 400K public GitHub repos with AI coding harness configs were analyzed.
  • Covers market share, adoption trends, configuration patterns, and multi-harness usage.
In-site article

Microsoft Backs Open Agentic AI Ecosystem with New Linux Releases, Governance Tools, and AAIF Push

Microsoft announced at Open Source Summit North America 2026 the public preview of Azure Linux 4.0 and general availability of Azure Container Linux, alongside contributions to the Agentic AI Foundation (AAIF) and open governance tools for agentic systems.

  • Microsoft introduces Azure Linux 4.0 public preview and Azure Container Linux GA for cloud-native and AI workloads.
  • The company pushes for open agentic AI standards through the Agentic AI Foundation (AAIF).
In-site article

Take your local GitHub sessions anywhere

Remote control for GitHub Copilot CLI sessions is now generally available on github.com and GitHub Mobile. Developers can start a session in VS Code or the CLI, then monitor and adjust it from another device. Features include real-time monitoring, mid-flight instruction changes, permission approvals, and a seamless cross-device workflow, with privacy by default.

  • Remote control for GitHub Copilot CLI sessions is now GA on github.com and GitHub Mobile.
  • Support for remote control in VS Code and JetBrains IDE enables multi-surface workflows.
In-site article

LLM Tracing with MLflow AI Gateway

MLflow AI Gateway automatically logs LLM call traces, helping developers debug agentic apps and coding assistants. The article covers usage, integration with LiteLLM, support for Copilot CLI, and discusses tracing as a scaling problem.

  • MLflow AI Gateway captures LLM call traces without code changes.
  • Supports local testing via LiteLLM with Ollama as provider.
In-site article

Microsoft starts canceling Claude Code licenses

Microsoft is abruptly canceling licenses for Anthropic's Claude Code in favor of its own Copilot CLI, revealing strategic prioritization of internal tools over partner products despite close ties with OpenAI. The shift may cause developer friction and highlights the tension between best-of-breed tools and corporate control over AI infrastructure.

  • Microsoft cancels Claude Code licenses, pushing developers to Copilot CLI.
  • The move underscores Microsoft's preference for internal AI coding assistants over partner tools.
In-site article

Show HN: Strava for AI coding – analytics on your Copilot/Claude/Codex usage

Microsoft's open-source AI Engineer Coach analyzes AI coding assistant logs locally, providing practice scores, anti-pattern detection, and output metrics to help developers improve coding efficiency.

  • Microsoft open-source tool for analyzing AI coding assistant usage
  • Fully local analysis with privacy protection
In-site article

GitHub takes aim at Claude Code and Codex with its new Copilot app

GitHub launches a standalone desktop app for Copilot, designed to manage coding agents, issues, pull requests, and development sessions from a single interface, directly competing with Anthropic's Claude Code and OpenAI's Codex. Built on Copilot CLI, the app offers a unified inbox, side-by-side diff reviews, session history, and multi-agent support. Currently in public preview.

  • GitHub introduces a standalone Copilot desktop app that integrates coding agents, issue tracking, pull requests, and session management.
  • The app directly competes with Claude Code and Codex, leveraging GitHub's existing developer infrastructure.
In-site article

Show HN: LightningTrack – Issue tracker built for AI-assisted development

LightningTrack is an issue tracker designed for AI-assisted development. It turns any issue into a context-rich prompt for Copilot, Cursor, or Claude with one click, supports email-to-issue conversion, sprints, custom fields, reports, and an agent-friendly API.

  • One-click conversion of issues into AI context prompts
  • Email forwarding automatically creates tracked issues
In-site article

Further Notes on Our Recent Research on AI Delegation and Long-Horizon Reliability

Microsoft Research clarifies the scope of its paper on AI delegation, noting that while models show fidelity degradation in long-horizon tasks, production systems mitigate these effects, and the benchmark is a diagnostic tool for future improvement.

  • The DELEGATE-52 benchmark evaluates semantic fidelity loss in long-horizon delegated workflows.
  • State-of-the-art models show 19-34% degradation over 20 iterations, but Python workflows degrade less than 1%.
In-site article

Show HN: One Markdown File to Set Up Claude, Codex, Cursor and Copilot

A single ~7,600-line Markdown bootstrap file (AI_PROJECT_SETUP.md) automates the setup of AI coding assistants including Claude Code, ChatGPT Codex CLI, Cursor, and GitHub Copilot. It generates 13-section rules, safety hooks, session resume, dual-write memory, bilingual GitHub files, and more. Just download the file, tell your AI to read and execute it, and within 1-3 minutes you have a complete, gitignored AI tooling configuration.

  • Single source of truth Markdown file works with Claude, Codex, Cursor, and Copilot.
  • Auto-generates 13-section rules, 5 safety hooks, 3-tier session saving, dual-write memory, and bilingual (EN/KO) GitHub standard files.
In-site article

[AINews] Everything is Conductor

A relatively quiet day in AI news highlights a smaller trend: the convergence of coding agent form factors around Conductor's pioneering approach. Key stories include GitHub's new Copilot App mimicking Conductor, OpenAI's Codex mobile launch, LangChain's agent infrastructure updates (SmithDB, Engine, Labs), Anthropic's Claude Code restrictions backlash, Figure's 24/7 autonomous sorting livestream, and notable research releases on diffusion LMs, time-series forecasting, and mechanistic interpretability.

  • GitHub launches Copilot App with an agent-first UX similar to Conductor; YC CEO Garry Tan publicly endorses Conductor as superior.
  • OpenAI integrates Codex into ChatGPT mobile, enabling remote task initiation, review, and execution.
In-site article

Show HN: JDS – a Copilot skill suite for structuring AI coding behavior

JDS is a GitHub Copilot CLI plugin that enforces structured workflows (design before code, tests before implementation, evidence-based verification), transforming AI coding assistants from autocomplete tools into disciplined software engineers. It features a multi-stage pipeline (think → plan → execute → verify → finish) with both flexible and rigid skills for different task types.

  • Enforces design-before-code, test-before-implementation, and evidence-based completion verification.
  • Workflow includes bootstrap, think, plan, execute, verify, and finish stages.
In-site article

Microsoft starts canceling Claude Code licenses

Microsoft is planning to remove most Claude Code licenses by June 30, pushing developers to use GitHub Copilot CLI instead. The decision is driven by financial and strategic reasons, despite Claude Code's popularity. Anthropic models remain accessible via Copilot CLI.

  • Microsoft will cancel most Claude Code licenses and push developers to GitHub Copilot CLI.
  • The cutoff is June 30, aligning with the end of Microsoft's fiscal year.
In-site article

Conductor: Deterministic orchestration for multi-agent AI workflows

Conductor is an open-source CLI from Microsoft that defines multi-agent workflows in YAML with deterministic routing instead of LLM-based dynamic orchestration, reducing cost and latency. It supports mixed models, parallel execution, human gates, script steps, and a web dashboard, ideal for structured workflows like code review and research synthesis.

  • Deterministic orchestration: YAML-defined workflow topology with zero token consumption for routing, reducing cost and unpredictability.
  • Mixed models: each agent can specify a different model and provider, such as Claude or GPT.
In-site article

Microsoft pits more than 100 AI agents against each other to find Windows vulnerabilities

Microsoft built MDASH, a system that pits more than 100 specialized AI agents against each other to find software vulnerabilities. On Patch Tuesday alone, the system uncovered 16 security flaws in Windows, four of them critical. Microsoft isn't saying which AI models power the system.

  • Microsoft created MDASH, a system using over 100 AI agents in adversarial roles to discover vulnerabilities.
  • The system found 16 Windows security flaws on Patch Tuesday, including 4 critical ones.
In-site article

Whimsical Strategies Break AI Agents

AI agents are vulnerable to 'whimsical' adversarial strategies that appear absurd to humans but reliably succeed. Microsoft researchers generated 30K such strategies from diverse Wikipedia articles, demonstrating that even frontier models like GPT-5 can be manipulated in negotiation environments. These out-of-distribution attacks exploit blind spots in safety training that focuses on human-perceptible threats.

  • Whimsical strategies that seem absurd to humans break AI agents.
  • Generated from diverse Wikipedia seeds (e.g., activation functions, Aboriginal history).
In-site article

Microsoft's Edge Copilot can now read all your open tabs at once and write for you on LinkedIn

Microsoft is upgrading Edge's Copilot AI chatbot so it can read all open tabs at once, compare products, and summarize articles. New additions include long-term memory, a tool that turns tabs into AI podcasts, and a quiz mode.

  • Edge Copilot can now read all open tabs simultaneously for enhanced context.
  • It can compare products, summarize articles, and write LinkedIn posts.
In-site article

Microsoft’s Edge Copilot update uses AI to pull information from across your tabs

Microsoft Edge is adding a new feature that lets Copilot AI gather information from all open tabs. You can ask questions, compare products, and summarize articles. Microsoft is retiring Copilot Mode and folding its agentic capabilities into 'Browse with Copilot'. Other updates include a 'Study and Learn' mode, tabs-to-podcasts, AI writing assistant, browsing history access, long-term memory, a redesigned new tab page with Journeys, and mobile screen sharing.

  • Copilot can now collect information from all open tabs for queries, comparisons, and summaries.
  • Microsoft retires Copilot Mode, integrates agentic features into 'Browse with Copilot'.
In-site article

Everything Claude Code: performance optimization system for AI agent harnesses

Everything Claude Code is a comprehensive performance optimization system for AI agent harnesses, originally an Anthropic hackathon winner. It provides a complete system including skills, instincts, memory optimization, continuous learning, security scanning, and research-first development. The system supports multiple harnesses such as Claude Code, Codex, Cursor, OpenCode, Gemini, and GitHub Copilot, evolving over 10+ months of intensive daily use building real products. The latest v2.0.0-rc.1 introduces a dashboard GUI, operator workflows, and the ECC 2.0 alpha.

  • Everything Claude Code is a comprehensive performance optimization system for AI agent harnesses.
  • It includes skills, instincts, memory optimization, continuous learning, security scanning, and supports multiple AI agent harnesses.
In-site article

GridSFM: A new, small foundation model for the electric grid

Microsoft releases a lightweight foundation model that can predict AC optimal power flow in milliseconds, boosting efficiency and unlocking cost savings in grid analysis.

  • GridSFM predicts AC optimal power flow in milliseconds, targeting up to $20B/year in congestion losses and 3.4 TWh of renewable curtailment.
  • Provides full AC system states for direct visibility into congestion, stability, and system health.
In-site article

Microsoft’s new multi-model agentic security system tops industry benchmark

Microsoft announced its new multi-model agentic security system (MDASH) has achieved top performance in an industry benchmark. The system, which coordinates over 100 specialized AI agents across frontier and distilled models, discovered 16 new vulnerabilities in the Windows networking and authentication stack, including four critical remote code execution flaws.

  • Microsoft's MDASH system uses over 100 AI agents to discover 16 new Windows vulnerabilities
  • Includes four critical remote code execution flaws in kernel TCP/IP stack and IKEv2 service
In-site article

Microsoft doesn’t want any of this

In the Musk v. Altman trial, Microsoft appears reluctant and detached. Their opening statement was essentially an ad for Microsoft products, implying the trial's absurdity. Despite being an early major investor in OpenAI, Microsoft was notably absent from key decisions. CEO Nadella called the OpenAI board drama 'amateur city,' and Microsoft lawyers repeatedly showed the company had no involvement in controversial events.

  • Microsoft's opening statement was an ad for its products, showing reluctance
  • Microsoft was a major early investor but not a key decision-maker
In-site article

RCE in VSCode Copilot Chat

Researchers discovered a prompt injection vulnerability in VSCode Copilot's agent mode, allowing attackers to bypass user confirmation by exploiting a TOCTOU flaw in the applyPatch tool, leading to arbitrary file write and remote code execution via overwriting .git/config or shell configuration files.

  • Copilot agent mode vulnerable to prompt injection, automatically processes malicious GitHub Issue content.
  • TOCTOU vulnerability in applyPatch tool: file rename operation bypasses destination path checks.
In-site article

Boycott unethical AI companies – and do it now

In light of recent events, consumers are urged to wield their power against AI giants by replacing ChatGPT with Claude and avoiding Microsoft Copilot.

  • Recent events prompt boycott of unethical AI companies
  • Replace ChatGPT with Claude
In-site article

Show HN: One memory layer across every MCP-compatible AI tool

SubVault is an MCP server that gives AI tools like Claude, Cursor, and Copilot persistent memory by extracting structured knowledge from conversations. It remembers decisions, facts, people, and project context across sessions, scoring information by authority, recency, and relevance. Free during early access, setup takes 30 seconds.

  • SubVault provides persistent memory across MCP-compatible AI tools, capturing structured knowledge from conversations.
  • It scores and ranks information by authority, recency, and relevance, delivering only the most relevant context.
In-site article

RoBrain

RoBrain is an open-source shared memory for teams using AI agents. It automatically captures every decision and the alternatives ruled out, across all developers' sessions, and flags contradictions. Teams revisit past decisions with original rationale instead of re-litigating from scratch. Works with Claude Code, Cursor, and Copilot.

  • Shared AI memory captures decisions, reasons, and rejected options.
  • Works across Claude Code, Cursor, and Copilot.
In-site article

Microsoft ousts its Israel chief following reports that Azure quietly powered military AI targeting in Gaza

Microsoft Israel's top executive Alon Haimovich stepped down amid an internal investigation into the subsidiary's work with Israel's defense ministry. Reports indicate Azure cloud services were used for mass surveillance and AI-powered target selection in Gaza, leading to legal and transparency concerns.

  • Microsoft Israel's general manager Alon Haimovich resigns after internal probe.
  • Azure storage and AI services were used by Unit 8200 for surveillance and targeting in Gaza.
In-site article

Dungeons & Desktops: Building a procedurally generated roguelike with GitHub Copilot CLI

Learn how one Hubber used GitHub Copilot CLI to build an extension that turns any codebase into a unique, roguelike dungeon.

  • GitHub Dungeons generates a roguelike dungeon from your codebase using BSP, seeded by your latest commit.
  • Copilot CLI's /delegate command lets developers describe features in plain English and get a pull request back.
In-site article

Cplt: Run AI coding agents or a plain shell inside a kernel-level sandbox

Cplt is a sandbox wrapper for AI coding agents that provides kernel-level filesystem and environment isolation on macOS and Linux. It protects secrets by restricting access to credentials, SSH keys, cloud configs, and blocking lifecycle scripts. Supports Copilot, OpenCode, and other agents with fine-grained control over files, network, and environment variables.

  • Kernel-level sandbox for AI coding agents using Apple Seatbelt (macOS) or Landlock+seccomp (Linux).
  • Blocks access to .env, SSH keys, cloud credentials, git hooks, and other sensitive paths by default.
In-site article

Company Directory