MCP AI News

MCP updates

Kimi K3 on vLLM: Up to 370 Tokens/sec

2026-07-27 15:44 UTC

vLLM announces day-0 support for Kimi K3, a 2.8-trillion-parameter Mixture-of-Experts model with 1M-token context. Achieves up to 370 tok/s with DSpark speculative decoding. Features hybrid prefix caching, tool calling, and optimizations for production deployment.

Kimi K3 is a 2.8T parameter MoE model with 16 of 896 experts active per token, supporting 1M token context.
vLLM serves Kimi K3 at up to 370 tok/s using DSpark speculative decoding, a 3.14x speedup over baseline.

Show HN: Maginary.ai gets seedance2 and GPT-images2 support

2026-07-26 20:56 UTC

Maginary.ai, an image/video generator with a Midjourney-like prompt syntax and over 40 underlying models, gets a major update integrating seedance2 and GPT-images2, with future plans for MCP and x402.

Maginary.ai is a multimedia generation platform using multiple underlying models.
The latest update adds support for seedance2 and GPT-images2 models.

MCP’s biggest update removes the machinery many servers were built around

2026-07-26 16:00 UTC

The Model Context Protocol (MCP) receives its largest update since launch, removing session state and initialization handshake to simplify remote server operations. The release candidate is frozen, final spec due July 28. Deprecations include core features like Sampling, with migration directions provided.

MCP's update eliminates session affinity by making requests stateless, reducing operational complexity.
Capabilities and protocol version are now carried per-call via _meta, enabling caching and routing.

MCP Code Review Server – AI code review in your editor

2026-07-26 14:13 UTC

MCP Code Review Server is an open-source tool that integrates AI-powered code review into MCP-compatible editors like Claude Code and Cursor. It supports reviewing code snippets, diffs, and files, using OWASP Top 10 scanning, N+1 query detection, and race condition analysis to provide structured output with severity ratings. Easy installation via uvx or pip.

MCP server brings AI code review directly into editor workflows
Supports reviewing code, diffs, and local files for security, performance, and style issues

Show HN: Integrate any CLI agent into any terminal

2026-07-26 06:06 UTC

A developer built Terminai, a transparent wrapper that lets you summon AI coding agents in your terminal on demand, without switching contexts or changing terminal emulators. Written in Rust, it solves TUI rendering issues by forking ratatui and rat-salsa, and provides an MCP server and CLI tool for AI interaction.

Solves the pain point of using AI in the terminal without copy-pasting or running a full agent preemptively
Built on mprocs, integrates with any AI agent via MCP server and CLI tool

Show HN: Axtary – Content Authorization for AI Agents

2026-07-26 01:02 UTC

Axtary checks the exact diff, message, query, or tool payload before a connector executes. Routine actions follow policy; higher-risk actions require approval of that exact payload. The system binds approvals to payload hashes to prevent tampering.

Checks exact payload before connector execution
Routine actions follow policy; high-risk actions require human approval of exact payload

KDnuggets Weekly Roundup: Week of July 20, 2026

2026-07-25 13:00 UTC

This weekly roundup highlights top articles on agentic AI, including MCP servers, free courses, model evaluation, and more.

Editor's Pick: Top 5 MCP Servers for high-performance agentic development.
Kaggle and Google offer a free 5-day Agentic AI course.

Quil – a reboot-proof terminal multiplexer that resumes AI sessions

2026-07-25 12:27 UTC

Quil is a terminal multiplexer designed for AI-native development. It persists workspaces across reboots, resumes AI sessions by ID, and lets AI assistants drive the terminal over MCP. It features typed panes, command palette, built-in integrations with tools like Claude Code and lazygit, and a one-line install.

Survives full reboots, restoring all tabs, panes, and AI conversations.
AI assistants can control the terminal via MCP protocol.

Show HN: Give your AI ears – live meeting transcripts over MCP on macOS

2026-07-25 11:27 UTC

VoxAI is a macOS app that provides local transcription and a live AI copilot for conversations. It records, transcribes, labels speakers, and streams the conversation to your AI for real-time advice and note-taking. Free trial for the first ten sessions, then a one-time purchase.

Local transcription with live speaker labels
AI copilot provides real-time advice and action items

Show HN: TS Compiler Knowledge Graph reducing AI tokens about 90%

2026-07-25 08:13 UTC

@ttsc/graph is an MCP server that gives AI agents a compiler-built code graph of a TypeScript codebase, allowing them to answer code questions without reading source files, thereby reducing token costs by about 90%.

Leverages the TypeScript compiler to build an exact graph of declarations and relationships, avoiding text-guessing.
Agents answer code questions with one or a few graph calls, never opening files.

Building Self-Evolving AI Agents with OpenSpace Using Skills, MCP, Lineage, and Low-Cost Reuse

2026-07-25 07:54 UTC

Discover how to create self-evolving AI agents using the OpenSpace framework. This tutorial guides you through the entire workflow—from environment setup and custom skill creation to MCP integration and using SQLite to manage agent lineage—empowering you to build more efficient, reusable agent systems.

OpenSpace enables self-improving AI agents through skill evolution and lineage management.
Tutorial covers environment setup, custom SKILL.md creation, MCP server launch, and more.

Show HN: Argus – An AI QA engineer: give it a URL and it tests your app

2026-07-24 16:42 UTC

Argus is an AI QA engineer that takes plain English test descriptions, navigates your app like a human, and reports issues with video evidence. No selectors needed, so it withstands UI changes. It can also be called by coding agents via MCP to verify their work.

Argus accepts natural language test steps and executes them autonomously.
Operates without fragile CSS selectors, adapting to UI changes.

5 Key Concepts Behind Agentic AI Every Engineer Must Understand

2026-07-24 12:25 UTC

This article breaks down the five essential engineering concepts that make agentic AI systems work in production: tool use via MCP, memory and context engineering, planning and reasoning loops, multi-agent orchestration, and evaluation with guardrails. It explains why many agents fail to reach production and how to build robust systems.

Tool use standardized by the Model Context Protocol (MCP) allows agents to interact with external services without custom integrations.
Memory is an architectural component separate from the context window, with tools like Mem0 and Zep enabling targeted retrieval.

Show HN: Frontier model pricing became a rip-off, so I built an open-source CLI

2026-07-24 10:33 UTC

Kolega Code is a local-first terminal coding agent with multi-agent orchestration (Gigacode) for broad tasks like large audits, migrations, and parallel checks. It supports model routing, plan/build modes, web search, MCP servers, and is open source under Apache 2.0.

Kolega Code is an open-source, local-first terminal coding agent designed for multi-agent collaboration.
Its Gigacode feature enables parallel execution of sub-agents for efficient handling of large codebases.

Open Source Tax Engine outperforming GPT sol and Fable 5

2026-07-24 02:32 UTC

OpenTax is a deterministic tax engine for AI agents, achieving 96% accuracy on TaxCalcBench, surpassing GPT and Claude Fable. It integrates via MCP, provides verifiable proofs, and ensures precise tax compliance.

OpenTax is the only open-source deterministic tax engine, with every calculation traceable to specific statutes.
Achieves 96% on TaxCalcBench, the highest score ever recorded.

Alexa Plus is getting an AI update to handle more complicated instructions

2026-07-23 21:15 UTC

Amazon updates Alexa Plus with deeper smart home integrations using new AI toolkit and MCP protocol, enabling voice commands for complex device tasks like setting washing machine cycles.

Alexa Plus update enables connections with smart home brands like Bosch, Whirlpool, iRobot, etc.
Users can give complex voice commands, e.g., washing machine setting based on tag instructions.

Show HN: Setoku – Self-hosted knowledge server for AI agents

2026-07-23 17:12 UTC

Setoku is an open-source, self-hosted MCP knowledge server that gives AI agents read-only access to company data, remembers metric definitions and gotchas, and enables building and sharing dashboards. It runs on a cheap VPS, requires no model inference costs, and emphasizes security with human approval for knowledge updates.

Self-hosted MCP server for AI to query company data with context understanding.
Provides read-only query, context tools, and app publishing with human-in-the-loop for knowledge changes.

Building trade assistant: How Jefferies optimized front office trading operations with AI

2026-07-23 16:42 UTC

Jefferies built an agentic AI trade assistant using Strands Agents, Amazon Bedrock, and MCP tools to enable traders to query data via natural language, reducing reliance on IT and accelerating insights.

Jefferies deployed an AI trade assistant integrating Strands Agents, Amazon Bedrock, and MCP tools.
Traders can ask natural language questions and receive SQL-generated insights and visualizations in real time.

Show HN: Mwe-MCP – self-hosted memory for AI agents that knows who may know what

2026-07-23 12:38 UTC

Mwe-MCP is a self-hosted, wiki-based memory engine for AI agents, offering per-fact access control, attribution, validity windows, and nightly self-organization. It enables multiple agents to share a governed memory while preserving privacy and accuracy.

Wiki-like memory stored as Markdown pages, browsable via built-in dashboard.
Each fact has owner, sender, reader permissions, and validity time window.

7 Best Claude Code Alternatives for CLI Agentic Coding

2026-07-23 12:00 UTC

Discover seven cheaper, faster Claude Code alternatives for CLI agentic coding, with open-source tools, local models, MCP support, and better context control.

OpenCode: open-source, multi-model, flexible workflows
Pi: lightweight, extensible, 15+ model providers

Show HN: Ours.network – give your AI agents a direct line to each other

2026-07-23 07:01 UTC

Ours.network introduces ours-mcp, a tool that enables AI agents to communicate directly without human intervention. It simplifies setup with an installable MCP server that allows agents to connect via one-time invites, bypassing the need for manual copy-paste. Features include end-to-end encryption, a blind relay for privacy, and full human control over connections. The tool is in early alpha, source-available, and designed for agent-to-agent communication across different runtimes like Claude Code and Codex.

Ours-mcp eliminates the need for humans to relay messages between AI agents by establishing direct lines.
Setup is quick: install the MCP server, generate an invite, and connect agents in about two minutes.

How good is your AI Gateway?

2026-07-23 05:07 UTC

This article evaluates three AI gateways—Highflame, Bifrost, and LiteLLM—across three critical moments: first token latency, peak concurrency, and tool calls. Highflame outperforms with negligible added latency, 100% success under 5,000 concurrent conversations, and efficient MCP proxying.

Highflame adds only 2ms to first token latency at 100 concurrent chats.
Bifrost buffers responses, causing 1.3s first token delay.

Show HN: AgentNest, self-hosted sandboxes for AI agents

2026-07-23 01:54 UTC

AgentNest is an open-source runtime for executing AI agent code in secure, disposable sandboxes. It supports Python, shell commands, files, packages, browsers, GPUs, and Git, with fine-grained network policies, stateful sessions, and forkable state. Self-hosted and extensible, it integrates with LangChain, MCP, and more.

Self-hosted sandbox with secure defaults and egress allowlisting
Stateful Python sessions and forkable sandboxes for agent workflows

Local agent first AI search optimization tooling

2026-07-22 23:26 UTC

Canonry is an open-source, self-hostable AI Engine Optimization (AEO) platform that helps websites track citations across Gemini, ChatGPT, Claude, Perplexity, and local LLMs. It offers CLI, dashboard, MCP adapter, and built-in agent for tracking keywords, technical audits, ad management, and more. Initial setup takes 5 minutes.

Open-source and self-hostable with CLI and UI
Tracks citations across multiple AI engines

Show HN: Focus on approving agent actions and managing team MCP access

2026-07-22 21:03 UTC

TrustLoopGuard is an open-source control boundary for production AI agents that checks proposed actions before they execute, returning permit, deny, require approval, or defer decisions with receipts.

Prevents agents from executing actions without authorization by checking at runtime.
Returns explicit decisions (permit, deny, require_approval, defer) with reasons.

CoreBase: Governed AI Agents for Your Product, on Your Customers' Data

2026-07-22 16:35 UTC

CoreBase has a new look. It offers a governed infrastructure layer for building and deploying AI agents with built-in connectors, permissions, audit trails, and cost controls, enabling trusted AI agents for your product.

CoreBase provides a governed infrastructure layer for AI agents.
Includes connectors, permissions, audit trails, and cost controls.

Show HN: Turn narrated screen recordings into data for AI agents (local, MIT)

2026-07-22 13:02 UTC

talkthrough-mcp is a local-first MCP server that processes narrated screen recordings into structured data for AI agents. It provides timestamped transcripts, scene-change keyframes, OCR, speaker diarization, and wall-clock anchoring, all running locally without cloud dependencies. The server integrates with various MCP clients and includes pre-built workflows for triaging recordings, extracting specs, and generating backlogs.

Local-first MCP server for narrated screen recordings; no cloud or LLM inside.
Provides tools for transcription, keyframes, OCR, speaker diarization, and wall-clock mapping.

Why R&D Data Belongs in the Lakehouse - and Why Agents Need It There

2026-07-21 19:45 UTC

At cellcentric, a joint venture of Daimler Truck and Volvo Group, the Data Hub built on Databricks serves as a governed context layer for data and AI, unifying scattered R&D data from sources like IoT, SAP, and MES. By making documentation a first-class quality metric and exposing context via MCP, it accelerates investigations from weeks to days and enables governed agent access.

Data Hub is a governed context layer providing a unified UI for employees and an MCP server for agents. Documentation coverage is a first-class quality metric. Agent access is governed through Unity Catalog and identity forwarding, ensuring no bypass of permissions.

How Apollo Uses Deep Agents and LangSmith for GTM AI

2026-07-21 18:27 UTC

Apollo uses Deep Agents and LangSmith to power an AI Assistant that handles prospecting, enrichment, outreach, analytics, and MCP integrations.

Apollo rebuilt its AI Assistant from a supervisor-based architecture to a skill-based one using Deep Agents, improving flexibility and efficiency.
The new architecture reduced development cycle by ~80-85% and significantly decreased confirmation prompts for users.

Remote OpenClaw

2026-07-21 14:57 UTC

Remote OpenClaw offers over 13,000 MCP servers, skills, and plugins designed to enhance AI coding agents, providing a comprehensive ecosystem for developers.

Over 13,000 MCP servers, skills, and plugins available
Specifically tailored for AI coding agents

Show HN: Rowset – An open-source back end for AI agents

2026-07-21 14:24 UTC

Rowset is a private MCP and REST backend for structured datasets that trusted AI agents can create, inspect, update, export, and share. It provides a stable programmatic interface for agents, avoiding browser automation.

Rowset offers MCP and REST APIs for AI agents to manage datasets
Features include row CRUD, projects, column types, exports, and public previews

The classic Java RSS reader won't run in 2026, so I rebuilt it to the web

2026-07-21 12:18 UTC

A developer rebuilt the abandoned Java desktop RSS reader RSSOwl for the web using AI (Claude Code) and Vaadin 25. Most of the UI transferred quickly, but the AI produced incorrect APIs due to outdated training data. With the help of an MCP server for current docs and manual verification against the original, a multi-user reader emerged, though some features (pluggable menus, embedded browser) were impossible to port.

RSSOwl is a classic Eclipse desktop RSS reader, but its 32-bit binary won't run on a 2026 Mac.
The developer used Claude AI and Vaadin 25 to rebuild the core three-pane interface in hours.

ANSI escape injection in MCP servers: Hidden from humans, visible to AI

2026-07-21 07:01 UTC

ANSI escape sequences can be used to hide instructions from human reviewers while remaining visible to AI agents, enabling injection attacks. This article covers two attack variants (direct-fetch and stored AESI) and how DAST can automatically detect them.

ANSI escape sequences are invisible in terminals but read byte-by-byte by language models, creating an attack surface.
Direct-fetch AESI injects hidden instructions via malicious URLs; stored AESI persists in storage and triggers on later reads.

Storybook: AI MCP

2026-07-21 06:14 UTC

Storybook introduces AI integration using MCP tools, enabling AI agents to generate UI from existing components with automated test feedback via Storybook Test to ensure quality and consistency.

Storybook provides structured UI context and test feedback for AI agents, promoting component reuse and reducing hallucinations.
Agents write stories to document component states and edge cases, making changes explicit.

Cairn|personal finance your own AI can read over MCP, read-only by construction

2026-07-21 01:15 UTC

Cairn is a personal finance AI that accesses financial data read-only via the MCP protocol, ensuring security.

Cairn uses MCP protocol for read-only access to personal finance data
System designed as read-only to prevent data modification

Show HN: Agent Search Engine – an independent index of 247 AI agents

2026-07-21 00:46 UTC

A live technical index ranking AI agents, MCP servers, frameworks, and infrastructure by maintained adoption. Explore 247 verified records across 11 system classes.

An independent, unbiased index of 247 AI agents across 11 system classes.
Ranked by maintained adoption, not sponsorship.

Show HN: Vidmoat – a video editing pipeline any AI agent can operate

2026-07-21 00:09 UTC

Vidmoat is an AI-powered video editor that lets you edit by prompt. It features auto-cut, AI captions, text-to-edit, one-click shorts, and an MCP server that enables any AI agent (Claude, Cursor, etc.) to drive the entire editing pipeline end-to-end.

AI-first video editor operated via prompts or AI agents
MCP server integration for external agents (Claude, Cursor, etc.)

AI's most important protocol is getting a little bit easier to use

2026-07-20 23:56 UTC

The Model Context Protocol (MCP) is receiving a significant update that simplifies how AI models connect to external data sources and services, as explained by startup Arcade.

MCP is a foundational protocol for AI interoperability
The new version has been in spec since May and launches next week

At SIGGRAPH, NVIDIA Advances Graphics and Simulation With Agentic and Physical AI

2026-07-20 15:00 UTC

NVIDIA unveils major advancements at SIGGRAPH 2026, including MCP-connected AI agents for creative tools, a synthetic video detector NIM microservice, and the open-source Cosmos 3 Edge world model for edge physical AI. These innovations span content creation, media verification, and robotics.

Creative apps like Adobe, Affinity, and Blender adopt MCP to enable AI agents for automated tasks.
New Synthetic Video Detector NIM microservice analyzes video frame-by-frame with up to 92% accuracy.

Venv-manager, a Python venv runtime for Humans and AI agents

2026-07-20 14:23 UTC

Venv-manager is a Go-based CLI tool for managing Python virtual environments, designed for both humans and AI agents. It offers a file watcher that auto-installs missing imports, an MCP server for agentic control, sandboxed ephemeral execution, and comprehensive venv lifecycle commands, addressing common venv sprawl issues.

Single static binary written in Go, requires only python3 (optional uv for speed).
File watcher automatically detects and installs missing imports as files change.

Show HN: Building a product for humans and AI agents

2026-07-20 14:15 UTC

The article details the journey of building Competitor Tracker, a tool designed for both humans and AI agents to track competitors. It discusses how AI shifts the bottleneck from development to go-to-market, making building easier but selling harder. The author shares the backstory of failed attempts, the eventual collaboration with a team, and the decision to build a product that is API-first, with MCP and webhook support, catering to both humans and agents. The product sends weekly digests and offers a noir-themed interface with a dog mascot.

AI shifts product development bottleneck from building to marketing and selling.
Competitor Tracker is an API-first product for tracking competitors, usable by humans and AI agents.

Show HN: Give your AI agent a personality (and a voice) without external APIs

2026-07-20 13:01 UTC

mcp-speak is an open-source tool that gives AI agents a voice and personality through speech integration and customizable personality profiles. It supports multiple AI clients and requires no external APIs for voice interaction.

mcp-speak is an MCP server that adds voice and personality to AI agents without external APIs.
Offers multiple personality profiles like Sarcastic Senior, Eager Intern, etc., altering agent speech behavior.

Automating first-pass customer support with Claude Code and MCP

2026-07-20 12:02 UTC

A founder automated his support inbox using Claude Code and MCP servers: the agent classifies, investigates, and drafts replies, but never sends them. The post details the setup, key rules (no guessing, drafts only), and real-world results over 48 hours.

Claude Code runs daily via a scheduled job, using four MCP servers to pull support threads, error logs, and email drafts.
Two critical rules: every claim in a draft must be verified this session, and the agent is never allowed to send emails—only draft.

Top 5 MCP Servers for High-Performance Agentic Development

2026-07-20 12:00 UTC

This article highlights five MCP servers that genuinely enhance AI agent capabilities, chosen for their impact rather than star counts. They include GitHub MCP, Playwright MCP, Context7, Serena, and the Official Reference Servers, with insights on integrating them for a powerful agentic setup.

MCP has become the USB-C for agent tooling, standardizing integrations.
GitHub MCP server enables agents to manage repositories, issues, PRs, and Actions via natural language.

Show HN: Hail.so – Open-source phone, SMS and email for agents and humans, v0.15

2026-07-20 10:55 UTC

Hail.so is an open-source (AGPLv3) universal communication platform for AI agents, offering phone calls, SMS, and email. It is outbound-first with inbound support, self-hostable via Docker Compose, and integrates multiple STT/TTS providers. Version 0.15 includes CLI, Python SDK, MCP server, and OpenAPI spec.

Hail.so enables AI agents to make phone calls, send SMS, and emails, with outbound priority and inbound support.
Self-hostable via Docker Compose; integrates Twilio, Telnyx, AWS SES, and LiveKit Cloud.

Show HN: Bothread – multiple AI coding agents talk, share one repo, no collisions

2026-07-19 19:09 UTC

Bothread is a free, open-source local coordination hub that lets multiple MCP-compatible AI coding agents collaborate on the same codebase, preventing file collisions via exclusive claims, and providing a live human-supervision interface with real-time messaging, git diffs, task boards, and approval gates. No API keys or cloud required.

Enables multiple AI agents (Claude Code, Cursor, Antigravity, etc.) to work together on one codebase with collision prevention.
Includes human controls: live activity trail, approval gates, task board, per-agent git diffs, and file hand-offs.

Show HN: CallBro – Granola, but Powered by Codex, Claude Code, or Your Local LLM

2026-07-19 13:19 UTC

CallBro is a free, private meeting notes app that transcribes calls locally on your device. It uses AI (Codex, Claude Code, or local LLM) to generate summaries and action items, ensuring all data stays on your device with no cloud upload. It integrates with tools via MCP and works across platforms (macOS, Linux, Windows).

Transcribes calls locally with no cloud upload.
Uses AI (Codex, Claude Code, or local LLM) for summaries and action items.

Top 10 GitHub Repositories Trending in July 2026 (AI, ML & GenAI Edition)

2026-07-19 10:38 UTC

A roundup of the top 10 trending AI GitHub repositories in July 2026, highlighting a shift from new models to agent tooling, MCP servers, and practical AI applications. Projects like Strix (AI pentesting), Grok Build (coding agent), Vibe-Trading (quant), and Colibri (local LLM inference) reflect the focus on infrastructure and real-world utility.

The July 2026 GitHub trending list is dominated by AI agent tooling and infrastructure, not new models.
Top repos include Strix (AI security), Grok Build (coding agent), Vibe-Trading (quant), codebase-memory-mcp (code understanding), and Colibri (local MoE inference).

AI for Bug Bounty with VulneraMCP

2026-07-18 21:20 UTC

This article introduces VulneraMCP, an AI-enhanced security testing platform built on ZAP. By integrating machine learning through the Model Context Protocol (MCP), it achieves adaptive vulnerability detection and fully automated workflows. The system uses ZAP's REST API for core scanning, dynamically generates payloads based on training data from HackTheBox, PortSwigger Academy, and real-world bug bounty reports, and significantly improves detection accuracy. Author Telmon Maluleka details the architecture, components, workflow, and results.

VulneraMCP combines ZAP's scanning engine with AI learning for advanced bug hunting
Architecture includes ZAP integration layer, MCP proxy layer, learning engine, and database

Talon – a self-hosted harness for long-lived AI agents

2026-07-18 16:24 UTC

Talon is a multi-platform, self-hosted AI agent framework supporting Telegram, Discord, Microsoft Teams, terminal, and a cross-platform desktop/mobile app. It offers pluggable backends (Claude Agent SDK, Kilo, OpenCode, Codex, OpenAI Agents) and full MCP tool access, with background agents, goal management, skill system, event bus, and hot-reloadable plugins. The architecture is clean, with frontend and backend independent, making it highly extensible.

Supports multiple frontends (Telegram, Discord, Teams, terminal, desktop/mobile) and backends (Claude, Kilo, OpenCode, Codex, OpenAI Agents) with rich MCP tools.
Features background agents (heartbeat, dream), persistent goals, skill system (SKILL.md), and triggers for proactive task advancement.

MCP

Related topics