Weaviate Blog AI News Source

Public articles 14Collected articles 17Trust 82Refresh 120 min

Health HealthySource type OfficialFull-text rights Official full textLast ingested 2026-06-25ID weaviate-blogStatus Enabled

Official vector database and AI infrastructure feed; confirm reuse terms before full body display.

Latest public articles

Weaviate 1.38 Release

2026-06-25 00:00 UTC

This release brings the HFresh disk-based vector index and the built-in MCP Server to general availability, rebuilds cluster-wide async replication to run from a single scheduler (on by default), and adds two previews: the Boost API and Nested Object Filtering.

HFresh disk-based vector index is now GA, suitable for streaming workloads
MCP Server GA enables LLMs and AI agents to interact directly with Weaviate

Import & Vectorize Data with Weaviate at Scale

2026-06-18 00:00 UTC

Most vector database prototypes fail at ingest. This guide covers server-side batching, error handling, blobHash data type, and multimodal ingestion for Weaviate, with code examples and production-ready patterns.

Use server-side batching to auto-tune batch size and avoid manual tuning
Deterministic UUIDs make retries idempotent, preventing duplicate work and costs

Weaviate Cloud is now free to start

2026-06-17 00:00 UTC

Weaviate Cloud now offers free tiers across its entire product suite, including the managed database, Query Agent, and Engram, with no credit card required and no time limit, enabling users to build prototypes and use them indefinitely.

Weaviate Cloud now offers free tiers across all products.
Free tiers include the managed Weaviate database, Query Agent, and Engram.

Engram is now Generally Available

2026-06-03 00:00 UTC

Weaviate announces the general availability of Engram, a managed memory and context service for agentic applications. It addresses long-context degradation, messy raw data, and multi-agent context fragmentation through asynchronous pipelines, templates, and built-in scopes, helping agents compound value over time.

Engram is a managed memory and context service for agents, now GA.
It solves three critical failure modes: long-context degradation, messy raw data, and multi-agent context fragmentation.

Build a Coding Assistant with Weaviate MCP: RAG over Code & Docs

2026-05-21 00:00 UTC

Use Weaviate's built-in MCP server to give Claude Code, Cursor, and VS Code hybrid search over your codebase and docs. No glue code.

LLMs have limited context and no knowledge of private code; naive prompting leads to high costs and stale context.
Weaviate integrates MCP server inside the database, exposing hybrid search (BM25+vector) and other tools at /v1/mcp.

Your LLM Is Only as Good as What It Retrieves

2026-05-06 00:00 UTC

A researcher argues retrieval quality is the most critical factor in RAG systems, outweighing model size or prompt design. Poor retrieval leads to undetectable hallucinations. The article identifies five common failure modes and offers practical tips for improving retrieval, including hybrid search, cross-encoder re-ranking, and continuous evaluation.

Retrieval quality is the primary determinant of RAG output reliability.
Five retrieval failure modes: retrieval drift, context truncation, stale index poisoning, low-relevance top-k retrieval, and inter-agent miscommunication.

Weaviate 1.37 Release

2026-04-23 00:00 UTC

This release introduces the built-in MCP Server, Extensible Tokenizers, Diversity Search (MMR), and Query Profiling as previews, along with Incremental Backups, Gemini audio support for multi2vec-google, and the new BlobHash property type.

Built-in MCP Server preview enables native integration with AI agents and IDEs via the Model Context Protocol.
Extensible Tokenizers preview adds accent folding, custom stopword presets, and a tokenize endpoint for observability.

Engram: Memory by Weaviate

2026-04-21 00:00 UTC

A deep dive into Engram, our managed memory service for agents which is simple to get started but adaptable to any use case.

Engram is a managed memory service for agentic applications, built on Weaviate.
It uses async pipelines to extract, reconcile, and persist memories.

Weaviate Shared Cloud now generally available on AWS

2026-04-15 00:00 UTC

Weaviate Shared Cloud is now generally available on AWS in US East and Europe, providing teams with a fully managed, AI-native database on the provider and region that works best for them.

Weaviate Shared Cloud is now generally available on AWS in US East (N. Virginia) and Europe (Frankfurt).
Fully managed clusters with automatic upgrades, granular RBAC, immutable backups, and SOC 2/ISO 27001 certifications.

Oh Memories, Where'd You Go

2026-04-02 00:00 UTC

Two weeks of dogfooding Engram, Weaviate's memory product, in daily Claude Code sessions. This surfaced where a dedicated memory product adds value, and the specific mechanics that prevent integration with coding assistants from working well.

Claude defaults to MEMORY.md because it's always loaded with zero latency, making external tools unnecessary without explicit triggers.
Engram adds value by structuring memory around topics, excelling in decision archaeology but requiring deterministic triggers to be used.

Multimodal Embeddings and RAG: A Practical Guide

2026-04-01 00:00 UTC

Multimodal embeddings allow AI systems to search and reason across text, images, audio, and video in their native formats. This blog covers the key intuitions behind how this all works and walks through three practical implementations using Weaviate and Gemini.

Multimodal embeddings map different modalities into a shared semantic space, enabling cross-modal retrieval.
Using native embeddings instead of bridge approaches avoids information loss, such as audio tone or PDF layout.

Your Code is Your Schema: Weaviate Managed C# Client

2026-03-31 00:00 UTC

The Weaviate Managed .NET Client brings Entity Framework Core-like experience to C# developers, enabling attribute-driven schema, type-safe queries, and automatic migrations for vector databases.

Define collection schemas with C# attributes, no string configs
Type-safe LINQ-style queries for vector and hybrid search

Securing Enterprise AI with Weaviate

2026-03-19 00:00 UTC

A comprehensive guide to securing Weaviate enterprise deployments using OIDC, RBAC, multi-tenant isolation, audit logging, and network security, illustrated through the fictional MedVector Health case study.

OIDC integration delegates authentication to existing identity providers, eliminating shared API keys.
Role-based access control (RBAC) provides granular, collection-level and tenant-level permissions.

Building A Legal RAG App in 36 Hours

2026-02-26 00:00 UTC

Learn how we built a production-ready, end-to-end RAG application in just 36 hours using the Query Agent and the new Weaviate Agent Skills library. The post explains the architecture, comparison with naive RAG, and step-by-step instructions.

Agentic search surpasses naive RAG by adding a reasoning layer, crucial for legal queries.
The architecture uses multimodal embedding with Muvera compression and three collection schemas.

Weaviate Blog