This release brings the HFresh disk-based vector index and the built-in MCP Server to general availability, rebuilds cluster-wide async replication to run from a single scheduler (on by default), and adds two previews: the Boost API and Nested Object Filtering.
HFresh disk-based vector index is now GA, suitable for streaming workloads
MCP Server GA enables LLMs and AI agents to interact directly with Weaviate
Most vector database prototypes fail at ingest. This guide covers server-side batching, error handling, blobHash data type, and multimodal ingestion for Weaviate, with code examples and production-ready patterns.
Use server-side batching to auto-tune batch size and avoid manual tuning
Deterministic UUIDs make retries idempotent, preventing duplicate work and costs
Weaviate Cloud now offers free tiers across its entire product suite, including the managed database, Query Agent, and Engram, with no credit card required and no time limit, enabling users to build prototypes and use them indefinitely.
Weaviate Cloud now offers free tiers across all products.
Free tiers include the managed Weaviate database, Query Agent, and Engram.
Weaviate announces the general availability of Engram, a managed memory and context service for agentic applications. It addresses long-context degradation, messy raw data, and multi-agent context fragmentation through asynchronous pipelines, templates, and built-in scopes, helping agents compound value over time.
Engram is a managed memory and context service for agents, now GA.
It solves three critical failure modes: long-context degradation, messy raw data, and multi-agent context fragmentation.
A researcher argues retrieval quality is the most critical factor in RAG systems, outweighing model size or prompt design. Poor retrieval leads to undetectable hallucinations. The article identifies five common failure modes and offers practical tips for improving retrieval, including hybrid search, cross-encoder re-ranking, and continuous evaluation.
Retrieval quality is the primary determinant of RAG output reliability.
Five retrieval failure modes: retrieval drift, context truncation, stale index poisoning, low-relevance top-k retrieval, and inter-agent miscommunication.
This release introduces the built-in MCP Server, Extensible Tokenizers, Diversity Search (MMR), and Query Profiling as previews, along with Incremental Backups, Gemini audio support for multi2vec-google, and the new BlobHash property type.
Built-in MCP Server preview enables native integration with AI agents and IDEs via the Model Context Protocol.
Extensible Tokenizers preview adds accent folding, custom stopword presets, and a tokenize endpoint for observability.
Weaviate Shared Cloud is now generally available on AWS in US East and Europe, providing teams with a fully managed, AI-native database on the provider and region that works best for them.
Weaviate Shared Cloud is now generally available on AWS in US East (N. Virginia) and Europe (Frankfurt).
Fully managed clusters with automatic upgrades, granular RBAC, immutable backups, and SOC 2/ISO 27001 certifications.
Two weeks of dogfooding Engram, Weaviate's memory product, in daily Claude Code sessions. This surfaced where a dedicated memory product adds value, and the specific mechanics that prevent integration with coding assistants from working well.
Claude defaults to MEMORY.md because it's always loaded with zero latency, making external tools unnecessary without explicit triggers.
Engram adds value by structuring memory around topics, excelling in decision archaeology but requiring deterministic triggers to be used.
Multimodal embeddings allow AI systems to search and reason across text, images, audio, and video in their native formats. This blog covers the key intuitions behind how this all works and walks through three practical implementations using Weaviate and Gemini.
Multimodal embeddings map different modalities into a shared semantic space, enabling cross-modal retrieval.
Using native embeddings instead of bridge approaches avoids information loss, such as audio tone or PDF layout.
A comprehensive guide to securing Weaviate enterprise deployments using OIDC, RBAC, multi-tenant isolation, audit logging, and network security, illustrated through the fictional MedVector Health case study.
OIDC integration delegates authentication to existing identity providers, eliminating shared API keys.
Role-based access control (RBAC) provides granular, collection-level and tenant-level permissions.
Learn how we built a production-ready, end-to-end RAG application in just 36 hours using the Query Agent and the new Weaviate Agent Skills library. The post explains the architecture, comparison with naive RAG, and step-by-step instructions.
Agentic search surpasses naive RAG by adding a reasoning layer, crucial for legal queries.
The architecture uses multimodal embedding with Muvera compression and three collection schemas.