AI News HubLIVE
Public articles 14Collected articles 17Trust 82Refresh 120 min
Health HealthySource type OfficialFull-text rights Official full textLast ingested 2026-06-25ID weaviate-blogStatus Enabled

Official vector database and AI infrastructure feed; confirm reuse terms before full body display.

Latest public articles

Weaviate 1.38 Release

This release brings the HFresh disk-based vector index and the built-in MCP Server to general availability, rebuilds cluster-wide async replication to run from a single scheduler (on by default), and adds two previews: the Boost API and Nested Object Filtering.

  • HFresh disk-based vector index is now GA, suitable for streaming workloads
  • MCP Server GA enables LLMs and AI agents to interact directly with Weaviate
In-site article

Import & Vectorize Data with Weaviate at Scale

Most vector database prototypes fail at ingest. This guide covers server-side batching, error handling, blobHash data type, and multimodal ingestion for Weaviate, with code examples and production-ready patterns.

  • Use server-side batching to auto-tune batch size and avoid manual tuning
  • Deterministic UUIDs make retries idempotent, preventing duplicate work and costs
In-site article

Weaviate Cloud is now free to start

Weaviate Cloud now offers free tiers across its entire product suite, including the managed database, Query Agent, and Engram, with no credit card required and no time limit, enabling users to build prototypes and use them indefinitely.

  • Weaviate Cloud now offers free tiers across all products.
  • Free tiers include the managed Weaviate database, Query Agent, and Engram.
In-site article

Engram is now Generally Available

Weaviate announces the general availability of Engram, a managed memory and context service for agentic applications. It addresses long-context degradation, messy raw data, and multi-agent context fragmentation through asynchronous pipelines, templates, and built-in scopes, helping agents compound value over time.

  • Engram is a managed memory and context service for agents, now GA.
  • It solves three critical failure modes: long-context degradation, messy raw data, and multi-agent context fragmentation.
In-site article

Build a Coding Assistant with Weaviate MCP: RAG over Code & Docs

Use Weaviate's built-in MCP server to give Claude Code, Cursor, and VS Code hybrid search over your codebase and docs. No glue code.

  • LLMs have limited context and no knowledge of private code; naive prompting leads to high costs and stale context.
  • Weaviate integrates MCP server inside the database, exposing hybrid search (BM25+vector) and other tools at /v1/mcp.
In-site article

Your LLM Is Only as Good as What It Retrieves

A researcher argues retrieval quality is the most critical factor in RAG systems, outweighing model size or prompt design. Poor retrieval leads to undetectable hallucinations. The article identifies five common failure modes and offers practical tips for improving retrieval, including hybrid search, cross-encoder re-ranking, and continuous evaluation.

  • Retrieval quality is the primary determinant of RAG output reliability.
  • Five retrieval failure modes: retrieval drift, context truncation, stale index poisoning, low-relevance top-k retrieval, and inter-agent miscommunication.
In-site article

Weaviate 1.37 Release

This release introduces the built-in MCP Server, Extensible Tokenizers, Diversity Search (MMR), and Query Profiling as previews, along with Incremental Backups, Gemini audio support for multi2vec-google, and the new BlobHash property type.

  • Built-in MCP Server preview enables native integration with AI agents and IDEs via the Model Context Protocol.
  • Extensible Tokenizers preview adds accent folding, custom stopword presets, and a tokenize endpoint for observability.
In-site article

Engram: Memory by Weaviate

A deep dive into Engram, our managed memory service for agents which is simple to get started but adaptable to any use case.

  • Engram is a managed memory service for agentic applications, built on Weaviate.
  • It uses async pipelines to extract, reconcile, and persist memories.
In-site article

Weaviate Shared Cloud now generally available on AWS

Weaviate Shared Cloud is now generally available on AWS in US East and Europe, providing teams with a fully managed, AI-native database on the provider and region that works best for them.

  • Weaviate Shared Cloud is now generally available on AWS in US East (N. Virginia) and Europe (Frankfurt).
  • Fully managed clusters with automatic upgrades, granular RBAC, immutable backups, and SOC 2/ISO 27001 certifications.
In-site article

Oh Memories, Where'd You Go

Two weeks of dogfooding Engram, Weaviate's memory product, in daily Claude Code sessions. This surfaced where a dedicated memory product adds value, and the specific mechanics that prevent integration with coding assistants from working well.

  • Claude defaults to MEMORY.md because it's always loaded with zero latency, making external tools unnecessary without explicit triggers.
  • Engram adds value by structuring memory around topics, excelling in decision archaeology but requiring deterministic triggers to be used.
In-site article

Multimodal Embeddings and RAG: A Practical Guide

Multimodal embeddings allow AI systems to search and reason across text, images, audio, and video in their native formats. This blog covers the key intuitions behind how this all works and walks through three practical implementations using Weaviate and Gemini.

  • Multimodal embeddings map different modalities into a shared semantic space, enabling cross-modal retrieval.
  • Using native embeddings instead of bridge approaches avoids information loss, such as audio tone or PDF layout.
In-site article

Your Code is Your Schema: Weaviate Managed C# Client

The Weaviate Managed .NET Client brings Entity Framework Core-like experience to C# developers, enabling attribute-driven schema, type-safe queries, and automatic migrations for vector databases.

  • Define collection schemas with C# attributes, no string configs
  • Type-safe LINQ-style queries for vector and hybrid search
In-site article

Securing Enterprise AI with Weaviate

A comprehensive guide to securing Weaviate enterprise deployments using OIDC, RBAC, multi-tenant isolation, audit logging, and network security, illustrated through the fictional MedVector Health case study.

  • OIDC integration delegates authentication to existing identity providers, eliminating shared API keys.
  • Role-based access control (RBAC) provides granular, collection-level and tenant-level permissions.
In-site article

Building A Legal RAG App in 36 Hours

Learn how we built a production-ready, end-to-end RAG application in just 36 hours using the Query Agent and the new Weaviate Agent Skills library. The post explains the architecture, comparison with naive RAG, and step-by-step instructions.

  • Agentic search surpasses naive RAG by adding a reasoning layer, crucial for legal queries.
  • The architecture uses multimodal embedding with Muvera compression and three collection schemas.
In-site article

All sources