AI News HubLIVE
Public articles 10Collected articles 21Trust 82Refresh 120 min
Health HealthySource type OfficialFull-text rights Official full textLast ingested 2026-06-26ID qdrant-blogStatus Enabled

Official vector database and AI infrastructure feed; confirm reuse terms before full body display.

Latest public articles

Predicting Weak Retrieval Without an LLM

Most retrieval systems use a single pipeline for all queries, which either under-serves hard queries or wastes compute on easy ones. This article presents cheap signals—like score spread and retriever agreement—to detect weak retrieval without an LLM, enabling selective escalation only when needed.

  • Weak retrieval occurs when the needed evidence is missing from the top-k results, even if recall is high deeper down.
  • Cheap signals such as dense variance (spread), agreement between dense and sparse retrievers, and top-score height can predict weak retrieval with AUC up to 0.76.
In-site article

TurboQuant in Qdrant

Qdrant 1.18 ships TurboQuant, a new rotation-based vector quantization method from Google Research, with extensions for production embeddings. It offers 4-bit, 2-bit, 1.5-bit, and 1-bit options, outperforming or matching Scalar Quantization (SQ) and Binary Quantization (BQ) in compression and recall. The article explains the algorithm, Qdrant's enhancements (length renormalization and per-coordinate calibration), and benchmark results.

  • TurboQuant is a new rotation-based vector quantization algorithm that matches SQ recall at 4x compression and outperforms BQ by 9-24 percentage points at 2-bit and 1-bit modes.
  • Qdrant enhances TurboQuant with length renormalization and per-coordinate calibration (anisotropy compensation) to handle real-world embeddings.
In-site article

Fine-Tuning Sparse Embeddings for E-Commerce Search | Part 2: Training SPLADE on Modal

This is Part 2 of a 5-part series on fine-tuning sparse embeddings for e-commerce search. It covers training a SPLADE model on Modal's serverless GPUs using the Amazon ESCI dataset, including data loading, product text formatting, Modal setup, model creation, training function, SpladeLoss, YAML configuration, parallel hyperparameter sweeps, and a pitfall to avoid.

  • Use Amazon ESCI dataset with Exact and Substitute pairs as positives for training SPLADE.
  • Product text formatting is critical for sparse embeddings; use bracket notation for brands and pipe separators for sections.
In-site article

Fine-Tuning Sparse Embeddings for E-Commerce Search | Part 1: Why Sparse Embeddings Beat BM25

This article is Part 1 of a series on fine-tuning sparse embeddings for e-commerce search. It explains why dense embeddings fail in product search due to blurred exact matches, and how sparse embeddings preserve critical details. It introduces the SPLADE model, query expansion, and Qdrant's native sparse vector support. The fine-tuned system achieves a 29% improvement over BM25 on the Amazon ESCI dataset.

  • Dense embeddings blur exact matches, causing irrelevant results in e-commerce search.
  • Sparse embeddings use a large vocabulary space to preserve term importance weights for precise matching and interpretability.
In-site article

Distance-based data exploration

This article explores how Qdrant's Distance Matrix API enables efficient data exploration through dimensionality reduction, clustering, and graph-based visualization, helping to uncover hidden structures in large unstructured datasets.

  • Qdrant 1.12's Distance Matrix API efficiently computes pairwise distances, leveraging pre-indexed data.
  • UMAP dimensionality reduction can directly use the precomputed distance matrix for 2D visualization.
In-site article

Qdrant Summer of Code 2024 - ONNX Cross Encoders in Python

In this article, Huong (Celine) Hoang shares her experience integrating ONNX cross-encoders into the FastEmbed library during Qdrant's Summer of Code 2024. The project enables re-ranking search results using relevance scores, enhancing context-aware search applications. Key challenges included building a new input-output scheme, tokenization, model loading, and testing. The functionality is available in FastEmbed 0.4.0.

  • Cross-encoders have been integrated into FastEmbed for re-ranking tasks.
  • The project used ONNX models to avoid heavy dependencies like PyTorch.
In-site article

What is a Vector Database?

An introduction to vector databases, covering how they represent unstructured data as vectors, their architecture (collections, distance metrics, storage), core operations (indexing, searching, updating, deleting), advanced features (dense/sparse vectors, hybrid search, quantization, sharding, replication, multitenancy, security), and practical use cases.

  • Vector databases store and search unstructured data as high-dimensional vectors, enabling similarity search.
  • Key components: unique ID, dimensions (numerical representation), and payload (metadata).
In-site article

What is Vector Quantization?

Vector quantization compresses high-dimensional vectors to reduce memory usage and speed up search operations in large datasets. This article covers scalar, binary, and product quantization methods, along with techniques like oversampling, rescoring, and io_uring to balance accuracy and performance.

  • Vector quantization reduces memory footprint and search latency, crucial for scaling to millions of vectors.
  • Scalar quantization (float32 to int8) offers 75% memory reduction with minimal accuracy loss.
In-site article

Qdrant 1.7.0 has just landed!

Qdrant 1.7.0 introduces native support for sparse vectors, enabling keyword-based search and hybrid search; a new Discovery API for more precise vector search, including discovery search and context search; user-defined sharding for flexible data distribution; snapshot-based shard transfer for efficient cluster scaling; and various performance improvements.

  • Native sparse vector support enables hybrid keyword-semantic search
  • Discovery API offers discovery search and context search modes
In-site article

On Unstructured Data, Vector Databases, New AI Age, and Our Seed Round.

Qdrant announces $7.5M seed funding led by Unusual Ventures. The article discusses the importance of vector databases in the AI age, the explosive growth of unstructured data, and Qdrant's progress as an open-source vector similarity search solution.

  • Qdrant raised $7.5M seed funding from Unusual Ventures and others.
  • Vector databases are foundational to the new AI stack for handling unstructured data.
In-site article

All sources