返回技能列表

qdrant-hybrid-search

qdrant
更新于 5 days ago
158
18
158
在 GitHub 上查看
文档处理wordpowerpointai

关于

This skill explains how to implement hybrid search in Qdrant, which combines keyword and semantic search by running parallel searches using the `prefetch` feature in the Query API. It guides developers on merging results from sparse and dense vectors to address issues like missing keyword matches. Use it when setting up combined search methods or handling multiple search representations.

快速安装

Claude Code

推荐
主要方式
npx skills add qdrant/skills -a claude-code
插件命令备选方式
/plugin add https://github.com/qdrant/skills
Git 克隆备选方式
git clone https://github.com/qdrant/skills.git ~/.claude/skills/qdrant-hybrid-search

在 Claude Code 中复制并粘贴此命令以安装该技能

技能文档

Hybrid Search in Qdrant

Hybrid search means running two or more different searches in parallel and combining their results into one.

In Qdrant this is powered by the Query API via prefetch: each prefetch runs exactly one type of search independently, and the outer query combines results from parallel prefetches.
Prefetches can be nested and searches can be multi-stage, all pipeline happening in one request through Query API. See Universal Query API for examples.

Identify the user's problem and pick building blocks:

  • What can go into one prefetch, e.g. power one search, in Search Types
  • How to combine results of these searches (RRF, DBSF, FormulaQuery, reranking) in Combining Searches

Based on what you've picked, test your approach:

  1. Configure Qdrant collection with named vectors, where each named vector usually corresponds to one representation (different embedding models or different vector types) of a data point.
  2. Construct a hybrid search request with Query API from your building blocks. You can search independently among one type of vectors, with prefetch + using, like shown in examples in Hybrid Queries documentation.
  3. Evaluate hybrid search quality on real user data and provide user with improvements and tradeoffs (speed/resources).

How Isolated Are Parallel Searches?

Use when: different tenants share one collection and you need to understand hybrid search isolation guarantees.

If user wants to isolate/share hybrid search pipelines between tenants, consider that:

  • Indexes (sparse, payload and dense) and IDF modifier for sparse vectors are computed independently per shard, not per tenant.
  • Prefetch runs independently per shard to retrieve #limit results, so for collection-level prefetches if collection has several shards, Qdrant will always prefetch under the hood #limit * #shard results. Final results are merged based on scores.
  • In nested prefetches (deeper than 1 level), methods described in "Combining Searches" might be done on a shard level first, then per-shards results once again will be merged based on scores.

What NOT to Do

  • Choose a hybrid search pattern based on "vibes" without any hybrid search quality evaluation in-place.
  • Create too many named vectors without a need. An unfilled named vector might take as much resources as a filled one.

GitHub 仓库

qdrant/skills
路径: skills/qdrant-search-quality/search-strategies/hybrid-search
0
agent-skillsai-agentsclaude-codecodexcursorembeddings

相关推荐技能

release-standards

文档处理

这个Skill为开发者提供了语义化版本规范和变更日志格式标准。它能在准备软件发布时快速指导版本号更新和变更日志撰写,包含版本号递增规则、预发布标识符等关键信息。适用于需要遵循规范发布流程的开发场景。

查看技能

commit-standards

文档处理

这个Skill帮助开发者遵循Conventional Commits规范格式化Git提交信息。它提供了标准格式模板和常用提交类型的中英文对照表(如feat/新增、fix/修正等),适用于编写提交、执行git commit或审查提交历史的场景。通过确保提交信息的规范性和一致性,它能提升团队协作效率和版本历史可读性。

查看技能

huggingface-tokenizers

文档处理

HuggingFace Tokenizers 提供了基于 Rust 的高性能分词工具,支持 BPE、WordPiece 和 Unigram 算法,能在一分钟内处理 1GB 文本。它适用于需要快速分词或训练自定义词汇表的场景,并能无缝集成到 transformers 库中。开发者可以借助它进行对齐跟踪、填充截断等操作,满足从研究到生产的全流程需求。

查看技能

nano-pdf

文档处理

nano-pdf 让开发者能用自然语言指令直接编辑PDF文件,无需手动操作复杂工具。它通过命令行快速修改指定页面内容,如修正拼写错误或更新标题,适合处理日常文档微调。使用前请注意核对页码和输出结果,确保修改准确无误。

查看技能