FastEmbed by Qdrant

FastEmbed 来自 Qdrant 是一个轻量级、快速的 Python 库，用于嵌入生成。

量化模型权重

ONNX 运行时，无需 PyTorch 依赖

优先考虑 CPU 设计

大数据集编码的数据并行性。

依赖

要在 LangChain 中使用 FastEmbed，请安装 fastembed Python 包。

%pip install --upgrade --quiet  fastembed

导入

<!--IMPORTS:[{"imported": "FastEmbedEmbeddings", "source": "langchain_community.embeddings.fastembed", "docs": "https://python.langchain.com/api_reference/community/embeddings/langchain_community.embeddings.fastembed.FastEmbedEmbeddings.html", "title": "FastEmbed by Qdrant"}]-->
from langchain_community.embeddings.fastembed import FastEmbedEmbeddings

实例化 FastEmbed

参数

model_name: str (默认: "BAAI/bge-small-en-v1.5")

要使用的 FastEmbedding 模型名称。您可以在这里找到支持的模型列表。

max_length: int (默认: 512)

最大令牌数。对于大于 512 的值，行为未知。

cache_dir: Optional[str] (默认: None)

缓存目录的路径。默认为父目录中的 local_cache。

threads: Optional[int] (默认: None)

单个onnxruntime会话可以使用的线程数。

doc_embed_type: Literal["default", "passage"] (默认: "default")

"default": 使用FastEmbed的默认嵌入方法。

"passage": 在嵌入之前用"passage"前缀文本。

batch_size: int (默认: 256)

编码的批量大小。更高的值将使用更多内存，但速度更快。

parallel: Optional[int] (默认: None)

如果 >1，将使用数据并行编码，推荐用于大型数据集的离线编码。如果 0，使用所有可用核心。如果 None，则不使用数据并行处理，而是使用默认的onnxruntime线程。

embeddings = FastEmbedEmbeddings()

使用

生成文档嵌入

document_embeddings = embeddings.embed_documents(
    ["This is a document", "This is some other document"]
)

生成查询嵌入

query_embeddings = embeddings.embed_query("This is a query")

FastEmbed by Qdrant

依赖

导入

实例化 FastEmbed

参数

使用

生成文档嵌入

生成查询嵌入

相关

Was this page helpful?

You can also leave detailed feedback on GitHub.

依赖​

导入​

实例化 FastEmbed​

参数​

使用​

生成文档嵌入​

生成查询嵌入​

相关​

Was this page helpful?

You can also leave detailed feedback on GitHub.

依赖

导入

实例化 FastEmbed

参数

使用

生成文档嵌入

生成查询嵌入

相关