Huggingface 端点

Hugging Face Hub 是一个拥有超过 12 万个模型、2 万个数据集和 5 万个演示应用（Spaces）的平台，所有资源均为开源和公开可用，提供一个在线平台，方便人们协作并共同构建机器学习。

Hugging Face Hub 还提供各种端点以构建机器学习应用。本示例展示了如何连接到不同的端点类型。

特别是，文本生成推理由文本生成推理提供支持：一个定制构建的 Rust、Python 和 gRPC 服务器，用于快速的文本生成推理。

<!--IMPORTS:[{"imported": "HuggingFaceEndpoint", "source": "langchain_huggingface", "docs": "https://python.langchain.com/api_reference/huggingface/llms/langchain_huggingface.llms.huggingface_endpoint.HuggingFaceEndpoint.html", "title": "Huggingface Endpoints"}]-->
from langchain_huggingface import HuggingFaceEndpoint

安装与设置

要使用此功能，您应该安装 huggingface_hub python 包。

%pip install --upgrade --quiet huggingface_hub

# get a token: https://huggingface.co/docs/api-inference/quicktour#get-your-api-token

from getpass import getpass

HUGGINGFACEHUB_API_TOKEN = getpass()

import os

os.environ["HUGGINGFACEHUB_API_TOKEN"] = HUGGINGFACEHUB_API_TOKEN

准备示例

<!--IMPORTS:[{"imported": "HuggingFaceEndpoint", "source": "langchain_huggingface", "docs": "https://python.langchain.com/api_reference/huggingface/llms/langchain_huggingface.llms.huggingface_endpoint.HuggingFaceEndpoint.html", "title": "Huggingface Endpoints"}]-->
from langchain_huggingface import HuggingFaceEndpoint

<!--IMPORTS:[{"imported": "LLMChain", "source": "langchain.chains", "docs": "https://python.langchain.com/api_reference/langchain/chains/langchain.chains.llm.LLMChain.html", "title": "Huggingface Endpoints"}, {"imported": "PromptTemplate", "source": "langchain_core.prompts", "docs": "https://python.langchain.com/api_reference/core/prompts/langchain_core.prompts.prompt.PromptTemplate.html", "title": "Huggingface Endpoints"}]-->
from langchain.chains import LLMChain
from langchain_core.prompts import PromptTemplate

question = "Who won the FIFA World Cup in the year 1994? "

template = """Question: {question}

Answer: Let's think step by step."""

prompt = PromptTemplate.from_template(template)

示例

这是一个示例，展示如何访问免费的无服务器端点 API 的 HuggingFaceEndpoint 集成。

repo_id = "mistralai/Mistral-7B-Instruct-v0.2"

llm = HuggingFaceEndpoint(
    repo_id=repo_id,
    max_length=128,
    temperature=0.5,
    huggingfacehub_api_token=HUGGINGFACEHUB_API_TOKEN,
)
llm_chain = prompt | llm
print(llm_chain.invoke({"question": question}))

专用端点

免费的无服务器 API 让您能够快速实现解决方案并进行迭代，但对于重负载使用情况可能会有限制，因为负载与其他请求共享。

对于企业工作负载，最佳选择是使用推理端点 - 专用。这提供了对完全托管基础设施的访问，提供更多的灵活性和速度。这些资源提供持续的支持和正常运行时间保证，以及自动扩展等选项。

# Set the url to your Inference Endpoint below
your_endpoint_url = "https://fayjubiy2xqn36z0.us-east-1.aws.endpoints.huggingface.cloud"

llm = HuggingFaceEndpoint(
    endpoint_url=f"{your_endpoint_url}",
    max_new_tokens=512,
    top_k=10,
    top_p=0.95,
    typical_p=0.95,
    temperature=0.01,
    repetition_penalty=1.03,
)
llm("What did foo say about bar?")

流式处理

<!--IMPORTS:[{"imported": "StreamingStdOutCallbackHandler", "source": "langchain_core.callbacks", "docs": "https://python.langchain.com/api_reference/core/callbacks/langchain_core.callbacks.streaming_stdout.StreamingStdOutCallbackHandler.html", "title": "Huggingface Endpoints"}, {"imported": "HuggingFaceEndpoint", "source": "langchain_huggingface", "docs": "https://python.langchain.com/api_reference/huggingface/llms/langchain_huggingface.llms.huggingface_endpoint.HuggingFaceEndpoint.html", "title": "Huggingface Endpoints"}]-->
from langchain_core.callbacks import StreamingStdOutCallbackHandler
from langchain_huggingface import HuggingFaceEndpoint

llm = HuggingFaceEndpoint(
    endpoint_url=f"{your_endpoint_url}",
    max_new_tokens=512,
    top_k=10,
    top_p=0.95,
    typical_p=0.95,
    temperature=0.01,
    repetition_penalty=1.03,
    streaming=True,
)
llm("What did foo say about bar?", callbacks=[StreamingStdOutCallbackHandler()])

同样的 HuggingFaceEndpoint 类可以与本地 HuggingFace TGI 实例一起使用，以提供大型语言模型。有关各种硬件（GPU、TPU、Gaudi...）支持的详细信息，请查看 TGI 仓库。

Huggingface 端点

安装与设置

准备示例

示例

专用端点

流式处理

相关

Was this page helpful?

You can also leave detailed feedback on GitHub.

安装与设置​

准备示例​

示例​

专用端点​

流式处理​

相关​

Was this page helpful?

You can also leave detailed feedback on GitHub.

安装与设置

准备示例

示例

专用端点

流式处理

相关