Skip to main content

合并文档加载器

合并从一组指定文档加载器返回的文档。

<!--IMPORTS:[{"imported": "WebBaseLoader", "source": "langchain_community.document_loaders", "docs": "https://python.langchain.com/api_reference/community/document_loaders/langchain_community.document_loaders.web_base.WebBaseLoader.html", "title": "Merge Documents Loader"}]-->
from langchain_community.document_loaders import WebBaseLoader

loader_web = WebBaseLoader(
"https://github.com/basecamp/handbook/blob/master/37signals-is-you.md"
)
<!--IMPORTS:[{"imported": "PyPDFLoader", "source": "langchain_community.document_loaders", "docs": "https://python.langchain.com/api_reference/community/document_loaders/langchain_community.document_loaders.pdf.PyPDFLoader.html", "title": "Merge Documents Loader"}]-->
from langchain_community.document_loaders import PyPDFLoader

loader_pdf = PyPDFLoader("../MachineLearning-Lecture01.pdf")
<!--IMPORTS:[{"imported": "MergedDataLoader", "source": "langchain_community.document_loaders.merge", "docs": "https://python.langchain.com/api_reference/community/document_loaders/langchain_community.document_loaders.merge.MergedDataLoader.html", "title": "Merge Documents Loader"}]-->
from langchain_community.document_loaders.merge import MergedDataLoader

loader_all = MergedDataLoader(loaders=[loader_web, loader_pdf])
docs_all = loader_all.load()
len(docs_all)
23

相关


Was this page helpful?


You can also leave detailed feedback on GitHub.

扫我,入群扫我,找书