-
Notifications
You must be signed in to change notification settings - Fork 15k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sitemap loader throws error TypeError: _request() got an unexpected keyword argument 'verify', many docs refer to wrong links for sitemap as well. #6691
Comments
Answer generated by a 🤖 AnswerIt seems like the issue is caused by the 'verify' keyword argument being passed to the from langchain.document_loaders.sitemap import SitemapLoader
class CustomSitemapLoader(SitemapLoader):
def __init__(self, *args, requests_kwargs=None, **kwargs):
super().__init__(*args, **kwargs)
self.requests_kwargs = requests_kwargs or {}
def _request(self, method, url, **kwargs):
kwargs.update(self.requests_kwargs)
return super()._request(method, url, **kwargs)
sitemap_loader = CustomSitemapLoader(web_path="https://langchain.readthedocs.io/sitemap.xml", requests_kwargs={"verify": False})
docs = sitemap_loader.load() This custom implementation of Sources
This response is meant to be useful, save you time, and share context. It is not meant to be a precise solution, but rather a starting point for your own research. Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant. |
Have you managed to solve this? |
Nope, i didn't think the bot advice to create a custom class is fully appropriate when dealing with an official notebook example. I,'m hoping for something more 'official' from the team. |
@hwchase17 is this being solved? @support |
Just tried it out and can confirm it doesn't work. |
Hi Guys @weasley0-0 and @juanvecino seems like this work with this. just change 1 line in this file - /document_loaders/web_base.py change this code async with session.get( |
Any updates in this bug? |
) **Description:** While working on the Docusaurus site loader langchain-ai#9138, I noticed some outdated docs and tests for the Sitemap Loader. **Issue:** This is tangentially related to langchain-ai#6691 in reference to doc links. I plan on digging in to a few of these issue when I find time next.
Hi, @luca-git! I'm Dosu, and I'm helping the LangChain team manage their backlog. I wanted to let you know that we are marking this issue as stale. From what I understand, the issue you reported is related to the sitemap loader in the langchain library throwing a Before we close this issue, we wanted to check with you if it is still relevant to the latest version of the LangChain repository. If it is, please let us know by commenting on this issue. Otherwise, feel free to close the issue yourself, or it will be automatically closed in 7 days. Thank you for your understanding and patience. Let us know if you have any further questions or concerns. |
) **Description:** While working on the Docusaurus site loader langchain-ai#9138, I noticed some outdated docs and tests for the Sitemap Loader. **Issue:** This is tangentially related to langchain-ai#6691 in reference to doc links. I plan on digging in to a few of these issue when I find time next.
System Info
windows 11 python 3.9.16 langchain 0.0.212
Who can help?
Code from https://python.langchain.com/docs/modules/data_connection/document_loaders/integrations/sitemap
throws:
The text was updated successfully, but these errors were encountered: