Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix Html2TextTransformer for shallow copy (#14197)
<!-- Thank you for contributing to LangChain! Replace this entire comment with: - **Description:** a description of the change, - **Issue:** the issue # it fixes (if applicable), - **Dependencies:** any dependencies required for this change, - **Tag maintainer:** for a quicker response, tag the relevant maintainer (see below), - **Twitter handle:** we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Hi, There is some unintended behavior in Html2TextTransformer. The current code is **directly modifying the original documents that are passed as arguments to the function.** Therefore, not only the return of the function but also the input variables are being modified simultaneously. **To resolve this, I added unit test code as well.** reference link: [Shallow vs Deep Copying of Python Objects](https://realpython.com/copying-python-objects/) Thanks!☺️
- Loading branch information