# URL Loader

This notebook demonstrates how to load HTML documents from a list of URLs into the `Document` format using different loaders.

## Unstructured URL Loader

You need to install the `unstructured` library:

In [None]:
!pip install -U unstructured

In [1]:
from unstructured.document_loaders import UnstructuredURLLoader

In [2]:
urls = [
    'https://www.understandingwar.org/backgrounder/russian-offensive-campaign-assessment-february-8-2023',
    'https://www.understandingwar.org/backgrounder/russian-offensive-campaign-assessment-february-9-2023',
]

loader = UnstructuredURLLoader(urls=urls, ssl_verification=False, headers=headers)

In [3]:
data = loader.load()

## Selenium URL Loader

To use the `SeleniumURLLoader`, you need to install `selenium` and `unstructured`.

In [None]:
!pip install -U selenium unstructured

In [None]:
from unstructured.document_loaders import SeleniumURLLoader

In [None]:
urls = [
    'https://www.youtube.com/watch?v=dQw4w9WgXcQ',
    'https://goo.gl/maps/NDSHwePEyaHMFGwh8',
]

loader = SeleniumURLLoader(urls=urls)

In [None]:
data = loader.load()

## Playwright URL Loader

To use the `PlaywrightURLLoader`, you need to install `playwright`, `unstructured`, and the `Playwright Chromium` browser.

First, install the required libraries and the Playwright Chromium browser:
```bash
!pip install -U playwright unstructured
!playwright install
```

Now, you can import the `PlaywrightURLLoader` and use it as follows:

In [None]:
from unstructured.document_loaders import PlaywrightURLLoader

In [None]:
urls = [
    'https://www.youtube.com/watch?v=dQw4w9WgXcQ',
    'https://goo.gl/maps/NDSHwePEyaHMFGwh8',
]

loader = PlaywrightURLLoader(urls=urls, remove_selectors=["header", "footer"])

In [None]:
data = loader.load()