GoTo returns None for certain sites (never the first page)

Hi! I have a spider that uses playwright with a proxy.
**NOTE: the spider works as it should when the proxy is not needed and the proxy works, as the first page is correctly scraped.**

This is what happens:
- first page is scraped, I see that the `************* RESPONSE *************` log, so `parse_item` is hit once
- links are extracted and `set_playwright_true` is called (the list of links is logged)
- errors are raised: `'NoneType' object has no attribute 'all_headers'`

![image](https://user-images.githubusercontent.com/90040975/182645849-1d56c0d7-5f41-4d1e-98c7-30a88bb66f9c.png)

It seems similar to https://github.com/scrapy-plugins/scrapy-playwright/issues/10 and https://github.com/scrapy-plugins/scrapy-playwright/issues/102 and I saw that a fix has been merged with https://github.com/scrapy-plugins/scrapy-playwright/pull/113 . 

When will the fix be released to the next version? Will this fix this or it will just prevent the error from being risen?
Any idea why using the proxy is causing such exception?

```python
class PlaywrightSpiderWithProxy(CrawlSpider):
    name = "client-side-site"
    handle_httpstatus_list = [301, 302, 401, 403, 404, 408, 429, 500, 503]
    exclude_patterns: List[str] = []

    playwright_meta = {
        "playwright": True,
        "playwright_page_goto_kwargs": {"wait_until": "networkidle"},
    }

    custom_settings = {
        "TWISTED_REACTOR": "twisted.internet.asyncioreactor.AsyncioSelectorReactor",
        "DOWNLOAD_HANDLERS": {
            "http": "scrapy_playwright.handler.ScrapyPlaywrightDownloadHandler",
            "https": "scrapy_playwright.handler.ScrapyPlaywrightDownloadHandler",
        },
        "PLAYWRIGHT_LAUNCH_OPTIONS": {
            "proxy": {
                "server": "http://192.0.0.1:12345",
                "username": "username",
                "password": "password",
            },
        },
    }

    def __init__(self, **kwargs: Any):
        # ...
        self.rules = (
            Rule(
                LinkExtractor(allow=allow_path),
                callback=self.parse_item,
                process_request=self.set_playwright_true,
                follow=True,
            ),
        )
        # ...
        super().__init__(**kwargs)

    def start_requests(self) -> Iterator[Request]:
        yield Request(self.start_urls[0], meta=self.playwright_meta)

    def set_playwright_true(self, request: Request, response: Response):
        self.log("%s => %s " % (response.url, request.url), logging.INFO)
        request.meta.update(self.playwright_meta)
        return request

    def parse_start_url(self, response: Response) -> Dict[str, Any]:
        return self.parse_item(response)

    def parse_item(self, response: Response) -> Dict[str, Any]:
        self.log("************* RESPONSE *************", logging.INFO)
        return {
          #  ...
        }
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GoTo returns None for certain sites (never the first page) #115

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

GoTo returns None for certain sites (never the first page) #115

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions