设置python爬虫IP代理(urllib/requests模块) #1

lznism · 2018-01-12T09:51:10Z

urllib模块设置代理

如果我们频繁用一个IP去爬取同一个网站的内容，很可能会被网站封杀IP。其中一种比较常见的方式就是设置代理IP

from urllib import request
proxy = 'http://39.134.93.12:80'
proxy_support = request.ProxyHandler({'http': proxy})
opener = request.build_opener(proxy_support)
request.install_opener(opener)
result = request.urlopen('http://baidu.com')

首先我们需要构建一个ProxyHandler类，随后将该类用于构建网页代开的opener的类，最后在request中安装opener

requests模块使用代理

该模块设置代理非常容易

import requests
proxies = {
    'http': 'http://10.10.1.10:3128',
    'https': 'http://10.10.1.10:1080'
}
r = requests.get('http://icanhazip.com', proxies=proxies)

The text was updated successfully, but these errors were encountered:

lznism added the Python label Jan 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

设置python爬虫IP代理(urllib/requests模块) #1

设置python爬虫IP代理(urllib/requests模块) #1

lznism commented Jan 12, 2018

设置python爬虫IP代理(urllib/requests模块) #1

设置python爬虫IP代理(urllib/requests模块) #1

Comments

lznism commented Jan 12, 2018

urllib模块设置代理

requests模块使用代理