我写了个python爬虫爬微博的数据,原先运行的很好,挂在服务器上跑了好几天都没问题
昨天突然出现问题,爬取时会报这样的错
查百度说的是因为证书问题,我尝试了requests.get(varify=False),但仍然会在运行不定次数后报错
后来尝试用urllib库,但也会在不定次数后报这样的错
urllib.error.URLError: <urlopen error [SSL: BAD_SIGNATURE] bad signature (_ssl.c:1124)>
实在没有头绪了,求大佬救命!
爬取的网页和header是这样的:
url='https://weibo.cn/search/mblog?hideSearchFrame=&keyword=%E7%99%BD%E5%9F%8E%E5%B8%82&advancedfilter=1&starttime=20190118-27&endtime=20190118-4&sort=time&atten=1&page=1'
header= {
'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:61.0) Gecko/20100101 Firefox/61.0',
'cookie': 'SCF=Agtcpqc0msuvQaUJzhtwRo0FR_6nbHLFO_Zx0qk8-nOgJWTRV9u_bj9EXBEWlB6CcmJC8yjOqHDZGUDFkDR5ZwU.; _T_WM=ebe83238120630d95680dabb1d053e59; SUB=_2A25NIi5JDeRhGeFL6lcS8SnIyTiIHXVu7LIBrDV6PUJbkdAKLU-kkW1NQnjl50xkYhJ0SXpsibK52cCEGYaYCm7a; SSOLoginState=1613127193'}
昨天突然出现问题,爬取时会报这样的错
查百度说的是因为证书问题,我尝试了requests.get(varify=False),但仍然会在运行不定次数后报错
后来尝试用urllib库,但也会在不定次数后报这样的错
urllib.error.URLError: <urlopen error [SSL: BAD_SIGNATURE] bad signature (_ssl.c:1124)>
实在没有头绪了,求大佬救命!
爬取的网页和header是这样的:
url='https://weibo.cn/search/mblog?hideSearchFrame=&keyword=%E7%99%BD%E5%9F%8E%E5%B8%82&advancedfilter=1&starttime=20190118-27&endtime=20190118-4&sort=time&atten=1&page=1'
header= {
'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:61.0) Gecko/20100101 Firefox/61.0',
'cookie': 'SCF=Agtcpqc0msuvQaUJzhtwRo0FR_6nbHLFO_Zx0qk8-nOgJWTRV9u_bj9EXBEWlB6CcmJC8yjOqHDZGUDFkDR5ZwU.; _T_WM=ebe83238120630d95680dabb1d053e59; SUB=_2A25NIi5JDeRhGeFL6lcS8SnIyTiIHXVu7LIBrDV6PUJbkdAKLU-kkW1NQnjl50xkYhJ0SXpsibK52cCEGYaYCm7a; SSOLoginState=1613127193'}