-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
返回200,但无全文 #12
Comments
@yilu1015 ciphertext这个参数正确吗 |
import requests
headers = {
"User-Agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10.14; rv:76.0) Gecko/20100101 Firefox/76.0",
"Cookie": "HM4hUBT0dDOn80S=_wkz59snzaPmdO69oJWw7RwKvOLLkQX0DikwmBGlkQPmxpSSOx12K0bMQsZbsAnM; HM4hUBT0dDOn80T=4Cy7lu21LZTzgJixkyHmmEWZbRc8ka8p5n4j3VjY4QGDG1SvaYh_7s905F2vIqvUGdeqkJsKzN4nn207l2ZD5vCAYFgItnHeaHE9BgfeeFdxdrkPoybXDL1RJ7ZP_5WTlOs5R7awSBB_ft9xbGTXkYY4Yk3Cg4H5_iirToB6gyrJi67k95Ce8R.uGobThrdX2fAuiZF2ME1Wi9uIefdYS9UEajx44DAw2oi3R7X6o7XKmuyrMkU7h1DSW3I5XrUYu3wrrpNRSiTZoFndIDsOuiA9iKs2RnTnS3.v9Gi34m_msrGtVPkMlqjZxrXHzsjfKtO7; SESSION=ff5520e2-66e0-4bce-998e-02062e95b414"
}
res = requests.post(url="http://wenshu.court.gov.cn/website/parse/rest.q4w", data={
"docId": "83451b69d9ff46b6af96abeb00d51326",
"ciphertext": "110010+1000110+1100100+110100+1001001+1001101+1001010+1100001+1100100+1000100+1110111+110011+1001011+110010+1001100+1110101+110101+1110110+1101000+1101011+110000+1010010+1101101+1001111+110010+110000+110010+110000+110000+110111+110000+110110+1110100+110110+1101111+110010+1101001+1110011+1010011+110010+110100+1101110+1001000+1000011+1000100+1110110+1110111+1110001+110111+1000110+1001110+1110110+110100+1000001+111101+111101",
"cfg": "com.lawyee.judge.dc.parse.dto.SearchDataDsoDTO@docInfoSearch",
"__RequestVerificationToken": "SnhEAA5fkrhLG4Yqhv6ySDvi"
},
headers = headers)
print(res.text) 我测试是可以的啊,没问题 @yilu1015 返回结果如下 {"code":1,"description":null,"secretKey":"YuNfjorc70mO1Cllf6Isxf2B","result":"","success":true} |
应该没问题。我用它成功获得了条目信息。跑出来就只有
|
@nciefeiniu 参数如何设置有方法吗? 我当时看了#4 ,以为不需要。 |
我也是返回200但是没有全文, 楼主解决了吗 |
抱歉,这两周忙其他的项目,还没仔细研究。请问你抓全文是用APP版还是网页版?欢迎参考 #13。 |
已经解决了, 不过还有个问题是pyqt5可以获取cookie, 但是连续获取第二次的话会直接退出Process finished with exit code -1073741819 (0xC0000005), 请问你又出现吗 |
哦?请问问题出在哪里?我读了大牛的回答,以为是cookies的问题,看着要设置pyppeteer + asyncio,就还没做。所以最后还是请求设置的问题?谢谢指教! |
pyqt获取的cookie没问题, 我是formdata的queryCondition多了个逗号 |
谢谢提示。以下是我POST方法的请求数据,感觉没问题:你的formdata是怎么设置的? 至于pyqt退出问题,我也有同样问题。目前还在测试获取全文,我只是重启Jupyter kernel,实战如何解决,也还等大佬指教。
|
能看到的就是headers里面cookie的c没大写,网站里的是大写 |
@yilu1015 老哥,你这个问题。。。。。。。。 今天有空,就来看看这个 现在这样做还是能爬取到数据的。 老哥你获取不到详细数据,是你请求携带的data 数据搞错了!!!!!! data = {
'docId': '199a3ed2137846f1bf17ac1d01116358' # 请注意这个 docId 的大小写
} 我自己看半天也没看出哪里错了。抓包一下就看到了。 |
老师,我想请教个幼稚的问题,我用了您的方法来获取文书网首页的访问量: res = requests.post(url="http://wenshu.court.gov.cn/website/parse/rest.q4w", data={ 其中,cfg 和 header01 都是我从xhr获取的,但总是得不到数据,报405错误,我明明用的post,报错信息却是: Request method 'GET' not supported 以下时我执行后的结果: HTTP Status 405 – Method Not AllowedType Status Report Message Request method 'GET' not supported Description The method received in the request-line is known by the origin server but not supported by the target resource. Apache Tomcat/8.0.53 |
成功用您的方法获得了三枚cookies,然后构造了请求headers和data。
如下:
最终返回200,但无全文。请问这是什么情况呀?
The text was updated successfully, but these errors were encountered: