Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

又发现了新的url模式 #28

Closed
zhbzhbzhbz opened this issue Jul 16, 2020 · 1 comment
Closed

又发现了新的url模式 #28

zhbzhbzhbz opened this issue Jul 16, 2020 · 1 comment
Labels
enhancement New feature or request

Comments

@zhbzhbzhbz
Copy link

zhbzhbzhbz commented Jul 16, 2020

1)https://www.lanzoux.com/iRGL1em4prc 网址里有大写字母,因此后面的部分需要加A-Z
2)(我自己弄的)https://hb-az.lanzous.com/izijcenly5c 三级域名部分有连字符- 注:亲自改了下个性域名,果然跟url规范一样,只允许数字、字母和连字符
3)还有一个可以改善的地方~
比如有这样一个长字符串 https://habo.qq.com/file/showdetail?pk=ADcGYl1sB24IOVs%2BU2M界面截图:协yi:安卓转PC功能就简单说下吧关闭真锁:拖进去的格式必须是账号-本帖隐藏的内容https://www.lanzoux.com/iRGL1em4prc
如果正则表达式里的三级域名部分(比如www)用现在的.*?而不是[a-zA-Z0-9-]*?的话,会导致上面一大长串都被匹配到,而不仅匹配到结尾的URL,哪怕用了非贪婪模式也是如此。

综上三点,建议 base_pat 改成https?://[a-zA-Z0-9-]*?\.?lanzou[six].com/.+
user_pat 改成 https?://[a-zA-Z0-9-]*?\.?lanzou[six].com/i[a-zA-Z0-9]{5,}/?,文件夹同理

base_pat = r'https?://.*?\.?lanzou[six].com/.+' # 子域名可个性化设置或者不存在

user_pat = r'https?://.*?\.?lanzou[six].com/b[a-z0-9]{7,}/?'

@zaxtyson
Copy link
Owner

😁这两天在乡下,回来就改。
官方真是喜欢搞事呢( 0 x 0 )
URL规则已经变了n次了

@zaxtyson zaxtyson added the enhancement New feature or request label Jul 20, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants