-
Notifications
You must be signed in to change notification settings - Fork 10k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[polskieradio] Add thumbnails. #10028
Conversation
Usually the |
@yan12125 Well, it's important that we check it finds the right image. Especially on Polskie Radio the page often contains, aside from the main image, multiple thumbnails for related materials. |
To avoid any possible false-positive, the extraction method itself should be precise. In this case, it's unlikely that |
Agree, yet the test should not be concerned with the implementation details. |
You want to make sure that the extractor always gives the correct thumbnail, but using a full URL does not increase the detection rate of website changes. Thumbnail URLs are constantly changing. If there's a failed test, we can't know, without further investigation, whether it's due to wrong files or just moved/renamed files. Though I don't have a concrete statistical result, moved files are more common than wrong files, and in this case, a moved/renamed file is almost the only possible reason. As a result, if there's a failed test in polskieradio in the future, we will just update the thumbnail URL. It wastes time of developers, so I recommend you to use a regular expression. If you really want a strict check, the correct way is to check the MD5 checksum of thumbnails, which is much less unlikely to change. |
That's a fair point on the MD5, thank you. I made the suggested change. :) |
Well |
@yan12125 That seems like a good idea, sadly I'm rather time constrained now. I've gone with the regular expression approach for now. Is this good to go now? |
Thanks. Merged with minor fixes. |
Ah, of course. Ought to have escaped the dots. Thanks! |
What is the purpose of your pull request?
Description of your pull request and other information
This is a simple change to extract thumbnails from Polskie Radio auditions.