New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix SiCKRAGETV/sickrage-issues/issues/3347 #2905
Conversation
@ncksol That's why I added that log line about "Could not find table of torrents mainblockcontenttt".
|
Turns out simply upgrading soup to v4 fixed the problem. No need for html5 parser. I've just tested this pr on my server it has found the episodes that were missing. |
still does not work to me:
|
here is my html:
|
Yeah, I can recreate the issue from your html. Somehow it's different from the one I am getting served. I am completely buffled atm as to why the parser cannot find the data. This applies to both of my approaches. None of them work now. |
NP! This issue is bugging me for quite a while! |
@ncksol Yeah I had the same problem. It works fine for me but not fernandog and duramato -.- I know sort of what is wrong, its the broken html in each item for the date ie |
Ok. So I've started to play with html that fernandog gets served and found out that if I just take the portion that containts the search result data then parser is fine. I decided to cut out anything before the table and it seem to work now. Both on html fernandog gave and the one I am getting served. |
@ncksol Awesome! It's working!! [HDTorrents] :: Discarding torrent because it doesn't meet the minimum seeders or leechers: Modern.Family.S07E04.720p.HDTV.x264-FLEET (S:0 L:0) |
Size is working:
|
@ncksol can you check what's wrong this this? https://gist.github.com/fernandog/9f4c3c2417bbc776bd69 No results in SR, but 5minutes ago I got results |
I found out that there's no cells in the page
2015-10-18 10:15:38 ERROR SEARCHQUEUE-MANUAL-95011 :: [HDTorrents] :: No cells in page |
@fernandog got the fix for seeders. |
As for the cells not found, I can't seem to be able to recreate this even with your html. If I feed it to my code it just comes back with valid: |
Ah! that's right. Sorry! @miigotu any comments on the PR? |
@ncksol can you squash into one commit please? |
@fernandog unfortunately I have no idea how to do that =D |
You are using only git web? |
No. I have github for windows. But it's pretty basic functionality. Only allows you to push, pull stuff. |
@ncksol it should have installed git with that after that do
and |
HD-Torrents has some invalid html on the page with search results. Using the default html parser wasn't returning the correct data. Substituted it with html5 parser to fix the problem. Update soup to v4 Cutting out invalid portions of html before feeding it to parser. Added error handling and case insensitive match Fixed detection of seeders/leechers and improved size detection
@duramato thanks! looks like i've managed to do it. |
@miigotu any commens before merge? |
I must test this before merge =P |
I tested and its working fine so far. |
@miigotu So did tested? |
Fix SiCKRAGETV/sickrage-issues/issues/3347
Don't know if HDT is passing the correct char to SR or it's only the log: From website: Carnivàle S02 1080p WEB-DL DD5.1 H.264-BS
|
HD-Torrents has some invalid html on the page with search results. Using
the default html parser wasn't returning the correct data. Substituted
it with html5 parser to fix the problem.
P.S. I've also created another PR with a different fix to that issue. Not sure which one would you prefere.