-
Notifications
You must be signed in to change notification settings - Fork 22
Select /html/body/ #38
Comments
I'm in the same boat... I want to pull a comic image from a page that has no classes. This doesn't seem to work (Firebug says it's the xpath to the image tag). "xpath": "html/body/table/tbody/tr[2]/td/table/tbody/tr/td[2]/table/tbody/tr/td/table/tbody/tr[2]/td/img" |
Can you provide a sample url? |
I'm having a similar issue with a page without usable DIV classes. I have found a unique locator but can't seem to get it to pull body text. Here is an example page: http://paddocktalk.com/news/html/story-259326.html |
It seems to me, that it is not a proper xml format, maybe some tags are missing or there is no encoding specified so some characters can not be read successfully. This will lead to errors and the xpath selection is not performed. You can try my version and use the split method. https://github.com/m42e/ttrss_plugin-af_feedmod |
Ok, i digged in deeper. @troydunham try: "xpath" : "td[@width='85%' and @valign='top' and @bgcolor='#FFFFFF']" and you may be near the treasure..... |
Any way to select an entire body of a page? I'm working on one that has no div or classes or much of anything except text wrapped in a body tag.
The text was updated successfully, but these errors were encountered: