Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing content when parsing html #10

Open
lixing123 opened this issue Nov 8, 2011 · 2 comments
Open

Missing content when parsing html #10

lixing123 opened this issue Nov 8, 2011 · 2 comments

Comments

@lixing123
Copy link

hi,
first thank you for hpple. It helps a lot.

Recently when I use hpple to parse a html file, the content, which should have something in it, is null.
Here is part of the html file:

        发信人:
            znslm
            (小白), 信区: Pictures 标 题: Re: 你十六岁喜欢的那个人怎么样了?发信站: 南京大学小百合站 (Mon Nov 7 21:49:51 2011)    小孩上小学了
            
            [:D]
            
            喜欢我的那位呢/ 我猜的 --
        
        
the xpath string is "://tr/td/pre/a" the result is supposed to be "znslm", but it's null. how to fetch the string?
@lixing123
Copy link
Author

the problem is that hpple seems cannot parse "pre" tag. It missed everything associated with the tag.

@samniu
Copy link

samniu commented Dec 10, 2011

fatal error: 'libxml/tree.h' file not found [2]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants