Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use use html data instead of direct URLs #39

Closed
mejo opened this issue May 6, 2013 · 3 comments
Closed

How to use use html data instead of direct URLs #39

mejo opened this issue May 6, 2013 · 3 comments

Comments

@mejo
Copy link

mejo commented May 6, 2013

Older issue mentions 'train_from_htmlpage' method but its not working anymore? What I try to do is provide preprocessed html data (utf8 conversion done to make scrapely work) for scrapely.

@tpeng
Copy link
Contributor

tpeng commented Nov 19, 2013

I think train_from_htmlpage should work. could you check if you pass HtmlPage or raw data?
BTW you can convert the raw html data to HtmlPage simply with HtmlPage(body=raw_body)

@pablohoffman
Copy link
Member

@mejo did you manage to solve your issue with @tpeng suggestion?. Can you close this ticket if so, thanks.

@mejo
Copy link
Author

mejo commented Jan 3, 2014

Did not verify, pursuing other projects. Closing anyway.

@mejo mejo closed this as completed Jan 3, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants