Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

URL download and cleanup #19

Closed
shyams80 opened this issue Oct 26, 2013 · 3 comments
Closed

URL download and cleanup #19

shyams80 opened this issue Oct 26, 2013 · 3 comments

Comments

@shyams80
Copy link

Don't see the code for downloading content from a url, removing the boilerplate etc... Can you point me to the section of the code that does that?

@tkroman
Copy link
Contributor

tkroman commented Oct 26, 2013

I'm sorry, could you be more elaborate on that issue please? It's not clear for me what precisely do you want.

@shyams80
Copy link
Author

The mashape api can take a URL input and return the summarized result. I
couldn't find the section of the code that provides this functionality. I
am assuming that when given a URL, you will need to download the entire
page, extract the main article (i.e. remove the boilerplate) and them pass
the article to the main summarizer routine. How and where is the code
handling this?

On Sat, Oct 26, 2013 at 11:20 PM, Roman Tkalenko
notifications@github.comwrote:

I'm sorry, could you be more elaborate on that issue please? It's not
clear for me what precisely do you want.


Reply to this email directly or view it on GitHubhttps://github.com//issues/19#issuecomment-27151513
.

http://about.me/sunder.struck/bio

@MojoJolo
Copy link
Owner

The code here in Github only accepts the text and the title. The API in Mashape has another layer to extract the text from the webpage. It was using Python Goose for it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants