URL download and cleanup #19

shyams80 · 2013-10-26T07:22:00Z

Don't see the code for downloading content from a url, removing the boilerplate etc... Can you point me to the section of the code that does that?

tkroman · 2013-10-26T17:50:00Z

I'm sorry, could you be more elaborate on that issue please? It's not clear for me what precisely do you want.

shyams80 · 2013-10-27T07:15:54Z

The mashape api can take a URL input and return the summarized result. I
couldn't find the section of the code that provides this functionality. I
am assuming that when given a URL, you will need to download the entire
page, extract the main article (i.e. remove the boilerplate) and them pass
the article to the main summarizer routine. How and where is the code
handling this?

On Sat, Oct 26, 2013 at 11:20 PM, Roman Tkalenko
notifications@github.comwrote:

I'm sorry, could you be more elaborate on that issue please? It's not
clear for me what precisely do you want.

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/19#issuecomment-27151513
.

http://about.me/sunder.struck/bio

MojoJolo · 2013-10-28T18:55:47Z

The code here in Github only accepts the text and the title. The API in Mashape has another layer to extract the text from the webpage. It was using Python Goose for it.

MojoJolo closed this as completed Oct 29, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

URL download and cleanup #19

URL download and cleanup #19

shyams80 commented Oct 26, 2013

tkroman commented Oct 26, 2013

shyams80 commented Oct 27, 2013

MojoJolo commented Oct 28, 2013

URL download and cleanup #19

URL download and cleanup #19

Comments

shyams80 commented Oct 26, 2013

tkroman commented Oct 26, 2013

shyams80 commented Oct 27, 2013

MojoJolo commented Oct 28, 2013