Can not ectract a local website #15335

chrizmc · 2018-02-26T19:27:37Z

Hi,

I want to label content of a webpage before I extract it with phantomjs. For that reason I saved the website with google chrome completly on my pc. Then I started phantomjs in commandline like this:

phantomjs.exe extractorscript.js c:/savedlocalwebsite/www.living.com

I only are interested in text and css.

Problem: phantomjs hangs, because it searches for some links in the web but as file type:
"url": "file://tpc.googlesyndication.com/sodar/V6zvOIoD.js"

question: how can i avoid searching for these links?
question: if i can not avoid it: how can I change the file-type back to a real URL?

Thanks in advance

ghost · 2018-02-27T08:38:04Z

@chrizmc Please specify full path to selected file/s and you will be good.

chrizmc · 2018-02-27T08:55:38Z

But how to do it? :-) I need an http// instead of file//

Can i configure phantomjs in this way?

I think problem is that I use a local stored website and the phantomjs thinks that everythin is local. But all the links are not stored locally

stale · 2019-12-28T16:55:18Z

Due to our very limited maintenance capacity (see #14541 for more details), we need to prioritize our development focus on other tasks. Therefore, this issue will be automatically closed. In the future, if we see the need to attend to this issue again, then it will be reopened. Thank you for your contribution!

stale bot added the stale label Dec 25, 2019

stale bot closed this as completed Dec 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can not ectract a local website #15335

Can not ectract a local website #15335

chrizmc commented Feb 26, 2018

ghost commented Feb 27, 2018

chrizmc commented Feb 27, 2018

stale bot commented Dec 28, 2019

Can not ectract a local website #15335

Can not ectract a local website #15335

Comments

chrizmc commented Feb 26, 2018

ghost commented Feb 27, 2018

chrizmc commented Feb 27, 2018

stale bot commented Dec 28, 2019