Skip to content
This repository has been archived by the owner on May 30, 2023. It is now read-only.

Can not ectract a local website #15335

Closed
chrizmc opened this issue Feb 26, 2018 · 3 comments
Closed

Can not ectract a local website #15335

chrizmc opened this issue Feb 26, 2018 · 3 comments
Labels

Comments

@chrizmc
Copy link

chrizmc commented Feb 26, 2018

Hi,

I want to label content of a webpage before I extract it with phantomjs. For that reason I saved the website with google chrome completly on my pc. Then I started phantomjs in commandline like this:

phantomjs.exe extractorscript.js c:/savedlocalwebsite/www.living.com

I only are interested in text and css.

Problem: phantomjs hangs, because it searches for some links in the web but as file type:
"url": "file://tpc.googlesyndication.com/sodar/V6zvOIoD.js"

  1. question: how can i avoid searching for these links?
  2. question: if i can not avoid it: how can I change the file-type back to a real URL?

Thanks in advance

@ghost
Copy link

ghost commented Feb 27, 2018

@chrizmc Please specify full path to selected file/s and you will be good.

@chrizmc
Copy link
Author

chrizmc commented Feb 27, 2018

But how to do it? :-) I need an http// instead of file//

Can i configure phantomjs in this way?

I think problem is that I use a local stored website and the phantomjs thinks that everythin is local. But all the links are not stored locally

@stale stale bot added the stale label Dec 25, 2019
@stale
Copy link

stale bot commented Dec 28, 2019

Due to our very limited maintenance capacity (see #14541 for more details), we need to prioritize our development focus on other tasks. Therefore, this issue will be automatically closed. In the future, if we see the need to attend to this issue again, then it will be reopened. Thank you for your contribution!

@stale stale bot closed this as completed Dec 28, 2019
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

1 participant