Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Terrible documentation #16

Closed
jsDobromir opened this issue Sep 16, 2022 · 7 comments
Closed

Terrible documentation #16

jsDobromir opened this issue Sep 16, 2022 · 7 comments

Comments

@jsDobromir
Copy link

I appreciate your efforts to create this, but have nod idea how to start it, the explanation is more confusing than helpful.

@anilabhadatta
Copy link
Owner

anilabhadatta commented Sep 16, 2022

Clone the repo,
Download the respective Chrome Binary for your operating system from the releases section and extract it inside the Chrome-bin folder,
image

Run python chromedriver.py in a terminal
Run python educative_scraper.py in a separate terminal

Generate the config for first time use.
For urls.txt file path, add the full path to URLs.txt file
eg, C:/Desktop/urls.txt

config file eg
image

In URLs txt file you should add the URLs of the first topic from a specific course.
eg ; image

In urls.txt file it should show like this;
image
Starting Index is only required if you are scraping from a particular topic otherwise not required.

@jsDobromir
Copy link
Author

Tnx for answer.
I am on Ubuntu and getting this error: Main Exception Message: unknown error: no chrome binary at /home/dobromir/Documents/env/educative.io_scraper-master/Chrome-bin/linux/chrome/chrome

Screenshot from 2022-09-16 14-06-10

@jsDobromir
Copy link
Author

After creating /linux/chrome/chrome and dropping the chrome binary there, getting this error: **Main Exception Message: unknown error: Chrome failed to start: exited abnormally.
(unknown error: DevToolsActivePort file doesn't exist)
(The process started from chrome location /home/dobromir/Documents/env/educative.io_scraper-master/Chrome-bin/linux/chrome/chrome is no longer running, so ChromeDriver is assuming that Chrome has crashed.)
**

@anilabhadatta
Copy link
Owner

anilabhadatta commented Sep 16, 2022

@jsDobromir which chrome-bin did you download?
and is your ubuntu running on arm64 or amd64 architecture?
I think you should also restart your OS once , maybe some process is causing issues.

@jsDobromir
Copy link
Author

Its amd64, restarted, same issue. Thats what I get when I try to open it in the browser.

Screenshot from 2022-09-16 14-38-00

@anilabhadatta
Copy link
Owner

@jsDobromir Let me check, i may need you to share your screen

@anilabhadatta
Copy link
Owner

@jsDobromir Is your issue resolved and should I close this issue?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants