Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request guideline #6

Open
trung1704ptit opened this issue Jun 24, 2022 · 6 comments
Open

Request guideline #6

trung1704ptit opened this issue Jun 24, 2022 · 6 comments

Comments

@trung1704ptit
Copy link

Hi,
Thanks for your sharing repo. But I cannot run crawl indeed job. I already run by docker. When I see the Flower screen it created a job success but nothing happen. The question is how do I know it works.

@yennanliu
Copy link
Owner

hi @trung1704ptit
thx for the feedback! You may need to run indeed scrapping manually.
Kindly try the below commands mentioned in readme

https://github.com/yennanliu/web_scraping

# "indeed_scrap_task" task
$ curl -X POST  http://localhost:5555/api/task/async-apply/tasks.indeed_scrap_task

# "indeed_scrap_api_V1" task
$ curl -X POST -d '{"args":["New+York"]}' http://localhost:5555/api/task/async-apply/tasks.indeed_scrap_api_V1

Cheers,

@yennanliu
Copy link
Owner

yennanliu commented Jun 24, 2022

you can also refer below stand-alone py script, it should be able to execute indeed scrapping as well
https://github.com/yennanliu/web_scraping/blob/master/cron_indeed_scrapping_test.py

@trung1704ptit
Copy link
Author

@yennanliu Thank you, I have already run it. But need to update the code to crawl the job information because indeed updated the format HTML.

@trung1704ptit
Copy link
Author

and more question is if I crawl job successfully, how do I see the result on MongoDB. I don't see the way to see the result

@yennanliu
Copy link
Owner

@yennanliu Thank you, I have already run it. But need to update the code to crawl the job information because indeed updated the format HTML.

yes, you may have to update the code when Indeed updates its HTML.
A PR is welcomed!

@yennanliu
Copy link
Owner

yennanliu commented Jun 26, 2022

and more question is if I crawl job successfully, how do I see the result on MongoDB. I don't see the way to see the result

Hi @trung1704ptit
Not sure if you can find such information in Redis instead. (Redis as CELERY_RESULT_BACKEND)
MongoDB is not used in this service actually.
https://github.com/yennanliu/web_scraping/blob/master/docker-compose.yml#L28
https://github.com/yennanliu/web_scraping/blob/master/Dockerfile#L7

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants