JavaScript Scraping Tutorial

Introduction

For scraping web pages that use JavaScript, we can use Selenium, a browser automation library. Selenium allows us to control a web browser through code, enabling us to navigate pages, interact with elements, and extract dynamic information generated by JavaScript.

In this tutorial, we will create a Python script that uses Selenium to extract links from a web page. Additionally, we will use Docker to run our script in an isolated environment.

Usage

Start Services and Run the Scraping Script

Build the Docker image and start the services:
```
make run-js
```
Run the scraping script withoud javascript:
```
make run
```

This will start the Nginx web server and the scraping script inside a Docker container, extracting the links from the specified web page.

Conclusion

This tutorial demonstrates how to use Selenium and Docker for scraping a web page that uses JavaScript. Selenium enables us to effectively interact with dynamic web pages, while Docker ensures that our environment is isolated and reproducible.

For more details, you can refer to the tutorial News Technology .

Happy scraping!

Let me know if you need any further clarifications or if there's anything else I can help you with! 😊

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
html		html
script		script
Makefile		Makefile
README.md		README.md
stack.yml		stack.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

JavaScript Scraping Tutorial

Introduction

Usage

Start Services and Run the Scraping Script

Conclusion

About

Uh oh!

Releases

Packages

Uh oh!

Languages

danelsan/tutorial-scraping-javascript

Folders and files

Latest commit

History

Repository files navigation

JavaScript Scraping Tutorial

Introduction

Usage

Start Services and Run the Scraping Script

Conclusion

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages