Skip to content

Commit bddf005

Browse files
authoredJun 8, 2020
Update README.md
1 parent 6be4354 commit bddf005

File tree

1 file changed

+8
-3
lines changed

1 file changed

+8
-3
lines changed
 

Diff for: ‎README.md

+8-3
Original file line numberDiff line numberDiff line change
@@ -22,8 +22,9 @@ To run our example scraper, you are going to need these libraries:
2222
## List of contents
2323

2424
- [Introduction](#introduction)
25-
- [Be polite](#be_polite)
26-
- [Let's get to it](#lets_get_to_it)
25+
- [Be polite](#be-polite)
26+
- [Let's get to it](#lets-get-to-it)
27+
-- [Inspecting the site](#inspecting-the-site)
2728

2829
## Introduction
2930

@@ -35,12 +36,16 @@ Just as you are polite and caring in the real world, you should be such online a
3536

3637
## Let’s get to it
3738

38-
### Inspecting the site
3939
In the following tutorial, you will not only see how a basic scraper is written but will also learn how to adjust it to your own needs. Moreover, you will learn how to do it via a proxy!
4040

4141
As mentioned, we will be using these libraries:
4242
Requests
4343
BeautifulSoup 4
4444
The page we’re going to scrape is http://books.toscrape.com/. It doesn’t have robots.txt, but I think we can agree that the name of the site is asking you to scrape it. But before we carry on with the coding part, let's inspect the website first.
4545

46+
### Inspecting the site
47+
4648
So, this is what the main page of the website looks like. We can see it contains books, their titles, prices, ratings, availability information, and a list of genres in the sidebar.
49+
<p align="center">
50+
<img src="https://smartproxy.com/wp-content/themes/smartproxy/images/smartproxy-logo.svg" alt="Smartproxy logo" width="200" height="50">
51+
</p>

0 commit comments

Comments
 (0)
Please sign in to comment.