-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Šarūnas Navickas
committed
Sep 6, 2016
1 parent
3016080
commit 81d947d
Showing
4 changed files
with
58 additions
and
0 deletions.
There are no files selected for viewing
Empty file.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -8,3 +8,4 @@ Contents: | |
|
||
installing | ||
usage | ||
config |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,53 @@ | ||
Configuration | ||
============= | ||
|
||
Config | ||
------ | ||
|
||
============ ================ ====================================== | ||
Value Notes Description | ||
============ ================ ====================================== | ||
name A user-friendly name for configuration | ||
description Description about configuration | ||
site_root Site url which we will be scrapping | ||
start_page Where to start our scrapping | ||
cookies (Optional, Array) Add cookies to our requests | ||
headers (Optional, Array) Add headers to our requests | ||
proxies (Optional, Array) List of http proxies to be used for requests | ||
pages (Array) Consists an array of `Page` | ||
parser (Optional) Select parser for web pages. Default: html.parser, or you can use lxml | ||
auth (Optional) `Auth` object | ||
============ ================ ====================================== | ||
|
||
|
||
Auth | ||
---- | ||
|
||
============ ================ ====================================== | ||
Value Notes Description | ||
============ ================ ====================================== | ||
url Login url | ||
method (Optional) Default: POST, you can add any other http method | ||
params (Array) Key-Value pairs for request | ||
============ ================ ====================================== | ||
|
||
Page | ||
---- | ||
|
||
============ ================ ====================================== | ||
Value Notes Description | ||
============ ================ ====================================== | ||
name Name of page | ||
link_pattern Pattern which allows to detect which page parser to use | ||
mappings (Array) Array of `Mapping` | ||
============ ================ ====================================== | ||
|
||
Mapping | ||
------- | ||
|
||
============ ================ ====================================== | ||
Value Notes Description | ||
============ ================ ====================================== | ||
name Name of mapping | ||
path Path to element | ||
============ ================ ====================================== |