Skip to content

Commit

Permalink
Merge pull request #10 from lawzava/feature/flatten-cli
Browse files Browse the repository at this point in the history
v1.6.0
  • Loading branch information
lawzava committed Mar 6, 2021
2 parents fa9b516 + e134783 commit 46f2bf1
Show file tree
Hide file tree
Showing 15 changed files with 98 additions and 423 deletions.
2 changes: 1 addition & 1 deletion .github/workflows/golang.yml
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@ jobs:
- name: Setup Go for use with actions
uses: actions/setup-go@v2
with:
go-version: 1.15
go-version: 1.16
- uses: actions/checkout@v2
- uses: actions/cache@v2
with:
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/golangci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -10,4 +10,4 @@ jobs:
- name: golangci-lint
uses: golangci/golangci-lint-action@v2
with:
version: v1.35
version: v1.36
2 changes: 1 addition & 1 deletion .github/workflows/release.yml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ jobs:
name: Set up Go
uses: actions/setup-go@v1
with:
go-version: 1.15.x
go-version: 1.16.x
-
name: Run GoReleaser
uses: goreleaser/goreleaser-action@v2
Expand Down
1 change: 1 addition & 0 deletions .golangci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -89,6 +89,7 @@ linters:
- makezero
- predeclared
- thelper
- ifshort

issues:
exclude-rules:
Expand Down
6 changes: 5 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,11 +23,15 @@ Depends on `chromium` or `google-chrome` being available in path if `--js` is us
```
--async Scrape website pages asynchronously (default true)
-d, --depth int Max depth to follow when scraping recursively (default 3)
--emails Scrape emails (default true)
--follow-external Follow external 3rd party links within website
-h, --help help for scrape
--js Enables JS execution await
--debug Print debug logs
--recursively Scrape website recursively (default true)
-w, --website string Website to scrape (default "https://lawzava.com")
```

### Note about scraper package

For those that are looking for `scraper` package - this repository was intended as a cli-use only thus the scraper package was moved to [lawzava/emailscraper](https://github.com/lawzava/emailscraper).
The `scrape` utility will be maintained as a CLI implementation of `emailscraper` package.
63 changes: 0 additions & 63 deletions cmd/root.go

This file was deleted.

16 changes: 7 additions & 9 deletions go.mod
Original file line number Diff line number Diff line change
@@ -1,25 +1,23 @@
module github.com/lawzava/scrape

go 1.15
go 1.16

require (
github.com/PuerkitoBio/goquery v1.6.1 // indirect
github.com/andybalholm/cascadia v1.2.0 // indirect
github.com/antchfx/htmlquery v1.2.3 // indirect
github.com/antchfx/xmlquery v1.3.3 // indirect
github.com/antchfx/xmlquery v1.3.5 // indirect
github.com/antchfx/xpath v1.1.11 // indirect
github.com/chromedp/chromedp v0.6.5
github.com/chromedp/chromedp v0.6.8
github.com/gobwas/glob v0.2.3 // indirect
github.com/gocolly/colly v1.2.0
github.com/golang/protobuf v1.4.3 // indirect
github.com/kennygrant/sanitize v1.2.4 // indirect
github.com/lawzava/emailscraper v1.0.0 // indirect
github.com/lawzava/go-tld v1.0.0
github.com/saintfish/chardet v0.0.0-20120816061221-3af4cd4741ca // indirect
github.com/spf13/cobra v1.1.1
github.com/spf13/cobra v1.1.3
github.com/temoto/robotstxt v1.1.1 // indirect
golang.org/x/net v0.0.0-20210119194325-5f4716e94777 // indirect
golang.org/x/sys v0.0.0-20210123111255-9b0068b26619 // indirect
golang.org/x/net v0.0.0-20210226172049-e18ecbb05110 // indirect
golang.org/x/sys v0.0.0-20210305230114-8fe3ee5dd75b // indirect
golang.org/x/text v0.3.5 // indirect
google.golang.org/appengine v1.6.7 // indirect
google.golang.org/protobuf v1.25.0 // indirect
)
Loading

0 comments on commit 46f2bf1

Please sign in to comment.