README

A web app to crawl content especially photo assets using curl and websocket written in php, html5 and javascript.

Websocket:

ws://host:port/path (ex. ws://localhost:8000/echo)

Server: PHP websockets https://github.com/ghedipunk/PHP-WebSockets

php -q bin/socket.php

Example usage:

http://localhost:80 Press connect socket button Paste 'https://en.wikipedia.org/wiki/World' to textarea Press Run

Format for input:

url|folder name|selection

https://en.wikipedia.org/wiki/World|wiki-world-images|range(1, 10)

url is the only required portion
folder name if you want to specify name of the folder where images will be saved to
selection can be:
- an array will specify pages to download & skip the rest e.g [11, 14, 15]
- a number will set a starting page to download from & skip the prior ones e.g 11
- range(1, 10) will tell apps to crawl image 1 to 10

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
bin		bin
css		css
fonts		fonts
include		include
js		js
src		src
test		test
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
composer.json		composer.json
composer.lock		composer.lock
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bin

bin

css

css

fonts

fonts

include

include

js

js

src

src

test

test

.gitignore

.gitignore

.gitmodules

.gitmodules

README.md

README.md

composer.json

composer.json

composer.lock

composer.lock

index.html

index.html

Repository files navigation

README

Websocket:

About

Releases

Packages

Languages

nghiaqh/img-crawler-php

Folders and files

Latest commit

History

Repository files navigation

README

Websocket:

About

Topics

Resources

Stars

Watchers

Forks

Languages