Skip to content

nghiaqh/img-crawler-php

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

README

A web app to crawl content especially photo assets using curl and websocket written in php, html5 and javascript.

Websocket:

ws://host:port/path (ex. ws://localhost:8000/echo)

Server: PHP websockets https://github.com/ghedipunk/PHP-WebSockets

php -q bin/socket.php

Example usage:

http://localhost:80 Press connect socket button Paste 'https://en.wikipedia.org/wiki/World' to textarea Press Run

Format for input:

url|folder name|selection

https://en.wikipedia.org/wiki/World|wiki-world-images|range(1, 10)

  • url is the only required portion
  • folder name if you want to specify name of the folder where images will be saved to
  • selection can be:
    • an array will specify pages to download & skip the rest e.g [11, 14, 15]
    • a number will set a starting page to download from & skip the prior ones e.g 11
    • range(1, 10) will tell apps to crawl image 1 to 10

About

web crawler for image

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published