Skip to content

textarcana/scrapers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scrapers

Some tools to retrieve text or files from remote Web pages.

Grabit.pl

My first Web scraper. Expects as argument the name of a file containing a newline-delimited list of URLs. When invoked, launches an interactive shell that asks what type of file should be downloaded. Then downloads all the files that are linked from each of the listed Web pages.

Here’s the instructions to use:

  1. Put a list of all the pages you want to scrape, into a text file named FOO

  2. Say perl grabit.pl FOO

  3. You will be prompted to choose which type of file you want to grab.

  4. Enjoy!

About

Web scraping tools

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published