Skip to content

davidesantangelo/webinspector

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
bin
 
 
lib
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Webinspector

Ruby gem to inspect completely a web page. It scrapes a given URL, and returns you its title, description, meta, links, images and more.

See it in action!

You can try WebInspector live at this little demo: https://scrappet.herokuapp.com

Installation

Add this line to your application's Gemfile:

gem 'webinspector'

And then execute:

$ bundle

Or install it yourself as:

$ gem install webinspector

Usage

Initialize a WebInspector instance for an URL, like this:

page = WebInspector.new('http://davidesantangelo.com')

Accessing response status and headers

You can check the status and headers from the response like this:

page.response.status  # 200
page.response.headers # { "server"=>"apache", "content-type"=>"text/html; charset=utf-8", "cache-control"=>"must-revalidate, private, max-age=0", ... }

Accessing inpsected data

You can see the data like this:

page.url                 # URL of the page
page.scheme              # Scheme of the page (http, https)
page.host                # Hostname of the page (like, davidesantangelo.com, without the scheme)
page.port                # Port of the page
page.title               # title of the page from the head section, as string
page.description         # description of the page
page.links               # every link found
page.images              # every image found
page.meta                # metatags of the page

Accessing meta tags

page.meta                 # metatags of the page
page.meta['description']  # meta description
page.meta['keywords']     # meta keywords

Find words (as array)

page.find(["word1, word2"]) # return {"word1"=>3, "word2"=>1}

Contributors

License

The webinspector GEM is released under the MIT License.

Contributing

  1. Fork it ( https://github.com/[my-github-username]/webinspector/fork )
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create a new Pull Request

develop

About

Ruby gem to inspect completely a web page. It scrapes a given URL, and returns you its meta, links, images more.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •