web-scrape-script-ruby

Clone the repository as:

git clone git@github.com:CuriousSugam/web-scrape-script-ruby.git

docker build -t scraper .

After the docker build is complete, you can the run the container as:

docker run -i scraper bash

This will interactively execute the docker container and you'll be able to run commands in bash. Now, run the script as:

./app.rb stackoverflow.com http://www.google.com

Display the metadata for the given urls along with fetching and saving the urls:

./app.rb --metadata stackoverflow.com http://www.google.com

rspec spec/

I have not added in the specs for the all services. This is a must for a production grade code.
I have assumed there won't be much of a data so intead of using a database I've stored the information in a file.
The generated html files could be saved in a folder so that these files are not scattered in root path.
I've not fetched all the assets of the web page so that it could be run locally without breaking the pages.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
file_db		file_db
service		service
spec		spec
.gitignore		.gitignore
.rspec		.rspec
Dockerfile		Dockerfile
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
README.md		README.md
app.rb		app.rb
config.rb		config.rb
db.json		db.json

Provide feedback