No description, website, or topics provided.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
.gitignore
.travis.yml
README.md
bootsnall.py
data_indiblogger.xls
fb_page_feed_check.php
ibawards.py
indiblogger.py
screenshot.py
tests.py
timeout.py
trpt.py

README.md

Yet another thing I came up with because I was lazy! Here's how to run it. Note: Do not use it for commercial purposes.

###Install dependencies:

sudo apt-get install python-bs4

#####Alternate installation:

sudo apt-get install python-pip
sudo pip install beautifulsoup4

#####Installing PyQt (For Screenshots)

pip install PyQt
In case you have isseus installing the package, visit this link-
http://movingthelamppost.com/blog/html/2013/07/12/installing_pyqt____because_it_s_too_good_for_pip_or_easy_install_.html

###Clone the repository:

git clone https://github.com/sdaityari/web-scraping

##Tools #####trpt.py (Obsolete after the restructuring of their site) Prints out trip information about a trip in tripoto. Takes in trip id as input.

azure@ubuntu:~/web-scraping$ python trpt.py
Enter trip id: 273
Getting webpage...
Title: The Handicraft Tour
Travel Agent: Royal Tours India
Places: New Delhi, Agra, Jaipur, Jodhpur, Udaipur

#####bootsnall.py Gets information about all world travelers from travelers.bootsnall.com and saves it in an xls file!

#####fb_page_feed_check.php Run it in your web browser. Fill in the arrays of misleading words and terms to check for in the feed. Also put it the fb page id.

#####indiblogger.py Gets the highest no of posts by any blogger in a given contest

#####screenshot.py Takes screenshot of a given url