Ontario Tech University Web Scrapers

Repository Contents:

Python3.
Python3 Libraries:
- Pandas: For creating data frames.
- bs4: For the package BeautifulSoup which parses web pages.
- urllib.request: For the urlopen package to open links to the pages that need to be parsed.
Special Import Case:
- MySQLdb: for the important dates parser, helps with escaped strings.
- re: used for regex matching.
- copy: used to copy and manipulate data.
- datetime: used to convert to datetime.

Important Dates Scraper:
- Used for parsing and creating MySQL queries from the Important Dates page.
- Link to Important Dates page: https://bit.ly/37RmY4m
- Produces a .sql file to be uploaded for mobile app calendar data.
Accordion Parser:
- This scraper is used parse any FAQ's that use Accordion's.
- Example of Accordion page: https://bit.ly/33xnEsD
- Produces a csv file of intents to be uploaded to Watson Assistant
Strong Tags Parser:
- This scraper parses pages that have information in <strong></strong> tags.
- Example of page with strong tags: https://bit.ly/2OALU8Z
- Produces a csv file of intents to be uploaded to Watson Assistant

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.gitignore		.gitignore
README.md		README.md
webscraper-Accordion-Tags.py		webscraper-Accordion-Tags.py
webscraper-Important-Dates.py		webscraper-Important-Dates.py
webscraper-Strong-Tags.py		webscraper-Strong-Tags.py