-
Python3.
-
Python3 Libraries:
- Pandas: For creating data frames.
- bs4: For the package BeautifulSoup which parses web pages.
- urllib.request: For the urlopen package to open links to the pages that need to be parsed.
-
Special Import Case:
- MySQLdb: for the important dates parser, helps with escaped strings.
- re: used for regex matching.
- copy: used to copy and manipulate data.
- datetime: used to convert to datetime.
-
Important Dates Scraper:
- Used for parsing and creating MySQL queries from the Important Dates page.
- Link to Important Dates page: https://bit.ly/37RmY4m
- Produces a .sql file to be uploaded for mobile app calendar data.
-
Accordion Parser:
- This scraper is used parse any FAQ's that use Accordion's.
- Example of Accordion page: https://bit.ly/33xnEsD
- Produces a csv file of intents to be uploaded to Watson Assistant
-
Strong Tags Parser:
- This scraper parses pages that have information in
<strong></strong>
tags. - Example of page with strong tags: https://bit.ly/2OALU8Z
- Produces a csv file of intents to be uploaded to Watson Assistant
- This scraper parses pages that have information in