Generic library functions
cleans a html source by removing attributes, styles and returns raw content
check if the file type of given source path matches given file type
check if a given date is in the given slice
download a file given the source and destination
make directory if not exist
extract the main domain from a given source path
extract filename from a given source path
convert relative urls to absolute urls
convert html string to a queryable document
return maximum of a positive number slice
return minimum of a positive number slice
check if a given string exists in a given slice
converts a categories string into a slice
reads and extract content from a given PDF source filepath
standardize titles to make them url compatible by removing error prone characters
check if a given string is contained in any string in a given slice
check if a given string exists in a given slice
check the similarity percentage of two given strings