A machine learning model which extracts the publication date from a given web URL, preferably a blog URL.
Tools used:
- Beautiful Soup
- scikit-learn
- pandas
- HTML Tag Annotator (https://github.com/SachinKalsi/html_tag_annotator)
Tutorials on the methodology used:
https://www.youtube.com/playlist?list=PLfSv7CK7EjD2XmStXvZthQjGn1DAhfOaK