This repository has been archived by the owner on Dec 25, 2021. It is now read-only.
Release 1.8.0
— Cleaned up code
— Added availability to use PostgreSQL as archive database
— HTML tags are no longer being removed (except tags like script, style and iframe)
— Added availability to save articles as JSON (but you can still use pickle if you want)
— Some bug fixes
— Removed useless files
If you have problems updating articles try again after removing ~/.tech-parser/articles_dumped file.
You will have to update your current parser configuration.
Take a look at TechParser/parser_config.py.