v0.5.3
π Expanded Publisher Support & Key Bug Fixes π
This release introduces 10 new publishers to fundus, bringing the total to 160 publishers across 37 countries. We've also added a feature that respects publishers' preferences to not be scraped for AI purposes (see our documentation for details).
Additionally, we resolved several bugs related to deadlocks that appeared in specific edge cases within our threading logic.
New Publishers
πΈπͺ Sweden
- Add SE Expressen by @ghostsshadow in #800
π©πͺ Germany
π¬π§ UK
- Add Nature (UK Scientific Journal) by @Kucki2018 in #797
πΊπΈ USA
- Add Rest Of World Publisher by @marten-ti in #801
π§πͺ Belgium
πΏπ¦ South Africa
- Add
Dizindababy @addie9800 in #832 - Add Independent Online newspapers by @addie9800 in #827
Maintained Existing Publishers
- Add
V1_1forSeznamZpravyby @addie9800 in #821 - Fix
ZwanzigMinutenby @addie9800 in #820 - Update
upper_boundary_selectorforNZZby @addie9800 in #819 - Update topics for
Funkeby @addie9800 in #823 - Fix
BoersenZeitungby @addie9800 in #824 - Add
V1_1forZDFby @addie9800 in #825 - Fix
summary_selectorforTheNationby @addie9800 in #831 - Add
V1_1toNTVTRby @addie9800 in #830
New Features
- Add
skip_publishers_disallowing_trainingby @addie9800 in #772 - Update
generic_parsingby @addie9800 in #822
Bug Fixes
- Fix spacing error in
LaVanguardiaby @MaxDall in #795 - Handle malformed XML by @addie9800 in #794
- Fix race conditions and improve exception handling by @MaxDall in #796
- Ignore type check for
MONTHSby @addie9800 in #810 - Update User Agents by @addie9800 in #818
- Fix deadlock in
queue_wrapperby @MaxDall in #833
New Contributors
- @ghostsshadow made their first contribution in #800
- @Kucki2018 made their first contribution in #797
- @marten-ti made their first contribution in #801
- @bresslem made their first contribution in #798
- @rascaria made their first contribution in #811
Full Changelog: v0.5.2...v0.5.3