Skip to content

Conversation

@wofferl
Copy link
Collaborator

@wofferl wofferl commented Dec 12, 2025

Summary

To avoid encoding problems when using full text scraping, the downloaded content is converted to utf-8 before parsing and stripping it with the readability library.

Checklist

Signed-off-by: Wolfgang <github@linux-dude.de>
@wofferl wofferl added 3. to review API Impact API/Backend code labels Dec 12, 2025
@SMillerDev SMillerDev merged commit ff06025 into nextcloud:master Dec 16, 2025
29 checks passed
@Grotax Grotax mentioned this pull request Jan 12, 2026
Grotax added a commit that referenced this pull request Jan 13, 2026
Changed
- Drop Support for Nextcloud 31 (#3485)
- Add Support for Nextcloud 33 (#3485)
- Add feature to Group starred Items per Feed (#3148)
- Add `occ` commands for importing/exporting articles (#3437)
- Add importing/exporting articles to frontend settings (#3437)
- App settings are now displayed in an app settings dialog (#3500)

Fixed
- Special characters may be displayed incorrectly when full text is enabled (#3472)
- Wrong url to feed page (#3486)
- Long articles are cut short (#3489)
- Imported feeds or newly created folders do not disappear immediately when `showAll` is not set (#3500)

Signed-off-by: Benjamin Brahmer <info@b-brahmer.de>
Grotax added a commit that referenced this pull request Jan 13, 2026
Changed
- Drop Support for Nextcloud 31 (#3485)
- Add Support for Nextcloud 33 (#3485)
- Add feature to Group starred Items per Feed (#3148)
- Add `occ` commands for importing/exporting articles (#3437)
- Add importing/exporting articles to frontend settings (#3437)
- App settings are now displayed in an app settings dialog (#3500)

Fixed
- Special characters may be displayed incorrectly when full text is enabled (#3472)
- Wrong url to feed page (#3486)
- Long articles are cut short (#3489)
- Imported feeds or newly created folders do not disappear immediately when `showAll` is not set (#3500)

Signed-off-by: Benjamin Brahmer <info@b-brahmer.de>
@wofferl wofferl deleted the fix_scraper_encoding branch January 13, 2026 14:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

3. to review API Impact API/Backend code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

NextCloud news strips the Danish characters æ, ø and å (and their upper-case equivalents Æ, Ø and Å) from articles.

2 participants