Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parsing error and missing content on theregister.com #862

Open
lgrn opened this issue Mar 19, 2024 · 0 comments
Open

Parsing error and missing content on theregister.com #862

lgrn opened this issue Mar 19, 2024 · 0 comments
Labels
component:readability type:bug Something isn't working

Comments

@lgrn
Copy link

lgrn commented Mar 19, 2024

Data

  • Shiori version: 1.6.0 (build 595cb45)
  • Database Engine: sqlite
  • Operating system: Debian 12
  • CLI/Web interface/Web Extension: None

Describe the bug / actual behavior

Shiori fails to parse quotes, they are not included in the saved content.

Expected behavior

The quotes are a part of the article, and should be included, preferably with some kind of UI indication that they are quotes, but at the very least included at all.

To Reproduce

Steps to reproduce the behavior:

  1. Save the article https://www.theregister.com/2024/03/18/truenas_abandons_freebsd/
  2. Inspect the saved content
  3. Note that the paragraph beginning with "The creator of PC-BSD(...)" has been saved
  4. Note that the following quote beginning with "Right now the plan(...)" is missing

Notes

This is an HTML excerpt of the problematic section -- the <p> within the <div> is not included:

<p>The creator of PC-BSD(...)</p>
<div class="blockextract">
<p>Right now the plan(...)</p>
</div>
@lgrn lgrn added the type:bug Something isn't working label Mar 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component:readability type:bug Something isn't working
Projects
Status: To do
Development

No branches or pull requests

2 participants