Obscure parsing error on cheezburger.com #331

kylealwyn · 2023-03-07T01:46:33Z

Hello!

Having an issue parsing https://cheezburger.com/19495941/33-purrfect-cat-memes-for-all-the-grumpy-cats-on-monday-morning-february-27-2023 (or really any article on that site) both locally and on https://extractor-demos.pages.dev/article-extractor. Receiving Cannot read properties of null (reading 'length').

The text was updated successfully, but these errors were encountered:

ndaidong · 2023-03-07T04:18:10Z

@kylealwyn thank you. I've checked and will fix that now.

The problem is that website uses wrong property name for Twitter Cards:

They should use "content" instead of "value".

Anyway we have to handle this case better.

- Fix issue #331 - Update dependencies - Remove unnecessary watermark

ndaidong · 2023-03-07T05:02:54Z

@kylealwyn done. However that website's HTML structure is not convenient for extracting article. You may need to add transformations to improve your extraction result.

kylealwyn · 2023-03-07T06:02:27Z

Appreciate it @ndaidong! Was coming here to share one more site with same issue: https://themerkle.com/exploring-the-potential-of-collateral-network-colt-ethereum-eth-and-quant-qnt/

Will see if 7.2.10 is solve for both!

kylealwyn · 2023-03-07T06:07:11Z

Ope - looks like 7.2.10 isn't published yet. Will wait for that.

Edit: installed from main branch and worked great 👌

ndaidong · 2023-03-07T06:13:48Z

@kylealwyn yes, v7.2.10 has just been published. And it works for themerkle.com too ✔️

ndaidong added a commit that referenced this issue Mar 7, 2023

v7.2.10

3aa1d8f

- Fix issue #331 - Update dependencies - Remove unnecessary watermark

ndaidong mentioned this issue Mar 7, 2023

v7.2.10 #332

Merged

ndaidong closed this as completed Mar 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Obscure parsing error on cheezburger.com #331

Obscure parsing error on cheezburger.com #331

kylealwyn commented Mar 7, 2023

ndaidong commented Mar 7, 2023 •

edited

Loading

ndaidong commented Mar 7, 2023

kylealwyn commented Mar 7, 2023

kylealwyn commented Mar 7, 2023 •

edited

Loading

ndaidong commented Mar 7, 2023

Obscure parsing error on cheezburger.com #331

Obscure parsing error on cheezburger.com #331

Comments

kylealwyn commented Mar 7, 2023

ndaidong commented Mar 7, 2023 • edited Loading

ndaidong commented Mar 7, 2023

kylealwyn commented Mar 7, 2023

kylealwyn commented Mar 7, 2023 • edited Loading

ndaidong commented Mar 7, 2023

ndaidong commented Mar 7, 2023 •

edited

Loading

kylealwyn commented Mar 7, 2023 •

edited

Loading