New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Youtube: only favicon gets extracted #67
Labels
Comments
trieloff
added a commit
to trieloff/unfurl
that referenced
this issue
Jan 4, 2021
this is a test for metadata in the body as exposed by youtube since the december 2020 update test for jacktuck#67
trieloff
added a commit
to trieloff/unfurl
that referenced
this issue
Jan 4, 2021
this change defers the early termination of the parser only if a title tag has been found in the head of the html fixes jacktuck#67
🎉 This issue has been resolved in version 5.2.1 🎉 The release is available on: Your semantic-release bot 📦🚀 |
If it turns out title is often in the head but other meta is in the body we could in the future just remove this optimisation all together or default to not having it and add a option flag for it. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Youtube changed its HTML a month ago and since then our tests (adobe/helix-embed#345) have been failing when verifying the output for Youtube.
The underlying issue is a combination of making the reasonable assumption that all metadata is in the
head
hereunfurl/src/index.ts
Lines 270 to 273 in db57429
and Youtube being above convention, standards, and reason:
(HTML reformatted and all
script
andstyle
tags removed)As you can see, most of the interesting metadata (even
title
) is outside thehead
.I will submit a PR to address that.
The text was updated successfully, but these errors were encountered: