Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Zimit fails to retrieve videos on https://mesquartierschinois.wordpress.com/ #71

Closed
Popolechien opened this issue Dec 17, 2020 · 20 comments
Assignees
Milestone

Comments

@Popolechien
Copy link
Contributor

Scraping a blog with a few youtube/vimeo videos in it: none of them is retrieved. I do get the thumbnail for Youtube video but a generic error message ("an error occured, please try again later") when trying to play. Vimeo does not have thumbnails and simply says "we detected a high number of errors from your connection. To continue please confirm you are a human (and not a spambot)"

@kelson42
Copy link
Contributor

@Popolechien Can you please give a concrete example?

@Popolechien
Copy link
Contributor Author

Sorry: same blog as in #70. Here are screenshots:

yt
yt2
vimeo

@kelson42 kelson42 changed the title Zimit fails to retrieve videos Zimit fails to retrieve videos on https://mesquartierschinois.wordpress.com/ Dec 17, 2020
@kelson42 kelson42 pinned this issue Dec 17, 2020
@rgaudin
Copy link
Member

rgaudin commented Dec 17, 2020

Again:

  • Task ID/link so we can check logs (exact one) and retrieve ZIM
  • Link to online page corresponding to the issue (we can find in-zim one from it)

@Popolechien
Copy link
Contributor Author

@rgaudin
Copy link
Member

rgaudin commented Jan 14, 2021

There seem to be a problem with video handling on the scraper at the moment. I've opened an upstream ticket at webrecorder/browsertrix-crawler#4

@Popolechien we can't do much until this is resolved but your screenshot might indicate an additional problem on vimeo. We might have hit a quota or something.

@kelson42
Copy link
Contributor

The upstream bug has been fixed, wee need a new webcrawler release.

@stale
Copy link

stale bot commented Apr 12, 2021

This issue has been automatically marked as stale because it has not had recent activity. It will be now be reviewed manually. Thank you for your contributions.

@stale stale bot added the stale label Apr 12, 2021
@kelson42
Copy link
Contributor

@Popolechien Coukd you please verify the bug has been fixed?

@stale stale bot removed the stale label May 22, 2021
@Popolechien
Copy link
Contributor Author

@kelson42 No idea since we've had another regression to cover it up. See 2661

@rgaudin
Copy link
Member

rgaudin commented May 24, 2021

Can you test it with kiwix-serve ? Or share the link to the new Zim so I can check with kiwix-serve ?

@kelson42
Copy link
Contributor

kelson42 commented Jun 6, 2021

@Popolechien The zim file is not online anymore and your bug report on Kiwix Android is for the moment not conclusive. Would you be able to make a new one and test with kiwix-serve please?

@Popolechien
Copy link
Contributor Author

I have no way to run Kiwix-serve locally, so at best I can restart zimit and send you the link?

@Popolechien
Copy link
Contributor Author

@kelson42
Copy link
Contributor

kelson42 commented Jun 7, 2021

Thx, still broken as far as I can see.

@rgaudin
Copy link
Member

rgaudin commented Jun 7, 2021

Sorry to reply only after you tested but as noted in mentioned issue, the main issue is openzim/warc2zim#80 which is still open so there were no reason for this to work.

@stale
Copy link

stale bot commented Aug 22, 2021

This issue has been automatically marked as stale because it has not had recent activity. It will be now be reviewed manually. Thank you for your contributions.

@stale stale bot added the stale label Aug 22, 2021
@kelson42
Copy link
Contributor

kelson42 commented Jan 5, 2022

@rgaudin Now that openzim/warc2zim#83 is done, what should we do with this ticket?

@stale stale bot removed the stale label Jan 5, 2022
@rgaudin
Copy link
Member

rgaudin commented Jan 5, 2022

I suggest we close it. Might not be as wanted but behavior has probably changed so we'll reopen once master is released/deployed

@rgaudin rgaudin closed this as completed Jan 5, 2022
@kelson42
Copy link
Contributor

kelson42 commented Jan 5, 2022

@Popolechien Count on you to retest as soon a new warc2zim release is done.

@kelson42 kelson42 unpinned this issue Jan 5, 2022
@kelson42 kelson42 added this to the 1.2.0 milestone Jun 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants