Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

configuring warc capture #60

Closed
johnknapp opened this issue Jan 16, 2019 · 2 comments
Closed

configuring warc capture #60

johnknapp opened this issue Jan 16, 2019 · 2 comments

Comments

@johnknapp
Copy link

johnknapp commented Jan 16, 2019

I'm excited to adopt warcio in a project but I'm stuck.

Following the warcio warc write examples, the warc files I create do not contain styles, images, fonts, videos. However, the user experience on warcrecorder.io does contain those elements.

I'm not sure if I should be passing args, kwargs, params or a filter_function to achieve my desired result but examining the tests and source has left me without a clue!

I'd be very grateful if you could give me a hint or point me towards some samples.

@wumpus
Copy link
Collaborator

wumpus commented Jan 16, 2019

warcio isn't a browser and only captures exactly the url you asked for, not all of the stuff that a browser will access while displaying that url. You need to use much more complicated software like webrecorder.io if you want everything.

@johnknapp
Copy link
Author

thanks Greg

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants