You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I started to look around on how this could be done technically and the first idea I have is to take some OSS clipper extension and fork it to suit AB needs. Eg https://github.com/go-shiori/shiori-web-ext
Regarding the upload, I think the best way would be to allow AB to import WARCs (also see #160). Then, perhaps, an extension like https://github.com/machawk1/warcreate could be used without any changes or with a minimal one (to automatically upload the WARC).
In the meantime as a workaround if you urgently need this, any files placed into the snapshot folder (./archive/<timestamp>/) will be respected by archivebox. So if you have any external WARC, PNG, PDF, etc files you can drag them into the snapshot folder manually or create a small script to place them in there.
If you overwrite the existing files or use the default names archivebox uses it will even display them properly in the UI as part of the snapshot.
I try to respect the UNIX "everything is a file" mentality, and may even move towards supporting more pure filesystem-based manipulation of the archives in future releases.
Type
What is the problem that your feature request solves
Being able to clip webpage contents that are hard to fetch using ArchiveBox (captchas, datacenter IP blocks, authentication).
Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes
Evernote Web Clipper did it perfectly.
What hacks or alternative solutions have you tried to solve the problem?
Using Evernote Web Clipper for pages that ArchiveBox cannot archive. Tried Joplin and it seems to do the job too.
How badly do you want this new feature?
The text was updated successfully, but these errors were encountered: