Join GitHub today
GitHub is home to over 36 million developers working together to host and review code, manage projects, and build software together.
Sign upHello from bookmark-archiver! #20
Comments
This comment has been minimized.
This comment has been minimized.
|
Hi, I've looked into bookmark-archiver, and it is a great tool for archiving already bookmarked collection from variety of services.
Sure! you're also free to use any code from reminiscence. The code related to tagging/summarization using NLTK is highly modular, and you can use it as it is in bookmark-archiver.
I do not use twitter, but I will certainly try to contact you via email in order to discuss various stuffs related to archiving. You've pretty good list for improving BA like warc, git, deep crawling etc..in which I'm also interested. I'm looking forward to have more interesting conversation with you on various archiving related topics. You're also free to contact me via email.
That's great! Thanks! By the way is there any reason to use dramatiq instead of celery? Reminiscence has built-in task queue manager which is sufficient for regular users, but I think having celery is lot more reliable that's why option has been provided for using it. |
This comment has been minimized.
This comment has been minimized.
|
Great! Email is fine too of course, although my git email usually goes to spam, you can use Having used Celery at decent scale for several years, moving to dramatiq was a breath of fresh air, I've found it's much more reliable so far. Dramatiq has some guarantees making sure it never loses tasks that Celery would have routinely dropped, and I find it's easier to manage across multiple servers. I'll close this issue to get it off your open issues list, but I look forward to chatting in the future! |
pirate
closed this
Nov 23, 2018
This comment has been minimized.
This comment has been minimized.
|
Oh btw you should definitely file a PR to get Reminiscence added to https://github.com/iipc/awesome-web-archiving @kanishka-linux I just got off the phone with Mark Graham @ archive.org too, he's super friendly and I'm sure would love to discuss Reminiscence with you. You should reach out if you're interested in joining the Archive.org circle of folk who build archiving tools! |
This comment has been minimized.
This comment has been minimized.
|
hey, thanks for the list. It is pretty useful.
That sounds great! Once I'll get some free time, I'll certainly try contacting them. |
pirate commentedNov 23, 2018
•
edited
Hi! I maintain https://github.com/pirate/bookmark-archiver, and I just learned that this project exists today from HN & the LWN post about archiving sites!
You have a lot of good ideas in this repo, very similar to how I've been planning to improve bookmark-archiver in the coming months:
You're welcome to use any of the code from bookmark-archiver of course, and I may take inspiration from your repo as well for the UI and NLTK automatic tagging and summarization, we've had tickets open for that for a while.
Best of luck! Please hit me up on twitter: @theSquashSH if you ever want to chat or cooperate on stuff, I just added a link to reminiscence at the bottom of the BA readme.
P.S. I may meet up with the author of the LWN article in Montreal at some point, I'll talk to him as well about Reminiscence.