Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: support export of archive content #1

Open
nxg opened this issue Nov 22, 2010 · 7 comments
Open

Feature request: support export of archive content #1

nxg opened this issue Nov 22, 2010 · 7 comments

Comments

@nxg
Copy link

nxg commented Nov 22, 2010

It's the obvious feature request:

If you were able to provide a way of exporting the content of a google group -- just the dump of all the messages, nothing fancy -- I'm sure you would be a very popular person, worldwide!

(why is it so hard...!?)

Norman

@blanchardjeremy
Copy link
Owner

Thanks for the request!

I think this would be a bit harder to pull off. Or would have to be used with caution. It would basically pound their servers to death if you were exporting tons of messages because every message would involve a separate page-load (or 2 or 3 if there were multiple pages for a given thread).

What formats would you like to see it in? Which details do you need about each post? Do you want the posts threaded the way google groups threads them?

@nxg
Copy link
Author

nxg commented Nov 24, 2010

I would imagine this being used for occasional archiving dumps of groups, as a slightly paranoid backup, perhaps; or because a group has served its purpose and is being shut down; or because one wants to move a mailing list to a different service, and transfer the history from Google Groups.

For this sort of occasional use, it would be OK to throttle the process, retrieving only a message per second, or every few seconds.

This is the sort of case that I'd imagine being handled by dataliberation.org, but there's no mention of Groups on the list of Google products there. If they have this sort of feature on their roadmap, that would be ideal, but they don't publish a roadmap (intelligibly).

@blanchardjeremy
Copy link
Owner

dataliberation.org looks awesome. Thanks for that reference.

What format should the data be exported in? RSS? Atom? mbox (I'm not familiar with it)?

Does anyone know what format is popular for this kind of export?

@nxg
Copy link
Author

nxg commented Nov 25, 2010

Any format would work. Atom or RSS would be nifty, but plain old mbox is the no-frills format into which I'd probably convert a feed for archiving.

mbox http://en.wikipedia.org/wiki/Mbox is a semi-standard. It's what mailers usually write out if they're asked to 'save raw email message' or something like that.

@blanchardjeremy
Copy link
Owner

Hmm. Okay. I'd also to investigate storing the threading of messages rather than just the flat messages. :)

@blanchardjeremy
Copy link
Owner

Remove util.php requirement in basic_tests. Closed by 2f2d029.

@blanchardjeremy
Copy link
Owner

Oops. didn't mean to close this. Sorry!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants