graph storage format #215

jovo · 2016-02-05T20:44:55Z

i believe we decided on the new graph storage format was a pair of files

edge list
json file, with some spec that disa/will would decide upon and then sign off on?

@icoming @disa-mhembere @WillGray @gkiar

wrgr · 2016-02-05T20:52:10Z

Thank you for the reminder. @disa-mhembere - do you want to have a quick call? Or chat on Monday? @gkiar - arguably this solves our current ndmg issue?

Do we have a spec for an attributed edge list somewhere? I like roughly speaking:
(n+1)x(m+2) csv file, where n are the number of edges and m are the attributes (optional). The header row and vertices result in the final (n+1)(m+2) size.

I think Disa has a format already implemented that converts nicely to/from graphml and other formats.

For JSON spec, would be nice to have node ids, node names, edge ids, edge names, and meta data about the graph, to start.

Row 1 -> header
Row 2:n -> vertex1, vertex2, attribute1, atribute2, atribute3,...

disa-mhembere · 2016-02-06T01:53:49Z

This is almost what I had in mind. I think we should chat on monday after
the meeting. Sound ok?

On Fri, Feb 5, 2016 at 3:52 PM, William Gray notifications@github.com
wrote:

Thank you for the reminder. @disa-mhembere
https://github.com/disa-mhembere - do you want to have a quick call? Or
chat on Monday? @gkiar https://github.com/gkiar - arguably this solves
our current ndmg issue?

Do we have a spec for an attributed edge list somewhere? I like roughly
speaking:
(n+1)x(m+2) csv file, where n are the number of edges and m are the
attributes (optional). The header row and vertices result in the final
(n+1)(m+2) size.

I think Disa has a format already implemented that converts nicely to/from
graphml and other formats.

For JSON spec, would be nice to have node ids, node names, edge ids, edge
names, and meta data about the graph, to start.

Row 1 -> header
Row 2:n -> vertex1, vertex2, attribute1, atribute2, atribute3,...

—
Reply to this email directly or view it on GitHub
neurodata/m2g#215 (comment).

jovo · 2016-02-07T02:46:03Z

yup

gkiar · 2016-02-10T00:39:53Z

cross referenced in ndmg

jovo · 2016-03-23T02:36:33Z

i believe this is finalized.
are we doing it now?
can i see an example?

for the DARPA talk on 4/4, would i be able to see some benchmarks for this?
eg, speed reading/writing, compression reading/writing?
or is that not interesting?

disa-mhembere · 2016-03-23T03:14:49Z

No unfortunately, I have not had time to complete all the interfaces to make this happen. I have parallel ingests working, but nothing is tested on the live services. This will be completed only after supercomputing

wrgr closed this as completed Feb 5, 2016

wrgr reopened this Feb 5, 2016

jovo assigned disa-mhembere Feb 7, 2016

jovo added this to the 2016_02_15 milestone Feb 7, 2016

This was referenced Feb 10, 2016

graph format neurodata/m2g#37

Closed

Discuss and document storage plan for graphs #189

Closed

jovo modified the milestones: 2016_04_11, 2016_02_15 Mar 23, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

graph storage format #215

graph storage format #215

jovo commented Feb 5, 2016

wrgr commented Feb 5, 2016

disa-mhembere commented Feb 6, 2016

jovo commented Feb 7, 2016

gkiar commented Feb 10, 2016

jovo commented Mar 23, 2016

disa-mhembere commented Mar 23, 2016

graph storage format #215

graph storage format #215

Comments

jovo commented Feb 5, 2016

wrgr commented Feb 5, 2016

disa-mhembere commented Feb 6, 2016

jovo commented Feb 7, 2016

gkiar commented Feb 10, 2016

jovo commented Mar 23, 2016

disa-mhembere commented Mar 23, 2016