Encode keys for records in mongoDB #492

pld · 2013-03-15T04:35:25Z

In the interests of saving space, we can encode the observation record keys instead of "[some column name]" we could use "1", etc.

This would also require mapping queries, to the coded keys, and mapping the coded keys to the column names on their way into a dframe (we can pass columns= in the dframe constructor).

This assumes we are not planning on moving to a column store (which seems to be true).

The text was updated successfully, but these errors were encountered:

pld · 2013-03-24T16:16:22Z

A couple more things:

this is a change to the way we store data that will require migrating or clearing any current databases
in observations we can remove the reserved key dataset_observation_id, it fulfills the same function as dataset_id
we can tie all our batch* functions closer to the Observation model, we do not expect anyone else to use these (with the potential exclusion of batch_read)
any encoding/decoding should happen at the batch level, to isolate it from the rest of the system (feels like a completely independent mix-in could be in order)

WIP here, modilabs:master...modilabs:encode-492

ghost assigned pld Mar 24, 2013

pld pushed a commit that referenced this issue Mar 25, 2013

ensure encoding/decoding is done properly in observations, closes #492

1be4b43

pld pushed a commit that referenced this issue Mar 29, 2013

ensure encoding/decoding is done properly in observations, closes #492

0281738

pld closed this as completed in ca5c746 Apr 11, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encode keys for records in mongoDB #492

Encode keys for records in mongoDB #492

pld commented Mar 15, 2013

pld commented Mar 24, 2013

Encode keys for records in mongoDB #492

Encode keys for records in mongoDB #492

Comments

pld commented Mar 15, 2013

pld commented Mar 24, 2013