An important part of any algorithm is the data it's working over and the data that it produces.
An important part of working with large scales of data is where the data is stored and how it's accessed.
The smqtk_dataprovider
module contains interfaces and plugins for various core data structures, allowing plugin implementations to decide where and how the underlying raw data should be stored and accessed.
This potentially allows algorithms to handle more data that would otherwise be feasible on a single machine.
The following are the core data representation interfaces included in this package.
- Note:
- It is required that implementations have a common serialization format so that they may be stored or transported by other structures in a general way without caring what the specific implementation is.
For this we require that all implementations be serializable via the
pickle
module functions.
.. autoclass:: smqtk_dataprovider.DataElement :members:
.. autoclass:: smqtk_dataprovider.DataSet :members:
.. autoclass:: smqtk_dataprovider.KeyValueStore :members: