Skip to content

[SYSTEMDS-206] HDF5 File Format#1299

Closed
fathollahzadeh wants to merge 20 commits intoapache:masterfrom
fathollahzadeh:sf-hdf5
Closed

[SYSTEMDS-206] HDF5 File Format#1299
fathollahzadeh wants to merge 20 commits intoapache:masterfrom
fathollahzadeh:sf-hdf5

Conversation

@fathollahzadeh
Copy link
Copy Markdown
Contributor

This commit is an implementation for HDF5 data format files. The HDF5 The file format specification is available from the HDF Group here

Currently, this patch is targeting the HDF5 format with supporting some features of the HDF5. The supported features:

  1. Two Dimension Data (Matrix)
  2. Matrix with FP64 (double) Data Type
  3. Single Dataset
  4. Single Group
  5. Contiguous Dataset
  6. Read/Write and SystemDS Parallel Feature

Copy link
Copy Markdown
Contributor

@Baunsgaard Baunsgaard left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few comments, I have issues with the syntax for @,,, , double array allocations, and the tests. but otherwise it looks fine.
I am unsure how much of the code is actually executed in the tests, maybe you want to see what parts are covered using jacoco?

most importantly the tests are missing verification of the actual values parsed and written to disk since the tests currently only verify that there are no crashes.

return bb;
}

public void toBuffer(H5BufferBuilder bb) {
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should this method do nothing?

double[][] A = getRandomMatrix(rows, cols, 0, 10, 1, 714);
writeInputMatrixWithMTD("A", A, false);

runTest(true, false, null, -1);
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Missing verify of the written matrix.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

R scripts for read/write verification added

add r test to verify r/w
fix code style
…orts, ...

Added read/write verification test with R
Removed some MutableInt variables and replace them with primitive variables
Removed unused codes
@fathollahzadeh
Copy link
Copy Markdown
Contributor Author

A few comments, I have issues with the syntax for @,,, , double array allocations, and the tests. but otherwise it looks fine.
I am unsure how much of the code is actually executed in the tests, maybe you want to see what parts are covered using jacoco?

most importantly the tests are missing verification of the actual values parsed and written to disk since the tests currently only verify that there are no crashes.

Thanks for reviewing PR @Baunsgaard . I really appreciate your review and your comments.

I fixed most of them and pushed them in separate commits. Also, some short responses to resolve the comments.

@Baunsgaard
Copy link
Copy Markdown
Contributor

Thanks for the PR, i have now merged it.
while merging i @ignored a few of the write tests, since these took 1 minute, now instead they take 12sec.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants