-
Notifications
You must be signed in to change notification settings - Fork 68
fix the logic for setting stripe size and stripe count in the lustre … #400
Conversation
…fs module. Takes now also the MPI_Info object into consideration. Conflicts: ompi/mca/fs/lustre/fs_lustre_file_open.c
|
@edgargabriel do we have a test that I could run to check out this feature? |
|
the way I tested it was to take a simple MPI_File I/O test which creates a new file, run on it lustre. If you set the mca parameter e.g. mpirun --mca fs_lustre_stripe_size 8 -np x ./mytest and verify with lfs getstripe 'name of output file' you should see the the file has the stripe size that you set. I tested it on the Lustre Cray system in Stuttgart, and it seemed to work. If you want, I can provide a trivial MPI file I/O test code for that. |
|
simple enough. I'll try that approach. Odd that IU jenkins tests got triggered by my comment. |
|
maybe I should add a testcase to my testsuite, where we do that through an info object. That could be done semi-automatic in that case. |
|
btw, the real problem that this commit fixes was, that the previous code version did not use the system defaults if the user did not specify the parameters, we basically always overwrote the system settings. I suspect that this was one of the contributors to the read performance problem that Fujitsu observed. This fix should keep now the system settings if the user does not specify anything. |
|
Edgar, works for me if I use a stripe size thats a multilpe of 64KB. So I had to use I'm assuming your example with '8' left out a qualifier like 'M'. |
|
sorry, I think my example was meant to be stripe_count, where I force lustre to use 8 servers. Not the stripe size |
|
@jsquyres I'll let you approve and merge. |
fix the logic for setting stripe size and stripe count in the lustre …
|
@hppritcha and @jsquyres I found the issue that lead to the lustre problem, and I have a fix, the vast majority of tests pass now on lustre (except for the new file_delete test, need to double check). Should I file a new pr or issue, or is there a way to reopen this one? |
|
Make a new PR; this one has already been merged, so it's effectively done. |
…fs module. Takes now also the MPI_Info object into consideration.
Conflicts:
ompi/mca/fs/lustre/fs_lustre_file_open.c