Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VeloC and MPI IO #27

Closed
denisbertini opened this issue Mar 5, 2020 · 9 comments
Closed

VeloC and MPI IO #27

denisbertini opened this issue Mar 5, 2020 · 9 comments

Comments

@denisbertini
Copy link

Hi
Just beginner question: is it possible to adapt a code using MPI- collective IO for its
checkpointing files to VeloC?
Thanks
Denis

@gonsie
Copy link
Member

gonsie commented Mar 24, 2020

Hey @denisbertini

You might be able to use the file-based mode that veloc offers. You call VELOC_Route_file with the file path that you would normally write to and instead use the passed-back file path. Veloc then handles the files and moves them around / provides fault tolerance.

@denisbertini
Copy link
Author

Hi @gonsie
Thanks a lot, i will try to use VeloC to leverage the checkpointing IO on our plasma physics simulation.
This is using extensively MPI-IO, i hope to make it runnuing. Do you know examples already that i can look at ?
Thanks
Denis

@gonsie
Copy link
Member

gonsie commented Mar 25, 2020

They only examples I know of are provided in the documentation (at the bottom of the page).

@denisbertini
Copy link
Author

These example are using fwrite/fread but i suppose one can substitute the corresponding MPI_WRITE/MPI_READ functions in my case ...

@bnicolae
Copy link
Contributor

Is there a particular reason why you need MPI-IO? It should be easier for you to protect the memory regions directly instead of writing them into a file using MPI-IO.

@denisbertini
Copy link
Author

Well the program that i use is already using MPI-IO to do chekpointing and to dump the data files.
Is there some limitation in this case ? I mean combining MPI IO and VeloC ?

@denisbertini
Copy link
Author

When you say "protecting memory region directly" you mean what is done in the heatdis_mem example ?

@bnicolae
Copy link
Contributor

bnicolae commented Jun 1, 2020

Yes, this is what I mean. If you are using MPI-IO, then you are already writing to a parallel file system so there is no point in using VELOC. The idea of using VELOC is to checkpoint asynchronously and avoid paying for expensive I/O (which you do if you wait for MPI-IO to finish).

@bnicolae
Copy link
Contributor

bnicolae commented Jun 1, 2020

Also, please note that we have a mailing list you can subscribe to: veloc-users@lists.mcs.anl.gov.
This is the right place to discuss such considerations. We use github issues primarily for bug reports.

@bnicolae bnicolae closed this as completed Jun 1, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants