Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

adios_read_open_file seems to stall when not finding file #131

Closed
fouriaux opened this issue May 29, 2017 · 7 comments
Closed

adios_read_open_file seems to stall when not finding file #131

fouriaux opened this issue May 29, 2017 · 7 comments

Comments

@fouriaux
Copy link

fouriaux commented May 29, 2017

adios_read_open_file seems to stall when not finding file

I try to do a restart feature in my program: I want to check if a file success to open for reading variables and continue if not. However my program seems to block completely after adios_read_open_file fail (which should return NULL and continue).

I run my execution via this command line:

srun -n4 frozen_read 

If I run my program without srun, it is behaving properly: the adios_read_open_file call return NULL as expected. Please find below my code...

frozen_read.cc.txt
config.xml.txt
CMakeLists.txt

@pnorbert
Copy link
Contributor

pnorbert commented May 29, 2017 via email

@fouriaux
Copy link
Author

ok :-)
I forgot to mention about the error message that is returned:

ERROR: MPI open failed for not_existing.bp: 'File does not exist, error stack:
ADIOI_UFS_OPEN(69): File not_existing.bp does not exist'
ERROR: File open failed: not_existing.bp

@fouriaux
Copy link
Author

This proble does appears only with adios_read_open_file: adios_read_open function have correct behavior.

@pnorbert
Copy link
Contributor

Fixed this bug. Can you give it a try? Thanks.

@fouriaux
Copy link
Author

Thanks ! It is working 👍
However I got some remaining error messages when I disable Verbosity of the method:

srun -n4 ./1_frozen_read
Invalid error code (269009701) (error ring index 1 invalid)
Invalid error code (269009701) (error ring index 1 invalid)
Invalid error code (269009701) (error ring index 1 invalid)
Invalid error code (269009701) (error ring index 1 invalid)
Invalid error code (269009701) (error ring index 1 invalid)
Invalid error code (269009701) (error ring index 1 invalid)

I am not sure if it is related..

@pnorbert
Copy link
Contributor

I cannot reproduce this message. You meant setting "verbose=0" in adios_read_init_method(), right? My run is quiet in that case.

@fouriaux
Copy link
Author

yes for setting verbose=0.
Ok, so it is probably something with my configuration, I will create a new ticket if I don't find something on that.
Thanks a lot.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants