New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error while using counters in HPX #585
Comments
It's very difficult for me to guess what happens. Thus, I did add more error reporting around this particular problem which should give us more insights of what actually goes wrong for you (see c05ffa6). May I ask you to check out my changes, recompile, and report back with whatever error messages you might see? |
One of our recent commits (see 5a6f43a) might have fixed the issue which was causing your trouble. I'd be interested in seeing whether things do work for you now. |
Hi Hartmut, {stack-trace}: 14 frames: Program received signal SIGABRT, Aborted.
Quit anyway? (y or n) y
On Oct 28, 2012, at 3:30 PM, Hartmut Kaiser notifications@github.com wrote:
^C^C^C ^C^C^C Script done on Mon 29 Oct 2012 04:39:13 PM PDT |
Sameer, unfortunately you cut out the important parts of the error output. Please run again (outside of gdb) and send us the whole output you get. Additionally, may I ask you to add the command line options Could you do an independent run while adding Thanks! |
Hi Hartmut, On Oct 29, 2012, at 5:31 PM, Hartmut Kaiser notifications@github.com wrote:
{stack-trace}: 14 frames: Abort (core dumped) (replace '*' below with the appropriate sequence number)/agas/count/allocate Script done on Mon 29 Oct 2012 05:34:23 PM PDT |
Thanks Sameer. I think I understand what is going on. You're using tcsh which messes with the '{' and '}' in the countername. Could you try to quote the countername or otherwise escape the special characters, please? |
Hi Hartmut, On Oct 29, 2012, at 5:45 PM, Hartmut Kaiser notifications@github.com wrote:
|
Hi,
I am using the HPX (git head) and I can run the code properly on our cluster without counters. When I use counters, I get an error and a stack trace. Matt Anderson does not see this problem with the same binary. What could be going wrong? I see:
{stack-trace}: 10 frames:
0x7f5e6597568c : hpx::detail::backtrace() + 0x5c in /usr/local/packages/HPX/apps/hpx_matt_boost_1510.un/lib/hpx/libhpx.so.0
0x7f5e659983f6 : boost::exception_ptr hpx::detail::get_exceptionhpx::exception(hpx::exception const&, std::string const&, std::string const&, long) + 0x46 in /usr/local/packages/HPX/apps/hpx_matt_boost_1510.un/lib/hpx/libhpx.so.0
0x7f5e659986ba : void hpx::detail::throw_exceptionhpx::exception(hpx::exception const&, std::string const&, std::string const&, long) + 0x1a in /usr/local/packages/HPX/apps/hpx_matt_boost_1510.un/lib/hpx/libhpx.so.0
0x7f5e65cf09db : hpx::util::query_counters::find_counters() + 0x33b in /usr/local/packages/HPX/apps/hpx_matt_boost_1510.un/lib/hpx/libhpx.so.0
0x7f5e65cf0d40 : hpx::util::query_counters::start() + 0x10 in /usr/local/packages/HPX/apps/hpx_matt_boost_1510.un/lib/hpx/libhpx.so.0
0x7f5e65b1359b : hpx::components::server::runtime_support::call_startup_functions(bool) + 0x5b in /usr/local/packages/HPX/apps/hpx_matt_boost_1510.un/lib/hpx/libhpx.so.0
0x7f5e65b29b6a : ??? + 0x7f5e65b29b6a in /usr/local/packages/HPX/apps/hpx_matt_boost_1510.un/lib/hpx/libhpx.so.0
0x7f5e65c1b1e3 : ??? + 0x7f5e65c1b1e3 in /usr/local/packages/HPX/apps/hpx_matt_boost_1510.un/lib/hpx/libhpx.so.0
0x7f5e65c09519 : ??? + 0x7f5e65c09519 in /usr/local/packages/HPX/apps/hpx_matt_boost_1510.un/lib/hpx/libhpx.so.0
{env}: 87 entries:
BOOST_ROOT=/usr/local/packages/HPX/apps/boost_1_51_0/
CMAKE=/usr/local/packages/cmake/2.8.6
CPLUS_INCLUDE_PATH=/usr/local/packages/gcc/4.6.3/include
CTBIN=/opt/clustertest/bin
CTLOGS=/opt/clustertest/logs
CTTEST=/opt/clustertest/logs/under_test
C_INCLUDE_PATH=/usr/local/packages/gcc/4.6.3/include
DISPLAY=localhost:40.0
DYNINSTAPI_RT_LIB=/usr/local/packages/dyninstAPI-7.0.2/x86_64-unknown-linux2.4/lib/libdyninstAPI_RT.so.7
DYNINST_ROOT=/usr/local/packages/dyninstAPI-7.0.2
ENVIRONMENT=BATCH
GCC=/usr/local/packages/gcc/4.6.3
GMP=/usr/local/packages/gmp/5.0.5
GROUP=cas_cis
G_BROKEN_FILENAMES=1
HDF5_CPP=/usr/local/packages/HPX/apps/hdf5-1.8.9_gcc-4.6.3
HDF5_FORTRAN_ROOT=/usr/local/packages/HPX/apps/hdf5-1.8.9_gcc-4.6.3
HDF5_ROOT=/usr/local/packages/HPX/apps/hdf5-1.8.9_gcc-4.6.3
HOME=/home3/sameer
HOST=hn1
HOSTNAME=hn1
HOSTTYPE=x86_64-linux
INFOPATH=/usr/local/packages/gmp/5.0.5/share/info:/usr/local/packages/mpfr/3.1.0/share/info:/usr/local/packages/mpc/0.9-2/share/info:/usr/local/packages/gcc/4.6.3/info
LANG=en_US.UTF-8
LD_DYNAMIC_WEAK=1
LD_LIBRARY_PATH=/ibrix/home3/sameer/apps/instr/mm/mylib:/usr/local/packages/gmp/5.0.5/lib:/usr/local/packages/mpfr/3.1.0/lib:/usr/local/packages/mpc/0.9-2/lib:/usr/local/packages/gcc/4.6.3/lib64:/usr/local/packages/gcc/4.6.3/lib:/home3/sameer/tau2//x86_64/lib:/usr/local/packages/libdwarf-20111030/lib:/usr/local/packages/dyninstAPI-7.0.2/x86_64-unknown-linux2.4/lib
LESSOPEN=|/usr/bin/lesspipe.sh %s
LOADEDMODULES=cmake/2.8.6:gcc/4.6.3
LOGNAME=sameer
LS_COLORS=rs=0:di=01;34:ln=01;36:mh=00:pi=40;33:so=01;35:do=01;35:bd=40;33;01:cd=40;33;01:or=40;31;01:mi=01;05;37;41:su=37;41:sg=30;43:ca=30;41:tw=30;42:ow=34;42:st=37;44:ex=01;32:.tar=01;31:.tgz=01;31:.arj=01;31:.taz=01;31:.lzh=01;31:.lzma=01;31:.tlz=01;31:.txz=01;31:.zip=01;31:.z=01;31:.Z=01;31:.dz=01;31:.gz=01;31:.lz=01;31:.xz=01;31:.bz2=01;31:.tbz=01;31:.tbz2=01;31:.bz=01;31:.tz=01;31:.deb=01;31:.rpm=01;31:.jar=01;31:.rar=01;31:.ace=01;31:.zoo=01;31:.cpio=01;31:.7z=01;31:.rz=01;31:.jpg=01;35:.jpeg=01;35:.gif=01;35:.bmp=01;35:.pbm=01;35:.pgm=01;35:.ppm=01;35:.tga=01;35:.xbm=01;35:.xpm=01;35:.tif=01;35:.tiff=01;35:.png=01;35:.svg=01;35:.svgz=01;35:.mng=01;35:.pcx=01;35:.mov=01;35:.mpg=01;35:.mpeg=01;35:.m2v=01;35:.mkv=01;35:.ogm=01;35:.mp4=01;35:.m4v=01;35:.mp4v=01;35:.vob=01;35:.qt=01;35:.nuv=01;35:.wmv=01;35:.asf=01;35:.rm=01;35:.rmvb=01;35:.flc=01;35:.avi=01;35:.fli=01;35:.flv=01;35:.gl=01;35:.dl=01;35:.xcf=01;35:.xwd=01;35:.yuv=01;35:.cgm=01;35:.emf=01;35:.axv=01;35:.anx=01;35:.ogv=01;35:.ogx=01;35:.aac=01;36:.au=01;36:.flac=01;36:.mid=01;36:.midi=01;36:.mka=01;36:.mp3=01;36:.mpc=01;36:.ogg=01;36:.ra=01;36:.wav=01;36:.axa=01;36:.oga=01;36:.spx=01;36:.xspf=01;36:
MACHTYPE=x86_64
MAIL=/var/spool/mail/sameer
MANPATH=/usr/local/packages/gcc/4.6.3/share/man:/usr/local/packages/cmake/2.8.6/man:/opt/torque/man:/usr/local/share/man:/usr/share/man
MODULEPATH=/usr/local/packages/Modules/modulefiles
MODULESHOME=/usr/share/Modules
MPC=/usr/local/packages/mpc/0.9-2
MPFR=/usr/local/packages/mpfr/3.1.0
OMP_NUM_THREADS=4
OSTYPE=linux
PATH=/usr/local/packages/intel/composer_xe_2011_sp1/pkg_bin/intel64:/usr/local/packages/gcc/4.6.3/bin:/usr/local/packages/cmake/2.8.6/bin:/usr/local/packages/openshmem/bin:/usr/local/packages/msmpi/jcl/Bin:/usr/local/packages/mingw64/bin:/home3/sameer/tau2//x86_64/bin:/usr/lib64/qt-3.3/bin:/opt/torque/bin:/usr/local/bin:/bin:/usr/bin:/opt/clustertest/bin
PBS_ENVIRONMENT=PBS_BATCH
PBS_GPUFILE=/opt/torque/aux//186478.hn1gpu
PBS_JOBCOOKIE=1A9F65C8ADDF0ADE59464FDC26D8AF06
PBS_JOBID=186478.hn1
PBS_JOBNAME=STDIN
PBS_MOMPORT=15003
PBS_NODEFILE=/opt/torque/aux//186478.hn1
PBS_NODENUM=0
PBS_NUM_NODES=2
PBS_NUM_PPN=1
PBS_O_HOME=/home3/sameer
PBS_O_HOST=hn1
PBS_O_INITDIR=/home3/sameer
PBS_O_LANG=en_US.UTF-8
PBS_O_LOGNAME=sameer
PBS_O_MAIL=/var/spool/mail/sameer
PBS_O_PATH=/usr/local/packages/intel/composer_xe_2011_sp1/pkg_bin/intel64:/usr/local/packages/gcc/4.6.3/bin:/usr/local/packages/cmake/2.8.6/bin:/usr/local/packages/openshmem/bin:/usr/local/packages/msmpi/jcl/Bin:/usr/local/packages/mingw64/bin:/home3/sameer/tau2//x86_64/bin:/usr/lib64/qt-3.3/bin:/opt/torque/bin:/usr/local/bin:/bin:/usr/bin:/opt/clustertest/bin
PBS_O_QUEUE=gpu
PBS_O_SHELL=/bin/tcsh
PBS_O_WORKDIR=/home3/sameer
PBS_QUEUE=gpu
PBS_SERVER=hn1
PBS_TASKNUM=10
PBS_VERSION=TORQUE-2.5.12
PBS_VNODENUM=0
PBS_WALLTIME=86400
PKG=/usr/local/packages
PLATFORM=x86_64-unknown-linux2.4
PWD=/home3/sameer
QTDIR=/usr/lib64/qt-3.3
QTINC=/usr/lib64/qt-3.3/include
QTLIB=/usr/lib64/qt-3.3/lib
REMOTEHOST=dhcp-207.nic.uoregon.edu
SH=/usr/local/packages/tar/openshmem/examples
SHELL=/bin/tcsh
SHLVL=1
SSH_ASKPASS=/usr/libexec/openssh/gnome-ssh-askpass
SSH_AUTH_SOCK=/tmp/ssh-AjftL17708/agent.17708
SSH_CLIENT=128.223.202.207 54301 22
SSH_CONNECTION=128.223.202.207 54301 128.223.224.254 22
SSH_TTY=/dev/pts/3
T=/home3/sameer/tau2/
TAU_MAKEFILE=/home3/sameer/tau2//include/Makefile
TERM=xterm
USER=sameer
VENDOR=unknown
LMFILES=/usr/local/packages/Modules/modulefiles/cmake/2.8.6:/usr/local/packages/Modules/modulefiles/gcc/4.6.3
{what}: this future has not been initialized: HPX(broken_promise)
{locality-id}: 0
{hostname}: 10.0.1.70:7910
{process-id}: 23850
{function}: future::get
{file}: /home3/sameer/apps/hpx/hpx/lcos/future.hpp
{line}: 134
{os-thread}: 0
{thread-id}: 00007f5e673810b0
{thread-description}:
{version}: V0.9.5-trunk (AGAS: V2.1), Git:
{boost}: V1.51.0
{build-type}: release
{date}: Oct 24 2012 17:20:12
{platform}: linux
{compiler}: GNU C++ version 4.6.3
{stdlib}: GNU libstdc++ version 20120301
pbsdsh: task 0 exit status 262
I should add that Matt Anderson does not see these errors on our system when we both have the same set of modules and use the same binary. What else could affect this execution?
Thanks,
The text was updated successfully, but these errors were encountered: