Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RP (0.35.1) failing on archer #780

Closed
vivek-bala opened this issue Oct 1, 2015 · 49 comments
Closed

RP (0.35.1) failing on archer #780

vivek-bala opened this issue Oct 1, 2015 · 49 comments
Assignees
Labels
Milestone

Comments

@vivek-bala
Copy link
Contributor

I was able to RP scripts on archer this morning, but with no changes from the client side it now seems to fail. Could be some changes on archer (?).

Client side verbose:

Successful build: http://ci.radical-project.org/job/ExTASY-0.1/EXTASY_BRANCH=0.1,EXTASY_HOST=archer,EXTASY_WORKFLOW=cocoamber,PYTHON=System-CPython-2.7/lastBuild/console

Failed/Aborted build: http://ci.radical-project.org/job/ExTASY-0.1/EXTASY_BRANCH=0.1,EXTASY_HOST=archer,EXTASY_WORKFLOW=gromacslsdmap,PYTHON=System-CPython-2.7/121/console

agent.err:

ModuleCmd_Switch.c(172):ERROR:152: Module 'anaconda' is currently not loaded
which: no pip in (/opt/cray/llm/default/bin:/opt/cray/llm/default/etc:/opt/cray/lustre-cray_ari_s/2.4_3.0.80_0.5.1_1.0501.7664.16.1-1.0501.18401.34.1/sbin:/opt/cray/lustre-cray_ari_s/2.4_3.0.80_0.5.1_1.0501.7664.16.1-1.0501.18401.34.1/bin:/opt/cray/MySQL/5.0.64-1.0000.7096.23.2/sbin:/opt/cray/MySQL/5.0.64-1.0000.7096.23.2/bin:/opt/cray/alps/5.1.1-2.0501.8471.1.1.ari/sbin:/opt/cray/alps/5.1.1-2.0501.8471.1.1.ari/bin:/opt/cray/sdb/1.0-1.0501.48084.4.48.ari/bin:/opt/cray/nodestat/2.2-1.0501.47138.1.78.ari/bin:/usr/local/packages/cse/quickstart/1.0:/home/y07/y07/cse/nano/2.2.6/bin:/usr/local/packages/cse/serialJobs:/usr/local/packages/cse/bolt/0.6/bin:/usr/local/packages/cse/checkDisk:/usr/local/packages/cse/checkQueue:/usr/local/packages/cse/checkScript:/usr/local/packages/cse/budgets:/opt/cray/mpt/7.1.1/gni/bin:/opt/pbs/12.2.401.141761/bin:/opt/cray/atp/1.7.5/bin:/opt/cray/rca/1.0.0-2.0501.48090.7.46.ari/bin:/opt/cray/alps/5.1.1-2.0501.8507.1.1.ari/sbin:/opt/cray/alps/5.1.1-2.0501.8507.1.1.ari/bin:/opt/cray/dvs/2.4_0.9.0-1.0501.1672.2.122.ari/bin:/opt/cray/csa/3.0.0-1_2.0501.47112.1.91.ari/sbin:/opt/cray/csa/3.0.0-1_2.0501.47112.1.91.ari/bin:/opt/cray/job/1.5.5-0.1_2.0501.48066.2.43.ari/bin:/opt/cray/xpmem/0.1-2.0501.48424.3.3.ari/bin:/opt/cray/dmapp/7.0.1-1.0501.8315.8.4.ari/bin:/opt/cray/pmi/5.0.6-1.0000.10439.140.2.ari/bin:/opt/cray/ugni/5.0-1.0501.8253.10.22.ari/bin:/opt/cray/udreg/2.3.2-1.0501.7914.1.13.ari/bin:/opt/cray/cce/8.3.7/cray-binutils/x86_64-unknown-linux-gnu/bin:/opt/cray/cce/8.3.7/craylibs/x86-64/bin:/opt/cray/cce/8.3.7/cftn/bin:/opt/cray/cce/8.3.7/CC/bin:/opt/cray/craype/2.2.1/bin:/opt/cray/switch/1.0-1.0501.47124.1.93.ari/bin:/opt/cray/eslogin/eswrap/1.1.0-1.010400.915.0/bin:/opt/modules/3.2.10.2/bin:/usr/local/bin:/usr/bin:/bin:/usr/bin/X11:/usr/X11R6/bin:/usr/games:/usr/lib64/jvm/jre/bin:/usr/lib/mit/bin:/usr/lib/mit/sbin:/sbin:/usr/sbin:.:/usr/lib/qt3/bin:/opt/cray/bin)
--------------------------------------------------------------------------------
This is a private computing facility. Access to this system is limited to those
who have been granted access by the operating service provider on behalf of the
issuing authority and use is restricted to the purposes for which access was
granted. All access and usage are governed by the terms and conditions of access
agreed to by all registered users and are thus subject to the provisions of the
Computer Misuse Act, 1990 under which unauthorised use is a criminal offence.

If you are not authorised to use this service you must disconnect immediately.
--------------------------------------------------------------------------------

python: error while loading shared libraries: libpython2.7.so.1.0: cannot open shared object file: No such file or directory
python: error while loading shared libraries: libpython2.7.so.1.0: cannot open shared object file: No such file or directory
/fs4/e290/shared/shared_pilot_ve_20150429/bin/python: error while loading shared libraries: libpython2.7.so.1.0: cannot open shared object file: No such file or directory
mkdir: cannot create directory `////////radical//': Read-only file system
default_bootstrapper.sh: line 961: ////////radical//__init__.py: No such file or directory
default_bootstrapper.sh: line 962: ////////radical//__init__.py: No such file or directory
default_bootstrapper.sh: line 963: ////////radical//__init__.py: No such file or directory
default_bootstrapper.sh: line 964: ////////radical//__init__.py: No such file or directory
/fs4/e290/shared/shared_pilot_ve_20150429/bin/python: error while loading shared libraries: libpython2.7.so.1.0: cannot open shared object file: No such file or directory
/fs4/e290/shared/shared_pilot_ve_20150429/bin/python: error while loading shared libraries: libpython2.7.so.1.0: cannot open shared object file: No such file or directory
/fs4/e290/shared/shared_pilot_ve_20150429/bin/python: error while loading shared libraries: libpython2.7.so.1.0: cannot open shared object file: No such file or directory
/fs4/e290/shared/shared_pilot_ve_20150429/bin/python: error while loading shared libraries: libpython2.7.so.1.0: cannot open shared object file: No such file or directory
python: error while loading shared libraries: libpython2.7.so.1.0: cannot open shared object file: No such file or directory

agent.out:

--------------------------------------------------------------------------------
*** vb224   Job: 3187804.sdb   started: 01/10/15 16:13:23   host: mom3 ***
*** vb224   Job: 3187804.sdb   started: 01/10/15 16:13:23   host: mom3 ***
*** vb224   Job: 3187804.sdb   started: 01/10/15 16:13:23   host: mom3 ***
*** vb224   Job: 3187804.sdb   started: 01/10/15 16:13:23   host: mom3 ***

--------------------------------------------------------------------------------
# -------------------------------------------------------------------
# Bootstrapper running on host: nid01921.
# Bootstrapper started as     : 'default_bootstrapper.sh -b radical.utils-0.35.tar.gz:saga-python-0.35.tar.gz:radical.pilot-0.35.1.tar.gz -c 24 -d 50 -g /work/e290/shared/shared_pilot_ve_20150429/ -j APRUN -k APRUN -l PBSPRO -m extasy-db.epcc.ed.ac.uk:27017 -n radicalpilot -o POPEN -p pilot.0000 -q CONTINUOUS -r 20 -s rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007 -t multicore -u use -v debug -a extasy:extasyproject -e module switch anaconda python-compute/2.7.6-ucs4 -f 10.60.0.52'
# Environment of bootstrapper process:
#
#
}
ASSEMBLER_X86_64=/opt/cray/cce/8.3.7/cray-binutils/x86_64-unknown-linux-gnu/bin/as
ATP_HOME=/opt/cray/atp/1.7.5
ATP_MRNET_COMM_PATH=/opt/cray/atp/1.7.5/bin/atp_mrnet_commnode_wrapper
ATP_POST_LINK_OPTS=-Wl,-L/opt/cray/atp/1.7.5/lib/
BASH_FUNC_module()=() {  eval `/opt/modules/3.2.10.2/bin/modulecmd bash $*`
BOLT_DIR=/usr/local/packages/cse/bolt/0.6
CC_X86_64=/opt/cray/cce/8.3.7/CC/x86-64
COLORTERM=1
CPU=x86_64
CRAY_ALPS_INCLUDE_OPTS=-I/opt/cray/alps/5.1.1-2.0501.8471.1.1.ari/include
CRAY_ALPS_POST_LINK_OPTS=-L/opt/cray/alps/5.1.1-2.0501.8471.1.1.ari/lib64
CRAY_BINUTILS_BIN=/opt/cray/cce/8.3.7/cray-binutils/x86_64-unknown-linux-gnu/bin
CRAY_BINUTILS_ROOT=/opt/cray/cce/8.3.7/cray-binutils
CRAY_BINUTILS_VERSION=/opt/cray/cce/8.3.7
CRAY_CC_VERSION=8.3.7
CRAY_CPU_TARGET=ivybridge
CRAY_DMAPP_INCLUDE_OPTS=-I/opt/cray/dmapp/7.0.1-1.0501.8315.8.4.ari/include -I/opt/cray/gni-headers/3.0-1.0501.8317.12.1.ari/include 
CRAY_DMAPP_POST_LINK_OPTS=-L/opt/cray/dmapp/7.0.1-1.0501.8315.8.4.ari/lib64
CRAY_FTN_VERSION=8.3.7
CRAY_GNI_HEADERS_INCLUDE_OPTS=-I/opt/cray/gni-headers/3.0-1.0501.8317.12.1.ari/include
CRAY_LD_LIBRARY_PATH=/opt/cray/alps/5.1.1-2.0501.8471.1.1.ari/lib64:/opt/cray/mpt/7.1.1/gni/mpich2-cray/83/lib:/opt/cray/rca/1.0.0-2.0501.48090.7.46.ari/lib64:/opt/cray/alps/5.1.1-2.0501.8507.1.1.ari/lib64:/opt/cray/xpmem/0.1-2.0501.48424.3.3.ari/lib64:/opt/cray/dmapp/7.0.1-1.0501.8315.8.4.ari/lib64:/opt/cray/pmi/5.0.6-1.0000.10439.140.2.ari/lib64:/opt/cray/ugni/5.0-1.0501.8253.10.22.ari/lib64:/opt/cray/udreg/2.3.2-1.0501.7914.1.13.ari/lib64:/opt/cray/libsci/13.0.1/CRAY/83/x86_64/lib:/opt/cray/cce/8.3.7/CC/x86-64/lib/x86-64:/opt/cray/cce/8.3.7/craylibs/x86-64
CRAY_LIBSCI_BASE_DIR=/opt/cray/libsci/13.0.1
CRAY_LIBSCI_DIR=/opt/cray/libsci/13.0.1
CRAY_LIBSCI_PREFIX_DIR=/opt/cray/libsci/13.0.1/CRAY/83/x86_64
CRAY_LIBSCI_VERSION=13.0.1
CRAYLIBS_X86_64=/opt/cray/cce/8.3.7/craylibs/x86-64
CRAY_LLM_DIR=/opt/cray/llm/default
CRAYLMD_LICENSE_FILE=/opt/cray/cce/cce.lic
CRAY_MPICH2_BASEDIR=/opt/cray/mpt/7.1.1/gni
CRAY_MPICH2_DIR=/opt/cray/mpt/7.1.1/gni/mpich2-cray/83
CRAY_MPICH2_ROOTDIR=/opt/cray/mpt/7.1.1
CRAY_MPICH2_VER=7.1.1
CRAYOS_VERSION=5.1.29
CRAYPE_DIR=/opt/cray/craype/2.2.1
CRAYPE_NETWORK_TARGET=aries
CRAY_PE_TARGET=x86-64
CRAYPE_VERSION=2.2.1
CRAY_PMI_INCLUDE_OPTS=-I/opt/cray/pmi/5.0.6-1.0000.10439.140.2.ari/include
CRAY_PMI_POST_LINK_OPTS=-L/opt/cray/pmi/5.0.6-1.0000.10439.140.2.ari/lib64
CRAY_PRE_COMPILE_OPTS=-hnetwork=aries
CRAY_PRGENVCRAY=loaded
CRAY_RCA_INCLUDE_OPTS=-I/opt/cray/rca/1.0.0-2.0501.48090.7.46.ari/include -I/opt/cray-hss-devel/7.1.0/include -I/opt/cray/krca/1.0.0-2.0501.47640.3.70.ari/include
CRAY_RCA_POST_LINK_OPTS=-L/opt/cray/rca/1.0.0-2.0501.48090.7.46.ari/lib64 -lrca
CRAY_SITE_LIST_DIR=/etc/opt/cray/modules
CRAY_UDREG_INCLUDE_OPTS=-I/opt/cray/udreg/2.3.2-1.0501.7914.1.13.ari/include
CRAY_UDREG_POST_LINK_OPTS=-L/opt/cray/udreg/2.3.2-1.0501.7914.1.13.ari/lib64
CRAY_UGNI_INCLUDE_OPTS=-I/opt/cray/ugni/5.0-1.0501.8253.10.22.ari/include
CRAY_UGNI_POST_LINK_OPTS=-L/opt/cray/ugni/5.0-1.0501.8253.10.22.ari/lib64
CRAY_XPMEM_INCLUDE_OPTS=-I/opt/cray/xpmem/0.1-2.0501.48424.3.3.ari/include
CRAY_XPMEM_POST_LINK_OPTS=-L/opt/cray/xpmem/0.1-2.0501.48424.3.3.ari/lib64
CSHEDIT=emacs
CVS_RSH=ssh
DMAPP_ABORT_ON_ERROR=1
DVS_INCLUDE_OPTS=-I/opt/cray/dvs/2.4_0.9.0-1.0501.1672.2.122.ari/include
DVS_VERSION=0.9.0
ENV=/etc/bash.bashrc
ENVIRONMENT=BATCH
EPCC_PE_RELEASE=/home/y07/y07/cse/cray-pe-release
ESWRAP_LOGIN=mom1
FFTW_SYSTEM_WISDOM_DIR=/opt/cray/libsci/13.0.1
FORTRAN_SYSTEM_MODULE_NAMES=ftn_lib_definitions
FROM_HEADER=
FTN_X86_64=/opt/cray/cce/8.3.7/cftn/x86-64
G_BROKEN_FILENAMES=1
GCC_X86_64=/opt/gcc/4.8.1/snos
G_FILENAME_ENCODING=@locale,UTF-8,ISO-8859-15,CP1252
HISTSIZE=1000
HOME=/home/e290/e290/vb224
HOST=eslogin003
HOSTNAME=eslogin003
HOSTTYPE=x86_64
INCLUDE_PATH_X86_64=/opt/cray/cce/8.3.7/craylibs/x86-64/include
INFODIR=/usr/local/info:/usr/share/info:/usr/info
INFOPATH=/usr/local/info:/usr/share/info:/usr/info
INPUTRC=/etc/inputrc
JAVA_BINDIR=/usr/lib64/jvm/jre/bin
JAVA_HOME=/usr/lib64/jvm/jre
JAVA_ROOT=/usr/lib64/jvm/jre
JRE_HOME=/usr/lib64/jvm/jre
LANG=en_US.UTF-8
LD_LIBRARY_PATH=/opt/cray/csa/3.0.0-1_2.0501.47112.1.91.ari/lib64:/opt/cray/job/1.5.5-0.1_2.0501.48066.2.43.ari/lib64
LESS_ADVANCED_PREPROCESSOR=no
LESSCLOSE=lessclose.sh %s %s
LESSKEY=/etc/lesskey.bin
LESS=-M -I
LESSOPEN=lessopen.sh %s
LIBRARYMODULES=/opt/modules/3.2.10.2/init/.librarymodules:acml:alps:cray-dwarf:cray-fftw:cray-ga:cray-hdf5:cray-hdf5-parallel:cray-libsci:cray-libsci_acc:cray-mpich:cray-mpich2:cray-mpich-abi:cray-netcdf:cray-netcdf-hdf5parallel:cray-parallel-netcdf:cray-petsc:cray-petsc-complex:cray-shmem:cray-tpsl:cray-trilinos:cudatoolkit:fftw:ga:hdf5:hdf5-parallel:iobuf:libfast:netcdf:netcdf-hdf5parallel:ntk:onesided:papi:petsc:petsc-complex:pmi:tpsl:trilinos:xt-libsci:xt-mpich2:xt-mpt:xt-papi:/etc/opt/cray/modules/site_librarymodules
LIBSCI_BASE_DIR=/opt/cray/libsci/13.0.1
LIBSCI_VERSION=13.0.1
LINKER_X86_64=/opt/cray/cce/8.3.7/cray-binutils/x86_64-unknown-linux-gnu/bin/ld
_LMFILES_=/opt/modulefiles/modules/3.2.10.2:/opt/modulefiles/eswrap/1.1.0-1.010400.915.0:/opt/cray/ari/modulefiles/switch/1.0-1.0501.47124.1.93.ari:/opt/cray/craype/default/modulefiles/craype-network-aries:/opt/cray/modulefiles/craype/2.2.1:/opt/modulefiles/cce/8.3.7:/opt/cray/modulefiles/cray-libsci/13.0.1:/opt/cray/ari/modulefiles/udreg/2.3.2-1.0501.7914.1.13.ari:/opt/cray/ari/modulefiles/ugni/5.0-1.0501.8253.10.22.ari:/opt/cray/ari/modulefiles/pmi/5.0.6-1.0000.10439.140.2.ari:/opt/cray/ari/modulefiles/dmapp/7.0.1-1.0501.8315.8.4.ari:/opt/cray/ari/modulefiles/gni-headers/3.0-1.0501.8317.12.1.ari:/opt/cray/ari/modulefiles/xpmem/0.1-2.0501.48424.3.3.ari:/opt/cray/ari/modulefiles/job/1.5.5-0.1_2.0501.48066.2.43.ari:/opt/cray/ari/modulefiles/csa/3.0.0-1_2.0501.47112.1.91.ari:/opt/cray/ari/modulefiles/dvs/2.4_0.9.0-1.0501.1672.2.122.ari:/opt/cray/ari/modulefiles/alps/5.1.1-2.0501.8507.1.1.ari:/opt/cray/ari/modulefiles/rca/1.0.0-2.0501.48090.7.46.ari:/opt/cray/modulefiles/atp/1.7.5:/opt/cray/modulefiles/PrgEnv-cray/5.1.29:/opt/modulefiles/pbs/12.2.401.141761:/opt/cray/craype/default/modulefiles/craype-ivybridge:/opt/cray/modulefiles/cray-mpich/7.1.1:/opt/modulefiles/packages-archer:/opt/modules/packages-archer/budgets/1.1:/opt/modules/packages-archer/checkScript/1.1:/opt/modules/packages-archer/checkQueue/1.0:/opt/modules/packages-archer/checkDisk/1.0:/opt/modules/packages-archer/bolt/0.6:/opt/modules/packages-archer/serialJobs/1.0:/opt/modules/packages-archer/nano/2.2.6:/opt/modules/packages-archer/leave_time/1.0.0:/opt/modules/packages-archer/quickstart/1.0:/opt/modules/packages-archer/epcc-tools/3.0:/opt/cray/ari/modulefiles/nodestat/2.2-1.0501.47138.1.78.ari:/opt/cray/ari/modulefiles/sdb/1.0-1.0501.48084.4.48.ari:/opt/cray/ari/modulefiles/alps/5.1.1-2.0501.8471.1.1.ari:/opt/cray/modulefiles/MySQL/5.0.64-1.0000.7096.23.2:/opt/cray/modulefiles/lustre-cray_ari_s/2.4_3.0.80_0.5.1_1.0501.7664.16.1-1.0501.18401.34.1:/opt/modulefiles/hss-llm/7.1.0:/opt/modulefiles/Base-opts/1.0.2-1.0501.47945.4.2.ari:/opt/modules/packages-archer/cse-compute-defaults/3.0
LOADEDMODULES=modules/3.2.10.2:eswrap/1.1.0-1.010400.915.0:switch/1.0-1.0501.47124.1.93.ari:craype-network-aries:craype/2.2.1:cce/8.3.7:cray-libsci/13.0.1:udreg/2.3.2-1.0501.7914.1.13.ari:ugni/5.0-1.0501.8253.10.22.ari:pmi/5.0.6-1.0000.10439.140.2.ari:dmapp/7.0.1-1.0501.8315.8.4.ari:gni-headers/3.0-1.0501.8317.12.1.ari:xpmem/0.1-2.0501.48424.3.3.ari:job/1.5.5-0.1_2.0501.48066.2.43.ari:csa/3.0.0-1_2.0501.47112.1.91.ari:dvs/2.4_0.9.0-1.0501.1672.2.122.ari:alps/5.1.1-2.0501.8507.1.1.ari:rca/1.0.0-2.0501.48090.7.46.ari:atp/1.7.5:PrgEnv-cray/5.1.29:pbs/12.2.401.141761:craype-ivybridge:cray-mpich/7.1.1:packages-archer:budgets/1.1:checkScript/1.1:checkQueue/1.0:checkDisk/1.0:bolt/0.6:serialJobs/1.0:nano/2.2.6:leave_time/1.0.0:quickstart/1.0:epcc-tools/3.0:nodestat/2.2-1.0501.47138.1.78.ari:sdb/1.0-1.0501.48084.4.48.ari:alps/5.1.1-2.0501.8471.1.1.ari:MySQL/5.0.64-1.0000.7096.23.2:lustre-cray_ari_s/2.4_3.0.80_0.5.1_1.0501.7664.16.1-1.0501.18401.34.1:hss-llm/7.1.0:Base-opts/1.0.2-1.0501.47945.4.2.ari:cse-compute-defaults/3.0
LOGNAME=vb224
LS_COLORS=no=00:fi=00:di=01;34:ln=00;36:pi=40;33:so=01;35:do=01;35:bd=40;33;01:cd=40;33;01:or=41;33;01:ex=00;32:*.cmd=00;32:*.exe=01;32:*.com=01;32:*.bat=01;32:*.btm=01;32:*.dll=01;32:*.tar=00;31:*.tbz=00;31:*.tgz=00;31:*.rpm=00;31:*.deb=00;31:*.arj=00;31:*.taz=00;31:*.lzh=00;31:*.lzma=00;31:*.zip=00;31:*.zoo=00;31:*.z=00;31:*.Z=00;31:*.gz=00;31:*.bz2=00;31:*.tb2=00;31:*.tz2=00;31:*.tbz2=00;31:*.avi=01;35:*.bmp=01;35:*.fli=01;35:*.gif=01;35:*.jpg=01;35:*.jpeg=01;35:*.mng=01;35:*.mov=01;35:*.mpg=01;35:*.pcx=01;35:*.pbm=01;35:*.pgm=01;35:*.png=01;35:*.ppm=01;35:*.tga=01;35:*.tif=01;35:*.xbm=01;35:*.xpm=01;35:*.dl=01;35:*.gl=01;35:*.wmv=01;35:*.aiff=00;32:*.au=00;32:*.mid=00;32:*.mp3=00;32:*.ogg=00;32:*.voc=00;32:*.wav=00;32:
LS_OPTIONS=-N --color=none -T 0
MACHTYPE=x86_64-suse-linux
MAIL=/var/mail/vb224
MANPATH=/opt/cray/llm/default/man:/opt/cray/lustre-cray_ari_s/2.4_3.0.80_0.5.1_1.0501.7664.16.1-1.0501.18401.34.1/man:/opt/cray/alps/5.1.1-2.0501.8471.1.1.ari/share/man:/home/y07/y07/cse/nano/2.2.6/share/man:/opt/cray/mpt/7.1.1/gni/man/mpich2:/opt/pbs/12.2.401.141761/man:/opt/cray/atp/1.7.5/man:/opt/cray/alps/5.1.1-2.0501.8507.1.1.ari/share/man:/opt/cray/csa/3.0.0-1_2.0501.47112.1.91.ari/man:/opt/cray/job/1.5.5-0.1_2.0501.48066.2.43.ari/man:/opt/cray/libsci/13.0.1/man:/opt/cray/cce/8.3.7/man:/opt/cray/cce/8.3.7/craylibs/man:/opt/cray/cce/8.3.7/CC/man:/opt/cray/cce/8.3.7/cftn/man:/opt/cray/craype/2.2.1/man:/opt/cray/eslogin/eswrap/1.1.0-1.010400.915.0/man:/opt/modules/3.2.10.2/share/man:/opt/cray/share/man:/usr/local/man:/usr/share/man:/usr/man:/opt/cray/share/man:/cm/local/apps/environment-modules/current/share/man
MINICOM=-c on
MODULEPATH=/opt/cray/craype/default/modulefiles:/opt/cray/ari/modulefiles:/opt/cray/modulefiles:/opt/modulefiles:/cm/local/modulefiles:/cm/shared/modulefiles:/opt/modules/packages-archer
MODULESHOME=/opt/modules/3.2.10.2
MODULE_VERSION=3.2.10.2
MODULE_VERSION_STACK=3.2.10.2
MORE=-sl
MPICH_ABORT_ON_ERROR=1
MPICH_DIR=/opt/cray/mpt/7.1.1/gni/mpich2-cray/83
NCPUS=1
NLSPATH=/opt/cray/cce/8.3.7/CC/x86-64/nls/En/%N.cat:/opt/cray/cce/8.3.7/craylibs/x86-64/nls/En/%N.cat:/opt/cray/cce/8.3.7/cftn/x86-64/nls/En/%N.cat
NNTPSERVER=news
NODE_COUNT=1
NUM_DEPTH=1
NUM_PES=24
NUM_PPN=24
OMP_NUM_THREADS=1
OSTYPE=linux
PAGER=less
PATH=/opt/cray/llm/default/bin:/opt/cray/llm/default/etc:/opt/cray/lustre-cray_ari_s/2.4_3.0.80_0.5.1_1.0501.7664.16.1-1.0501.18401.34.1/sbin:/opt/cray/lustre-cray_ari_s/2.4_3.0.80_0.5.1_1.0501.7664.16.1-1.0501.18401.34.1/bin:/opt/cray/MySQL/5.0.64-1.0000.7096.23.2/sbin:/opt/cray/MySQL/5.0.64-1.0000.7096.23.2/bin:/opt/cray/alps/5.1.1-2.0501.8471.1.1.ari/sbin:/opt/cray/alps/5.1.1-2.0501.8471.1.1.ari/bin:/opt/cray/sdb/1.0-1.0501.48084.4.48.ari/bin:/opt/cray/nodestat/2.2-1.0501.47138.1.78.ari/bin:/usr/local/packages/cse/quickstart/1.0:/home/y07/y07/cse/nano/2.2.6/bin:/usr/local/packages/cse/serialJobs:/usr/local/packages/cse/bolt/0.6/bin:/usr/local/packages/cse/checkDisk:/usr/local/packages/cse/checkQueue:/usr/local/packages/cse/checkScript:/usr/local/packages/cse/budgets:/opt/cray/mpt/7.1.1/gni/bin:/opt/pbs/12.2.401.141761/bin:/opt/cray/atp/1.7.5/bin:/opt/cray/rca/1.0.0-2.0501.48090.7.46.ari/bin:/opt/cray/alps/5.1.1-2.0501.8507.1.1.ari/sbin:/opt/cray/alps/5.1.1-2.0501.8507.1.1.ari/bin:/opt/cray/dvs/2.4_0.9.0-1.0501.1672.2.122.ari/bin:/opt/cray/csa/3.0.0-1_2.0501.47112.1.91.ari/sbin:/opt/cray/csa/3.0.0-1_2.0501.47112.1.91.ari/bin:/opt/cray/job/1.5.5-0.1_2.0501.48066.2.43.ari/bin:/opt/cray/xpmem/0.1-2.0501.48424.3.3.ari/bin:/opt/cray/dmapp/7.0.1-1.0501.8315.8.4.ari/bin:/opt/cray/pmi/5.0.6-1.0000.10439.140.2.ari/bin:/opt/cray/ugni/5.0-1.0501.8253.10.22.ari/bin:/opt/cray/udreg/2.3.2-1.0501.7914.1.13.ari/bin:/opt/cray/cce/8.3.7/cray-binutils/x86_64-unknown-linux-gnu/bin:/opt/cray/cce/8.3.7/craylibs/x86-64/bin:/opt/cray/cce/8.3.7/cftn/bin:/opt/cray/cce/8.3.7/CC/bin:/opt/cray/craype/2.2.1/bin:/opt/cray/switch/1.0-1.0501.47124.1.93.ari/bin:/opt/cray/eslogin/eswrap/1.1.0-1.010400.915.0/bin:/opt/modules/3.2.10.2/bin:/usr/local/bin:/usr/bin:/bin:/usr/bin/X11:/usr/X11R6/bin:/usr/games:/usr/lib64/jvm/jre/bin:/usr/lib/mit/bin:/usr/lib/mit/sbin:/sbin:/usr/sbin:.:/usr/lib/qt3/bin:/opt/cray/bin
PBS_ACCOUNT=e290
PBS_ENVIRONMENT=PBS_BATCH
PBS_JOBCOOKIE=000000005EAFDFEB000000004644E3B4
PBS_JOBDIR=/home/e290/e290/vb224
PBS_JOBID=3187804.sdb
PBS_JOBNAME=SAGA-Python-PBS
PBS_MOMPORT=15003
PBS_NODEFILE=/var/spool/PBS/aux/3187804.sdb
PBS_NODENUM=0
PBS_O_HOME=/home/e290/e290/vb224
PBS_O_HOST=eslogin3-ldap
PBS_O_LANG=en_US.UTF-8
PBS_O_LOGNAME=vb224
PBS_O_MAIL=/var/mail/vb224
PBS_O_PATH=/usr/local/packages/cse/quickstart/1.0:/home/y07/y07/cse/nano/2.2.6/bin:/usr/local/packages/cse/serialJobs:/usr/local/packages/cse/bolt/0.6/bin:/usr/local/packages/cse/checkDisk:/usr/local/packages/cse/checkQueue:/usr/local/packages/cse/checkScript:/usr/local/packages/cse/budgets:/opt/cray/mpt/7.1.1/gni/bin:/opt/pbs/12.2.401.141761/bin:/opt/cray/atp/1.7.5/bin:/opt/cray/rca/1.0.0-2.0501.48090.7.46.ari/bin:/opt/cray/alps/5.1.1-2.0501.8507.1.1.ari/sbin:/opt/cray/alps/5.1.1-2.0501.8507.1.1.ari/bin:/opt/cray/dvs/2.4_0.9.0-1.0501.1672.2.122.ari/bin:/opt/cray/csa/3.0.0-1_2.0501.47112.1.91.ari/sbin:/opt/cray/csa/3.0.0-1_2.0501.47112.1.91.ari/bin:/opt/cray/job/1.5.5-0.1_2.0501.48066.2.43.ari/bin:/opt/cray/xpmem/0.1-2.0501.48424.3.3.ari/bin:/opt/cray/dmapp/7.0.1-1.0501.8315.8.4.ari/bin:/opt/cray/pmi/5.0.6-1.0000.10439.140.2.ari/bin:/opt/cray/ugni/5.0-1.0501.8253.10.22.ari/bin:/opt/cray/udreg/2.3.2-1.0501.7914.1.13.ari/bin:/opt/cray/cce/8.3.7/cray-binutils/x86_64-unknown-linux-gnu/bin:/opt/cray/cce/8.3.7/craylibs/x86-64/bin:/opt/cray/cce/8.3.7/cftn/bin:/opt/cray/cce/8.3.7/CC/bin:/opt/cray/craype/2.2.1/bin:/opt/cray/switch/1.0-1.0501.47124.1.93.ari/bin:/opt/cray/eslogin/eswrap/1.1.0-1.010400.915.0/bin:/opt/modules/3.2.10.2/bin:/usr/local/bin:/usr/bin:/bin:/usr/bin/X11:/usr/X11R6/bin:/usr/games:/usr/lib64/jvm/jre/bin:/usr/lib/mit/bin:/usr/lib/mit/sbin:/sbin:/usr/sbin:.:/usr/lib/qt3/bin:/opt/cray/bin
PBS_O_QUEUE=short
PBS_O_SHELL=/bin/bash
PBS_O_SYSTEM=Linux
PBS_O_WORKDIR=/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007-pilot.0000
PBS_QUEUE=S3178110
PBS_TASKNUM=1
PE_CRAY_DEFAULT_FIXED_PKGCONFIG_PATH=/opt/cray/hdf5-parallel/1.8.13/CRAY/83/lib/pkgconfig:/opt/cray/hdf5/1.8.13/CRAY/83/lib/pkgconfig:/opt/cray/netcdf-hdf5parallel/4.3.2/CRAY/83/lib/pkgconfig:/opt/cray/ga/5.1.0.5/CRAY/83/lib/pkgconfig:/opt/cray/netcdf/4.3.2/CRAY/83/lib/pkgconfig:/opt/cray/parallel-netcdf/1.5.0/CRAY/83/lib/pkgconfig
PE_CXX_PKGCONFIG_LIBS=mpichcxx
PE_ENV=CRAY
PE_FFTW_DEFAULT_REQUIRED_PRODUCTS=PE_MPICH
PE_FFTW_DEFAULT_TARGET_haswell=haswell
PE_FFTW_DEFAULT_TARGET_interlagos=interlagos
PE_FFTW_DEFAULT_TARGET_sandybridge=sandybridge
PE_FFTW_DEFAULT_TARGET_x86_64=x86_64
PE_FFTW_DEFAULT_VOLATILE_PKGCONFIG_PATH=/opt/cray/fftw/3.3.4.1/@PE_FFTW_DEFAULT_TARGET@/lib/pkgconfig
PE_FORTRAN_PKGCONFIG_LIBS=mpichf90
PE_GA_DEFAULT_FIXED_PRGENV=CRAY INTEL
PE_GA_DEFAULT_GENCOMPS_GNU=49 48
PE_GA_DEFAULT_VOLATILE_PKGCONFIG_PATH=/opt/cray/ga/5.1.0.5/@PRGENV@/@PE_GA_DEFAULT_GENCOMPS@/lib/pkgconfig
PE_GA_DEFAULT_VOLATILE_PRGENV=GNU
PE_HDF5_DEFAULT_FIXED_PRGENV=CRAY INTEL
PE_HDF5_DEFAULT_GENCOMPS_GNU=49 48
PE_HDF5_DEFAULT_VOLATILE_PKGCONFIG_PATH=/opt/cray/hdf5/1.8.13/@PRGENV@/@PE_HDF5_DEFAULT_GENCOMPS@/lib/pkgconfig
PE_HDF5_DEFAULT_VOLATILE_PRGENV=GNU
PE_HDF5_PARALLEL_DEFAULT_FIXED_PRGENV=CRAY INTEL
PE_HDF5_PARALLEL_DEFAULT_GENCOMPS_GNU=49 48
PE_HDF5_PARALLEL_DEFAULT_REQUIRED_PRODUCTS=PE_MPICH
PE_HDF5_PARALLEL_DEFAULT_VOLATILE_PKGCONFIG_PATH=/opt/cray/hdf5-parallel/1.8.13/@PRGENV@/@PE_HDF5_PARALLEL_DEFAULT_GENCOMPS@/lib/pkgconfig
PE_HDF5_PARALLEL_DEFAULT_VOLATILE_PRGENV=GNU
PE_INTEL_DEFAULT_FIXED_PKGCONFIG_PATH=/opt/cray/hdf5-parallel/1.8.13/INTEL/140/lib/pkgconfig:/opt/cray/hdf5/1.8.13/INTEL/140/lib/pkgconfig:/opt/cray/netcdf-hdf5parallel/4.3.2/INTEL/140/lib/pkgconfig:/opt/cray/ga/5.1.0.5/INTEL/140/lib/pkgconfig:/opt/cray/netcdf/4.3.2/INTEL/140/lib/pkgconfig:/opt/cray/mpt/7.1.1/gni/mpich2-intel/140/lib/pkgconfig:/opt/cray/parallel-netcdf/1.5.0/INTEL/140/lib/pkgconfig
PE_INTEL_FIXED_PKGCONFIG_PATH=/opt/cray/mpt/7.1.1/gni/mpich2-intel/140/lib/pkgconfig
PE_LEVEL=8.3
PE_LIBSCI_DEFAULT_GENCOMPS_CRAY_haswell=83
PE_LIBSCI_DEFAULT_GENCOMPS_CRAY_interlagos=83
PE_LIBSCI_DEFAULT_GENCOMPS_CRAY_sandybridge=83
PE_LIBSCI_DEFAULT_GENCOMPS_CRAY_x86_64=83
PE_LIBSCI_DEFAULT_GENCOMPS_GNU_haswell=49 48
PE_LIBSCI_DEFAULT_GENCOMPS_GNU_interlagos=49 48
PE_LIBSCI_DEFAULT_GENCOMPS_GNU_sandybridge=49 48
PE_LIBSCI_DEFAULT_GENCOMPS_GNU_x86_64=49 48
PE_LIBSCI_DEFAULT_GENCOMPS_INTEL_haswell=140
PE_LIBSCI_DEFAULT_GENCOMPS_INTEL_interlagos=140
PE_LIBSCI_DEFAULT_GENCOMPS_INTEL_sandybridge=140
PE_LIBSCI_DEFAULT_GENCOMPS_INTEL_x86_64=140
PE_LIBSCI_DEFAULT_OMP_REQUIRES_openmp=_mp
PE_LIBSCI_DEFAULT_PKGCONFIG_VARIABLES=PE_LIBSCI_DEFAULT_OMP_REQUIRES_@openmp@
PE_LIBSCI_DEFAULT_REQUIRED_PRODUCTS=PE_MPICH
PE_LIBSCI_DEFAULT_VOLATILE_PKGCONFIG_PATH=/opt/cray/libsci/13.0.1/@PRGENV@/@PE_LIBSCI_DEFAULT_GENCOMPS@/@PE_LIBSCI_DEFAULT_TARGET@/lib/pkgconfig
PE_LIBSCI_DEFAULT_VOLATILE_PRGENV=CRAY GNU INTEL
PE_LIBSCI_GENCOMPS_CRAY_haswell=83
PE_LIBSCI_GENCOMPS_CRAY_interlagos=83
PE_LIBSCI_GENCOMPS_CRAY_sandybridge=83
PE_LIBSCI_GENCOMPS_CRAY_x86_64=83
PE_LIBSCI_GENCOMPS_GNU_haswell=49 48
PE_LIBSCI_GENCOMPS_GNU_interlagos=49 48
PE_LIBSCI_GENCOMPS_GNU_sandybridge=49 48
PE_LIBSCI_GENCOMPS_GNU_x86_64=49 48
PE_LIBSCI_GENCOMPS_INTEL_haswell=140
PE_LIBSCI_GENCOMPS_INTEL_interlagos=140
PE_LIBSCI_GENCOMPS_INTEL_sandybridge=140
PE_LIBSCI_GENCOMPS_INTEL_x86_64=140
PE_LIBSCI_MODULE_NAME=cray-libsci/13.0.1
PE_LIBSCI_OMP_REQUIRES_openmp=_mp
PE_LIBSCI_PKGCONFIG_LIBS=libsci_mpi:libsci
PE_LIBSCI_PKGCONFIG_VARIABLES=PE_LIBSCI_OMP_REQUIRES_@openmp@
PE_LIBSCI_REQUIRED_PRODUCTS=PE_MPICH
PE_LIBSCI_VOLATILE_PKGCONFIG_PATH=/opt/cray/libsci/13.0.1/@PRGENV@/@PE_LIBSCI_GENCOMPS@/@PE_LIBSCI_TARGET@/lib/pkgconfig
PE_LIBSCI_VOLATILE_PRGENV=CRAY GNU INTEL
PELOCAL_PRGENV=true
PE_MPICH_DEFAULT_DIR_CRAY_DEFAULT64=64
PE_MPICH_DEFAULT_FIXED_PRGENV=INTEL
PE_MPICH_DEFAULT_GENCOMPS_CRAY=83
PE_MPICH_DEFAULT_GENCOMPS_GNU=49 48
PE_MPICH_DEFAULT_VOLATILE_PKGCONFIG_PATH=/opt/cray/mpt/7.1.1/gni/mpich2-@PRGENV@@PE_MPICH_DEFAULT_DIR_DEFAULT64@/@PE_MPICH_DEFAULT_GENCOMPS@/lib/pkgconfig
PE_MPICH_DEFAULT_VOLATILE_PRGENV=CRAY GNU
PE_MPICH_DIR_CRAY_DEFAULT64=64
PE_MPICH_FIXED_PRGENV=INTEL
PE_MPICH_GENCOMPS_CRAY=83
PE_MPICH_GENCOMPS_GNU=49 48
PE_MPICH_MODULE_NAME=cray-mpich
PE_MPICH_MULTITHREADED_LIBS_multithreaded=_mt
PE_MPICH_NV_LIBS_nvidia20=-lcudart
PE_MPICH_NV_LIBS_nvidia35=-lcudart
PE_MPICH_PKGCONFIG_VARIABLES=PE_MPICH_NV_LIBS_@accelerator@:PE_MPICH_MULTITHREADED_LIBS_@multithreaded@
PE_MPICH_TARGET_VAR_nvidia20=-lcudart
PE_MPICH_TARGET_VAR_nvidia35=-lcudart
PE_MPICH_VOLATILE_PKGCONFIG_PATH=/opt/cray/mpt/7.1.1/gni/mpich2-@PRGENV@@PE_MPICH_DIR_DEFAULT64@/@PE_MPICH_GENCOMPS@/lib/pkgconfig
PE_MPICH_VOLATILE_PRGENV=CRAY GNU
PE_NETCDF_DEFAULT_FIXED_PRGENV=CRAY INTEL
PE_NETCDF_DEFAULT_GENCOMPS_GNU=49 48
PE_NETCDF_DEFAULT_REQUIRED_PRODUCTS=PE_HDF5_PARALLEL PE_MPICH
PE_NETCDF_DEFAULT_VOLATILE_PKGCONFIG_PATH=/opt/cray/netcdf/4.3.2/@PRGENV@/@PE_NETCDF_DEFAULT_GENCOMPS@/lib/pkgconfig
PE_NETCDF_DEFAULT_VOLATILE_PRGENV=GNU
PE_NETCDF_HDF5PARALLEL_DEFAULT_FIXED_PRGENV=CRAY INTEL
PE_NETCDF_HDF5PARALLEL_DEFAULT_GENCOMPS_GNU=49 48
PE_NETCDF_HDF5PARALLEL_DEFAULT_REQUIRED_PRODUCTS=PE_HDF5_PARALLEL PE_MPICH
PE_NETCDF_HDF5PARALLEL_DEFAULT_VOLATILE_PKGCONFIG_PATH=/opt/cray/netcdf-hdf5parallel/4.3.2/@PRGENV@/@PE_NETCDF_HDF5PARALLEL_DEFAULT_GENCOMPS@/lib/pkgconfig
PE_NETCDF_HDF5PARALLEL_DEFAULT_VOLATILE_PRGENV=GNU
PE_PAPI_DEFAULT_ACCEL_LIBS_nvidia35=,-lcupti,-lcudart,-lcuda
PE_PAPI_DEFAULT_PKGCONFIG_VARIABLES=PE_PAPI_ACCEL_LIBS_@accelerator@
PE_PAPI_DEFAULT_TARGET_VAR_nvidia35=,-lcupti,-lcudart,-lcuda
PE_PARALLEL_NETCDF_DEFAULT_FIXED_PRGENV=CRAY INTEL
PE_PARALLEL_NETCDF_DEFAULT_GENCOMPS_GNU=49 48
PE_PARALLEL_NETCDF_DEFAULT_VOLATILE_PKGCONFIG_PATH=/opt/cray/parallel-netcdf/1.5.0/@PRGENV@/@PE_PARALLEL_NETCDF_DEFAULT_GENCOMPS@/lib/pkgconfig
PE_PARALLEL_NETCDF_DEFAULT_VOLATILE_PRGENV=GNU
PE_PETSC_DEFAULT_GENCOMPS_CRAY_haswell=83
PE_PETSC_DEFAULT_GENCOMPS_CRAY_interlagos=83
PE_PETSC_DEFAULT_GENCOMPS_CRAY_sandybridge=83
PE_PETSC_DEFAULT_GENCOMPS_CRAY_x86_64=83
PE_PETSC_DEFAULT_GENCOMPS_GNU_haswell=49 48
PE_PETSC_DEFAULT_GENCOMPS_GNU_interlagos=49 48
PE_PETSC_DEFAULT_GENCOMPS_GNU_sandybridge=49 48
PE_PETSC_DEFAULT_GENCOMPS_GNU_x86_64=49 48
PE_PETSC_DEFAULT_GENCOMPS_INTEL_haswell=140
PE_PETSC_DEFAULT_GENCOMPS_INTEL_sandybridge=140
PE_PETSC_DEFAULT_GENCOMPS_INTEL_x86_64=140
PE_PETSC_DEFAULT_REQUIRED_PRODUCTS=PE_MPICH:PE_LIBSCI:PE_TPSL
PE_PETSC_DEFAULT_VOLATILE_PKGCONFIG_PATH=/opt/cray/petsc/3.5.2.1/complex/@PRGENV@/@PE_PETSC_DEFAULT_GENCOMPS@/@PE_PETSC_DEFAULT_TARGET@/lib/pkgconfig
PE_PETSC_DEFAULT_VOLATILE_PRGENV=CRAY GNU INTEL
PE_PKGCONFIG_DEFAULT_PRODUCTS=PE_HDF5_PARALLEL:PE_HDF5:PE_PETSC:PE_NETCDF_HDF5PARALLEL:PE_TRILINOS:PE_FFTW:PE_GA:PE_NETCDF:PE_TPSL:PE_MPICH:PE_LIBSCI:PE_PARALLEL_NETCDF
PE_PKGCONFIG_LIBS=mpich:AtpSigHandler:libsci_mpi:libsci
PE_PKGCONFIG_PRODUCTS_DEFAULT=PE_PAPI
PE_PKGCONFIG_PRODUCTS=PE_MPICH:PE_LIBSCI
PE_PRODUCT_LIST=CRAY_LLM:CRAYPE_IVYBRIDGE:CRAY_RCA:CRAY_ALPS:DVS:CRAY_XPMEM:CRAY_DMAPP:CRAY_PMI:CRAY_UGNI:CRAY_UDREG:CRAY_LIBSCI:CRAYPE:CRAY
PE_SMA_DIR_CRAY_DEFAULT64=64
PE_SMA_DIR_PGI_DEFAULT64=64
PE_SMA_VOLATILE_PKGCONFIG_PATH=/opt/cray/mpt/7.1.1/gni/sma@PE_SMA_DIR_DEFAULT64@/lib64/pkgconfig
PE_TPSL_DEFAULT_GENCOMPS_CRAY_haswell=83
PE_TPSL_DEFAULT_GENCOMPS_CRAY_interlagos=83
PE_TPSL_DEFAULT_GENCOMPS_CRAY_sandybridge=83
PE_TPSL_DEFAULT_GENCOMPS_CRAY_x86_64=83
PE_TPSL_DEFAULT_GENCOMPS_GNU_haswell=49 48
PE_TPSL_DEFAULT_GENCOMPS_GNU_interlagos=49 48
PE_TPSL_DEFAULT_GENCOMPS_GNU_sandybridge=49 48
PE_TPSL_DEFAULT_GENCOMPS_GNU_x86_64=49 48
PE_TPSL_DEFAULT_GENCOMPS_INTEL_haswell=140
PE_TPSL_DEFAULT_GENCOMPS_INTEL_interlagos=140
PE_TPSL_DEFAULT_GENCOMPS_INTEL_sandybridge=140
PE_TPSL_DEFAULT_GENCOMPS_INTEL_x86_64=140
PE_TPSL_DEFAULT_REQUIRED_PRODUCTS=PE_MPICH:PE_LIBSCI
PE_TPSL_DEFAULT_VOLATILE_PKGCONFIG_PATH=/opt/cray/tpsl/1.4.3/@PRGENV@/@PE_TPSL_DEFAULT_GENCOMPS@/@PE_TPSL_DEFAULT_TARGET@/lib/pkgconfig
PE_TPSL_DEFAULT_VOLATILE_PRGENV=CRAY GNU INTEL
PE_TRILINOS_DEFAULT_GENCOMPS_CRAY_x86_64=83
PE_TRILINOS_DEFAULT_GENCOMPS_GNU_x86_64=49 48
PE_TRILINOS_DEFAULT_GENCOMPS_INTEL_x86_64=140
PE_TRILINOS_DEFAULT_REQUIRED_PRODUCTS=PE_MPICH:PE_HDF5_PARALLEL:PE_NETCDF_HDF5PARALLEL:PE_LIBSCI:PE_TPSL
PE_TRILINOS_DEFAULT_VOLATILE_PKGCONFIG_PATH=/opt/cray/trilinos/11.12.1.0/@PRGENV@/@PE_TRILINOS_DEFAULT_GENCOMPS@/@PE_TRILINOS_DEFAULT_TARGET@/lib/pkgconfig
PE_TRILINOS_DEFAULT_VOLATILE_PRGENV=CRAY GNU INTEL
PKGCONFIG_ENABLED=1
PKG_CONFIG_PATH_DEFAULT=/opt/cray/papi/5.3.2.1/lib64/pkgconfig
PKG_CONFIG_PATH=/opt/cray/MySQL/5.0.64-1.0000.7096.23.2/lib64/pkgconfig:/opt/cray/alps/5.1.1-2.0501.8471.1.1.ari/lib64/pkgconfig:/opt/cray/rca/1.0.0-2.0501.48090.7.46.ari/lib64/pkgconfig:/opt/cray/alps/5.1.1-2.0501.8507.1.1.ari/lib64/pkgconfig:/opt/cray/csa/3.0.0-1_2.0501.47112.1.91.ari/lib64/pkgconfig:/opt/cray/xpmem/0.1-2.0501.48424.3.3.ari/lib64/pkgconfig:/opt/cray/gni-headers/3.0-1.0501.8317.12.1.ari/lib64/pkgconfig:/opt/cray/dmapp/7.0.1-1.0501.8315.8.4.ari/lib64/pkgconfig:/opt/cray/pmi/5.0.6-1.0000.10439.140.2.ari/lib64/pkgconfig:/opt/cray/ugni/5.0-1.0501.8253.10.22.ari/lib64/pkgconfig:/opt/cray/udreg/2.3.2-1.0501.7914.1.13.ari/lib64/pkgconfig:/opt/cray/craype/2.2.1/pkg-config:/opt/cray/switch/1.0-1.0501.47124.1.93.ari/lib64/pkgconfig:/opt/cray/atp/1.7.5/lib/pkgconfig
PRGENVMODULES=/opt/modules/3.2.10.2/init/.prgenvmodules:PrgEnv-cray:PrgEnv-gnu:PrgEnv-intel:PrgEnv-pathscale:PrgEnv-pgi
PROFILEREAD=true
PWD=/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007-pilot.0000
PYTHONPATH=/opt/cray/sdb/1.0-1.0501.48084.4.48.ari/lib64/py:/usr/local/packages/cse/bolt/0.6/modules
PYTHONSTARTUP=/etc/pythonstart
QTDIR=/usr/lib/qt3
QT_SYSTEM_DIR=/usr/share/desktop-data
RCLOCAL_BASEOPTS=true
RCLOCAL_PRGENV=true
SAGA_PPN=24
SHELL=/bin/bash
SHLVL=4
SHMEM_ABORT_ON_ERROR=1
SSH_CLIENT=74.88.204.39 45561 22
SSH_CONNECTION=74.88.204.39 45561 193.62.216.44 22
SSH_SENDS_LOCALE=yes
SSH_TTY=/dev/pts/27
TARGETMODULES=/opt/modules/3.2.10.2/init/.targetmodules:craype-abudhabi:craype-abudhabi-cu:craype-accel-host:craype-accel-nvidia20:craype-accel-nvidia30:craype-accel-nvidia35:craype-barcelona:craype-broadwell:craype-haswell:craype-hugepages128K:craype-hugepages128M:craype-hugepages16M:craype-hugepages256M:craype-hugepages2M:craype-hugepages32M:craype-hugepages4M:craype-hugepages512K:craype-hugepages512M:craype-hugepages64M:craype-hugepages8M:craype-intel-knc:craype-interlagos:craype-interlagos-cu:craype-istanbul:craype-ivybridge:craype-mc12:craype-mc8:craype-mic-knl:craype-network-aries:craype-network-gemini:craype-network-infiniband:craype-network-none:craype-network-seastar:craype-sandybridge:craype-shanghai:craype-target-compute_node:craype-target-local_host:craype-target-native:craype-xeon:xtpe-barcelona:xtpe-interlagos:xtpe-interlagos-cu:xtpe-istanbul:xtpe-mc12:xtpe-mc8:xtpe-network-gemini:xtpe-network-seastar:xtpe-shanghai:xtpe-target-native:xtpe-xeon:/etc/opt/cray/modules/site_targetmodules
TERM=vt100
TMPDIR=/var/tmp/pbs.3187804.sdb
TOOLMODULES=/opt/modules/3.2.10.2/init/.toolmodules:apprentice:apprentice2:atp:chapel:cray-lgdb:craypat:craypkg-gen:cray-snplauncher:ddt:gdb:iobuf:papi:perftools:perftools-lite:stat:totalview:xt-craypat:xt-lgdb:xt-papi:xt-totalview:/etc/opt/cray/modules/site_toolmodules
TZ=Europe/London
USERMODULES=/opt/modules/3.2.10.2/init/.usermodules:acml:alps:apprentice:apprentice2:atp:blcr:cce:chapel:cray-ccdb:cray-fftw:cray-ga:cray-hdf5:cray-hdf5-parallel:cray-lgdb:cray-libsci:cray-libsci_acc:cray-mpich:cray-mpich2:cray-mpich-compat:cray-netcdf:cray-netcdf-hdf5parallel:cray-parallel-netcdf:craypat:craype:cray-petsc:cray-petsc-complex:craypkg-gen:cray-shmem:cray-snplauncher:cray-tpsl:cray-trilinos:cudatoolkit:ddt:fftw:ga:gcc:hdf5:hdf5-parallel:intel:iobuf:java:lgdb:libfast:libsci_acc:mpich1:netcdf:netcdf-hdf5parallel:netcdf-nofsync:netcdf-nofsync-hdf5parallel:ntk:onesided:papi:parallel-netcdf:pathscale:perftools:perftools-lite:petsc:petsc-complex:pgi:pmi:PrgEnv-cray:PrgEnv-gnu:PrgEnv-intel:PrgEnv-pathscale:PrgEnv-pgi:stat:totalview:tpsl:trilinos:xt-asyncpe:xt-craypat:xt-lgdb:xt-libsci:xt-mpich2:xt-mpt:xt-papi:xt-shmem:xt-totalview:/etc/opt/cray/modules/site_usermodules
USER=vb224
_=/usr/bin/env
WINDOWMANAGER=
XCURSOR_THEME=crystalwhite
XDG_CONFIG_DIRS=/etc/xdg
XDG_DATA_DIRS=/usr/share:/etc/opt/kde3/share:/opt/kde3/share
XKEYSYMDB=/usr/share/X11/XKeysymDB
XNLSPATH=/usr/share/X11/nls
XTOS_VERSION=5.1.29
XTPE_NETWORK_TARGET=aries
# -------------------------------------------------------------------

# -------------------------------------------------------------------
#
# Running pre-process command
# cmd: module switch anaconda python-compute/2.7.6-ucs4
#
#
# SUCCESS
#
# -------------------------------------------------------------------
VIRTENV : /work/e290/shared/shared_pilot_ve_20150429/
VIRTENV : /fs4/e290/shared/shared_pilot_ve_20150429 (normalized)
# -------------------------------------------------------------------
# Setting up forward tunnel for MongoDB to 10.60.0.52.

################################################################################
## Searching for available TCP port for tunnel in range 23000..23100.
## Found available port: 23000
PYTHON: /usr/bin/python
PIP   : 
obtained lock /fs4/e290/shared/shared_pilot_ve_20150429.lock
virtenv_create   : FALSE
virtenv_update   : FALSE
rp install sources:  radical.utils-0.35/ saga-python-0.35/ radical.pilot-0.35.1/
rp install target : SANDBOX
do not create virtenv /fs4/e290/shared/shared_pilot_ve_20150429
PYTHON: /fs4/e290/shared/shared_pilot_ve_20150429/bin/python
PIP   : /fs4/e290/shared/shared_pilot_ve_20150429/bin/pip
PYTHON INTERPRETER: /fs4/e290/shared/shared_pilot_ve_20150429/bin/python
PYTHON_VERSION    : 
VE_MOD_PREFIX     : 
PIP installer     : /fs4/e290/shared/shared_pilot_ve_20150429/bin/pip
PIP version       : 
activated virtenv
VIRTENV      : /fs4/e290/shared/shared_pilot_ve_20150429
VE_MOD_PREFIX: ///////
RP_MOD_PREFIX: ///////
PYTHONPATH   : ///////:/opt/cray/sdb/1.0-1.0501.48084.4.48.ari/lib64/py:/usr/local/packages/cse/bolt/0.6/modules
do not update virtenv /fs4/e290/shared/shared_pilot_ve_20150429
Using RADICAL-Pilot install sources ' radical.utils-0.35/ saga-python-0.35/ radical.pilot-0.35.1/'
VE_MOD_PREFIX: ///////
VIRTENV      : /fs4/e290/shared/shared_pilot_ve_20150429
SANDBOX      : /work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007-pilot.0000
VE_LOC_PREFIX: 
using local install tree
PYTHONPATH: ///////::/opt/cray/sdb/1.0-1.0501.48084.4.48.ari/lib64/py:/usr/local/packages/cse/bolt/0.6/modules
rp_install: ///////
radicalmod: ////////radical/
created radical namespace in ////////radical//__init__.py

# -------------------------------------------------------------------
#
# update radical.utils-0.35/ via pip
# cmd: /fs4/e290/shared/shared_pilot_ve_20150429/bin/pip install  --src '/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007-pilot.0000/rp_install/src' --build '/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007-pilot.0000/rp_install/build' --install-option='--prefix=/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007-pilot.0000/rp_install' radical.utils-0.35/
#
#
# ERROR
# no fallback command available
#
# -------------------------------------------------------------------
Couldn't install radical.utils-0.35/! Lets see how far we get ...

# -------------------------------------------------------------------
#
# update saga-python-0.35/ via pip
# cmd: /fs4/e290/shared/shared_pilot_ve_20150429/bin/pip install  --src '/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007-pilot.0000/rp_install/src' --build '/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007-pilot.0000/rp_install/build' --install-option='--prefix=/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007-pilot.0000/rp_install' saga-python-0.35/
#
#
# ERROR
# no fallback command available
#
# -------------------------------------------------------------------
Couldn't install saga-python-0.35/! Lets see how far we get ...

# -------------------------------------------------------------------
#
# update radical.pilot-0.35.1/ via pip
# cmd: /fs4/e290/shared/shared_pilot_ve_20150429/bin/pip install  --src '/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007-pilot.0000/rp_install/src' --build '/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007-pilot.0000/rp_install/build' --install-option='--prefix=/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016709.0007-pilot.0000/rp_install' radical.pilot-0.35.1/
#
#
# ERROR
# no fallback command available
#
# -------------------------------------------------------------------
Couldn't install radical.pilot-0.35.1/! Lets see how far we get ...
removed `/fs4/e290/shared/shared_pilot_ve_20150429.lock'

---------------------------------------------------------------------

 (/fs4/e290/shared/shared_pilot_ve_20150429/bin/python)
PYTHONPATH: ///////::/opt/cray/sdb/1.0-1.0501.48084.4.48.ari/lib64/py:/usr/local/packages/cse/bolt/0.6/modules
install failed!
--------------------------------------------------------------------------------

Resources requested: ncpus=24,place=free,walltime=00:20:00
Resources allocated: cpupercent=0,cput=00:00:00,mem=0kb,ncpus=24,vmem=0kb,walltime=00:00:05

*** vb224   Job: 3187804.sdb   ended: 01/10/15 16:13:28   queue: S3178110 ***
*** vb224   Job: 3187804.sdb   ended: 01/10/15 16:13:28   queue: S3178110 ***
*** vb224   Job: 3187804.sdb   ended: 01/10/15 16:13:28   queue: S3178110 ***
*** vb224   Job: 3187804.sdb   ended: 01/10/15 16:13:28   queue: S3178110 ***
--------------------------------------------------------------------------------
@vivek-bala
Copy link
Contributor Author

This is reproducible. All attempts since the failed build have lead to the same error.

@marksantcroos
Copy link
Contributor

ARCHER admins removed the default python module.
Can you try 55cfbd7?

@vivek-bala
Copy link
Contributor Author

I tried that commit for rp. devel for saga and radical utils. I still see the same error.

client verbose:

$ python getting_started_remote.py epsrc.archer
running on epsrc.archer
2015-10-01 20:56:41,853: radical.pilot       : MainProcess                     : MainThread     : ERROR   : The 'database_name' parameter is deprecated - please specify an URL path
create session rp.session.ip-10-184-31-85.vivek.016709.0010                   ok
session id: rp.session.ip-10-184-31-85.vivek.016709.0010
create pilot manager                                                          ok
create pilot description                                                      ok
submit 1 pilot(s).                                                            ok
create unit manager                                                           ok
add 1 pilot(s)                                                                ok
submit 8 unit(s)........[Callback]: ComputePilot 'pilot.0000' state: Launching.
                                                      ok
wait for 8 unit(s)[Callback]: unit unit.000002 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000003 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000001 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000007 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000004 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000000 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000005 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000006 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000004 on pilot.0000: StagingInput.
[Callback]: unit unit.000000 on pilot.0000: StagingInput.
[Callback]: unit unit.000004 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000000 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000005 on pilot.0000: StagingInput.
[Callback]: unit unit.000006 on pilot.0000: StagingInput.
[Callback]: unit unit.000005 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000006 on pilot.0000: AgentStagingInputPending.
[Callback]: ComputePilot 'pilot.0000' state: PendingActive.
[Callback]: ComputePilot 'pilot.0000' state: Done.

agent_0.err:

Traceback (most recent call last):
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0010-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 6101, in <module>
    bootstrap_3()
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0010-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 5943, in bootstrap_3
    _, mongo_db, _, _, _  = ru.mongodb_connect(cfg['mongodb_url'])
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0010-pilot.0000/rp_install/lib/python2.7/site-packages/radical/utils/misc.py", line 88, in mongodb_connect
    mongo = pymongo.MongoClient (host=host, port=port)
  File "/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/pymongo/mongo_client.py", line 377, in __init__
    raise ConnectionFailure(str(e))
pymongo.errors.ConnectionFailure: [Errno 111] Connection refused

bootstrap_1.err :

bootstrap_1.sh: line 1240: 26768 Terminated              ( /bin/bash -c "(>/dev/tcp/$host/$port)" 2> /dev/null )
Bad local forwarding specification '127.0.0.1:23000:extasy-db.epcc.ed.ac.uk:'
Python 2.7.6
kill: 26769: No such process

@marksantcroos
Copy link
Contributor

I still see the same error.

Hmmm, this is a rather different error, isnt it?

> 2015-10-01 20:56:41,853: radical.pilot       : MainProcess                     : MainThread     : ERROR   : The 'database_name' parameter is deprecated - please specify an URL path
Bad local forwarding specification '127.0.0.1:23000:extasy-db.epcc.ed.ac.uk:'

How do you configure the mongodb to use?

@vivek-bala
Copy link
Contributor Author

I set

session = rp.Session(database_url='mongodb://extasy:extasyproject@extasy-db.epcc.ed.ac.uk/radicalpilot',database_name='vivek')

Any reason why database_name is being deprecated ?

@marksantcroos
Copy link
Contributor

Any reason why database_name is being deprecated ?

For easier parsing, and for cases like this :)
Any reason why you have been ignoring the warning for ages? ;-)

@vivek-bala
Copy link
Contributor Author

This is the first time I am seeing/noticing the warning :) But is the parameter deprecated or not supported anymore ? Should I set the database_name along with the database_url ?

@andre-merzky
Copy link
Member

Yes, please set the database name as part of the URL...

@marksantcroos
Copy link
Contributor

Should I set the database_name along with the database_url ?

Technically you already did, as the "radicalpilot" part of the url is already the database name :-)

@vivek-bala
Copy link
Contributor Author

Hmm.. yes ! Didn't even realize I used the extasy db url :-| Running it again now.

@vivek-bala
Copy link
Contributor Author

client verbose remains same.

agent_0.err:

Traceback (most recent call last):
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0012-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 6101, in <module>
    bootstrap_3()
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0012-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 5943, in bootstrap_3
    _, mongo_db, _, _, _  = ru.mongodb_connect(cfg['mongodb_url'])
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0012-pilot.0000/rp_install/lib/python2.7/site-packages/radical/utils/misc.py", line 88, in mongodb_connect
    mongo = pymongo.MongoClient (host=host, port=port)
  File "/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/pymongo/mongo_client.py", line 377, in __init__
    raise ConnectionFailure(str(e))
pymongo.errors.ConnectionFailure: [Errno 111] Connection refused

bootstrap_1.err

bootstrap_1.sh: line 1240: 24834 Terminated              ( /bin/bash -c "(>/dev/tcp/$host/$port)" 2> /dev/null )
Bad local forwarding specification '127.0.0.1:23000:extasy-db.epcc.ed.ac.uk:'
Python 2.7.6
kill: 24835: No such process

agent.out:

2015-10-01 23:47:59,020: radical.saga        : MainProcess                     : MainThread     : INFO    : python.interpreter   version: 2.7.6 (default, Mar 10 2014, 14:13:45) [GCC 4.8.1 20130531 (Cray Inc.)]
2015-10-01 23:47:59,020: radical.saga        : MainProcess                     : MainThread     : INFO    :                      pid: 30175
2015-10-01 23:47:59,020: radical.saga        : MainProcess                     : MainThread     : INFO    :                      tid: MainThread
2015-10-01 23:47:59,020: radical.saga        : MainProcess                     : MainThread     : INFO    : radical.saga         version: v0.35-26-g8305ab2@devel
2015-10-01 23:47:59,032: radical.pilot       : MainProcess                     : MainThread     : INFO    : python.interpreter   version: 2.7.6 (default, Mar 10 2014, 14:13:45) [GCC 4.8.1 20130531 (Cray Inc.)]
2015-10-01 23:47:59,032: radical.pilot       : MainProcess                     : MainThread     : INFO    :                      pid: 30175
2015-10-01 23:47:59,032: radical.pilot       : MainProcess                     : MainThread     : INFO    :                      tid: MainThread
2015-10-01 23:47:59,032: radical.pilot       : MainProcess                     : MainThread     : INFO    : radical.pilot        version: v0.35-511-g55cfbd7@detached-55cfbd7
---------------------------------------------------------------------

PYTHONPATH: ['/fs4/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0012-pilot.0000/rp_install/bin', '/work/y07/y07/cse/python/modules/cython/0.21.1/lib/python2.7/site-packages/Cython-0.21.1-py2.7-linux-x86_64.egg', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/setuptools-0.6c11-py2.7.egg', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/pip-1.3-py2.7.egg', '/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0012-pilot.0000/rp_install/lib/python2.7/site-packages', '/fs4/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0012-pilot.0000', '/work/y07/y07/cse/pycairo/1.10.0/lib/python2.7/site-packages', '/work/y07/y07/cse/pygobject/2.21.3/lib/python2.7/site-packages', '/work/y07/y07/cse/pygtk/2.24.0/lib/python2.7/site-packages/gtk-2.0', '/work/y07/y07/cse/yaml/pyyaml/3.11/lib/python2.7/site-packages', '/work/y07/y07/cse/python/modules/cython/0.21.1/lib/python2.7/site-packages', '/work/y07/y07/cse/mpi4py/1.3.1/lib/python2.7/site-packages', '/opt/cray/sdb/1.0-1.0501.48084.4.48.ari/lib64/py', '/usr/local/packages/cse/bolt/0.6/modules', '/work/y07/y07/cse/pygobject/2.21.3/lib/python2.7/site-packages/gtk-2.0', '/work/e290/shared/shared_pilot_ve_20150924/lib/python27.zip', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/plat-linux2', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/lib-tk', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/lib-old', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/lib-dynload', '/work/y07/y07/cse/python/2.7.6/lib/python2.7', '/work/y07/y07/cse/python/2.7.6/lib/python2.7/plat-linux2', '/work/y07/y07/cse/python/2.7.6/lib/python2.7/lib-tk', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages', '/opt/cray/sdb/1.0-1.0501.48084.4.48.ari/lib64/py']
python: 2.7.6 (default, Mar 10 2014, 14:13:45) 
[GCC 4.8.1 20130531 (Cray Inc.)]
utils : v0.35-50-gd8796d4@devel : /work/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0012-pilot.0000/rp_install/lib/python2.7/site-packages/radical/utils/__init__.pyc
saga  : v0.35-26-g8305ab2@devel : /work/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0012-pilot.0000/rp_install/lib/python2.7/site-packages/saga/__init__.pyc
pilot : v0.35-511-g55cfbd7@detached-55cfbd7 : /work/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0012-pilot.0000/rp_install/lib/python2.7/site-packages/radical/pilot/__init__.pyc
        type  : multicore
        gitid : $Id$

---------------------------------------------------------------------

startup agent agent_0 : /fs4/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0012-pilot.0000/agent_0.cfg
Agent config (/fs4/e290/e290/vb224/radical.pilot.sandbox/rp.session.ip-10-184-31-85.vivek.016709.0012-pilot.0000/agent_0.cfg):
{'agent_launch_method': 'ORTE',
 'agent_layout': {'agent_0': {'bridges': ['agent_staging_input_queue',
                                          'agent_scheduling_queue',
                                          'agent_executing_queue',
                                          'agent_staging_output_queue',
                                          'agent_unschedule_pubsub',
                                          'agent_reschedule_pubsub',
                                          'agent_command_pubsub',
                                          'agent_state_pubsub'],
                              'components': {'AgentStagingInputComponent': 1,
                                             'AgentStagingOutputComponent': 1},
                              'pull_units': True,
                              'sub_agents': ['agent_1'],
                              'target': 'local'},
                  'agent_1': {'components': {'AgentExecutingComponent': 1,
                                             'AgentSchedulingComponent': 1},
                              'target': 'node'}},
 'agent_name': 'agent_0',
 'bulk_collection_time': 1.0,
 'cores': 8,
 'db_poll_sleeptime': 0.1,
 'debug': 40,
 'heartbeat_interval': 10,
 'lrms': 'PBSPRO',
 'max_io_loglength': 1024,
 'mongodb_url': 'mongodb://extasy:extasyproject@extasy-db.epcc.ed.ac.uk/radicalpilot',
 'mpi_launch_method': 'ORTE',
 'network_interface': 'ipogif0',
 'pilot_id': 'pilot.0000',
 'runtime': 10,
 'scheduler': 'CONTINUOUS',
 'session_id': 'rp.session.ip-10-184-31-85.vivek.016709.0012',
 'spawner': 'POPEN',
 'staging_area': 'staging_area',
 'staging_scheme': 'staging',
 'task_launch_method': 'ORTE'}

session created as follows:

session = rp.Session(database_url='mongodb://extasy:extasyproject@extasy-db.epcc.ed.ac.uk/radicalpilot')

@andre-merzky
Copy link
Member

bootstrap_1.out is long, but can you post it somewhere, or attach it here? Thanks!

@vivek-bala
Copy link
Contributor Author

@andre-merzky
Copy link
Member

Vivek, could you please change mongodb://extasy:extasyproject@extasy-db.epcc.ed.ac.uk/radicalpilot to mongodb://extasy:extasyproject@extasy-db.epcc.ed.ac.uk:27017/radicalpilot? If that works (or creates a different error), then I think I know what needs fixing. Thanks!

@andre-merzky
Copy link
Member

In now committed what I think is a fix in the branch fix/issue_780. You may want to give it a try (w/o the port number).

@vivek-bala
Copy link
Contributor Author

Do I install devel saga and ru with fix/issue_780 ?

@marksantcroos
Copy link
Contributor

Yes, its similar to devel modulo the fix. (I just merged devel back into it)

@vivek-bala
Copy link
Contributor Author

Ok, I don't get the forwarding error anymore. I get the module error. Should I be testing the devel branch now ? (fixes for both mongodb+archer module changes)

@marksantcroos
Copy link
Contributor

Did you pull after my message?

@vivek-bala
Copy link
Contributor Author

I think so. I used the fix_issue_780 branch of rp.

@marksantcroos
Copy link
Contributor

That should have the module fix.

@vivek-bala
Copy link
Contributor Author

Also the number of files on the agent side are lesser. As in there are no agent_0.err or .out files. Is this expected ?

vb224@eslogin007:~/work/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0002-pilot.0000> ls
agent_0.cfg      radical.pilot-v0.36.RC1-3-g576b11d-fix-issue-780         rp_install
bootstrap_1.err  radical.pilot-v0.36.RC1-3-g576b11d-fix-issue-780.tar.gz  saga-python-v0.36.RC1-devel
bootstrap_1.out  radical.utils-v0.36.RC1-devel                            saga-python-v0.36.RC1-devel.tar.gz
bootstrap_1.sh   radical.utils-v0.36.RC1-devel.tar.gz

@vivek-bala
Copy link
Contributor Author

Ok, looks like I might have pulled before the merge. I pulled again now, I see some changes. Wil give it another go

@vivek-bala
Copy link
Contributor Author

Now I get,

agent_0.out:

---------------------------------------------------------------------

PYTHONPATH: ['/fs4/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003-pilot.0000/rp_install/bin', '/work/y07/y07/cse/python/modules/cython/0.21.1/lib/python2.7/site-packages/Cython-0.21.1-py2.7-linux-x86_64.egg', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/setuptools-0.6c11-py2.7.egg', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/pip-1.3-py2.7.egg', '/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003-pilot.0000/rp_install/lib/python2.7/site-packages', '/fs4/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003-pilot.0000', '/work/y07/y07/cse/pycairo/1.10.0/lib/python2.7/site-packages', '/work/y07/y07/cse/pygobject/2.21.3/lib/python2.7/site-packages', '/work/y07/y07/cse/pygtk/2.24.0/lib/python2.7/site-packages/gtk-2.0', '/work/y07/y07/cse/yaml/pyyaml/3.11/lib/python2.7/site-packages', '/work/y07/y07/cse/python/modules/cython/0.21.1/lib/python2.7/site-packages', '/work/y07/y07/cse/mpi4py/1.3.1/lib/python2.7/site-packages', '/opt/cray/sdb/1.0-1.0501.48084.4.48.ari/lib64/py', '/usr/local/packages/cse/bolt/0.6/modules', '/work/y07/y07/cse/pygobject/2.21.3/lib/python2.7/site-packages/gtk-2.0', '/work/e290/shared/shared_pilot_ve_20150924/lib/python27.zip', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/plat-linux2', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/lib-tk', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/lib-old', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/lib-dynload', '/work/y07/y07/cse/python/2.7.6/lib/python2.7', '/work/y07/y07/cse/python/2.7.6/lib/python2.7/plat-linux2', '/work/y07/y07/cse/python/2.7.6/lib/python2.7/lib-tk', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages', '/opt/cray/sdb/1.0-1.0501.48084.4.48.ari/lib64/py']
python: 2.7.6 (default, Mar 10 2014, 14:13:45) 
[GCC 4.8.1 20130531 (Cray Inc.)]
utils : v0.36.RC1@devel : /work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003-pilot.0000/rp_install/lib/python2.7/site-packages/radical/utils/__init__.pyc
saga  : v0.36.RC1@devel : /work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003-pilot.0000/rp_install/lib/python2.7/site-packages/saga/__init__.pyc
pilot : v0.36.RC1-28-g944275b@fix-issue_780 : /work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003-pilot.0000/rp_install/lib/python2.7/site-packages/radical/pilot/__init__.pyc
        type  : multicore
        gitid : $Id$

---------------------------------------------------------------------

startup agent agent_0 : /fs4/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003-pilot.0000/agent_0.cfg
Agent config (/fs4/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003-pilot.0000/agent_0.cfg):
{'agent_launch_method': 'ORTE',
 'agent_layout': {'agent_0': {'bridges': ['agent_staging_input_queue',
                                          'agent_scheduling_queue',
                                          'agent_executing_queue',
                                          'agent_staging_output_queue',
                                          'agent_unschedule_pubsub',
                                          'agent_reschedule_pubsub',
                                          'agent_command_pubsub',
                                          'agent_state_pubsub'],
                              'components': {'AgentStagingInputComponent': 1,
                                             'AgentStagingOutputComponent': 1},
                              'pull_units': True,
                              'sub_agents': ['agent_1'],
                              'target': 'local'},
                  'agent_1': {'components': {'AgentExecutingComponent': 1,
                                             'AgentSchedulingComponent': 1},
                              'target': 'node'}},
 'agent_name': 'agent_0',
 'bulk_collection_time': 1.0,
 'cores': 8,
 'db_poll_sleeptime': 0.1,
 'debug': 40,
 'heartbeat_interval': 10,
 'lrms': 'PBSPRO',
 'max_io_loglength': 1024,
 'mongodb_url': 'mongodb://extasy:extasyproject@extasy-db.epcc.ed.ac.uk/radicalpilot',
 'mpi_launch_method': 'ORTE',
 'network_interface': 'ipogif0',
 'pilot_id': 'pilot.0000',
 'runtime': 10,
 'scheduler': 'CONTINUOUS',
 'session_id': 'rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003',
 'spawner': 'POPEN',
 'staging_area': 'staging_area',
 'staging_scheme': 'staging',
 'task_launch_method': 'ORTE'}


Error running agent: LRMS has no nodes left to run units
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 6024, in bootstrap_3
    logger = log)
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 2335, in create
    return impl(cfg, logger)
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 2547, in __init__
    LRMS.__init__(self, cfg, logger)
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0003-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 2282, in __init__
    raise RuntimeError('LRMS has no nodes left to run units')

atexit

agent_0.err:

2015-10-02 14:35:37,016: radical.saga        : MainProcess                     : MainThread     : INFO    : python.interpreter   version: 2.7.6 (default, Mar 10 2014, 14:13:45) [GCC 4.8.1 20130531 (Cray Inc.)]
2015-10-02 14:35:37,017: radical.saga        : MainProcess                     : MainThread     : INFO    :                      pid: 14642
2015-10-02 14:35:37,017: radical.saga        : MainProcess                     : MainThread     : INFO    :                      tid: MainThread
2015-10-02 14:35:37,017: radical.saga        : MainProcess                     : MainThread     : INFO    : radical.saga         version: v0.36.RC1@devel
2015-10-02 14:35:37,027: radical.pilot       : MainProcess                     : MainThread     : INFO    : python.interpreter   version: 2.7.6 (default, Mar 10 2014, 14:13:45) [GCC 4.8.1 20130531 (Cray Inc.)]
2015-10-02 14:35:37,028: radical.pilot       : MainProcess                     : MainThread     : INFO    :                      pid: 14642
2015-10-02 14:35:37,028: radical.pilot       : MainProcess                     : MainThread     : INFO    :                      tid: MainThread
2015-10-02 14:35:37,028: radical.pilot       : MainProcess                     : MainThread     : INFO    : radical.pilot        version: v0.36.RC1-28-g944275b@fix-issue_780

@marksantcroos
Copy link
Contributor

raise RuntimeError('LRMS has no nodes left to run units')

This was discussed in yesterdays meeting. You need to allocate enough nodes to allow for an agent node. On ARCHER this means at least 48 cores. (I think you now have 24)

@vivek-bala
Copy link
Contributor Author

Ok, I guess then we can't use the short queue (24 core limit). I'll try again on the standard one.

@marksantcroos
Copy link
Contributor

Per http://www.archer.ac.uk/documentation/user-guide/batch.php#sec-5.13

Jobs can range from 1-8 nodes (24-192 cores) and can have a maximum walltime of 20 minutes. The queue is only enabled between the hours of 0900-1700 UK time, Mon-Fri.

@vivek-bala
Copy link
Contributor Author

When I use the short queue with 48 cores, I got :

agent_0.out:

---------------------------------------------------------------------

PYTHONPATH: ['/fs4/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005-pilot.0000/rp_install/bin', '/work/y07/y07/cse/python/modules/cython/0.21.1/lib/python2.7/site-packages/Cython-0.21.1-py2.7-linux-x86_64.egg', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/setuptools-0.6c11-py2.7.egg', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/pip-1.3-py2.7.egg', '/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005-pilot.0000/rp_install/lib/python2.7/site-packages', '/fs4/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005-pilot.0000', '/work/y07/y07/cse/pycairo/1.10.0/lib/python2.7/site-packages', '/work/y07/y07/cse/pygobject/2.21.3/lib/python2.7/site-packages', '/work/y07/y07/cse/pygtk/2.24.0/lib/python2.7/site-packages/gtk-2.0', '/work/y07/y07/cse/yaml/pyyaml/3.11/lib/python2.7/site-packages', '/work/y07/y07/cse/python/modules/cython/0.21.1/lib/python2.7/site-packages', '/work/y07/y07/cse/mpi4py/1.3.1/lib/python2.7/site-packages', '/opt/cray/sdb/1.0-1.0501.48084.4.48.ari/lib64/py', '/usr/local/packages/cse/bolt/0.6/modules', '/work/y07/y07/cse/pygobject/2.21.3/lib/python2.7/site-packages/gtk-2.0', '/work/e290/shared/shared_pilot_ve_20150924/lib/python27.zip', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/plat-linux2', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/lib-tk', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/lib-old', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/lib-dynload', '/work/y07/y07/cse/python/2.7.6/lib/python2.7', '/work/y07/y07/cse/python/2.7.6/lib/python2.7/plat-linux2', '/work/y07/y07/cse/python/2.7.6/lib/python2.7/lib-tk', '/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages', '/opt/cray/sdb/1.0-1.0501.48084.4.48.ari/lib64/py']
python: 2.7.6 (default, Mar 10 2014, 14:13:45) 
[GCC 4.8.1 20130531 (Cray Inc.)]
utils : v0.36.RC1@devel : /work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005-pilot.0000/rp_install/lib/python2.7/site-packages/radical/utils/__init__.pyc
saga  : v0.36.RC1@devel : /work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005-pilot.0000/rp_install/lib/python2.7/site-packages/saga/__init__.pyc
pilot : v0.36.RC1-28-g944275b@fix-issue_780 : /work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005-pilot.0000/rp_install/lib/python2.7/site-packages/radical/pilot/__init__.pyc
        type  : multicore
        gitid : $Id$

---------------------------------------------------------------------

startup agent agent_0 : /fs4/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005-pilot.0000/agent_0.cfg
Agent config (/fs4/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005-pilot.0000/agent_0.cfg):
{'agent_launch_method': 'ORTE',
 'agent_layout': {'agent_0': {'bridges': ['agent_staging_input_queue',
                                          'agent_scheduling_queue',
                                          'agent_executing_queue',
                                          'agent_staging_output_queue',
                                          'agent_unschedule_pubsub',
                                          'agent_reschedule_pubsub',
                                          'agent_command_pubsub',
                                          'agent_state_pubsub'],
                              'components': {'AgentStagingInputComponent': 1,
                                             'AgentStagingOutputComponent': 1},
                              'pull_units': True,
                              'sub_agents': ['agent_1'],
                              'target': 'local'},
                  'agent_1': {'components': {'AgentExecutingComponent': 1,
                                             'AgentSchedulingComponent': 1},
                              'target': 'node'}},
 'agent_name': 'agent_0',
 'bulk_collection_time': 1.0,
 'cores': 48,
 'db_poll_sleeptime': 0.1,
 'debug': 40,
 'heartbeat_interval': 10,
 'lrms': 'PBSPRO',
 'max_io_loglength': 1024,
 'mongodb_url': 'mongodb://extasy:extasyproject@extasy-db.epcc.ed.ac.uk/radicalpilot',
 'mpi_launch_method': 'ORTE',
 'network_interface': 'ipogif0',
 'pilot_id': 'pilot.0000',
 'runtime': 10,
 'scheduler': 'CONTINUOUS',
 'session_id': 'rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005',
 'spawner': 'POPEN',
 'staging_area': 'staging_area',
 'staging_scheme': 'staging',
 'task_launch_method': 'ORTE'}


Error running agent: Not enough cores available (24) to satisfy allocation request (48).
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 6024, in bootstrap_3
    logger = log)
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 2335, in create
    return impl(cfg, logger)
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 2547, in __init__
    LRMS.__init__(self, cfg, logger)
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0005-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 2310, in __init__
    % (str(cores_avail), str(self.requested_cores)))

atexit

So I thought that 24 cores was the limit for "short". I get the same with the standard queue.

Error running agent: Not enough cores available (24) to satisfy allocation request (48).

Not sure why this is the case.

@marksantcroos
Copy link
Contributor

Oh man, what a rabbithole.

Educated guess, you don't have RADICAL_PILOT_PROFILE set?

Can you do:

setenv RADICAL_PILOT_PROFILE="ohman"

If it works then I know what the problem is.

@vivek-bala
Copy link
Contributor Author

Yup, now works with the profile variable.

There seems to be some extra callbacks in the client side of the type:

[Callback]: #      1
...................................................

This is not in the executing script and I am not sure where this is coming from. But seems to take a lot of time (just for this callback).

Client verbose:

$ python getting_started_remote.py epsrc.archer
running on epsrc.archer
create session rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0006         ok
session id: rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0006
create pilot manager                                                          ok
create pilot description                                                      ok
submit 1 pilot(s).                                                            ok
create unit manager                                                           ok
add 1 pilot(s)                                                                ok
submit 48 unit(s)................................................[Callback]: ComputePilot 'pilot.0000' state: Launching.
[Callback]: unit unit.000031 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000036 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000037 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000034 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000010 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000011 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000042 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000012 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000023 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000005 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000004 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000031 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000036 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000037 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000011 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000028 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000024 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000041 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000038 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000034 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000029 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000028 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000025 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000024 on pilot.0000: StagingInput.
[Callback]: unit unit.000041 on pilot.0000: StagingInput.
[Callback]: unit unit.000040 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000018 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000020 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000025 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000024 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000038 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000041 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000027 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000039 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000029 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000019 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000040 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000033 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000018 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000043 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000020 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000000 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000027 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000039 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000045 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000019 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000001 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000035 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000021 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000002 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000000 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000033 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000045 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000006 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000014 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000044 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000043 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000001 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000035 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000003 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000021 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000030 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000044 on pilot.0000: StagingInput.
[Callback]: unit unit.000003 on pilot.0000: StagingInput.
[Callback]: unit unit.000002 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000022 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000026 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000006 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000014 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000016 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000044 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000003 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000015 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000013 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000026 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000030 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000015 on pilot.0000: StagingInput.
[Callback]: unit unit.000016 on pilot.0000: StagingInput.
[Callback]: unit unit.000017 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000009 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000022 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000007 on pilot.0000: PendingInputStaging.
             ok
[Callback]: unit unit.000015 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000016 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000017 on pilot.0000: StagingInput.
[Callback]: unit unit.000008 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000032 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000009 on pilot.0000: StagingInput.
[Callback]: unit unit.000047 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000046 on pilot.0000: PendingInputStaging.
[Callback]: unit unit.000013 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000007 on pilot.0000: AgentStagingInputPending.
wait for 48 unit(s)[Callback]: unit unit.000017 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000008 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000032 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000009 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000047 on pilot.0000: AgentStagingInputPending.
[Callback]: unit unit.000046 on pilot.0000: AgentStagingInputPending.
[Callback]: ComputePilot 'pilot.0000' state: PendingActive.
[Callback]: ComputePilot 'pilot.0000' state: Active.
[Callback]: unit unit.000018 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000019 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000030 on pilot.0000: Allocating.
[Callback]: unit unit.000031 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000036 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000037 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000034 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000033 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000010 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000011 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000012 on pilot.0000: Executing.
[Callback]: unit unit.000039 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000014 on pilot.0000: Allocating.
[Callback]: unit unit.000015 on pilot.0000: Allocating.
[Callback]: unit unit.000016 on pilot.0000: Allocating.
[Callback]: unit unit.000017 on pilot.0000: Allocating.
[Callback]: unit unit.000008 on pilot.0000: Allocating.
[Callback]: unit unit.000043 on pilot.0000: Allocating.
[Callback]: unit unit.000028 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000001 on pilot.0000: Allocating.
[Callback]: unit unit.000032 on pilot.0000: Allocating.
[Callback]: unit unit.000035 on pilot.0000: Allocating.
[Callback]: unit unit.000009 on pilot.0000: Allocating.
[Callback]: unit unit.000003 on pilot.0000: Allocating.
[Callback]: unit unit.000002 on pilot.0000: Allocating.
[Callback]: unit unit.000038 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000047 on pilot.0000: Allocating.
[Callback]: unit unit.000046 on pilot.0000: Allocating.
[Callback]: unit unit.000045 on pilot.0000: Allocating.
[Callback]: unit unit.000044 on pilot.0000: Allocating.
[Callback]: unit unit.000029 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000042 on pilot.0000: Executing.
[Callback]: unit unit.000041 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000040 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000025 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000024 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000027 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000000 on pilot.0000: Allocating.
[Callback]: unit unit.000021 on pilot.0000: Allocating.
[Callback]: unit unit.000020 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000005 on pilot.0000: ExecutingPending.
[Callback]: unit unit.000004 on pilot.0000: Executing.
[Callback]: unit unit.000013 on pilot.0000: Allocating.
[Callback]: unit unit.000023 on pilot.0000: Executing.
[Callback]: unit unit.000022 on pilot.0000: Allocating.
[Callback]: unit unit.000007 on pilot.0000: Allocating.
[Callback]: unit unit.000026 on pilot.0000: Allocating.
[Callback]: unit unit.000006 on pilot.0000: Allocating.
[Callback]: unit unit.000031 on pilot.0000: Executing.
[Callback]: unit unit.000036 on pilot.0000: Executing.
[Callback]: unit unit.000037 on pilot.0000: Executing.
[Callback]: unit unit.000034 on pilot.0000: Executing.
[Callback]: unit unit.000010 on pilot.0000: Executing.
[Callback]: unit unit.000011 on pilot.0000: Executing.
[Callback]: unit unit.000028 on pilot.0000: Executing.
[Callback]: unit unit.000041 on pilot.0000: Executing.
[Callback]: unit unit.000005 on pilot.0000: Executing.
[Callback]: unit unit.000018 on pilot.0000: Executing.
[Callback]: unit unit.000019 on pilot.0000: Executing.
[Callback]: unit unit.000033 on pilot.0000: Executing.
[Callback]: unit unit.000039 on pilot.0000: Executing.
[Callback]: unit unit.000038 on pilot.0000: Executing.
[Callback]: unit unit.000029 on pilot.0000: Executing.
[Callback]: unit unit.000040 on pilot.0000: Executing.
[Callback]: unit unit.000025 on pilot.0000: Executing.
[Callback]: unit unit.000024 on pilot.0000: Executing.
[Callback]: unit unit.000027 on pilot.0000: Executing.
[Callback]: unit unit.000020 on pilot.0000: Executing.
[Callback]: unit unit.000042 on pilot.0000: AgentStagingOutputPending.
[Callback]: unit unit.000045 on pilot.0000: Executing.
[Callback]: unit unit.000042 on pilot.0000: Done.
[Callback]: #      1
...................................................[Callback]: unit unit.000004 on pilot.0000: AgentStagingOutputPending.
.[Callback]: unit unit.000000 on pilot.0000: Executing.
[Callback]: unit unit.000004 on pilot.0000: PendingOutputStaging.
.[Callback]: unit unit.000004 on pilot.0000: Done.
[Callback]: #      2
........
        ............................[Callback]: unit unit.000043 on pilot.0000: Executing.
[Callback]: unit unit.000023 on pilot.0000: AgentStagingOutput.
........[Callback]: unit unit.000012 on pilot.0000: AgentStagingOutput.
[Callback]: unit unit.000001 on pilot.0000: Executing.
[Callback]: unit unit.000023 on pilot.0000: Done.
[Callback]: #      3
............[Callback]: unit unit.000012 on pilot.0000: Done.
[Callback]: #      4
........................
        ................[Callback]: unit unit.000005 on pilot.0000: AgentStagingOutputPending.
....[Callback]: unit unit.000035 on pilot.0000: Executing.
[Callback]: unit unit.000005 on pilot.0000: PendingOutputStaging.
....[Callback]: unit unit.000005 on pilot.0000: Done.
[Callback]: #      5
................................................
        .........................................................[Callback]: unit unit.000010 on pilot.0000: AgentStagingOutputPending.
...............
        .....[Callback]: unit unit.000010 on pilot.0000: Done.
[Callback]: #      6
[Callback]: unit unit.000021 on pilot.0000: Executing.
...................................................................
        ........................................................................
        .....[Callback]: unit unit.000011 on pilot.0000: PendingOutputStaging.
[Callback]: unit unit.000006 on pilot.0000: Executing.
......[Callback]: unit unit.000011 on pilot.0000: Done.
[Callback]: #      7
.............................................................
        ........................................................................
        ........................................................................
        ......................................................[Callback]: unit unit.000002 on pilot.0000: Executing.
[Callback]: unit unit.000027 on pilot.0000: PendingOutputStaging.
.......[Callback]: unit unit.000027 on pilot.0000: Done.
[Callback]: #      8
........[Callback]: unit unit.000035 on pilot.0000: PendingOutputStaging.
[Callback]: unit unit.000044 on pilot.0000: Executing.
...
        .....[Callback]: unit unit.000035 on pilot.0000: Done.
[Callback]: #      9
....................................[Callback]: unit unit.000043 on pilot.0000: AgentStagingOutputPending.
...........................[Callback]: unit unit.000003 on pilot.0000: Executing.
[Callback]: unit unit.000043 on pilot.0000: Done.
[Callback]: #     10
....
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ..................[Callback]: unit unit.000020 on pilot.0000: AgentStagingOutputPending.
..........[Callback]: unit unit.000014 on pilot.0000: Executing.
[Callback]: unit unit.000020 on pilot.0000: PendingOutputStaging.
..........[Callback]: unit unit.000020 on pilot.0000: Done.
[Callback]: #     11
..................................
        ..........[Callback]: unit unit.000019 on pilot.0000: AgentStagingOutputPending.
...........[Callback]: unit unit.000019 on pilot.0000: PendingOutputStaging.
[Callback]: unit unit.000026 on pilot.0000: Executing.
............................................[Callback]: unit unit.000019 on pilot.0000: Done.
[Callback]: #     12
.......
        ........................................................................
        ........................................................................
        .....[Callback]: unit unit.000033 on pilot.0000: PendingOutputStaging.
[Callback]: unit unit.000030 on pilot.0000: Executing.
................................................[Callback]: unit unit.000033 on pilot.0000: Done.
[Callback]: #     13
...................
        .......[Callback]: unit unit.000000 on pilot.0000: AgentStagingOutputPending.
.............[Callback]: unit unit.000000 on pilot.0000: PendingOutputStaging.
[Callback]: unit unit.000022 on pilot.0000: Executing.
....................................................
        [Callback]: unit unit.000015 on pilot.0000: Executing.
[Callback]: unit unit.000016 on pilot.0000: Executing.
[Callback]: unit unit.000028 on pilot.0000: AgentStagingOutput.
[Callback]: unit unit.000000 on pilot.0000: Done.
[Callback]: #     14
[Callback]: unit unit.000006 on pilot.0000: AgentStagingOutputPending.
.........................................................[Callback]: unit unit.000028 on pilot.0000: Done.
.[Callback]: #     15
.............[Callback]: unit unit.000006 on pilot.0000: Done.
[Callback]: #     16
.
        ........................................................................
        .......................................[Callback]: unit unit.000045 on pilot.0000: AgentStagingOutputPending.
.................................
        ...............................[Callback]: unit unit.000045 on pilot.0000: Done.
[Callback]: #     17
.................[Callback]: unit unit.000013 on pilot.0000: Executing.
........................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        .....................................[Callback]: unit unit.000037 on pilot.0000: AgentStagingOutput.
[Callback]: unit unit.000007 on pilot.0000: Executing.
...................................
        .................................[Callback]: unit unit.000037 on pilot.0000: PendingOutputStaging.
.................[Callback]: unit unit.000037 on pilot.0000: Done.
[Callback]: #     18
......................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ................................[Callback]: unit unit.000014 on pilot.0000: PendingOutputStaging.
[Callback]: unit unit.000017 on pilot.0000: Executing.
........................................
        ................................[Callback]: unit unit.000014 on pilot.0000: Done.
[Callback]: #     19
......................................[Callback]: unit unit.000036 on pilot.0000: AgentStagingOutputPending.
..
        ........................................................................
        ..[Callback]: unit unit.000036 on pilot.0000: Done.
[Callback]: #     20
[Callback]: unit unit.000009 on pilot.0000: Executing.
......................................................................
        ........................................................................
        ..........................................................[Callback]: unit unit.000032 on pilot.0000: Executing.
[Callback]: unit unit.000021 on pilot.0000: Done.
[Callback]: #     21
..............
        ........................................................................
        ...................[Callback]: unit unit.000030 on pilot.0000: StagingOutput.
[Callback]: unit unit.000015 on pilot.0000: PendingOutputStaging.
[Callback]: unit unit.000047 on pilot.0000: Executing.
[Callback]: unit unit.000008 on pilot.0000: Executing.
.....................[Callback]: unit unit.000030 on pilot.0000: Done.
[Callback]: #     22
[Callback]: unit unit.000015 on pilot.0000: Done.
[Callback]: #     23
................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ....................................................................[Callback]: unit unit.000002 on pilot.0000: AgentStagingOutputPending.
....
        ........................................................................
        ................[Callback]: unit unit.000002 on pilot.0000: AgentStagingOutput.
[Callback]: unit unit.000046 on pilot.0000: Executing.
........................................................
        ...........................................................[Callback]: unit unit.000002 on pilot.0000: PendingOutputStaging.
.............
        ..........[Callback]: unit unit.000002 on pilot.0000: Done.
[Callback]: #     24
..............................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ..................................[Callback]: unit unit.000047 on pilot.0000: AgentStagingOutputPending.
......................................
        ..........[Callback]: unit unit.000047 on pilot.0000: Done.
[Callback]: #     25
[Callback]: unit unit.000022 on pilot.0000: AgentStagingOutputPending.
..............................................................
        ......................................[Callback]: unit unit.000022 on pilot.0000: Done.
[Callback]: #     26
..................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ..........................................[Callback]: unit unit.000013 on pilot.0000: AgentStagingOutputPending.
..............................
        ........................................................................
        ..[Callback]: unit unit.000013 on pilot.0000: Done.
[Callback]: #     27
......................................................................
        ........................................................................
        ........................................................................
        .............................................................[Callback]: unit unit.000031 on pilot.0000: PendingOutputStaging.
.[Callback]: unit unit.000017 on pilot.0000: PendingOutputStaging.
..........
        ......................................[Callback]: unit unit.000031 on pilot.0000: Done.
[Callback]: #     28
[Callback]: unit unit.000017 on pilot.0000: Done.
[Callback]: #     29
..................................
        ........................................................................
        ..........[Callback]: unit unit.000008 on pilot.0000: PendingOutputStaging.
[Callback]: unit unit.000026 on pilot.0000: PendingOutputStaging.
..............................................................
        ........................................................................
        ...........[Callback]: unit unit.000008 on pilot.0000: Done.
[Callback]: #     30
[Callback]: unit unit.000026 on pilot.0000: Done.
[Callback]: #     31
.............................................................
        ........................................................................
        ........................................................................
        ............[Callback]: unit unit.000046 on pilot.0000: PendingOutputStaging.
...............................[Callback]: unit unit.000046 on pilot.0000: Done.
[Callback]: #     32
.............................
        ........................................................................
        ...........................[Callback]: unit unit.000034 on pilot.0000: AgentStagingOutputPending.
.............................................
        ...................[Callback]: unit unit.000034 on pilot.0000: AgentStagingOutput.
.....................................................
        ........................................................................
        ...[Callback]: unit unit.000034 on pilot.0000: Done.
[Callback]: #     33
.....................................................................
        ........................................................................
        ........................................................................
        ..................[Callback]: unit unit.000024 on pilot.0000: AgentStagingOutputPending.
.................................[Callback]: unit unit.000009 on pilot.0000: Done.
[Callback]: #     34
.....................
        .............[Callback]: unit unit.000024 on pilot.0000: Done.
[Callback]: #     35
...........................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ...[Callback]: unit unit.000038 on pilot.0000: AgentStagingOutputPending.
...................................[Callback]: unit unit.000038 on pilot.0000: PendingOutputStaging.
..................................
        .[Callback]: unit unit.000038 on pilot.0000: StagingOutput.
......................................................................[Callback]: unit unit.000038 on pilot.0000: Done.
[Callback]: #     36
.
        ...................................[Callback]: unit unit.000041 on pilot.0000: AgentStagingOutputPending.
.....................................
        ........................................................................
        ...................................[Callback]: unit unit.000041 on pilot.0000: PendingOutputStaging.
....................................[Callback]: unit unit.000041 on pilot.0000: Done.
[Callback]: #     37
.
        ........................................................................
        .[Callback]: unit unit.000025 on pilot.0000: AgentStagingOutput.
.....................................[Callback]: unit unit.000025 on pilot.0000: PendingOutputStaging.
..................................
        ...[Callback]: unit unit.000025 on pilot.0000: Done.
[Callback]: #     38
.....................................................................
        ........................................................................
        ...........[Callback]: unit unit.000001 on pilot.0000: AgentStagingOutputPending.
.............................................................
        ........................................................................
        .........................................................[Callback]: unit unit.000001 on pilot.0000: Done.
[Callback]: #     39
...............
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        .........[Callback]: unit unit.000007 on pilot.0000: AgentStagingOutputPending.
.......................................[Callback]: unit unit.000007 on pilot.0000: Done.
[Callback]: #     40
........................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................[Callback]: unit unit.000044 on pilot.0000: AgentStagingOutputPending.
................................
        ........................................................................
        ........................................................[Callback]: unit unit.000044 on pilot.0000: Done.
[Callback]: #     41
................
        .........................[Callback]: unit unit.000029 on pilot.0000: AgentStagingOutputPending.
...............................................
        ........................................................................
        .............................................[Callback]: unit unit.000029 on pilot.0000: Done.
[Callback]: #     42
...........................
        ........................................................................
        ........................................................................
        .......................................[Callback]: unit unit.000003 on pilot.0000: PendingOutputStaging.
.................................
        ...................................................[Callback]: unit unit.000003 on pilot.0000: Done.
[Callback]: #     43
.....................
        ........................................................................
        ........................................................................
        .......[Callback]: unit unit.000018 on pilot.0000: PendingOutputStaging.
...........................................[Callback]: unit unit.000018 on pilot.0000: Done.
[Callback]: #     44
......................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        ..............................[Callback]: unit unit.000032 on pilot.0000: AgentStagingOutputPending.
..........................................
        ..[Callback]: unit unit.000032 on pilot.0000: PendingOutputStaging.
......................................................................
        ..................[Callback]: unit unit.000032 on pilot.0000: Done.
[Callback]: #     45
.............................................[Callback]: unit unit.000016 on pilot.0000: AgentStagingOutputPending.
.........
        ........................................................................
        .........[Callback]: unit unit.000016 on pilot.0000: PendingOutputStaging.
...............................................................
        ........................................................................
        .............................................[Callback]: unit unit.000016 on pilot.0000: Done.
[Callback]: #     46
...........................
        ........................................................................
        ........................................................................
        ........................................................................
        ........................................................................
        .....................................................[Callback]: unit unit.000039 on pilot.0000: PendingOutputStaging.
[Callback]: unit unit.000040 on pilot.0000: StagingOutput.
...................
        ...........................[Callback]: unit unit.000039 on pilot.0000: Done.
[Callback]: #     47
[Callback]: unit unit.000040 on pilot.0000: Done.
[Callback]: #     48
.............................................
        ...                                                                   ok
* Task unit.000000 state Done, exit code: 0, started: None, finished: None
* Task unit.000001 state Done, exit code: 0, started: None, finished: None
* Task unit.000002 state Done, exit code: 0, started: None, finished: None
* Task unit.000003 state Done, exit code: 0, started: None, finished: None
* Task unit.000004 state Done, exit code: 0, started: None, finished: None
* Task unit.000005 state Done, exit code: 0, started: None, finished: None
* Task unit.000006 state Done, exit code: 0, started: None, finished: None
* Task unit.000007 state Done, exit code: 0, started: None, finished: None
* Task unit.000008 state Done, exit code: 0, started: None, finished: None
* Task unit.000009 state Done, exit code: 0, started: None, finished: None
* Task unit.000010 state Done, exit code: 0, started: None, finished: None
* Task unit.000011 state Done, exit code: 0, started: None, finished: None
* Task unit.000012 state Done, exit code: 0, started: None, finished: None
* Task unit.000013 state Done, exit code: 0, started: None, finished: None
* Task unit.000014 state Done, exit code: 0, started: None, finished: None
* Task unit.000015 state Done, exit code: 0, started: None, finished: None
* Task unit.000016 state Done, exit code: 0, started: None, finished: None
* Task unit.000017 state Done, exit code: 0, started: None, finished: None
* Task unit.000018 state Done, exit code: 0, started: None, finished: None
* Task unit.000019 state Done, exit code: 0, started: None, finished: None
* Task unit.000020 state Done, exit code: 0, started: None, finished: None
* Task unit.000021 state Done, exit code: 0, started: None, finished: None
* Task unit.000022 state Done, exit code: 0, started: None, finished: None
* Task unit.000023 state Done, exit code: 0, started: None, finished: None
* Task unit.000024 state Done, exit code: 0, started: None, finished: None
* Task unit.000025 state Done, exit code: 0, started: None, finished: None
* Task unit.000026 state Done, exit code: 0, started: None, finished: None
* Task unit.000027 state Done, exit code: 0, started: None, finished: None
* Task unit.000028 state Done, exit code: 0, started: None, finished: None
* Task unit.000029 state Done, exit code: 0, started: None, finished: None
* Task unit.000030 state Done, exit code: 0, started: None, finished: None
* Task unit.000031 state Done, exit code: 0, started: None, finished: None
* Task unit.000032 state Done, exit code: 0, started: None, finished: None
* Task unit.000033 state Done, exit code: 0, started: None, finished: None
* Task unit.000034 state Done, exit code: 0, started: None, finished: None
* Task unit.000035 state Done, exit code: 0, started: None, finished: None
* Task unit.000036 state Done, exit code: 0, started: None, finished: None
* Task unit.000037 state Done, exit code: 0, started: None, finished: None
* Task unit.000038 state Done, exit code: 0, started: None, finished: None
* Task unit.000039 state Done, exit code: 0, started: None, finished: None
* Task unit.000040 state Done, exit code: 0, started: None, finished: None
* Task unit.000041 state Done, exit code: 0, started: None, finished: None
* Task unit.000042 state Done, exit code: 0, started: None, finished: None
* Task unit.000043 state Done, exit code: 0, started: None, finished: None
* Task unit.000044 state Done, exit code: 0, started: None, finished: None
* Task unit.000045 state Done, exit code: 0, started: None, finished: None
* Task unit.000046 state Done, exit code: 0, started: None, finished: None
* Task unit.000047 state Done, exit code: 0, started: None, finished: None
closing session
closing session rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0006[Callback]: ComputePilot 'pilot.0000' state: Canceled.
.       ok

@marksantcroos
Copy link
Contributor

Yup, now works with the profile variable.

Ok, thanks. I will take care of that.

There seems to be some extra callbacks in the client side of the type:

[Callback]: # 1
...................................................

The dots are not callbacks, but are coming from the demo facility. That is fixed in RU feature/demo.

@vivek-bala
Copy link
Contributor Author

The dots are not callbacks, but are coming from the demo facility. That is fixed in RU feature/demo.

Okk. The units took quite a while to complete (sleep 0). It seemed that these dots were produced as part of some loop.

@marksantcroos
Copy link
Contributor

They come from the unit_manager.wait(), one dot per completed CU.

@marksantcroos
Copy link
Contributor

Ok, you should now be able to run without RADICAL_PILOT_PROFILE.

@marksantcroos
Copy link
Contributor

I've created a PR for the fixes, Andre can look at it and then roll a new RC.

@marksantcroos
Copy link
Contributor

ps. You have 30 min left to get into the short queue until you'll have to wait until Monday! ;)

@vivek-bala
Copy link
Contributor Author

Pulled and tried it again. I now get the following error:

agent_0.err:

t br2015-10-02 16:34:57,723: radical.saga        : MainProcess                     : MainThread     : INFO    : python.interpreter   version: 2.7.6 (default, Mar 10 2014, 14:13:45) [GCC 4.8.1 20130531 (Cray Inc.)]
2015-10-02 16:34:57,723: radical.saga        : MainProcess                     : MainThread     : INFO    :                      pid: 10613
2015-10-02 16:34:57,724: radical.saga        : MainProcess                     : MainThread     : INFO    :                      tid: MainThread
2015-10-02 16:34:57,724: radical.saga        : MainProcess                     : MainThread     : INFO    : radical.saga         version: v0.36.RC1@devel
2015-10-02 16:34:57,734: radical.pilot       : MainProcess                     : MainThread     : INFO    : python.interpreter   version: 2.7.6 (default, Mar 10 2014, 14:13:45) [GCC 4.8.1 20130531 (Cray Inc.)]
2015-10-02 16:34:57,734: radical.pilot       : MainProcess                     : MainThread     : INFO    :                      pid: 10613
2015-10-02 16:34:57,734: radical.pilot       : MainProcess                     : MainThread     : INFO    :                      tid: MainThread
2015-10-02 16:34:57,734: radical.pilot       : MainProcess                     : MainThread     : INFO    : radical.pilot        version: v0.36.RC1-32-g420c9c3@fix-issue_780
Traceback (most recent call last):
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0007-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 6122, in <module>
    bootstrap_3()
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0007-pilot.0000/rp_install/bin/radical-pilot-agent-multicore.py", line 5978, in bootstrap_3
    _, mongo_db, _, _, _  = ru.mongodb_connect(cfg['mongodb_url'])
  File "/work/e290/e290/vb224/radical.pilot.sandbox/rp.session.vivek-Lenovo-IdeaPad-Y480.vivek.016710.0007-pilot.0000/rp_install/lib/python2.7/site-packages/radical/utils/misc.py", line 95, in mongodb_connect
    db.authenticate (user, pwd)
  File "/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/pymongo/database.py", line 978, in authenticate
    self.connection._cache_credentials(self.name, credentials)
  File "/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/pymongo/mongo_client.py", line 467, in _cache_credentials
    auth.authenticate(credentials, sock_info, self.__simple_command)
  File "/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/pymongo/auth.py", line 475, in authenticate
    auth_func(credentials[1:], sock_info, cmd_func)
  File "/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/pymongo/auth.py", line 452, in _authenticate_default
    return _authenticate_mongo_cr(credentials, sock_info, cmd_func)
  File "/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/pymongo/auth.py", line 445, in _authenticate_mongo_cr
    cmd_func(sock_info, source, query)
  File "/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/pymongo/mongo_client.py", line 703, in __simple_command
    helpers._check_command_response(response, None, msg)
  File "/work/e290/shared/shared_pilot_ve_20150924/lib/python2.7/site-packages/pymongo/helpers.py", line 182, in _check_command_response
    raise OperationFailure(msg % errmsg, code, response)
pymongo.errors.OperationFailure: command SON([('authenticate', 1), ('user', u'extasy'), ('nonce', u'ad9df8b925cc26ca'), ('key', u'1e97392625796b90e1cf42f8711ebab1')]) on namespace radicalpilot.$cmd failed: auth failed

@vivek-bala
Copy link
Contributor Author

Pilot goes from PendingActive to Failed.

@marksantcroos
Copy link
Contributor

Its impressive the number and variety of problems that you run into ...

This seems unrelated though. Are the credentials (still) valid and/or can you use the same credentials to run on a different resource?

@vivek-bala
Copy link
Contributor Author

I know. At every run, I am thinking "now what's it going to be !" :)

credentials of the mongodb right ? They seem to be ok. I ran the getting_started_local example with the same url.

@marksantcroos
Copy link
Contributor

Im not really suspecting the agent here, especially given that you got past this point multiple times.

@marksantcroos
Copy link
Contributor

The units took quite a while to complete (sleep 0).

FYI: ARCHER and RP are not a good match currently. Probably filesystem related. To be continued.

@vivek-bala
Copy link
Contributor Author

Ok, this time the execution was successful. No changes to the script in all 3 trials.

@andre-merzky
Copy link
Member

This will be a fun tutorial :/

@marksantcroos
Copy link
Contributor

This will be a fun tutorial :/

Do we have any data on the reliability of "our" MongoDB vs the Extasy one?

@andre-merzky
Copy link
Member

No - but I somehow doubt that this was an issue. I share your sentiments on the Archer FS though -- this is incredibly slow at times...

@vivek-bala
Copy link
Contributor Author

Do we have any data on the reliability of "our" MongoDB vs the Extasy one?

I haven't had an issues with the extasy mongodb yet. All my experiments and tests (even some rp tests) are using the extasy url. Not sure if I have any quantitative data for that.

@marksantcroos
Copy link
Contributor

Such intuition is good enough for now.

@marksantcroos
Copy link
Contributor

Can we close this ticket thus?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants