logo       
Google Custom Search
    AddThis Social Bookmark Button

Re: estrange messages from mpich2 (tight integrated): msg#00043

Subject: Re: estrange messages from mpich2 (tight integrated)
Am 05.09.2007 um 22:53 schrieb Baudilio Tejerina:

Hi ALL:

I have just installed MPICH2 (1.0.5) in a cluster (ROCKS) and I'd like to have all the parallel jobs under GE control. For that, I've followed Reuti's "demonless" procedure described in http://gridengine.sunsource.net/howto/mpich2-integration/mpich2- integration.html

The script bellow is the one I used for testing the installation and next is the contents of the error file.

The weird messages that i don't know how to interpret nor where they may come from are those with permission denied.

Does anyone have any explanation for this?

Assuming the path to main.x is in your default $PATH, can you read this file if you login to a node? Is the mountpoint not mounted with "noexec" on the node?

-- Reuti


Thank you
Baudilio




#!/bin/sh
#$ -S /bin/sh
#$ -cwd
#$ -j y
#$ -pe mpich2 4
#$ -o TEST_MPICH2.err

#export MPIEXEC_RSH=rsh
#export PATH=/home/reuti/local/mpich2_smpd/bin:$PATH

export MPIEXEC_RSH=rsh
#Make sure we use the rsh wrapper provided by the PE. A copy of it will be placed on $TMPDIR:
export PATH=$TMPDIR:$PATH
echo BTA: my RSH is: `which rsh`


export PATH=/opt/local/mpich2-1.0.5/bin:$PATH
echo BTA: my MPIEXEC is: `which mpiexec`
mpiexec -rsh -nopm -np $NSLOTS -machinefile $TMPDIR/machines main.x

exit 0









THIS IS THE CONTENTS of the error FILE:







-catch_rsh /opt/gridengine/default/spool/compute-0-1/active_jobs/ 565.1/pe_hostfile
compute-0-1
compute-0-1
compute-0-0
compute-0-0
BTA: my RSH is: /state/partition1/565.1.all.q/rsh
BTA: my MPIEXEC is: /opt/local/mpich2-1.0.5/bin/mpiexec
env: main.x: Permission deniedenv: main.x: Permission denied
env: main.x: Permission denied
env: main.x: Permission denied/opt/gridengine/bin/lx26-x86/qrsh - inherit compute-0-1 env PMI_RANK=0 PMI_SIZE=4 PM\ I_KVS=19B130D14D0B84936F8CE02422E66098 PMI_ROOT_HOST=compute-0-1.local PMI_ROOT_PORT=33703 PMI_ROOT_LOCAL=0 PMI_A\
PPNUM=0 main.x
/opt/gridengine/bin/lx26-x86/qrsh -inherit compute-0-1 env PMI_RANK=1 PMI_SIZE=4 PMI_KVS=19B130D14D0B84936F8CE024\ 22E66098 PMI_ROOT_HOST=compute-0-1.local PMI_ROOT_PORT=33703 PMI_ROOT_LOCAL=0 PMI_APPNUM=0 main.x /opt/gridengine/bin/lx26-x86/qrsh -inherit compute-0-0 env PMI_RANK=2 PMI_SIZE=4 PMI_KVS=19B130D14D0B84936F8CE024\ 22E66098 PMI_ROOT_HOST=compute-0-1.local PMI_ROOT_PORT=33703 PMI_ROOT_LOCAL=0 PMI_APPNUM=0 main.x /opt/gridengine/bin/lx26-x86/qrsh -inherit compute-0-0 env PMI_RANK=3 PMI_SIZE=4 PMI_KVS=19B130D14D0B84936F8CE024\ 22E66098 PMI_ROOT_HOST=compute-0-1.local PMI_ROOT_PORT=33703 PMI_ROOT_LOCAL=0 PMI_APPNUM=0 main.x


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@xxxxxxxxxxxxxxxxxxxxxxxx
For additional commands, e-mail: users-help@xxxxxxxxxxxxxxxxxxxxxxxx



Try Searching:
servers, voip, java, networking, microsoft ...
<Prev in Thread] Current Thread [Next in Thread>