Dear all,
does anybody know what could have caused the 'Connection refused'
messages given in the
server_logs section below. Any job submitted to the batch queue stucks
with status 'Q' instead 'R'
Please note that
- pbs_server, pbs_sched and pbs_mom are installed on the same computer
(sgsl001)
- there is no firewall running on the system (OS = SLES 9)
Section of the daily server_log:
...
09/19/2006 15:18:43;0086;PBS_Server;Svr;PBS_Server;Recovered queue batch
09/19/2006 15:18:43;0002;PBS_Server;Svr;PBS_Server;Expected 1, recovered
1 queues
09/19/2006 15:18:43;0002;PBS_Server;Svr;PBS_Server;Expected 0, recovered
0 jobs
09/19/2006 15:18:43;0006;PBS_Server;Svr;PBS_Server;Using ports
Server:15001 Scheduler:15004 MOM:15002
09/19/2006 15:18:43;0002;PBS_Server;Svr;PBS_Server;Server Ready, pid =
4129, loglevel=0
09/19/2006 15:18:43;0004;PBS_Server;Svr;WARNING;ALERT: unable to contact
node sgsl001
09/19/2006 15:18:43;0001;PBS_Server;Svr;PBS_Server;Connection refused
(111) in contact_sched, Could not contact Scheduler - port 15004
09/19/2006 15:18:48;0040;PBS_Server;Req;ping_nodes;starting
09/19/2006 15:18:48;0040;PBS_Server;Req;ping_nodes;ping attempting to
contact 1 nodes
09/19/2006 15:18:48;0040;PBS_Server;Req;update_node_state;adjusting
state for node sgsl001 - state=514, newstate=2
09/19/2006 15:18:48;0040;PBS_Server;Req;ping_nodes;sending ping to
sgsl001 (new stream 0)
09/19/2006 15:18:48;0040;PBS_Server;Req;ping_nodes;successful ping to
node sgsl001 (stream 0)
09/19/2006 15:18:49;0040;PBS_Server;Req;do_rpp;rpp request received on
stream 0
09/19/2006 15:18:49;0040;PBS_Server;Req;do_rpp;corrupt rpp request
received on stream 0 (invalid protocol)
09/19/2006 15:18:49;0001;PBS_Server;Svr;PBS_Server;stream_eof,
connection to sgsl001 is bad, remote service may be down, message may be
corrupt, or connection may have been dropped remotely (End of File).
setting node state to down
...
Thanks in advance
G. Bachler
--
-----
Dr. Günter Bachler
Technical Expert System Administration CAE
Product Development IT
Corporate Functions / Information Technology
e-mail: guenter.bachler@xxxxxxx
Phone: +43-316-787-3425
Fax: +43-316-787-1847
AVL LIST GMBH
A-8020 Graz, Hans-List-Platz 1
http://www.avl.com
|