Please take our Survey
logo       

Choosing A Webhost:
A web hosting service is a type of Internet hosting service that allows individuals and organizations to provide their own website accessible via the World Wide Web. Web hosts are companies that provide space on a server they own for use by their clients as well as providing Internet connectivity, typically in a data center. Web hosts can also provide data center space and connectivity to the Internet for servers they do not own to be located in their data center, called colocation. more...

torque launchs more jobs than number of virtual proc per node: msg#00078

clustering.torque.user

Subject: torque launchs more jobs than number of virtual proc per node

Hi all,

I have installed torque 2.1.0p0 on 20 dual socket dual-core nodes, and using
pbs_sched. in my nodes files i have specified:

node1 np=4
node2 np=4
.
.
node20 np=4

All my jobs are single process jobs that needs to run on one core/virtual
processor, and tend to finish about the same time. I can't get torque to stop
launching just 4 jobs per node. If my queue is not full, this seems to work;
but if I have, say, 300 jobs in the queue, with majority of the jobs queued up
behind the first "wave" of jobs, some of the jobs from the 2nd "wave" would
launch as many as 8 jobs on a single node, therefore substantially slowing down
all the jobs on this node. When I try to set $max_load in the mom_priv/config
(tried to set at 3.5), the nodes gets the job-exclusive,busy state, but would
still continue to take on jobs. It seems like, once there are jobs queued up,
torque no longer check each node's state before launching more jobs to it...

I've read posts similar (not exactly same behavior) to this, and a recompile of
torque without optimization helped. I just ran ./configure and make - where
should I take out the optimization?

Would using the maui scheduler (instead of pbs_sched) help?

any suggestion from the list would be helpful. thanks in advance!

adrian


<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

Recently Viewed:
version-control...    qnx.openqnx.dev...    redhat.rhn.user...    ietf.openpgp/20...    mail.mutt.user/...    web.microformat...    java.sync4j.use...    education.ezpro...    user-groups.blu...    solaris.manager...    org.fitug.debat...    technology.erps...    politics.activi...    linux.redhat.fe...    bug-tracking.ma...    xfce.user/2004-...    hams/2004-11/ms...    kde.users.pim/2...    culture.cooking...    freebsd.devel.x...    gnu.m4.adhoc/20...    ngpt.user/2002-...    apple.fink.deve...   
Home | advertise | OSDir is an inevitable website. super tiny logo

Free Magazines

Cisco News
Receive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business.
subscribe

Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field.
subscribe

The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business.
subscribe

Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company.
subscribe

Total Telecom Total Telecom is "The Economist of the communications industry".
subscribe

Navigation