Please take our Survey
logo       

Choosing A Webhost:
A web hosting service is a type of Internet hosting service that allows individuals and organizations to provide their own website accessible via the World Wide Web. Web hosts are companies that provide space on a server they own for use by their clients as well as providing Internet connectivity, typically in a data center. Web hosts can also provide data center space and connectivity to the Internet for servers they do not own to be located in their data center, called colocation. more...

Re: More than one job per CPU: msg#00070

clustering.torque.user

Subject: Re: More than one job per CPU

On Tue, Sep 11, 2007 at 04:16:54PM -0500, Jeremy Mann alleged:
> I've been searching the mail archive most of the day and I haven't found
> anything regarding what our problem, well we call it a problem, is.
>
> We have a program that we run on our cluster a few hundred iterations at a
> time. We nice the program 19 so it won't interfere with any other program.
> So far, we've been doing this manually. Now we want to incorporate it into
> PBS/Maui. The problem we are coming into is even though we submit it with
> -l nice=19, PBS still says that compute node is state=busy and all other
> jobs stay in the queue. We run the program niced 19 because it usually
> runs for about 5-6 days on our 20 nodes, so we need the ability to run
> other things during this time.
>
> What I've been trying to accomplish for a few days now is to somehow make
> PBS submit a job to a compute node that has this niced 19 job running on
> it. I've tried everything I can think of and what I've found in the
> manpages.
>
> The changes I've tried are:
>
> In maui.cfg I've added:
> NODEACCESSPOLICY SHARED
> NODEALLOCATIONPOLICY MINRESOURCE
> NODECFG[DEFAULT] PRIORITYF=JOBCOUNT
> NODEMAXLOAD 4.00
>
> USERCFG[tigre] QDEF=tigre
> USERCFG[abarca] QDEF=gasbor
> QOSCFG[gasbor] PRIORITY=-100 FLAGS=PREEMPTEE
> QOSCFG[tigre] PRIORITY=100 FLAGS=PREEMPTOR:IGNMAXJOB
>
> My idea here was to create to QoS's, where the gasbor job (the niced 19
> job) would preempt in favor of the tigre jobs. This however has never
> worked.
>
> I took one compute node offline and edited it mom_priv/config file and
> added '$ideal_load 4.0'. My thinking here was if the telling PBS this node
> will run at a 4.0 load, it will execute mode jobs on this node. Again,
> this never worked either.

If the node is "busy" in torque, then maui won't run a job on it. End of story.

So you want to keep the node from being busy with the $ideal_load and $max_load
options. You mentioned that you tried the former, but did you also set the
later?

Attachment: pgpuFkej3t0Jj.pgp
Description: PGP signature

_______________________________________________
torqueusers mailing list
torqueusers@xxxxxxxxxxxxxxxx
http://www.supercluster.org/mailman/listinfo/torqueusers
<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

Recently Viewed:
user-groups.jax...    php.zend.framew...    os.solaris.open...    web.quixote.use...    java.openjdk.ho...    ietf.secmech/20...    gnu.glpk/2004-0...    recreation.cars...    network.smokepi...    linux.drivers.i...    cms.opencms.dev...    fonts.gfontview...    text.xml.soap.u...    voip.nist-sip/2...    debian.ports.hp...    xfree86.interna...    science.biology...    qnx.openqnx.dev...    mail.sylpheed.c...    busybox/bios/20...    emulators.kvm.s...    hardware.openco...    apple.fink.begi...    kde.german/2006...   
Home | advertise | OSDir is an inevitable website. super tiny logo

Free Magazines

Cisco News
Receive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business.
subscribe

Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field.
subscribe

The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business.
subscribe

Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company.
subscribe

Total Telecom Total Telecom is "The Economist of the communications industry".
subscribe

Navigation