logo       


RE: question on memory allocation: msg#00205

Subject: RE: question on memory allocation
There is a flag or param about resources..

Dedicated, utilized, or both

(former being what you say it'll take, latter what it's actually using)

NODEAVAILABILITYPOLICY

http://www.clusterresources.com/products/maui/docs/a.fparameters.shtml

and another around that I think

also:

This reminds me of a request wrt torque and enforcing resource requirements.
Would a future feature be to at least globally, if not per resource
(walltime, memory, etc) say that when I say a job has needs 1.4gb of mem and
a walltime of 2 days, that the 1.4gb is entirely soft meaning just use it in
scheduling (ie, each node has 3gb, so only 2 jobs of this type should go
there), but enforce walltime of 2 days (no way on earth our jobs should take
more than that, more like 12 hours max).

Last we (Garrick) and I talked about this, the code for enforcing these
resource 'limits' was only enabled if certain ones were defined.  Memory did
not enable it, but walltime did.  So we currently only specify walltime, but
sometimes get hung jobs due to transient issues and would love to set a max
walltime of like 12-24 hrs, but if a process w/1.4gb mem setting hits 2.5,
let it swap...(but still don't shoot yourself in the foot by knowingly
putting 3 x 1.4gb procs on a 3 or 4 proc box)

Has this changed since 2.1.2?  Any plans?

Garrick?  Anyone?

Thx,

-sr


Sam Rash
srash@xxxxxxxxxxxxx
408-349-7312
vertigosr37

-----Original Message-----
From: torqueusers-bounces@xxxxxxxxxxxxxxxx
[mailto:torqueusers-bounces@xxxxxxxxxxxxxxxx] On Behalf Of Lippert, Kenneth
B.
Sent: Thursday, October 26, 2006 11:24 AM
To: torqueusers@xxxxxxxxxxxxxxxx
Subject: [torqueusers] question on memory allocation

Quick question:

 I put memory requirements ala' "-l mem=1gb" in all of my job scripts.

When Torque/Maui is deciding where to send the job, does it look at the
ACTUAL free physical memory available on the nodes, or does it look at
the memory on each node minus the sum of the  "mem" values of the jobs
already running on that node ?  (all the nodes have several np's, from 2
to 8)

If my memory estimates were exact it wouldn't matter, but they aren't.
I have to estimate a little high, so if the algorithm is the latter,
Torque could be thinking a node was "full" when it really wasn't.

Looking at the documentation, I am thinking it is the first way, but I
want to be sure.

Thanks very much.

-k
_______________________________________________
torqueusers mailing list
torqueusers@xxxxxxxxxxxxxxxx
http://www.supercluster.org/mailman/listinfo/torqueusers


Ruby Jobs
Java Jobs
Jobs in California
more...
what
job title, keywords
where
city, state, zip
jobs by job search
<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

Recently Viewed:
encryption.gpg....    ietf.rfc822/199...    freebsd.devel.i...    lang.haskell.li...    mail.squirrelma...    web.zope.plone....    yellowdog.gener...    text.xml.xalan....    recreation.phot...    kde.devel.educa...    hardware.bus.ca...    printing.ghosts...    voip.peering/20...    assembly/2006-0...    org.user-groups...    culture.interne...    network.i2p/200...    boot-loaders.ya...    xfree86.render/...    qnx.openqnx.dev...    jakarta.velocit...    user-groups.pal...   
Home | blog view | USPTO Patent Archive | advertise | OSDir is an inevitable website. super tiny logo

Free Magazines

Cisco News
Receive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business.
subscribe

Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field.
subscribe

The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business.
subscribe

Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company.
subscribe

Total Telecom Total Telecom is "The Economist of the communications industry".
subscribe