|
|
Choosing A Webhost: |
Weird queue behavior.: msg#00124clustering.torque.user
Hi, I have two queues, parallel and dedicated. Parallel is supposed to catch any job requesting less than nodes=32:ppn=4 and dedicated gets any job larger than that. But I'm getting weird behavior like this: # A nine node, 4 processor per node job works: griznog@uinta ~ $ qsub -I -l nodes=9:ppn=4 qsub: waiting for job 10349.uinta.hpc.usu.edu to start # A ten node, 4 ppn job doesn't. griznog@uinta ~ $ qsub -I -l nodes=10:ppn=4 qsub: Job rejected by all possible destinations # However, a 20 node 2 ppn job does griznog@uinta ~ $ qsub -I -l nodes=20:ppn=2 qsub: waiting for job 10351.uinta.hpc.usu.edu to start What am I doing wrong here that allows > ~36 CPU jobs unless I pack all the processors on each node? Queue configuration follows. Thanks, jbh Qmgr: p q parallel # # Create queues and set their attributes. # # # Create and define queue parallel # create queue parallel set queue parallel queue_type = Execution set queue parallel resources_max.nodect = 32 set queue parallel resources_max.nodes = 32:ppn=4 set queue parallel resources_max.walltime = 24:00:00 set queue parallel resources_min.nodect = 1 set queue parallel resources_min.nodes = 1:ppn=2 set queue parallel resources_default.nodes = 1:ppn=2 set queue parallel resources_default.walltime = 01:00:00 set queue parallel resources_available.nodect = 62 set queue parallel resources_available.nodes = 62:ppn=4 set queue parallel max_user_run = 8 set queue parallel enabled = True set queue parallel started = True Qmgr: p q dedicated # # Create queues and set their attributes. # # # Create and define queue dedicated # create queue dedicated set queue dedicated queue_type = Execution set queue dedicated resources_max.nodect = 62 set queue dedicated resources_max.nodes = 62:ppn=4 set queue dedicated resources_max.walltime = 08:00:00 set queue dedicated resources_min.nodect = 33 set queue dedicated resources_min.nodes = 33:ppn=4 set queue dedicated resources_default.nodes = 33:ppn=4 set queue dedicated resources_default.walltime = 01:00:00 set queue dedicated resources_available.nodect = 62 set queue dedicated resources_available.nodes = 62 set queue dedicated enabled = True set queue dedicated started = True
|
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | Re: Cannot configure torque-2.0.0p8, Bas van der Vlies |
|---|---|
| Next by Date: | Torque/Maui Crashing or Pausing, Austin Godber |
| Previous by Thread: | Unauthorized request in qmgr, Hans Meier |
| Next by Thread: | Re: Weird queue behavior., Garrick Staples |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
Free MagazinesCisco NewsReceive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business. subscribe Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field. subscribe The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business. subscribe Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company. subscribe Total Telecom Total Telecom is "The Economist of the communications industry". subscribe |