logo       

Re: Load problem with ims 5.2: msg#00046

mail.ims.general

Subject: Re: Load problem with ims 5.2


Something else to look at is the counterutil stats. Which should give you a clue if you are reaching session limits.

counterutil -o imapstat

Will run endlessly until you give it the ^C

Do you offer shared folders? I seem to recall a similar problem around shared folders and disabling the feature brought the universe back to normal. Its been a while, not sure if that was a 5.0 or 5.2 issue.


Regards,
Anthony

Rich Bishop wrote:
Thanks to everyone who replied. Our disk service times don't look too bad - mostly under 15ms.

service.imap.maxthreads is 250
service.imap.maxsessions is 4000


The pstack output is quite large, so I've put them at http://www.pages.drexel.edu/~rjb38/ims-pstack.zip . It seems that lots of the threads are waiting for a mutex inside mboxlist_dofind.

We just moved dbtmpdir to tmpfs, but it doesn't really seem to have helped (load has climbed back to 200).

Rich

Chris Vervais wrote:
Don't rely on top for gauging how slow/fast your I/O is going. It's better to run a iostat -xzn 5 and look for service times on the disks the store resides on to be below 20ms. Over 20ms the mailstore starts hating life and chews on it's self.

Can you post the following from configutil:

service.imap.maxsessions
service.imap.maxthreads

Also, nab some pstacks of the IMAPD processes so we can see what they're spinning on.

Usually a corrupt database / index results in a crash, not a performance issue. Something else to check is, is the ims-ms queue building up? If it is that usually means the store is performing poorly.

Also, what do you have your dbtmpdir set to?

-Chris

On Oct 15, 2007, at 6:35 PM, Rich Bishop wrote:

Hello,

We're running IMS 5.2 (I know it's old!) on Solaris 8 and suddenly seeing some major load issues. It's one of two mailstores running on a Sunfire 280R with 6GB. We've been on this hardware for a couple of years and never noticed any problems until a couple of weeks ago.

Suddenly on this one machine we see very high loads (up to 700). imapd processes are always consuming most of the CPU in top, and the activity is pretty evenly divided between user and kernel (ie. no or minimal iowait). Here's a top snapshot - load is low right now but it shows imapd chewing up all the cpu. At this time we had only 700 imap connections.



last pid: 9266; load averages: 26.25, 26.14, 44.50 21:29:56
86 processes: 78 sleeping, 6 running, 2 on cpu
CPU states: 0.0% idle, 58.4% user, 41.6% kernel, 0.0% iowait, 0.0% swap
Memory: 6144M real, 596M swap in use, 4513M swap free

PID USERNAME THR PRI NICE SIZE RES STATE TIME CPU COMMAND
8248 mailsrv 33 53 -2 175M 43M run 3:06 51.89% imapd
8219 mailsrv 16 50 -2 184M 41M run 3:37 12.44% imapd
8231 mailsrv 28 50 -2 178M 42M run 3:58 8.79% imapd
8300 mailsrv 23 58 0 28M 21M run 0:42 3.59% tcp_smtp_server
8230 mailsrv 30 59 -2 232M 41M sleep 3:50 2.84% imapd
23887 root 11 58 0 142M 141M sleep 105:58 0.97% bpbkar
8269 mailsrv 10 50 -2 181M 54M run 0:12 0.86% mshttpd
8272 mailsrv 13 59 -2 218M 61M sleep 0:23 0.59% mshttpd

As this came on so suddenly without any configuration changes or major addition of users, I was wondering if we have some corrupt database / index somewhere, but I really have no idea where to look. Any suggestions would greatly appreciated.

Thanks,

Rich





<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

News | FAQ | advertise