|
Re: test hung after 36 hours: msg#00024linux.redhat.cluster
On Mon, Apr 11, 2005 at 05:13:06PM -0700, Daniel McNeil wrote: > I started my mount/tar/rm/ tests on Apr 4 17:41 and I hit > a problem at Apr 6 05:30. So the test ran for 36 hours. > cl030 and cl031 were getting "SM: process_reply invalid" > messages and cl032 got "No response" and "Missed too many > heartbeats" The SM messages are an effect of CMAN removing nodes. There's a fair chance that this recent fix will help: http://sources.redhat.com/ml/cluster-cvs/2005-q2/msg00018.html -- Dave Teigland <teigland@xxxxxxxxxx> |
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | test hung after 36 hours: 00024, Daniel McNeil |
|---|---|
| Next by Date: | Iozone tests on gfs with 29th march gfs snapshot: 00024, Hansjoerg Maurer |
| Previous by Thread: | test hung after 36 hoursi: 00024, Daniel McNeil |
| Next by Thread: | Re: oops after 12 hours during umount: 00024, Daniel McNeil |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
| News | FAQ | advertise |