logo       

[UPDATE] SUMMARY: E420R unexplaned panic after UE error: msg#00061

os.solaris.managers.summaries

Subject: [UPDATE] SUMMARY: E420R unexplaned panic after UE error

G'day All.

A little update on my earlier summary, which I ended with:

As to the qla error messages in the log, Kevin reinforced my opinion that forceloading the drivers is not necessary. None of these devices contain boot partitions. In the mean time we have been able to trace at least some of those to a faulty UPS that the storage array is plugged into (the panicky server is not plugged in there, though).


QLogic have in the mean time done a little investigation in this and provided me with an explanation for the qla2300 messages. To refresh your memories, these are expamles of the messages I'm talking about:

Feb 2 12:11:35 Slarty qla2300: [ID 175527 kern.info] qla2300(1):
configure_loop, 2 gigabit data rate connection
Feb 2 12:11:35 Slarty qla2300: [ID 467028 kern.info] qla2300(1):
configure_loop, F-PORT connection
Feb 2 12:11:35 Slarty qla2300: [ID 465925 kern.info] qla2300(1): status_entry,
check condition sense data t1d0
Feb 2 12:11:35 Slarty 70h 0h 6h 0h 0h 0h 0h 6h 0h 0h 0h 0h 29h 0h
0h 0h 0h 20h


Lyle Merdan of QLogic provided me with the following explanation of the last
two lines (thanks Lyle) :

The t##d## is indicative of the disk that is reporting the check condition.
Then at the beginning of the entry is the HBA instance. The example you gave
tells me it's HBA instance 6.

qla2300: [ID 465925 kern.info] qla2300(6): status_entry

Q) What are these check conditions that appear when extended logging is
enabled?
qla2300: [ID 465925 kern.info] qla2300(6): status_entry, check condition
sense data t94d0
70h 0h 6h 42h 55h 5ah 5ah ah 0h 0h 0h 0h 29h 0h 1h 0h 0h A) These are errors returned from the storage to the HBA. There are two parts to a check
condition. The ASC and ASCQ. The ASC is byte 12 and the ASCQ is byte 13.
Start counting
at 0. So in the above example the ASC is 29 and ASCQ is 0. These values can
be looked up
on this website: http://www.t10.org/lists/asc-num.htm
As to what exactly the reported errors mean, you'll have to contact the storage
vendor.

Now the reason you're getting the check conditions is you have extended logging
enabled in the driver.
To disable extended logging you have to edit the /kernel/drv/qla2300.conf file
and either add a line that explicitly
disables extended logging for HBA driver instance 6 OR use a GUI to turn
extended logging off.

You could just add this line:
hba6-extended-logging=0;
---

The website that Lyle mentiones has full explenation of all SCSI ASC/ASCQ
combinations possible. It transpires then that all messages are caused by
faults on the CLARIION. We'll persue this further with Dell.

Cheers,

--
Tony van Lingen
Technical Consultant

Technology One Limited,
67 High Street Toowong Qld 4066

Mobile: 0413 701 284
Phone: +61 7 3377 7300(TechOne), +61 7 3234 1972 (EPA)
Fax: +61 7 3377 7301(TechOne), +61 7 3227 6534 (EPA)

E-mail: tvlingen@xxxxxxxxxxxxxx
Visit our home page at: http://www.TechnologyOneCorp.com
Technology One's entire liability will be limited to resupplying the material
enclosed. No other warranties are provided

Technology One designs, develops, implements and supports intelligent
enterprise wide software applications using Internet, eBusiness and Client
Server technologies for both corporate and government organisations

*********************************** Confidentiality Statement
****************************************
The information transmitted in this email is only for the recipient referred in
this email and may contain confidential and/or privileged material.

If you are not the intended recipient (or responsible for delivery of the
message to such person), you may not copy or deliver this message to anyone. In
such case any review, retransmission, dissemination or other use of, or taking
of any action in reliance upon, this information by persons or entities other
than the intended recipient is prohibited. If you received this in error,
please contact the sender and delete the material from the computer.


Opinions, conclusions and other information in this message that do not relate
to the official business of the company shall be understood as neither given
nor endorsed by it.

Technology One's entire liability will be limited to resupplying the material
enclosed. No other warranties are provided

We use virus scanning software but exclude all liability for viruses or similar
in any attachment.




___________________________
Disclaimer

This e-mail, including attachments if any, has originated from a Queensland
government agency and may contain information that is confidential, or covered
by legal professional privilege, and is intended for the named recipient(s)
only. If you have received this message in error, you are asked to inform the
sender as quickly as possible and delete this message and any copies of this
message from your computer system network.

Any form of disclosure, modification, distribution and/or publication of this
e-mail, including attachments is prohibited. Unless otherwise stated, this
e-mail, including attachments represents the views of the sender and not the
views of the Environmental Protection Agency.

Although this e-mail has been checked for the presence of computer viruses, the
Environmental Protection Agency provides no warranty that all possible viruses
have been detected and cleaned. Any use of this e-mail could harm your
computer system.
___________________________


<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

News | FAQ | advertise