|
Re: Problems with multipathd: msg#00016linux.kernel.device-mapper.devel
===> I found some settings in /sys/module/qla2xxx/parameters/..., Ok, I have a 6 seconds timeout now :-) I have commented this line, but udev still has difficulties to create this I'm sure of not beeing the only one who has problems with missing /dev/... links. It's possible that multipath installs a device-mapper table without errors, but kpartx fails because udev doesn't create links in /dev/... So, I think multipath.dev should execute kpartx with /dev/mapper/... instead of /dev/... by default. ===> Without "udevstart" udev doesn't create the /dev/150gb*You can still identify the udev problems keeping the node creation multipath is able to react. I don't understand why I have to execute udevstart. ===> First multipathd says "8:0: tur checker reports After lowering the timeouts to 6 seconds multipathd shows the same behavior. ===> Multipathing seems to work without but not with multipathd. What do you means with "In your case..."? Because 2.6 and udev are multipath-tools dependencies all systems running multipath have the same environment. They all use kernel 2.6 and udev, that is hotplug-driven. The kernel starts this hotplug process and udev executes multipath. Sorry, but I have to ask again: Does we really need multipathd? After lowering dev_loss_tmo timeouts and stopping multipathd I have a working multipath environment :-))) I tested this with a little perl script and a mysql database: My trafficmaker-host executed this script 27 times (parallel): ... for(my $count=1;$count<=1000000;$count++) { ... my $sql="INSERT INTO $table VALUES($id,\"$value\")"; my $return=$dbh->do($sql); ... } ... { my $sql="SELECT COUNT(*) FROM $table WHERE id=$id"; my $sth=$dbh->prepare($sql); my $return=$sth->execute(); ... $selectCount=$sth->fetchrow_array(); ...; } The database host had to insert this 30 byte strings and I have started some copy-jobs (cp -a /usr/* /partition_mounted_with_multipath/ etc.) to increase the I/O load. During this test I have disabled and enabled the different HBA-Switch-Ports with the following result: It took 6 to 15 seconds before "multipath -l" showed that a path is down (15 seconds because the host had a 30.0 CPU load and responded very slowly), but no INSERT got lost :-))) But sometimes multipath seems to be a bit confused... 1.) one path disabled In the majority of cases multipath prints... testhalde2 sbin # multipath -l 150gb () [size=150 GB][features="0"][hwhandler="0"] \_ round-robin 0 [active] \_ #:#:#:# 8:0 [active] \_ 1:0:0:1 sdb 8:16 [active] But sometimes I get... testhalde2 usr # multipath -l 150gb () [size=150 GB][features="0"][hwhandler="0"] \_ round-robin 0 [active] \_ 4:0:0:1 sdb 8:16 [active] 2.) all paths enabled (default) In the majority of cases multipath prints... testhalde2 sbin # multipath -l 150gb () [size=150 GB][features="0"][hwhandler="0"] \_ round-robin 0 [enabled] \_ 1:0:0:1 sdb 8:16 [active] \_ 0:0:0:1 sdc 8:32 [active] But sometimes I get... testhalde2 usr # multipath -l 150gb () [size=150 GB][features="0"][hwhandler="0"] \_ round-robin 0 [active] \_ 0:0:0:1 sdb 8:16 [active] \_ round-robin 0 [enabled] \_ 4:0:0:1 sdc 8:32 [active] Regards Simon |
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | EMC CX 300 multipath problem: 00016, Nicola Murino |
|---|---|
| Next by Date: | Re: EMC Cx300 multipath seems to work: 00016, gistolero |
| Previous by Thread: | Re: Re: Problems with multipathdi: 00016, christophe varoqui |
| Next by Thread: | Progress on LVM2 snapshots - kernel 2.6.x: 00016, Hubert |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
| News | FAQ | advertise |