logo       

Re: RAID-6: help wanted: msg#00202

linux.raid

Subject: Re: RAID-6: help wanted

> >There is currently a data-corruption bug in the RAID-6 md layer
> >(raid6main.c). I have so far not been successful in locating it,
> >although it is easily reproducible, thanks to a set of scripts by Jim
> >Paris. I suspect it is a race condition between raid6d and the rest
> >of the kernel.
>
> Great, can we get a copy of the scripts to try and assist?

Sure:

---

Date: Fri, 6 Aug 2004 00:04:39 -0400
From: Jim Paris <jim@xxxxxxxx>
To: "H. Peter Anvin" <hpa@xxxxxxxxx>
Cc: linux-raid@xxxxxxxxxxxxxxx
Subject: Re: Kernel panic, FS corruption Was: Re: Call for RAID-6 users

> If you can reproduce it with ext2/3 it would make debugging simpler,
> because I understand the ext code and data structures a lot better.

This demonstrates it on ext2. I can't seem to reproduce it with just
simple use of 'dd', but it shows up if I untar a ton of data.

This script:
- creates five 100MB "disks" through loopback
- puts them in a six-disk RAID-6 array (resulting size=400MB, degraded)
- untars about 350MB of data to the array
- runs e2fsck, which shows filesystem errors

Usage:
- put r6ext.sh and big.tar.bz2 in a directory
- run r6ext.sh as root

Sorry for the huge files, but e2fsck didn't show any problems when I
scaled everything down by a factor of 10. You could probably make
your own big.tar.bz2 and see the same problem, as there's nothing
special about this data.

http://stonewall.mit.edu/~jim/r6ext.sh
http://stonewall.mit.edu/~jim/big.tar.bz2 (77MB)

-jim
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html



<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

News | FAQ | advertise