osdir.com
mailing list archive

Subject: Re: How to calculate the value of K and Lambda for two sequence alignment - msg#00050

List: science.biology.informatics.bulletin

Date: Prev Next Index Thread: Prev Next Index
I have no clue :-) but I typed "smith waterman k lamda" into google
and clicked on "I'm feeling lucky":

got the NCBI page where blast is explained
http://www.people.virginia.edu/~wrp/cshl02/Altschul/Altschul-3.html:

---------------
K and lambda are statistical parameters dependent upon the scoring
system and the background amino acid frequences of the sequences being
compared. While FASTA estimates these parameters from the scores
generated by actual database searches, BLAST estimates them beforehand
for specific scoring schemes by comparing many random sequences
generated using a standard protein amino acid composition [12].
For example, using BLOSUM-62 amino acid substitution scores [13],
and affine gap costs [14-16] in which a gap of length k is assigned a
score of -(10 + k), we generated 10,000 pairs of length-1000 random
protein sequences, and used the Smith-Waterman algorithm to calculate
10,000 optimal local alignment scores. From these scores, lambda was
estimated at 0.252 and K at 0.035 by the method of maximum-likelihood
[17]. In general, given M samples from an extreme value distribution,
the ratio of the maximum-likelihood estimate of lambda to its actual
value is approximately normally distributed, with mean 1.0 and
standard deviation 0.78/sqrt(M) [17]. Thus the standard error for our
estimate of lambda is about 0.002, or less than 1%.
---------------

>From what I understood, you generate many alignments, plot the
generated scores for the current matrix, assume that they follow your
function E and then approximate lambda.

The addison wesley BLAST book goes into details and gives an example
PERL program to calculate lambda and says that the value of k doesn't
really matter:
(this sample chapter is free)
http://www.oreilly.com/catalog/blast/chapter/ch04.pdf

Don't know if it helped or if it is completelywrong, it took 10
minutes and I found it interesting... :-)

Max


On 11/02/06, Ryan Golhar <golharam@xxxxxxxxx> wrote:
> Did anyone ever respond to you on this? K and lambda. I forget where K
> comes from. Lambda is dependent on the scoring matrix you are using. I
> believe it is given with the matrix. BLOSUM uses 0.347.
>
> Ryan
>
>
> -----Original Message-----
> From:
> bio_bulletin_board-bounces+golharam=umdnj.edu@xxxxxxxxxxxxxxxxxx
> [mailto:bio_bulletin_board-bounces+golharam=umdnj.edu@xxxxxxxxxxxxxxxxxx]
> On Behalf Of shohag md
> Sent: Friday, February 03, 2006 6:26 AM
> To: bio_bulletin_board@xxxxxxxxxxxxxxxxxx
> Subject: [BiO BB] How to calculate the value of K and Lambda for two
> sequence alignment
>
>
>
> Hi Everybody
>
>
>
> Using Smith Waterman algorithm I want to align two sequences. Aftet that I
> want to calculate the expectation value. For calculating the expectation
> value we know that
>
>
>
> E = Kmn e - lx
>
>
>
> But how can I calculate the value of K and l .
>
>
> Is there any formula that can help me to calculate the value of K and l ,
> and then the expectation value.
>
> Thanking all in advance
>
> Shoyaib
>
> _______________________________________________
> Bioinformatics.Org general forum -
> BiO_Bulletin_Board@xxxxxxxxxxxxxxxxxx
> https://bioinformatics.org/mailman/listinfo/bio_bulletin_board
>
>
>


--
Maximilian Haeussler,
CNRS Gif-sur-Yvette, Paris
tel: +33 6 12 82 76 16
icq: 3825815 -- msn: maximilian.haeussler@xxxxxxxxxxxxxxxxxx
skype: maximilianhaeussler
_______________________________________________
Bioinformatics.Org general forum - BiO_Bulletin_Board@xxxxxxxxxxxxxxxxxx
https://bioinformatics.org/mailman/listinfo/bio_bulletin_board


Was this page helpful?
Yes No
Thread at a glance:

Previous Message by Date: click to view message preview

How to calculate the value of k and lambda for calculating the expectation value?

Hi everybody   After aligning two/more sequences I want to find the expectation value. But for calculating the expectation value we know that there are two constanats, k and lambda.(E= kmn e-lambda *s, where m and n are the lengths of sequences) Now I want to know how to calculate the value of k and lambda. I want to know the mathematics , not any tool .   Are there any one who can help me? Thanks in advance   shoyaib _______________________________________________ Bioinformatics.Org general forum - BiO_Bulletin_Board@xxxxxxxxxxxxxxxxxx https://bioinformatics.org/mailman/listinfo/bio_bulletin_board

Next Message by Date: click to view message preview

Constructing a multiple aligner, similar to Smith-Waterman

Hi people, I am trying to figure out how to construct my own multiple aligner. Not because I expect it to be better than other aligners, but because I need to learn about alignment for a larger project. I need to get this all figured out. I know that when aligning a Smith-Waterman based aligner, we start from the highest cell in the matrix, and trace backwards. But what if the trace actually has multiple highest cells? Or what if it goes like this: 13, 11, 13, 10, 9, 6, 3, 1, 0 That is, the trace has multiple highest cells within it, two 13's. Do we use the shorter back trace? OK, so let's assume we got a single alignment done. How do we then find the other sections to align? Would a process of elimination do it? That is, search the remaining matrix (Excluding the portion already aligned) for the highest cell? All this Smith-Waterman understanding I'm gaining is going into go into an experimental project I am working on, that I have no idea if it will be practical or not. I know it will work, I just don't know that it will give anyone any benefits over what they already got :) Needless to say, it's exciting stuff for people like me who like giving something new a go, I suppose the danger of it not being useful is half the excitement. _______________________________________________ Bioinformatics.Org general forum - BiO_Bulletin_Board@xxxxxxxxxxxxxxxxxx https://bioinformatics.org/mailman/listinfo/bio_bulletin_board

Previous Message by Thread: click to view message preview

RE: How to calculate the value of K and Lambda for two sequence alignment

Did anyone ever respond to you on this?  K and lambda.  I forget where K comes from.  Lambda is dependent on the scoring matrix you are using.  I believe it is given with the matrix.  BLOSUM uses 0.347.   Ryan   -----Original Message-----From: bio_bulletin_board-bounces+golharam=umdnj.edu@xxxxxxxxxxxxxxxxxx [mailto:bio_bulletin_board-bounces+golharam=umdnj.edu@xxxxxxxxxxxxxxxxxx] On Behalf Of shohag mdSent: Friday, February 03, 2006 6:26 AMTo: bio_bulletin_board@xxxxxxxxxxxxxxxxxxSubject: [BiO BB] How to calculate the value of K and Lambda for two sequence alignment Hi Everybody   Using Smith Waterman algorithm I want to align two sequences. Aftet that I want to calculate the expectation value. For calculating the expectation value we know that   E = Kmn e - lx   But how can I calculate the value of K and l .   Is there any formula that can help me to calculate the value of K and l ,  and then the expectation value.   Thanking all in advanceShoyaib _______________________________________________ Bioinformatics.Org general forum - BiO_Bulletin_Board@xxxxxxxxxxxxxxxxxx https://bioinformatics.org/mailman/listinfo/bio_bulletin_board

Next Message by Thread: click to view message preview

Sequence Editor for Unix?

Hi, in 2000 and 2002 someone asked on this list if there was a good replacement for the Bioedit sequence editor for Unix. Some time has passed from this. So I'm asking the question again: What are you using as a sequence editor on Linux if you have a Biologist standing next to who who'd like to see a graphical view of the alignment of a couple of sequences in Blast, Clustal, etc? Bioedit could do all of this easily and quickly without any hassles. I've tried a couple of programs. Personally, I don't like java swing...interfaces feel rather sluggish. Apollo/Artemis take ages to load if you just want to have a look on an alignment or run a Blast. Strap has a strange interface. Seqpup hasn't been updated for years. Clustalx is OK for alignment viewing can only run ClustalW. Maybe just a small executable, C-style, that can plot blast/gff-files and zoom, might be sufficient... What are you using? Thanks a lot, Max _______________________________________________ Bioinformatics.Org general forum - BiO_Bulletin_Board@xxxxxxxxxxxxxxxxxx https://bioinformatics.org/mailman/listinfo/bio_bulletin_board
Loading Comments...
Home | News | Patents | Sitemap | FAQ | advertise

Advertising by