logo       
Google Custom Search
    AddThis Social Bookmark Button
-->

Re: optimizations [for non-debugging] amd64 kernels: msg#00006

Subject: Re: optimizations [for non-debugging] amd64 kernels
On Tue, Sep 11, 2007 at 07:09:31AM -0400, Blair Sadewitz wrote:

> Also, at:
> 
> http://bahar.aydogan.net/~blair/amd64-string.diff
> 
> is an enhancement for x86_64 memcpy/bzero/bcopy functions in
> common/libc.  This is authored by fuyuki@xxxxxxxxxx and is a slight
> modification of the latest version (<see
> http://www.hadaly.org/fuyuki>) of what was originally posted in a PR
> back around Jan/Feb.
...
> I'd appreciate it if someone who actually knew x86_64 assembly would
> take a look at this and/or if others would test it so we could get it
> in the tree at some point.

The setup and teardown for stos/movs/cmps are really expensive and for small
strings (like under 512 bytes) you're better off with really simple loops
using the arithemetic instructions.

Andrew



<Prev in Thread] Current Thread [Next in Thread>