rgathright wrote:
> What kind of gains would I get if I used the Intel Compiler to create an SSE 4.X
> binary specific for each of my processors?
I have no personal experience but was curious enough to read up
a bit on SSE4 now. On one extreme, a DivX encoding algorithm which
was specially optimized with SSE4 using special instructions dedicated
for just that purpose gave a speed increase of about 40%. On the other
hand, a povray compile of 3.6 in 2007 using Intel compiler 10 was
reported to be 16% slower than an SSE3 version Of course, the
SSE4 support of the compiler should be better now. But I don't
think you'll be getting more than a few percent speedup.
I assume you are already using the 3.7 beta SSE2 build?
Post a reply to this message
|