|
![](/i/fill.gif) |
Warp wrote:
>
> You have to link with "cc -xarch=v9" not with "cc" only.
*SMACK* = the sound made when ones open hand hits ones forehead soundly.
OK .. I hope no one at work sees this.
Thanks.
Next issue, what is the difference between the sparc v9 and v9a processor? The
UltraSparc I is referred to as the v8plusa yes? Then what is the v9a?
UltraSPARC II I guess. I'm going to compile for that architecture as well and
then test with your sample file until I get the fastest time. The sample file I
am using is :
----------8<----------8<----------8<----------8<----------8<----------
camera { location <0,2,-2> look_at y*.2 angle 35 }
light_source { <10,10,-5>, 1 }
#declare WPRadius = 1;
#declare WPCurvRadius = 1.5;
intersection
{ #declare IPH = sqrt(pow(WPCurvRadius,2)-pow(WPRadius,2));
torus { WPRadius, WPCurvRadius translate -y*IPH}
cylinder { 0, y*(WPCurvRadius-IPH), WPRadius }
pigment
{ spiral1 1 color_map { [0 rgb <.5,.75,1>][1 rgb <0,.5,1>] }
turbulence .1
rotate x*90
scale <.1,1,.1>
}
normal
{ spiral1 1 .15 slope_map
{ [0 <0,0>][.25 <.5,1>][.5 <1,0>][.75 <.5,-1>][1 <0,0>] }
rotate x*90
scale <.2,1,.2>
}
finish { specular .5 reflection .3 }
}
----------8<----------8<----------8<----------8<----------8<----------
which comes from one of your earlier posts. The system that I'm testing
with is a Sun Ultra 2 with two 300MHz processors. I believe that the CPUs have
2Mb of L2 cache each. The WorkShop compiler will allow me to specify a target
system and that will, in theory, tailor a few other options to maximize
performance on that system. I really don't know how much I believe that but any
binary built for the Ultra 2 will probably show poor performance on a SunBlade
100 with only 256Kb of cache.
I'll post my results. At the moment I'm trying something a tad extreme.
CFLAGS=-dalign -fsimple=0 -xO4 -ftrap=%none -libmil
-s -temp=/tmp/compiler -v -Xa -xarch=v9a
Dennis Clarke
Post a reply to this message
|
![](/i/fill.gif) |