"He who knows not, and knows not that he knows not, is a fool...shun him.
He who knows not, and knows that he knows not, is willing...teach him.
He who knows, and knows not that he knows, is asleep...awaken him.
He who knows, and knows that he knows, is wise...follow him."
- Chinese Probverb

MetaBalls demo updated
Wednesday, October 19, 2005 | Permalink

I just recently got myself a new dual-core Athlon64 3800+, so naturally I had to try taking advantage of this extra processing power. Most of my demos are more GPU limited than CPU though, but MetaBalls is an exception to that rule. So I implemented threading into it to improve performance. Everything couldn't be parallelized though, so the gain is fairly moderate, about 15-20%. Another reason for the moderate increase is that the bottleneck apparently shifted from computations to cache/memory. The gain is larger when running the slower FPU path than the 3DNow path, about 25%.
This CPU also supports SSE3, so naturally I threw in an SSE3 path as well. 3DNow is still a tiny bit faster though.

[ 22 comments | Last comment by Míša (2006-06-14 16:04:22) ]