Notes on Cula and Nvidia GTX titan
4 posts
• Page 1 of 1
Re: Notes on Cula and Nvidia GTX titan
From the raw numbers of the GTX Titan and the the GTX 480 I would guess, that your numbers are only slightly off.
The SP performance of these cards are 4.5 TFLOPS and 1.345 TFLOPS, respectively. So you can only get a maximum speedup of ~3.
The nice thing with the GTX Titan is the unlocked DP performance. While the GTX 480 is limited to 0.168 TFLOPS, the GTX Titan can still reach 1.3 TFLOPS which is a speedup of ~8.
Maybe you should compare the results of the DP examples and try to increase the size of the problems.
[Numbers are taken from english and german wikipedia
]
The SP performance of these cards are 4.5 TFLOPS and 1.345 TFLOPS, respectively. So you can only get a maximum speedup of ~3.
The nice thing with the GTX Titan is the unlocked DP performance. While the GTX 480 is limited to 0.168 TFLOPS, the GTX Titan can still reach 1.3 TFLOPS which is a speedup of ~8.
Maybe you should compare the results of the DP examples and try to increase the size of the problems.
[Numbers are taken from english and german wikipedia

- coruun
- Posts: 5
- Joined: Wed Mar 27, 2013 8:17 am
Re: Notes on Cula and Nvidia GTX titan
Note how your speedup factors are all still continuing to grow as you go from 4k -> 8k. As NVIDIA keeps adding cores, we need larger and larger problems to saturate all those cores. You can change the sizes that are run on the command line:
benchmark 1024 16384 512 (etc)
benchmark 1024 16384 512 (etc)
- john
- Administrator
- Posts: 587
- Joined: Thu Jul 23, 2009 2:31 pm
Re: Notes on Cula and Nvidia GTX titan
Following your advice, I get the figure below comparing the titan to the i7-3820:
The jump up at the end for several of the benchmarks I guess is caused by a good match between the titan hardware and 16384 matrix size. The benchmark numbers start to flatten out for matrix sizes of about 10,000 - one gets the most out of one's titan with large arrays!
- cula_bench.jpg (90.71 KiB) Viewed 14493 times
The jump up at the end for several of the benchmarks I guess is caused by a good match between the titan hardware and 16384 matrix size. The benchmark numbers start to flatten out for matrix sizes of about 10,000 - one gets the most out of one's titan with large arrays!
- Boxed Cylon
- Posts: 48
- Joined: Fri Oct 16, 2009 8:57 pm
4 posts
• Page 1 of 1
Return to General CULA Discussion
Who is online
Users browsing this forum: No registered users and 3 guests