multiple threads - culaDeviceSgesvd

General CULA Dense (LAPACK & BLAS) support and troubleshooting. Use this forum if you are having a general problem or have encountered a bug.

multiple threads - culaDeviceSgesvd

Postby gvarela13 » Sat Mar 09, 2013 8:14 pm

Hi, just to confirm if I got it right while I was reading the forum:

1- I understood that "culaDeviceSgesvd" cannot be called from a __device__ function to work within each thread inside a kernel.

Because of these, I cannot use CULA R.16b to calculate the SVD() of 128*128*70 # 3x3 matrices.

Suggestions & comments.

Regards! :)
Posts: 1
Joined: Mon Mar 04, 2013 12:55 pm

Re: multiple threads - culaDeviceSgesvd

Postby john » Wed Mar 13, 2013 8:41 am

That's correct - we have no _device_ calls in our lib. We do have device-pointer calls that are called from the host. But our algorithms are designed for larger matrices - tuning for a large batch of 3x3 matrices requires completely different code. We could build this as custom code for you, but I'd advise you that based on prior experiments that the solution time would likely be break-even against the CPU.
Posts: 587
Joined: Thu Jul 23, 2009 2:31 pm

Return to CULA Dense Support

Who is online

Users browsing this forum: Google [Bot] and 2 guests