Disqus Comments

Pantelis Sopasakis • 10 years ago

Very nice tutorial! I've one question: Can one use Thrust to perform lots of matrix-vector multiplications in parallel?

SolarianProgrammer • 10 years ago

Yes.

Pantelis Sopasakis • 10 years ago

Some hints? :)

SolarianProgrammer • 10 years ago

If I understood correctly how thrust works, the multiplications should be non-blocking (asynchronous), if your CUDA device has enough juice (threads available) they will run in parallel. You can also ask this on their forums (or mailing lists) to be sure.

Leonardo Miquelutti • 10 years ago

Considering an implementation of thrust::complex type, how would you do to summon cublasCgemm, which is the cuComplex version of cublasSgemm? I mean, how would you cast between thrust::complex and cuComplex?

Marc Maetz • 11 years ago

Awesome tutorial!

Carolyn Phillips • 11 years ago

Very nice example! However, your print_matrix code does not work with the thrust additions. Doesn't the data need to be moved back to the host by copying to a host_vector before it can be printed out?

SolarianProgrammer • 11 years ago

When you work with thrust vectors you can access directly the vector elements (no matter if the vector is on the CPU or on the GPU).

Carolyn Phillips • 11 years ago

I got Bus Errors until I added code that looked like as follows

thrust::host_vector<float> h_A = d_A;
print_matrix(thrust::raw_pointer_cast(&h_A[0]), nr_rows_A, nr_cols_A);

SolarianProgrammer • 11 years ago

Thanks for the follow up,

maybe your answer will help other people with the same problem.

On my machine I was able to print directly elements from d_A.

Linxi Chen • 11 years ago

This is very useful. Thanks!

Mike • 12 years ago

one minor problem. Shouldn't the seed move out the GPU_fill_rand() function?

SolarianProgrammer • 12 years ago

You can move it outside if you want, as it is implemented now it will reinitialized the seed for every call, twice in this case.

Ishayu • 12 years ago

I was looking all over the web for an easy to understand matrix multiplication example using CUBLAS. It was almost hopeless, but you got it. Thanks!

Miguel • 12 years ago

Thanks a lot for these tutorials, they are very helpful !

Jonathan • 13 years ago

Did you mean to say "CPU" for the first comment?

SolarianProgrammer • 13 years ago

Yes, definitely CPU :). Thanks.