Questions tagged [gpu]
Graphical Processing Unit - a specialized, relatively inexpensive hardware unit built for fast graphical computations and highly data-parallel scientific computations.
103
questions
0
votes
1
answer
121
views
recommended simple linear solver on gpu
I'm looking for recommendations for a simple GPU linear equation system solver that is a dropin replacement for scipy.linalg.solve. Right now, I'd rather not go the Petsc/TriLinos/Slate route. ...
0
votes
0
answers
71
views
How is kernel fusion done?
I have a computational graph (DAG) consisting of element-wise operations (potentially with broadcasting) and reshape/reduce operations (reshaping/sum/max). I'm trying to understand how vertical kernel ...
2
votes
1
answer
421
views
How to leverage the GPU for parallel 3-body problem computations
I have a 3-body simulation which must run millions of times.
As far as I know, the GPU shines when it gets to preform simple operations on huge matrices/arrays. Currently I'm debugging and running my ...
5
votes
0
answers
139
views
Single precision vs double precision conjugate gradients
I tested my conjugate gradients implementation with float and double precision and contrary to my guess the double code was twice faster than the single precision code. The reason is that I need many ...
0
votes
1
answer
289
views
Questions on the theory of distributed numerical algebraic computation
I'm trying to build a pure python distributed numerical algebra computation kernel based on GPU. but after I've learnt most of the software engineering, I realise that I'm seriously lacking in ...
2
votes
0
answers
83
views
OpenMP Fortran array operations in target regions
I am asking this here for the possibility that someone here is familiar with OpenMP offloading. Is there some other way to parallelise Fortran array operations outside of unrolling the loops of the ...
0
votes
1
answer
350
views
How can I convert a C program into a CUDA program?
Suppose I already have a C program that solves a specific computational problem. I want to convert that into a CUDA program.
What steps should I follow to do that?
For instance, can I think as follows?...
2
votes
0
answers
171
views
Why GPU scaling (speedup) is usually worse than CPU scaling?
Let's define scaling, as how linear the speedup of using more than one GPU or CPU is. For example, having 2 GPUs gives you 2x faster execution time.
I have noticed that in many software (e.g. ...
0
votes
0
answers
58
views
Calculate the time taken to run an algorithm on GPU
I want to calculate the total time taken for a fixed code run using an NVIDIA GPU (for instance, Tesla K40). The code has to run 1 million single-bit comparisons. All the comparisons are independent ...
3
votes
0
answers
192
views
How can I improve this matrix product calculation in OpenCL?
I am trying to compute a matrix-matrix product of N stacked complex double N x N matrices. For simplicity, I assume N = 512. I have written code in C++ parallelized with OMP and using OpenBLAS for the ...
3
votes
2
answers
6k
views
GPU vs CPU FLOP counts
I apologise if this is somewhat of a rookie question. So, from my understanding, on a GPU board, far more of the space is allocated to ALUs compared to CPUs which have far more cache available. This ...
3
votes
1
answer
455
views
Solving DAE in Julia using GPUs
I'm trying to solve a Differential Algebraic Equation (DAE) in Julia which is very computationally expensive using GPUs. I'm brand new to Julia and don't have much experience coding with GPUs. The ...
4
votes
0
answers
181
views
How amenable is this 2D Frenkel–Kontorova-like energy minimization problem in Python to the use of a modest PC + GPU? (Heavy reliance on indexing)
@Richard's answer to Going to try to move some of my scipy/numpy calculation to a new GPU, how to avoid disappointing results? is quite helpful, and as promised I've added a simple running example ...
4
votes
1
answer
596
views
Going to try to move some of my scipy/numpy calculation to a new GPU, how to avoid disappointing results?
update: I've refactored the question based on helpful advice in the linked meta.
I'm a heavy user of Python's NumPy and SciPy (and not much else) and for years I could run anything I need on my laptop....
6
votes
1
answer
506
views
PETSc-like library for Julia
I want to build an application for Material Point Method (and probably other meshfree methods too) in Julia and I am looking for library for direct and iterative solvers that can help me with it. One ...