Skip to main content

Questions tagged [gpu]

Graphical Processing Unit - a specialized, relatively inexpensive hardware unit built for fast graphical computations and highly data-parallel scientific computations.

0 votes
1 answer
121 views

recommended simple linear solver on gpu

I'm looking for recommendations for a simple GPU linear equation system solver that is a dropin replacement for scipy.linalg.solve. Right now, I'd rather not go the Petsc/TriLinos/Slate route. ...
NNN's user avatar
  • 760
0 votes
0 answers
71 views

How is kernel fusion done?

I have a computational graph (DAG) consisting of element-wise operations (potentially with broadcasting) and reshape/reduce operations (reshaping/sum/max). I'm trying to understand how vertical kernel ...
ilya's user avatar
  • 121
2 votes
1 answer
421 views

How to leverage the GPU for parallel 3-body problem computations

I have a 3-body simulation which must run millions of times. As far as I know, the GPU shines when it gets to preform simple operations on huge matrices/arrays. Currently I'm debugging and running my ...
Remeraze's user avatar
5 votes
0 answers
139 views

Single precision vs double precision conjugate gradients

I tested my conjugate gradients implementation with float and double precision and contrary to my guess the double code was twice faster than the single precision code. The reason is that I need many ...
lightxbulb's user avatar
  • 2,267
0 votes
1 answer
289 views

Questions on the theory of distributed numerical algebraic computation

I'm trying to build a pure python distributed numerical algebra computation kernel based on GPU. but after I've learnt most of the software engineering, I realise that I'm seriously lacking in ...
Haitao Xiao's user avatar
2 votes
0 answers
83 views

OpenMP Fortran array operations in target regions

I am asking this here for the possibility that someone here is familiar with OpenMP offloading. Is there some other way to parallelise Fortran array operations outside of unrolling the loops of the ...
Touko Puro's user avatar
0 votes
1 answer
350 views

How can I convert a C program into a CUDA program?

Suppose I already have a C program that solves a specific computational problem. I want to convert that into a CUDA program. What steps should I follow to do that? For instance, can I think as follows?...
user366312's user avatar
2 votes
0 answers
171 views

Why GPU scaling (speedup) is usually worse than CPU scaling?

Let's define scaling, as how linear the speedup of using more than one GPU or CPU is. For example, having 2 GPUs gives you 2x faster execution time. I have noticed that in many software (e.g. ...
Daniel Wiczew's user avatar
0 votes
0 answers
58 views

Calculate the time taken to run an algorithm on GPU

I want to calculate the total time taken for a fixed code run using an NVIDIA GPU (for instance, Tesla K40). The code has to run 1 million single-bit comparisons. All the comparisons are independent ...
guptasonal's user avatar
3 votes
0 answers
192 views

How can I improve this matrix product calculation in OpenCL?

I am trying to compute a matrix-matrix product of N stacked complex double N x N matrices. For simplicity, I assume N = 512. I have written code in C++ parallelized with OMP and using OpenBLAS for the ...
sqrt6's user avatar
  • 31
3 votes
2 answers
6k views

GPU vs CPU FLOP counts

I apologise if this is somewhat of a rookie question. So, from my understanding, on a GPU board, far more of the space is allocated to ALUs compared to CPUs which have far more cache available. This ...
Enforce's user avatar
  • 141
3 votes
1 answer
455 views

Solving DAE in Julia using GPUs

I'm trying to solve a Differential Algebraic Equation (DAE) in Julia which is very computationally expensive using GPUs. I'm brand new to Julia and don't have much experience coding with GPUs. The ...
steve3051980's user avatar
4 votes
0 answers
181 views

How amenable is this 2D Frenkel–Kontorova-like energy minimization problem in Python to the use of a modest PC + GPU? (Heavy reliance on indexing)

@Richard's answer to Going to try to move some of my scipy/numpy calculation to a new GPU, how to avoid disappointing results? is quite helpful, and as promised I've added a simple running example ...
uhoh's user avatar
  • 1,068
4 votes
1 answer
596 views

Going to try to move some of my scipy/numpy calculation to a new GPU, how to avoid disappointing results?

update: I've refactored the question based on helpful advice in the linked meta. I'm a heavy user of Python's NumPy and SciPy (and not much else) and for years I could run anything I need on my laptop....
uhoh's user avatar
  • 1,068
6 votes
1 answer
506 views

PETSc-like library for Julia

I want to build an application for Material Point Method (and probably other meshfree methods too) in Julia and I am looking for library for direct and iterative solvers that can help me with it. One ...
lokit khemka's user avatar

15 30 50 per page
1
2 3 4 5
7