Use CUDA unified memory for Random123 state #595

olupton · 2021-07-20T15:05:11Z

Description
This modifies how we allocate state variables for Random123 in GPU-enabled builds. This should simplify things, and enable use-cases like in #528 where we need to manipulate this state on the device from CPU code in ways that the existing API did not allow.

Because the state is now accessible from both CPU and GPU code, this also closes #345.

How to test this?
Build and run with/without GPU support enabled.

Test System

OS: BB5
Compiler: NVHPC 21.2
Version: master
Backend: CPU/GPU

Use certain branches for the SimulationStack CI

CI_BRANCHES:NEURON_BRANCH=olupton/random123-unified-memory,

bbpbuildbot · 2021-07-20T15:34:29Z

Logfiles from GitLab pipeline #11298 (:no_entry:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2021-07-21T08:05:29Z

Logfiles from GitLab pipeline #11336 (:no_entry:) have been uploaded here!

Status and direct links:

pramodk

LGTM!

coreneuron/utils/randoms/nrnran123.h

coreneuron/utils/randoms/nrnran123.cu

bbpbuildbot · 2021-07-21T10:31:23Z

Logfiles from GitLab pipeline #11367 (:white_check_mark:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2021-07-21T12:39:44Z

Logfiles from GitLab pipeline #11394 (:white_check_mark:) have been uploaded here!

Status and direct links:

codecov-commenter · 2021-07-21T12:42:22Z

Codecov Report

Merging #595 (f8cc054) into master (9bd58f6) will decrease coverage by 0.01%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #595      +/-   ##
==========================================
- Coverage   56.46%   56.45%   -0.02%     
==========================================
  Files          99      100       +1     
  Lines        8021     8030       +9     
==========================================
+ Hits         4529     4533       +4     
- Misses       3492     3497       +5

Impacted Files	Coverage Δ
coreneuron/utils/randoms/nrnran123.cpp	`36.36% <ø> (-0.78%)`	⬇️
coreneuron/utils/randoms/nrnran123.h	`100.00% <100.00%> (ø)`
coreneuron/network/multisend_setup.cpp	`81.45% <0.00%> (+0.07%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9bd58f6...f8cc054. Read the comment docs.

bbpbuildbot · 2021-07-21T13:08:49Z

Logfiles from GitLab pipeline #11415 (:no_entry:) have been uploaded here!

Status and direct links:

pramodk

Once channel-benchmark is validated to run with a single exe on CPU as well as GPU, this is good to go!

pramodk · 2021-07-23T10:06:00Z

Retest this please

olupton · 2021-07-23T10:20:49Z

It was tested locally that channel-benchmark produces consistent results with/without --gpu in a GPU-enabled build.

Note that a bug in the NVHPC compiler version 21.3 and 21.5 means version 21.2 (or older?) or 21.7 (or hopefully newer!) must be used.

bbpbuildbot · 2021-07-23T11:05:10Z

Logfiles from GitLab pipeline #11636 (:no_entry:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2021-07-23T12:56:56Z

Logfiles from GitLab pipeline #11650 (:white_check_mark:) have been uploaded here!

Status and direct links:

iomaganaris

LGTM 👍

bbpbuildbot · 2021-07-23T18:06:07Z

Logfiles from GitLab pipeline #11694 (:no_entry:) have been uploaded here!

Status and direct links:

pramodk · 2021-07-23T18:24:26Z

Replayed rxd-mpi tests and they passed - https://bbpgitlab.epfl.ch/hpc/coreneuron/-/jobs/38730 (in previous CI there was timeout)

* Unified Memory in Random123. * Use OMP_Mutex instead of std::atomic, drop detail. * Load gcc module for CUDA in Jenkins CI. CoreNEURON Repo SHA: BlueBrain/CoreNeuron@864b712

olupton force-pushed the olupton/random123-unified-memory branch from ce4bc03 to 8576f8d Compare July 21, 2021 07:23

pramodk reviewed Jul 21, 2021

View reviewed changes

coreneuron/utils/randoms/nrnran123.h Show resolved Hide resolved

coreneuron/utils/randoms/nrnran123.cu Outdated Show resolved Hide resolved

olupton force-pushed the olupton/random123-unified-memory branch from 5375bbd to f8cc054 Compare July 21, 2021 11:36

olupton mentioned this pull request Jul 21, 2021

Psolve direct neuronsimulator/nrn#1192

Merged

olupton force-pushed the olupton/random123-unified-memory branch from f8cc054 to fdc8aaf Compare July 21, 2021 12:58

olupton marked this pull request as ready for review July 21, 2021 13:04

pramodk approved these changes Jul 21, 2021

View reviewed changes

pramodk closed this Jul 23, 2021

pramodk reopened this Jul 23, 2021

olupton mentioned this pull request Jul 23, 2021

GPU-enabled builds should not call GPU APIs without --gpu #599

Closed

olupton added 3 commits July 23, 2021 13:04

Unified Memory in Random123.

9ca7ab9

Use OMP_Mutex instead of std::atomic, drop detail.

f8fd104

Load gcc module for CUDA in Jenkins CI.

769352a

olupton force-pushed the olupton/random123-unified-memory branch from fdc8aaf to 769352a Compare July 23, 2021 11:07

olupton mentioned this pull request Jul 23, 2021

Follow CoreNEURON change to use unified memory for Random123 neuronsimulator/nrn#1379

Merged

1 task

olupton closed this Jul 23, 2021

olupton reopened this Jul 23, 2021

olupton closed this Jul 23, 2021

olupton reopened this Jul 23, 2021

iomaganaris approved these changes Jul 23, 2021

View reviewed changes

Merge branch 'master' into olupton/random123-unified-memory

bdcee7a

pramodk merged commit 864b712 into master Jul 23, 2021

pramodk deleted the olupton/random123-unified-memory branch July 23, 2021 18:24

pramodk mentioned this pull request Jul 26, 2021

Efficient setup of random123 streams on GPU #587

Open

This was referenced Jul 29, 2021

Do not use __managed__ in GPU builds. Add utils. #606

Merged

Random123 global state is not propagated correctly #607

Closed

olupton mentioned this pull request Aug 12, 2021

Modernise CMake/CUDA and fix link issues #609

Merged

7 tasks

olupton mentioned this pull request Sep 3, 2021

NMODL generates OpenACC code that crashes on machines without GPUs. BlueBrain/nmodl#727

Open

olupton mentioned this pull request Apr 26, 2022

CoreNEURON memory allocation routines are confusing #806

Open

Use CUDA unified memory for Random123 state #595

Use CUDA unified memory for Random123 state #595

Uh oh!

Conversation

olupton commented Jul 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bbpbuildbot commented Jul 20, 2021

Uh oh!

bbpbuildbot commented Jul 21, 2021

Uh oh!

pramodk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bbpbuildbot commented Jul 21, 2021

Uh oh!

bbpbuildbot commented Jul 21, 2021

Uh oh!

codecov-commenter commented Jul 21, 2021

Codecov Report

Uh oh!

bbpbuildbot commented Jul 21, 2021

Uh oh!

pramodk left a comment

Choose a reason for hiding this comment

Uh oh!

pramodk commented Jul 23, 2021

Uh oh!

olupton commented Jul 23, 2021

Uh oh!

bbpbuildbot commented Jul 23, 2021

Uh oh!

bbpbuildbot commented Jul 23, 2021

Uh oh!

iomaganaris left a comment

Choose a reason for hiding this comment

Uh oh!

bbpbuildbot commented Jul 23, 2021

Uh oh!

pramodk commented Jul 23, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

olupton commented Jul 20, 2021 •

edited

Loading