[cxxmodules] Implement global module indexing to improve performance. #4016

vgvassilev · 2019-06-30T17:43:55Z

The global module index represents an efficient on-disk hash table which stores identifier->module mapping. Every time clang finds a unknown identifier we are informed and we can load the corresponding module on demand.

This way we can provide minimal set of loaded modules. Currently, we see that for hsimple.C only the half of the modules are loaded. This can be further improved because we currently load all modules which have an identifier, that is when looking for (for example TPad) we will load all modules which have the identifier TPad, including modules which contain only a forward declaration of it.

Kudos Arpitha Raghunandan (@arpi-r)!

We still need some performance measurements but the preliminary results are promising.

Performance

Methodology

We have a forwarding root.exe which essentially calls /usr/bin/time -v root.exe $@. We have processed and stored this information in csv files. We have run in three modes:

root master without modules (modulesoff)
root master with modules (moduleson)
root master with this PR with modules (gmi)

Run on Ubuntu 18.10 on Intel® Core™ i5-8250U CPU @ 1.60GHz × 8

Results Interpretation

A general comparison between 2) and 3) show that this PR makes ROOT about 3% faster and 25% more memory efficient.

A general comparison between 1) and 3) shows that modules are still less efficient in a few cases which is expected because the PR loads more modules than it should. This will be addressed in subsequent PRs. A good trend is that some test already show that 3) is better than 1).

The raw data could be found here. [work was done by Arpitha Raghunandan (@arpi-r)]

Depends on #4005.

phsft-bot · 2019-06-30T17:44:08Z

Starting build on ROOT-performance-centos7-multicore/default, ROOT-fedora27/noimt, ROOT-fedora29/python3, ROOT-ubuntu16/rtcxxmod, mac1014/cxx17, windows10/default
How to customize builds

phsft-bot · 2019-06-30T17:57:10Z

Build failed on ROOT-performance-centos7-multicore/default.
See console output.

Warnings:

/data/sftnight/workspace/root-pullrequests-build/root/core/metacling/src/TCling.cxx:1096:13: warning: ‘void RegisterCxxModules(cling::Interpreter&)’ defined but not used [-Wunused-function]

Failing tests:

And 18 more

phsft-bot · 2019-06-30T17:57:25Z

Build failed on mac1014/cxx17.
See console output.

Warnings:

/build/jenkins/workspace/root-pullrequests-build/root/core/metacling/src/TCling.cxx:1096:13: warning: unused function 'RegisterCxxModules' [-Wunused-function]

Failing tests:

And 13 more

phsft-bot · 2019-06-30T18:03:29Z

Build failed on ROOT-ubuntu16/rtcxxmod.
See console output.

Warnings:

/mnt/build/workspace/root-pullrequests-build/root/core/metacling/src/TCling.cxx:1096:13: warning: ‘void RegisterCxxModules(cling::Interpreter&)’ defined but not used [-Wunused-function]

Failing tests:

And 89 more

phsft-bot · 2019-06-30T18:04:16Z

Build failed on ROOT-fedora29/python3.
See console output.

Warnings:

/mnt/build/workspace/root-pullrequests-build/root/core/metacling/src/TCling.cxx:1096:13: warning: ‘void RegisterCxxModules(cling::Interpreter&)’ defined but not used [-Wunused-function]

Failing tests:

And 13 more

phsft-bot · 2019-06-30T18:05:02Z

Build failed on ROOT-fedora27/noimt.
See console output.

Warnings:

/mnt/build/workspace/root-pullrequests-build/root/core/metacling/src/TCling.cxx:1096:13: warning: ‘void RegisterCxxModules(cling::Interpreter&)’ defined but not used [-Wunused-function]

Failing tests:

And 13 more

phsft-bot · 2019-06-30T18:28:24Z

Starting build on ROOT-performance-centos7-multicore/default, ROOT-fedora27/noimt, ROOT-fedora29/python3, ROOT-ubuntu16/rtcxxmod, mac1014/cxx17, windows10/default
How to customize builds

phsft-bot · 2019-06-30T18:33:11Z

Build failed on ROOT-ubuntu16/rtcxxmod.
See console output.

Errors:

FAILED: cd /mnt/build/workspace/root-pullrequests-build/build/math/mathcore/test && /usr/local/bin/cmake -E env LD_LIBRARY_PATH=/mnt/build/workspace/root-pullrequests-build/build/lib: ROOTIGNOREPREFIX=1 /mnt/build/workspace/root-pullrequests-build/build/bin/rootcling -rootbuild -v2 -f G__TrackMathCoreUnitDict.cxx -cxxmodule -s /mnt/build/workspace/root-pullrequests-build/build/math/mathcore/test/libTrackMathCoreUnitDict.so -m libCore_rdict.pcm -m libMathCore_rdict.pcm -m libRIO_rdict.pcm -m libGenVector_rdict.pcm -excludePath /mnt/build/workspace/root-pullrequests-build/root -excludePath /mnt/build/workspace/root-pullrequests-build/build -I/mnt/build/workspace/root-pullrequests-build/root -I/mnt/build/workspace/root-pullrequests-build/build/etc/cling/ -I/mnt/build/workspace/root-pullrequests-build/build/include -I/mnt/build/workspace/root-pullrequests-build/root/math/mathcore/test -I/mnt/build/workspace/root-pullrequests-build/root/builtins/zlib -I/mnt/build/workspace/root-pullrequests-build/root/core/base/inc -I/mnt/build/workspace/root-pullrequests-build/root/core/clib/inc -I/mnt/build/workspace/root-pullrequests-build/root/core/cont/inc -I/mnt/build/workspace/root-pullrequests-build/root/core/foundation/inc -I/mnt/build/workspace/root-pullrequests-build/root/core/macosx/inc -I/mnt/build/workspace/root-pullrequests-build/root/core/unix/inc -I/mnt/build/workspace/root-pullrequests-build/root/core/winnt/inc -I/mnt/build/workspace/root-pullrequests-build/root/core/clingutils/inc -I/mnt/build/workspace/root-pullrequests-build/root/core/meta/inc -I/mnt/build/workspace/root-pullrequests-build/root/core/textinput/inc -I/mnt/build/workspace/root-pullrequests-build/root/math/mathcore/v7/inc -I/mnt/build/workspace/root-pullrequests-build/root/math/mathcore/inc -I/mnt/build/workspace/root-pullrequests-build/root/math/mathcore -I/mnt/build/workspace/root-pullrequests-build/root/io/io/inc -I/mnt/build/workspace/root-pullrequests-build/root/core/clib/res -I/mnt/build/workspace/root-pullrequests-build/root/io/io/res -I/mnt/build/workspace/root-pullrequests-build/root/math/genvector/inc /mnt/build/workspace/root-pullrequests-build/root/math/mathcore/test/stress/TrackMathCore.h /mnt/build/workspace/root-pullrequests-build/root/math/mathcore/test/stress/TrackMathCoreLinkDef.h
Error: Had to build non-system module Smatrix implicitly. You first need to

Warnings:

/mnt/build/workspace/root-pullrequests-build/root/core/metacling/src/TCling.cxx:1096:13: warning: ‘void RegisterCxxModules(cling::Interpreter&)’ defined but not used [-Wunused-function]

phsft-bot · 2019-06-30T18:42:22Z

Build failed on ROOT-performance-centos7-multicore/default.
See console output.

Warnings:

/data/sftnight/workspace/root-pullrequests-build/root/core/metacling/src/TCling.cxx:1096:13: warning: ‘void RegisterCxxModules(cling::Interpreter&)’ defined but not used [-Wunused-function]

Failing tests:

phsft-bot · 2019-06-30T18:44:23Z

Build failed on ROOT-fedora29/python3.
See console output.

Warnings:

/mnt/build/workspace/root-pullrequests-build/root/core/metacling/src/TCling.cxx:1096:13: warning: ‘void RegisterCxxModules(cling::Interpreter&)’ defined but not used [-Wunused-function]

Failing tests:

phsft-bot · 2019-06-30T18:47:01Z

Build failed on ROOT-fedora27/noimt.
See console output.

Warnings:

/mnt/build/workspace/root-pullrequests-build/root/core/metacling/src/TCling.cxx:1096:13: warning: ‘void RegisterCxxModules(cling::Interpreter&)’ defined but not used [-Wunused-function]

Failing tests:

phsft-bot · 2019-06-30T18:58:20Z

Starting build on ROOT-performance-centos7-multicore/default, ROOT-fedora27/noimt, ROOT-fedora29/python3, ROOT-ubuntu16/rtcxxmod, mac1014/cxx17, windows10/default
How to customize builds

phsft-bot · 2019-06-30T19:17:18Z

Build failed on ROOT-ubuntu16/rtcxxmod.
See console output.

Warnings:

/mnt/build/workspace/root-pullrequests-build/root/core/metacling/src/TCling.cxx:1096:13: warning: ‘void RegisterCxxModules(cling::Interpreter&)’ defined but not used [-Wunused-function]

Failing tests:

And 74 more

phsft-bot · 2019-06-30T19:30:28Z

Starting build on ROOT-performance-centos7-multicore/default, ROOT-fedora27/noimt, ROOT-fedora29/python3, ROOT-ubuntu16/rtcxxmod, mac1014/cxx17, windows10/default
How to customize builds

phsft-bot · 2019-06-30T19:49:22Z

Build failed on ROOT-ubuntu16/rtcxxmod.
See console output.

Failing tests:

And 61 more

phsft-bot · 2019-06-30T20:10:06Z

Starting build on ROOT-performance-centos7-multicore/default, ROOT-fedora27/noimt, ROOT-fedora29/python3, ROOT-ubuntu16/rtcxxmod, mac1014/cxx17, windows10/default
How to customize builds

phsft-bot · 2019-06-30T20:29:13Z

Build failed on ROOT-ubuntu16/rtcxxmod.
See console output.

Failing tests:

And 61 more

vgvassilev · 2019-06-30T20:39:58Z

@phsft-bot build just on ROOT-ubuntu16/rtcxxmod with flags -DCTEST_TEST_EXCLUDE_NONE=On

phsft-bot · 2019-06-30T20:40:09Z

Starting build on ROOT-ubuntu16/rtcxxmod with flags -DCTEST_TEST_EXCLUDE_NONE=On
How to customize builds

phsft-bot · 2019-06-30T21:06:30Z

Build failed on ROOT-ubuntu16/rtcxxmod.
See console output.

Errors:

66/1871 Test Moved TMath's fundamental constants to TMathBase #475: tutorial-hist-fillrandom ..........................................................................***Failed Error regular expression found in output. Regex=[: error:] 0.23 sec
544/1871 Test Temp workaround for ccache home dir issue #429: tutorial-graphs-gerrors ...........................................................................***Failed Error regular expression found in output. Regex=[: error:] 0.22 sec
545/1871 Test Fix relocation problems in Aarch64 (backport) #430: tutorial-graphs-gerrors2 ..........................................................................***Failed Error regular expression found in output. Regex=[: error:] 0.21 sec
547/1871 Test Update Enable If pattern in GenVector Vc test #431: tutorial-graphs-graph .............................................................................***Failed Error regular expression found in output. Regex=[: error:] 0.23 sec
555/1871 Test [TDF] optimizations to the event loop #442: tutorial-graphs-hlGraph1 ..........................................................................***Failed Error regular expression found in output. Regex=[: error:] 0.26 sec
569/1871 Test Patch GetVariableSettings() method to correctly report parameter limits. #455: tutorial-graphs-timeonaxis ........................................................................***Failed Error regular expression found in output. Regex=[: error:] 0.19 sec
570/1871 Test [cling] Keep deferred decls #456: tutorial-graphs-timeonaxis2 .......................................................................***Failed Error regular expression found in output. Regex=[: error:] 0.20 sec
611/1871 Test Avoid double increment to loop variable #499: tutorial-hist-ratioplotOld ........................................................................***Failed Error regular expression found in output. Regex=[: error:] 0.23 sec
619/1871 Test Change the form of the separation calculation for histograms to be consistent with the form of the separation calculation for PDFs. #506: tutorial-hist-th2polyEurope .......................................................................***Failed Error regular expression found in output. Regex=[: error:] 0.29 sec
630/1871 Test Fix warnings introduced by previous commit #514: tutorial-histfactory-hf001_example ................................................................***Failed Error regular expression found in output. Regex=[: error:] 0.81 sec

And 15 more

Failing tests:

And 119 more

phsft-bot · 2019-07-04T20:31:36Z

Starting build on ROOT-ubuntu16/rtcxxmod with flags -DCTEST_TEST_EXCLUDE_NONE=On
How to customize builds

phsft-bot · 2019-07-04T20:33:29Z

Build failed on ROOT-ubuntu16/rtcxxmod.
See console output.

Errors:

error: Failed to merge in the changes.
CMake Error at /mnt/build/workspace/root-pullrequests-build/rootspi/jenkins/root-build.cmake:861 (message):

phsft-bot · 2020-02-23T13:52:48Z

Build failed on ROOT-fedora30/cxx14.
See console output.

Failing tests:

projectroot.runtutorials.tutorial_v7_histops.cxx

phsft-bot · 2020-02-23T14:27:32Z

Build failed on ROOT-ubuntu18.04-i386/cxx14.
See console output.

Failing tests:

projectroot.runtutorials.tutorial_v7_histops.cxx

The global module index represents an efficient on-disk hash table which stores identifier->module mapping. Every time clang finds a unknown identifier we are informed and we can load the corresponding module on demand. This way we can provide minimal set of loaded modules. Currently, we see that for hsimple.C only the half of the modules are loaded. This can be further improved because we currently load all modules which have an identifier, that is when looking for (for example TPad) we will load all modules which have the identifier TPad, including modules which contain only a forward declaration of it. Kudos Arpitha Raghunandan (arpi-r)!

In cases where we have the module cache path and prebuilt module path pointing to the same location, the FileManager should not cache the module file lookup failure because it may currently be building the module. This patch is necessary because the global module index is built from the module cache path and it is loaded from the prebuilt module path. In a full explicit or implicit module build infrastructure this is not a problem. However, in a mixed scenario (where modules for third-party dependencies are built implicitly) such as ours this is problematic. One of the reasons is that we cannot configure the prebuilt modules paths or module cache paths externally. This is because the interpreter (at construction time) #includes RuntimeUniverse which may trigger module build. This patch allows us to refactor some of the code working around this issue.

When we are generating code, CodeGen automatically tries to complete decl's redeclaration chain. This ends up a call to the external sources, one of which is our global module index (GMI). The semantics of the GMI is to aid the frontend, that is to automatically import missing modules based on the typed *by the user* identifiers. It does not intend to aid the code generation by any means. Currently this happens when completing chains for existing identifiers such as 'volume' which happens to be an indentifier part of TMVA.pcm, too. Thus, CodeGen unintentionally loads the module of TMVA. This is an overkill, but worse -- it brings recursiveness to the identifier resolution system. This patch disables looking in the GMI at codegen time.

All reasibility checks are done in the routine itself and we also call it, when we need to complete type through LookupObject(TagDecl*).

The issue is that, it does not contain the identifier for TH1.

It does not contain identifier TFrame

Fixes some failing tests

This does not require resolving conflicts or bloating the history.

We do that to fix the: +Warning in <TClass::Init>: no dictionary for class TLorentzVector is available errors. This is due to the fact that TLorentzVector and such identifiers do not get to the module.idx. They are read in ::readIndex and visited in ::writeIndex however it seems that the on-disk writer trait is filtering somehow them out.

This is needed due to see this commit: fc24e98

By setting the ROOT_EXPERIMENTAL_GMI env variable. This is useful for comparing performance results from both implementations. ROOT_EXPERIMENTAL_GMI is temporary and will go away as soon as we switch to the GMI by default.

phsft-bot · 2020-02-24T08:52:06Z

Starting build on ROOT-performance-centos7-multicore/default, ROOT-fedora27/noimt, ROOT-fedora29/python3, ROOT-fedora30/cxx14, ROOT-ubuntu16/nortcxxmod, ROOT-ubuntu18.04-i386/cxx14, mac1014/cxx17, windows10/cxx14 with flags -DCTEST_TEST_EXCLUDE_NONE=On
How to customize builds

phsft-bot · 2020-02-24T09:16:45Z

Build failed on mac1014/cxx17.
See console output.

Failing tests:

projectroot.runtutorials.tutorial_v7_histops.cxx

phsft-bot · 2020-02-24T09:21:06Z

Build failed on ROOT-fedora30/cxx14.
See console output.

Failing tests:

projectroot.runtutorials.tutorial_v7_histops.cxx

phsft-bot · 2020-02-24T09:34:51Z

Build failed on ROOT-fedora27/noimt.
See console output.

phsft-bot · 2020-02-24T09:55:53Z

Build failed on ROOT-ubuntu18.04-i386/cxx14.
See console output.

Failing tests:

projectroot.runtutorials.tutorial_v7_histops.cxx

vgvassilev · 2020-02-24T10:12:17Z

The test failures are not related to this PR, they are present in the nightly builds.

vgvassilev requested review from Axel-Naumann and yamaguchi1024 June 30, 2019 17:43

vgvassilev requested review from amadio and pcanal as code owners June 30, 2019 17:43

oshadura assigned vgvassilev Jun 30, 2019

vgvassilev force-pushed the cxxmodules_global_index branch from b6a2f4c to 167d978 Compare June 30, 2019 18:28

vgvassilev force-pushed the cxxmodules_global_index branch from 167d978 to 4363501 Compare June 30, 2019 18:58

vgvassilev force-pushed the cxxmodules_global_index branch from 01c1356 to a2dea2b Compare June 30, 2019 20:09

vgvassilev and others added 20 commits February 24, 2020 10:51

[cxxmodules] Preload only common modules.

7c14fcf

[tcling] Modernize header file virtual -> override.

3db4b28

[tcling] Modernize header file: use inline initialization.

e96522a

[cxxmodules] Tighten the findInGlobalModuleIndex routine.

66d6e47

All reasibility checks are done in the routine itself and we also call it, when we need to complete type through LookupObject(TagDecl*).

[cxxmodules] Preload Hist because it has the same issue as Gpad.

9822d02

The issue is that, it does not contain the identifier for TH1.

Preload Graf for the same reason as Hist and Gpad

0766260

It does not contain identifier TFrame

Add GenVector FIXMEModules

a77d5ed

Fixes some failing tests

Do not resolve from require complete type.

661f9dd

If the identifier is unknown, return false.

1d59f46

Rename the interface, add documentation, make it bool.

4b38143

Preload modules which are not in the index.

9c06b0d

Add help how to automatically incorporate clang-format changes.

d84e4a0

This does not require resolving conflicts or bloating the history.

Preload Tree to fix the messy roottest-root-io-newstl-make

da1ec92

Do not load recursively modules.

c0014fd

Preload Smatrix, TreePlayer, Proof and Geom.

0eea44a

This is needed due to see this commit: fc24e98

[cxxmodules] Make the global module index opt-in.

774c08f

By setting the ROOT_EXPERIMENTAL_GMI env variable. This is useful for comparing performance results from both implementations. ROOT_EXPERIMENTAL_GMI is temporary and will go away as soon as we switch to the GMI by default.

vgvassilev force-pushed the cxxmodules_global_index branch from 7d4aee1 to 774c08f Compare February 24, 2020 08:51

vgvassilev merged commit a98c539 into root-project:master Feb 24, 2020

[cxxmodules] Implement global module indexing to improve performance. #4016

[cxxmodules] Implement global module indexing to improve performance. #4016

Uh oh!

Conversation

vgvassilev commented Jun 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Performance

Methodology

Results Interpretation

Uh oh!

phsft-bot commented Jun 30, 2019

Uh oh!

phsft-bot commented Jun 30, 2019

Warnings:

Failing tests:

Uh oh!

phsft-bot commented Jun 30, 2019

Warnings:

Failing tests:

Uh oh!

phsft-bot commented Jun 30, 2019

Warnings:

Failing tests:

Uh oh!

phsft-bot commented Jun 30, 2019

Warnings:

Failing tests:

Uh oh!

phsft-bot commented Jun 30, 2019

Warnings:

Failing tests:

Uh oh!

phsft-bot commented Jun 30, 2019

Uh oh!

phsft-bot commented Jun 30, 2019

Errors:

Warnings:

Uh oh!

phsft-bot commented Jun 30, 2019

Warnings:

Failing tests:

Uh oh!

phsft-bot commented Jun 30, 2019

Warnings:

Failing tests:

Uh oh!

phsft-bot commented Jun 30, 2019

Warnings:

Failing tests:

Uh oh!

phsft-bot commented Jun 30, 2019

Uh oh!

phsft-bot commented Jun 30, 2019

Warnings:

Failing tests:

Uh oh!

phsft-bot commented Jun 30, 2019

Uh oh!

phsft-bot commented Jun 30, 2019

Failing tests:

Uh oh!

phsft-bot commented Jun 30, 2019

Uh oh!

phsft-bot commented Jun 30, 2019

Failing tests:

Uh oh!

vgvassilev commented Jun 30, 2019

Uh oh!

phsft-bot commented Jun 30, 2019

Uh oh!

phsft-bot commented Jun 30, 2019

Errors:

Failing tests:

Uh oh!

phsft-bot commented Jul 4, 2019

Uh oh!

phsft-bot commented Jul 4, 2019

Errors:

Uh oh!

phsft-bot commented Feb 23, 2020

vgvassilev commented Jun 30, 2019 •

edited

Loading