-
Notifications
You must be signed in to change notification settings - Fork 1.4k
[TMVA] Add multiprocessing to parallelise CV #1869
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[TMVA] Add multiprocessing to parallelise CV #1869
Conversation
|
Starting build on |
When doing CV, we reuse the same DataSet (and the corresponding list of results). Thus we need to empty the DataSet results between folds. A convenient way of doing this was missing.
In CV, there was a bug causing erroneous results to be added to the DataSet (where the size of those results where smaller than what was expected).
Sometimes, methods can add extra results to the DataSet. E.g. if `CreateMvaPDFs` is specified. This was not correctly picked up by CV.
- Variable renaming ce -> cv. - Remove extraneous text output from method logs.
|
Starting build on |
|
Starting build on |
|
Build failed on slc6/gcc62. Warnings:
Failing tests: |
|
Build failed on ubuntu16/native. Warnings:
Failing tests: |
|
Build failed on mac1013/native. Warnings:
Failing tests: |
|
@phsft-bot build |
|
Starting build on |
|
Build failed on centos7/gcc49. Warnings:
Failing tests: |
|
Starting build on |
|
Build failed on mac1013/native. Warnings:
|
|
Build failed on centos7/gcc49. Failing tests: |
|
@phsft-bot build |
|
Starting build on |
|
Build failed on slc6/gcc62. Failing tests: |
|
Build failed on ubuntu16/native. Failing tests: |
Based on PR #858. Implements only parts relevant for CV.
Rudimentary performance benchmark with ~60000 events with 2-fold CV
Which translates into a 4 second speed up.
A different example, with a deeper bdt forest and 10-fold CV, almost halves the required time on my machine :)
For the second example, these changes to
TMVACrossValidation.Cwere used.