Less freezing time in multiprocessing with large arrays #440

weixuanfu · 2017-05-02T14:42:39Z

What does this PR do?

Change the backend of the parameter max_eval_time_mins that controls how many minutes TPOT has to optimize a single pipeline. The backend is based on stopit module
Change the backend of multiprocessing to dask instead of joblib
Fix a old issue in Windows. Now TPOT allow Control+C during optimization process when n_job !=1 in Windows
A new unit test for timeout function
Add check_X_y for checking dataset format.
Update documentation

Where should the reviewer start?

How should this PR be tested?

# coding: utf-8
from sklearn.datasets import make_classification
from tpot import TPOTClassifier
# make a huge dataset
X, y = make_classification(n_samples=50000, n_features=200,
                                    n_informative=20, n_redundant=20,
                                    n_classes=5, random_state=42)

# max_eval_time_mins=0.1 means 6 seconds limits for evaluating a single pipeline 
tpot = TPOTClassifier(generations=5, population_size=50, offspring_size=100, random_state=42, n_jobs=2, max_eval_time_mins=0.1, verbosity=3) 
tpot.fit(X, y)

Any background context you want to provide?

parallel processes freezing when matrices are too big
Joblib hangs without crashing

What are the relevant issues?

#436 #422

[you can link directly to issues by entering # then the number of the issue, for example, #3 links to issue 3]

Screenshots (if appropriate)

Questions:

Do the docs need to be updated? Yes, docs are already updated in the PR.
Does this PR add new (Python) dependencies? Yes, stopit and dask

A patch for config_dict parameter

rhiever · 2017-05-02T14:58:21Z

I think the best workaround (for now) is to make TPOT not use multiprocessing when n_jobs=1, and put warnings in the documentation that enabling multiprocessing (n_jobs!=1) may be slow and prone to freezing with very large datasets.

sklearn has this same problem, right? e.g. if you use cross_val_score with n_jobs!=1 for a very large dataset, it will also be slow and/or freeze.

coveralls · 2017-05-12T15:48:24Z

Coverage decreased (-0.9%) to 85.983% when pulling a4956d4 on weixuanfu2016:joblib_timeout into 7bea1ee on rhiever:development.

…r_fix A hot patch for TPOT MDR for supporting latest version of scikit-MDR

A patch for fixing the issue 454

rhiever · 2017-05-23T17:44:24Z

Looks like something broke with the rest of the merges. :-(

weixuanfu · 2017-05-23T20:03:09Z

Conflicts fixed

coveralls · 2017-05-23T20:03:14Z

Coverage decreased (-0.9%) to 86.186% when pulling 211eed9 on weixuanfu2016:joblib_timeout into 305701c on rhiever:development.

weixuanfu · 2017-05-23T20:19:54Z

I also added the patches in master branch (0.7.5) in this PR

coveralls · 2017-05-23T20:23:40Z

Coverage decreased (-1.03%) to 86.1% when pulling d8e1904 on weixuanfu2016:joblib_timeout into 305701c on rhiever:development.

coveralls · 2017-05-23T21:02:06Z

Coverage decreased (-0.3%) to 86.8% when pulling af01d55 on weixuanfu2016:joblib_timeout into 305701c on rhiever:development.

Weixuan Fu and others added 27 commits April 28, 2017 11:02

fix bug in config dict

a79079b

Update base.py

fa1acfb

Merge pull request EpistasisLab#431 from weixuanfu2016/config_dict_patch

5fab09a

A patch for config_dict parameter

Update version for minor release

ed3bdf7

use stopit replace Interruptable_cross_val_score

5e32488

Update requirements.txt

39cff19

fix bugs and clean bugs

eafc240

clean test codes

e50814c

add unit test

c10ba2e

try backend="threading"

6bc031c

dask works in macOS

3019275

clean codes

0b3680c

num_worker added

91caa55

use client

6b00655

threading

d9c1863

clean codes

4517abb

clean codes

94b4a37

clean codes

3ce128b

clean codes

02fd277

clean codes

af98b96

return to joblib

9cafac7

key works

5447fe0

fix issue in large dataset

1f97655

Merge remote-tracking branch 'upstream/development' into joblib_timeout

23ca6d3

add doc

3ce4a30

clean codes

6515732

min to sec

633e9e8

weixuanfu closed this May 2, 2017

weixuanfu changed the title ~~Disable memmaping of large arrays for freezing issue in multiprocessing~~ Issue about large arrays for freezing in multiprocessing May 2, 2017

warning when verbosity > 2

a4956d4

weixuanfu2016 and others added 11 commits May 16, 2017 17:58

fix this compatibility issue

7cea3bf

add unit test

454f54a

fix ci

dc40489

Merge pull request EpistasisLab#451 from weixuanfu2016/mdr_dict_maste…

1fc2860

…r_fix A hot patch for TPOT MDR for supporting latest version of scikit-MDR

Version increment for hot patch release

18927b0

fix bug for ploynomialfeatures

568f55d

add unit test

37c1529

Merge pull request EpistasisLab#455 from weixuanfu2016/issue454

179fdf1

A patch for fixing the issue 454

Minor version increment for release

7b1eb27

Update tests.py

fd2f1c3

Merge branch 'development' into joblib_timeout

c3b2167

weixuanfu2016 added 2 commits May 23, 2017 15:58

fix conflicts

cccf676

Merge branch 'development' into joblib_timeout

211eed9

weixuanfu2016 added 3 commits May 23, 2017 16:14

add patch in master

00fc6ff

add patch in tpot 0.7.5

1e0a8c4

clean codes

d8e1904

add some small unit tests for increasing coverage

af01d55

weixuanfu mentioned this pull request Jun 2, 2017

Update the driver for 0.8 release #482

Merged

AIAdventures mentioned this pull request Jun 6, 2017

Titanic example -problem with 2nd last cell. #492

Closed

rhiever closed this Jun 14, 2017

weixuanfu deleted the joblib_timeout branch June 15, 2017 17:08

weixuanfu restored the joblib_timeout branch June 26, 2017 14:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Less freezing time in multiprocessing with large arrays #440

Less freezing time in multiprocessing with large arrays #440

weixuanfu commented May 2, 2017 •

edited

Loading

Uh oh!

rhiever commented May 2, 2017

Uh oh!

coveralls commented May 12, 2017

Uh oh!

rhiever commented May 23, 2017

Uh oh!

weixuanfu commented May 23, 2017

Uh oh!

coveralls commented May 23, 2017

Uh oh!

weixuanfu commented May 23, 2017 •

edited

Loading

Uh oh!

coveralls commented May 23, 2017

Uh oh!

coveralls commented May 23, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Less freezing time in multiprocessing with large arrays #440

Less freezing time in multiprocessing with large arrays #440

Conversation

weixuanfu commented May 2, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Where should the reviewer start?

How should this PR be tested?

Any background context you want to provide?

What are the relevant issues?

Screenshots (if appropriate)

Questions:

Uh oh!

rhiever commented May 2, 2017

Uh oh!

coveralls commented May 12, 2017

Uh oh!

rhiever commented May 23, 2017

Uh oh!

weixuanfu commented May 23, 2017

Uh oh!

coveralls commented May 23, 2017

Uh oh!

weixuanfu commented May 23, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented May 23, 2017

Uh oh!

coveralls commented May 23, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

weixuanfu commented May 2, 2017 •

edited

Loading

weixuanfu commented May 23, 2017 •

edited

Loading