-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Less freezing time in multiprocessing with large arrays #440
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
A patch for config_dict parameter
|
I think the best workaround (for now) is to make TPOT not use multiprocessing when sklearn has this same problem, right? e.g. if you use |
…r_fix A hot patch for TPOT MDR for supporting latest version of scikit-MDR
A patch for fixing the issue 454
|
Looks like something broke with the rest of the merges. :-( |
|
Conflicts fixed |
|
I also added the patches in master branch (0.7.5) in this PR |
What does this PR do?
Change the backend of the parameter
max_eval_time_minsthat controls how many minutes TPOT has to optimize a single pipeline. The backend is based on stopit moduleChange the backend of multiprocessing to dask instead of joblib
Fix a old issue in Windows. Now TPOT allow Control+C during optimization process when n_job !=1 in Windows
A new unit test for timeout function
Add
check_X_yfor checking dataset format.Update documentation
Where should the reviewer start?
base.py
How should this PR be tested?
Any background context you want to provide?
parallel processes freezing when matrices are too big
Joblib hangs without crashing
What are the relevant issues?
#436 #422
[you can link directly to issues by entering # then the number of the issue, for example, #3 links to issue 3]
Screenshots (if appropriate)
Questions:
stopitanddask