always serialize the function using cloudpickle #281

basnijholt · 2020-05-28T12:15:29Z

Description

This was a suggestion of @jbweston some time ago.

Using this, the learners are picklable regardless of the serialization package chosen.

This also means that we can e.g. use lambda functions when using a ProcessPoolExecutor and do not need loky per se.

I am making these changes now because I am finding that in some cases it's not preferable to use cloudpickle to serialize simple dictionaries (see cloudpipe/cloudpickle#375).

Checklist

Fixed style issues using pre-commit run --all (first install using pip install pre-commit)
pytest passed

Type of change

Check relevant option(s).

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

@jbweston

This was a suggestion of @jbweston some time ago. Using this, the leaners are picklable regardless of the serialization package chosen.

codecov-commenter · 2020-05-28T12:21:41Z

Codecov Report

Merging #281 into master will increase coverage by 0.15%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #281      +/-   ##
==========================================
+ Coverage   80.57%   80.72%   +0.15%     
==========================================
  Files          34       34              
  Lines        4591     4601      +10     
  Branches      828      828              
==========================================
+ Hits         3699     3714      +15     
+ Misses        769      764       -5     
  Partials      123      123

Impacted Files	Coverage Δ
adaptive/learner/average_learner.py	`86.51% <100.00%> (+0.30%)`	⬆️
adaptive/learner/integrator_learner.py	`91.42% <100.00%> (+0.05%)`	⬆️
adaptive/learner/learner1D.py	`92.85% <100.00%> (+0.04%)`	⬆️
adaptive/learner/learner2D.py	`79.43% <100.00%> (+0.14%)`	⬆️
adaptive/learner/sequence_learner.py	`88.46% <100.00%> (+0.30%)`	⬆️
adaptive/runner.py	`70.56% <0.00%> (+0.70%)`	⬆️
adaptive/_version.py	`47.66% <0.00%> (+2.80%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c339f01...574ba35. Read the comment docs.

ogrisel · 2020-05-29T10:02:17Z

As noted in cloudpipe/cloudpickle#375, the performance problem when serializing large collections (list, dicts, sets) of small object is caused by the fact that prior to Python 3.8, cloudpickle had not other choice than subclass the Python implementation of the Pickler class which is slow.

Since Python 3.8, cloudpickle can subclass the C implementation of the Pickler class which should fix this performance problem.

basnijholt · 2020-05-29T10:09:53Z

@ogrisel, thanks a lot for your explanation (also in cloudpipe/cloudpickle#375 (comment))!

jbweston

LGTM, but what about LearnerND?

always serialize the function using cloudpickle

Loading status checks…

574ba35

This was a suggestion of @jbweston some time ago. Using this, the leaners are picklable regardless of the serialization package chosen.

jbweston self-requested a review May 29, 2020

jbweston approved these changes May 29, 2020

View changes

python-adaptive / adaptive

always serialize the function using cloudpickle #281

always serialize the function using cloudpickle #281

basnijholt commented May 28, 2020 •

edited

codecov-commenter commented May 28, 2020

ogrisel commented May 29, 2020

basnijholt commented May 29, 2020

jbweston left a comment

python-adaptive / adaptive

always serialize the function using cloudpickle #281

always serialize the function using cloudpickle #281

Conversation

basnijholt commented May 28, 2020 • edited

Description

Checklist

Type of change

codecov-commenter commented May 28, 2020

Codecov Report

ogrisel commented May 29, 2020

basnijholt commented May 29, 2020

jbweston left a comment

basnijholt commented May 28, 2020 •

edited