TST Speed-up test suite when using pytest-xdist #25918

jeremiedbb · 2023-03-20T16:28:01Z

[EDITED]

pytest_runtest_setup is called once per test, but _openmp_effective_n_threads and threadpool_limits are not cheap. It brings a significant overhead for very quick tests. This PR uses a fixture with a session scope. I tried to use pytest_sessionstart but it was not run (don't know exactly why, maybe because we run from a test folder).
joblib min version is 1.1.1 which exposes only_physical_cores in cpu_count. Restricting the number of openmp threads to the number of physical cores might probably speed things up. At least it does on my laptop where I have an intel cpu with hyper-threading. Anyway even when it doesn't bring speed-up, I'm pretty sure it won't bring a slow-down.
Only the number of openmp threads was limited but limiting the number of blas threads to 1 as well is beneficial since we set the number of xdist worker equal to the number of cores.
Since we have 3 cores on macos jobs, let's use 3 xdist workers

ogrisel

Thanks for working on this, looking forward to seeing the impacts of those changes.

I agree that using only physical cores by default is probably a good idea.

ogrisel · 2023-03-20T16:43:18Z

sklearn/utils/_openmp_helpers.pyx

+        max_n_threads = min(
+            omp_get_max_threads(),
+            cpu_count(only_physical_cores=only_physical_cores)
+        )


This change impacts more scikit-learn in general and not just the test suite. I think we should document it in the changelog.

I set the default to False so it has no impact on scikit-learn. We can discuss whether or not we want to change the default but I think it requires to be more careful and run some experiments (iirc some discussions in histgbt weren't clear about that)

Indeed I had missed that. +1 for keeping the PR focused on the tests for now.

We just found a case of this problem here: #25714 (comment).

jeremiedbb · 2023-03-20T17:21:41Z

looking forward to seeing the impacts of those changes.

Well I think it'll be hard to see in a single CI run because the fluctuations are huge (several minutes from 1 run to another).
That said, 28min on the windows job is something I've rarely seen.

For reference, on my laptop it reduced the duration from 14min to 11m30

jeremiedbb · 2023-03-20T17:34:07Z

And somehow the macos jobs and some of the linux ones are crazy high....

jeremiedbb · 2023-03-21T00:36:29Z

Starting to look good pretty good. (need to do some clean-up now)

jeremiedbb · 2023-03-21T00:45:11Z

It's now a fixture with a session scope but it's not used if we don't run the test suite for the whole package. It seems to be the same issue that we had for global_random_seed. We probably need to make a plugin if we want to be able to use it in all situations.

ogrisel · 2023-03-21T11:05:43Z

It's now a fixture with a session scope but it's not used if we don't run the test suite for the whole package.

I tried it locally and it seems to work even when limiting the tests to a given submodule / test pattern. I tried:

pytest -n 4 -v sklearn/ensemble -k test_map_to_bins

and I could check that it is actually limiting the number of threads to 2 (on a machine with 8 cores) by raising an exception with the content of threadpoolctl.threadpool_info() in the test.

ogrisel

LGTM. It's both cleaner than previously and it seems to yield a measurable average speed-up when running the full test suite, although there is a lot of variability between CI runs.

sklearn/conftest.py

ogrisel · 2023-03-22T08:45:53Z

Here are the Azure timings of this PR (left) vs the last run of #25930 (right):

This is a uniform improvement although the slowest macos run was not improved (but not degraded either).

ogrisel · 2023-03-22T08:48:12Z

Smaller yet also uniform improvements on the Cirrus CI runs as well (this PR left vs #25930 right).

ogrisel · 2023-03-22T08:57:14Z

For the wheel builder workflow on github actions:

36 min total duration on this PR
58 min total duration on DEBUG CI durations #25930

although I do not understand those numbers because the individual "Build wheel for..." jobs do not seem to be uniformly improved: many have the same duration and some are degraded.

ogrisel · 2023-03-22T09:33:53Z

I redid the experiment described in #25918 (comment) locally with the new method (using pytest_configure instead of the session scoped auto-scoped fixture) and it still works as expected.

@lesteve I think your concern was taken into account and based on the above timing results I think we can merge.

jeremiedbb · 2023-03-22T09:37:09Z

Let me just check if the conftest is discovered in the wheel builder

lesteve · 2023-03-22T10:01:33Z

Let me just check if the conftest is discovered in the wheel builder

All builds are red in #25930 so that shows that pytest_configure is run in wheels as well and we can merge right?

jeremiedbb · 2023-03-22T10:07:12Z

Yep.

Actually I just realized that the jobs of the wheel builder don't use pytest-xdist so it's expected that the durations are not impacted since the configuration in pytest_configure does the same thing as the previous configuration with the env vars.

lesteve · 2023-03-22T10:18:24Z

OK let's merge this one, thanks a lot!

thomasjpfan · 2023-03-22T14:57:23Z

With this PR, most CI jobs will be running with single threaded OpenMP/BLAS. I think only the wheel building jobs still runs with multiple threads because it does not install pytest-xdist.

Can we remove pytest-xdist from one of the CI jobs to have a little more coverage for running with multiple threads?

jeremiedbb · 2023-03-22T15:05:56Z

With this PR, most CI jobs will be running with single threaded OpenMP/BLAS

It was already the case for OpenMP, since _openmp_effecive_n_threads returned 2 and the number of xdist workers is >= 2 (the ratio being 1).
This PR treats BLAS similarly to avoid oversubscription.

I'm not opposed to removing pytest-xdist from 1 job but we must do it in a fast job because the reason of this PR was that the CI duration was becoming more and more frustrating :)
I guess Linux pylatest_pip_openblas_pandas is a good candidate

thomasjpfan · 2023-03-22T15:16:22Z

I guess Linux pylatest_pip_openblas_pandas is a good candidate

I'm okay with removing pytest-xdist from this one job. (I'm also curious to see how the runtime changes).

ogrisel · 2023-03-22T15:17:37Z

+1 but let's be very explicit (in a comment) as to why we remove it then.

jeremiedbb · 2023-03-22T15:31:16Z

I opened #25943

only physical cores + session setup

1bb1343

github-actions bot added cython module:utils labels Mar 20, 2023

ogrisel reviewed Mar 20, 2023

View reviewed changes

jeremiedbb added 10 commits March 20, 2023 19:15

another run cause I can't beleive what I'm seeing

be45ee6

debug

2affc0b

typo

919c8f6

correct arg name

a1669b9

debug [azure parallel]

3a65fa4

use a fixture

a7dce07

better [azure parallel]

4d0e998

faster [azure parallel]

84e1342

stronger [azure parallel]

e012b2b

harder [azure parallel]

7a3e7ed

ogrisel approved these changes Mar 21, 2023

View reviewed changes

ogrisel added Build / CI Quick Review For PRs that are quick to review labels Mar 21, 2023

lesteve reviewed Mar 21, 2023

View reviewed changes

sklearn/conftest.py Show resolved Hide resolved

ogrisel mentioned this pull request Mar 21, 2023

DOC Add demo on parallelization with context manager using different backends #25714

Draft

jeremiedbb added 6 commits March 21, 2023 19:30

pytest_configure [azure parallel] [cd build]

86769f5

[azure parallel] [cd build]

432fa67

lint [azure parallel] [cd build]

1fbe28a

fix [azure parallel] [cd build]

8d94358

fix? [azure parallel] [cd build]

7574584

fix?? [azure parallel] [cd build]

36bece2

jeremiedbb added 3 commits March 21, 2023 22:23

fix??? [azure parallel] [cd build]

c146724

oops [azure parallel] [cd build]

afab085

cln env vars [azure parallel] [cd build]

55060d8

jeremiedbb added the No Changelog Needed label Mar 21, 2023

jeremiedbb mentioned this pull request Mar 21, 2023

DEBUG CI durations #25930

Closed

lesteve merged commit d92c769 into scikit-learn:main Mar 22, 2023
43 of 44 checks passed

jeremiedbb mentioned this pull request Mar 22, 2023

CI Disable pytest-xdist in pylatest_pip_openblas_pandas build #25943

Merged

TST Speed-up test suite when using pytest-xdist #25918

TST Speed-up test suite when using pytest-xdist #25918

jeremiedbb commented Mar 20, 2023 •

edited

ogrisel left a comment

ogrisel Mar 20, 2023

jeremiedbb Mar 20, 2023

ogrisel Mar 21, 2023

ogrisel Mar 21, 2023

jeremiedbb commented Mar 20, 2023

jeremiedbb commented Mar 20, 2023 •

edited

jeremiedbb commented Mar 21, 2023 •

edited

jeremiedbb commented Mar 21, 2023

ogrisel commented Mar 21, 2023 •

edited

ogrisel left a comment

ogrisel commented Mar 22, 2023

ogrisel commented Mar 22, 2023 •

edited

ogrisel commented Mar 22, 2023

ogrisel commented Mar 22, 2023

jeremiedbb commented Mar 22, 2023 •

edited

lesteve commented Mar 22, 2023

jeremiedbb commented Mar 22, 2023

lesteve commented Mar 22, 2023

thomasjpfan commented Mar 22, 2023

jeremiedbb commented Mar 22, 2023

thomasjpfan commented Mar 22, 2023

ogrisel commented Mar 22, 2023

jeremiedbb commented Mar 22, 2023

TST Speed-up test suite when using pytest-xdist #25918

TST Speed-up test suite when using pytest-xdist #25918

Conversation

jeremiedbb commented Mar 20, 2023 • edited

ogrisel left a comment

Choose a reason for hiding this comment

ogrisel Mar 20, 2023

Choose a reason for hiding this comment

jeremiedbb Mar 20, 2023

Choose a reason for hiding this comment

ogrisel Mar 21, 2023

Choose a reason for hiding this comment

ogrisel Mar 21, 2023

Choose a reason for hiding this comment

jeremiedbb commented Mar 20, 2023

jeremiedbb commented Mar 20, 2023 • edited

jeremiedbb commented Mar 21, 2023 • edited

jeremiedbb commented Mar 21, 2023

ogrisel commented Mar 21, 2023 • edited

ogrisel left a comment

Choose a reason for hiding this comment

ogrisel commented Mar 22, 2023

ogrisel commented Mar 22, 2023 • edited

ogrisel commented Mar 22, 2023

ogrisel commented Mar 22, 2023

jeremiedbb commented Mar 22, 2023 • edited

lesteve commented Mar 22, 2023

jeremiedbb commented Mar 22, 2023

lesteve commented Mar 22, 2023

thomasjpfan commented Mar 22, 2023

jeremiedbb commented Mar 22, 2023

thomasjpfan commented Mar 22, 2023

ogrisel commented Mar 22, 2023

jeremiedbb commented Mar 22, 2023

jeremiedbb commented Mar 20, 2023 •

edited

jeremiedbb commented Mar 20, 2023 •

edited

jeremiedbb commented Mar 21, 2023 •

edited

ogrisel commented Mar 21, 2023 •

edited

ogrisel commented Mar 22, 2023 •

edited

jeremiedbb commented Mar 22, 2023 •

edited