[SPARK-50331][INFRA][FOLLOW-UP] Skip Torch/DeepSpeed tests in MacOS PySpark Daily test #48900

zhengruifeng · 2024-11-20T03:59:32Z

What changes were proposed in this pull request?

Skip Torch/DeepSpeed tests in MacOS PySpark Daily test

https://github.com/apache/spark/actions/runs/11921746968/job/33226552068

Why are the changes needed?

we don't need to test them on MacOS

Does this PR introduce any user-facing change?

no, test only

How was this patch tested?

They should be skipped due to no installation:

spark/python/pyspark/ml/torch/tests/test_data_loader.py

Line 36 in b724955

@unittest.skipIf(not have_torch, "torch is required")

manually test in my local MacOS:

(spark_312) ➜  spark git:(master) python/run-tests -k --python-executables python3 --testnames 'pyspark.ml.torch.tests.test_data_loader'
Running PySpark tests. Output is in /Users/ruifeng.zheng/Dev/spark/python/unit-tests.log
Will test against the following Python executables: ['python3']
Will test the following Python tests: ['pyspark.ml.torch.tests.test_data_loader']
python3 python_implementation is CPython
python3 version is: Python 3.12.7
Starting test(python3): pyspark.ml.torch.tests.test_data_loader (temp output: /Users/ruifeng.zheng/Dev/spark/python/target/c7bf947a-a746-4519-b475-aed24a1f8cec/python3__pyspark.ml.torch.tests.test_data_loader__1yq21_l5.log)
Finished test(python3): pyspark.ml.torch.tests.test_data_loader (0s) ... 1 tests were skipped
Tests passed in 0 seconds

Skipped tests in pyspark.ml.torch.tests.test_data_loader with python3:
      test_data_loader (pyspark.ml.torch.tests.test_data_loader.TorchDistributorDataLoaderUnitTests.test_data_loader) ... skip (0.001s)

Was this patch authored or co-authored using generative AI tooling?

No

LuciferYang

LGTM

zhengruifeng · 2024-11-20T04:40:41Z

thanks, merged to master

…val` not installed ### What changes were proposed in this pull request? #48900 skipped relevant test cases on macOS by no longer installing dependencies related to `torch` and `deepseed`. Subsequently, some test cases in `ml.tests.connect.test_legacy_mode_tuning.CrossValidatorTests` failed due to the absence of `torch/torcheval`: ``` ERROR (3.456s) test_crossvalidator_with_fold_col (pyspark.ml.tests.connect.test_legacy_mode_tuning.CrossValidatorTests.test_crossvalidator_with_fold_col) ... ERROR (2.550s) test_fit_maximize_metric (pyspark.ml.tests.connect.test_legacy_mode_tuning.CrossValidatorTests.test_fit_maximize_metric) ... ERROR (0.550s) test_fit_minimize_metric (pyspark.ml.tests.connect.test_legacy_mode_tuning.CrossValidatorTests.test_fit_minimize_metric) ... ERROR (0.544s) test_gen_avg_and_std_metrics (pyspark.ml.tests.connect.test_legacy_mode_tuning.CrossValidatorTests.test_gen_avg_and_std_metrics) ... ok (0.539s) ====================================================================== ERROR [9.991s]: test_copy (pyspark.ml.tests.connect.test_legacy_mode_tuning.CrossValidatorTests.test_copy) ---------------------------------------------------------------------- Traceback (most recent call last): File "/Users/runner/work/spark/spark/python/pyspark/ml/tests/connect/test_legacy_mode_tuning.py", line 119, in test_copy cvModel = cv.fit(dataset) ^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/base.py", line 105, in fit return self._fit(dataset) ^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/tuning.py", line 435, in _fit for j, metric in pool.imap_unordered(lambda f: f(), tasks): File "/opt/homebrew/Cellar/python3.11/3.11.10/Frameworks/Python.framework/Versions/3.11/lib/python3.11/multiprocessing/pool.py", line 873, in next raise value File "/opt/homebrew/Cellar/python3.11/3.11.10/Frameworks/Python.framework/Versions/3.11/lib/python3.11/multiprocessing/pool.py", line 125, in worker result = (True, func(*args, **kwds)) ^^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/tuning.py", line 435, in <lambda> for j, metric in pool.imap_unordered(lambda f: f(), tasks): ^^^ File "/Users/runner/work/spark/spark/python/pyspark/util.py", line 423, in wrapped return f(*args, **kwargs) # type: ignore[misc, operator] ^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/tuning.py", line 186, in single_task metric = evaluator.evaluate( ^^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/base.py", line 254, in evaluate return self._evaluate(dataset) ^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/evaluation.py", line 57, in _evaluate torch_metric = self._get_torch_metric() ^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/evaluation.py", line 125, in _get_torch_metric import torcheval.metrics as torchmetrics ModuleNotFoundError: No module named 'torcheval' ====================================================================== ERROR [3.456s]: test_crossvalidator_on_pipeline (pyspark.ml.tests.connect.test_legacy_mode_tuning.CrossValidatorTests.test_crossvalidator_on_pipeline) ---------------------------------------------------------------------- Traceback (most recent call last): File "/Users/runner/work/spark/spark/python/pyspark/ml/tests/connect/test_legacy_mode_tuning.py", line 210, in test_crossvalidator_on_pipeline cv_model = cv.fit(train_dataset) ^^^^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/base.py", line 105, in fit return self._fit(dataset) ^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/tuning.py", line 435, in _fit for j, metric in pool.imap_unordered(lambda f: f(), tasks): File "/opt/homebrew/Cellar/python3.11/3.11.10/Frameworks/Python.framework/Versions/3.11/lib/python3.11/multiprocessing/pool.py", line 873, in next raise value File "/opt/homebrew/Cellar/python3.11/3.11.10/Frameworks/Python.framework/Versions/3.11/lib/python3.11/multiprocessing/pool.py", line 125, in worker result = (True, func(*args, **kwds)) ^^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/tuning.py", line 435, in <lambda> for j, metric in pool.imap_unordered(lambda f: f(), tasks): ^^^ File "/Users/runner/work/spark/spark/python/pyspark/util.py", line 423, in wrapped return f(*args, **kwargs) # type: ignore[misc, operator] ^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/tuning.py", line 185, in single_task model = estimator.fit(train, param_map) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/base.py", line 103, in fit return self.copy(params)._fit(dataset) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/pipeline.py", line 201, in _fit model = stage.fit(dataset) # type: ignore[attr-defined] ^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/base.py", line 105, in fit return self._fit(dataset) ^^^^^^^^^^^^^^^^^^ File "/Users/runner/work/spark/spark/python/pyspark/ml/connect/classification.py", line 219, in _fit import torch ModuleNotFoundError: No module named 'torch' .... ``` Therefore, this pull request adds corresponding conditions to `CrossValidatorTests` to skip tests when `torch/torcheval` is not installed. ### Why are the changes neede Skip `CrossValidatorTests` if `torch/torcheval` not installed ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Pass Github Actions ### Was this patch authored or co-authored using generative AI tooling? No Closes #48901 from LuciferYang/pyspark-on-macos-split-mltest. Authored-by: yangjie01 <[email protected]> Signed-off-by: Ruifeng Zheng <[email protected]>

dongjoon-hyun · 2024-11-20T16:58:15Z

Thank you, @zhengruifeng .

test

d37c2f4

zhengruifeng requested a review from LuciferYang November 20, 2024 03:59

github-actions bot added the INFRA label Nov 20, 2024

LuciferYang approved these changes Nov 20, 2024

View reviewed changes

zhengruifeng closed this in 19b8250 Nov 20, 2024

zhengruifeng deleted the macos_py_lib branch November 20, 2024 04:40

LuciferYang mentioned this pull request Nov 20, 2024

[SPARK-50362][PYTHON][ML][TESTS] Skip CrossValidatorTests if torch/torcheval not installed #48901

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-50331][INFRA][FOLLOW-UP] Skip Torch/DeepSpeed tests in MacOS PySpark Daily test #48900

[SPARK-50331][INFRA][FOLLOW-UP] Skip Torch/DeepSpeed tests in MacOS PySpark Daily test #48900

Uh oh!

zhengruifeng commented Nov 20, 2024 •

edited

Loading

Uh oh!

LuciferYang left a comment

Uh oh!

zhengruifeng commented Nov 20, 2024

Uh oh!

dongjoon-hyun commented Nov 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-50331][INFRA][FOLLOW-UP] Skip Torch/DeepSpeed tests in MacOS PySpark Daily test #48900

[SPARK-50331][INFRA][FOLLOW-UP] Skip Torch/DeepSpeed tests in MacOS PySpark Daily test #48900

Uh oh!

Conversation

zhengruifeng commented Nov 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

LuciferYang left a comment

Choose a reason for hiding this comment

Uh oh!

zhengruifeng commented Nov 20, 2024

Uh oh!

dongjoon-hyun commented Nov 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zhengruifeng commented Nov 20, 2024 •

edited

Loading