-
Notifications
You must be signed in to change notification settings - Fork 33
Update databricks 17.3 [skip ci] #1017
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 7 commits
2e37fa2
39bab9a
7a0a824
c71e947
2621a40
d9a4cca
9e5cab0
cd18ca4
fe653b1
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
|
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Please update copyright date to 2026 while you are changing this file. |
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,5 +1,5 @@ | ||
| #!/bin/bash | ||
| # Copyright (c) 2025, NVIDIA CORPORATION. | ||
| # Copyright (c) 2026, NVIDIA CORPORATION. | ||
| # | ||
| # Licensed under the Apache License, Version 2.0 (the "License"); | ||
| # you may not use this file except in compliance with the License. | ||
|
|
@@ -15,21 +15,16 @@ | |
|
|
||
| set -ex | ||
|
|
||
| # IMPORTANT: specify RAPIDS_VERSION fully 23.10.0 and not 23.10 | ||
| # also in general, RAPIDS_VERSION (python) fields should omit any leading 0 in month/minor field (i.e. 23.8.0 and not 23.08.0) | ||
| # while SPARK_RAPIDS_VERSION (jar) should have leading 0 in month/minor (e.g. 23.08.2 and not 23.8.2) | ||
| RAPIDS_VERSION=25.12.0 | ||
| SPARK_RAPIDS_VERSION=25.12.0 | ||
| # IMPORTANT: specify RAPIDS_VERSION fully 26.4.0 and not 26.4 | ||
| # also in general, RAPIDS_VERSION (python) fields should omit any leading 0 in month/minor field (i.e. 26.4.0 and not 26.04.0) | ||
| # while SPARK_RAPIDS_VERSION (jar) should have leading 0 in month/minor (e.g. 26.04.2 and not 26.4.2) | ||
| # | ||
| # Note also that sometimes the jar and python packages will have different patch versions published and available at any time, | ||
| # so the versions may not perfectly align. This is expected and should not cause issues. | ||
| RAPIDS_VERSION=26.4.0 | ||
|
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think we need to keep this at 25.12.0 for now, till we do an update to match cuML changes. This is forthcoming for 26.6.0. The Spark rapids version bump is fine.
muzza-lovelytics marked this conversation as resolved.
Outdated
|
||
| SPARK_RAPIDS_VERSION=26.04.2 | ||
|
|
||
| curl -L https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark_2.12/${SPARK_RAPIDS_VERSION}/rapids-4-spark_2.12-${SPARK_RAPIDS_VERSION}-cuda12.jar -o /databricks/jars/rapids-4-spark_2.12-${SPARK_RAPIDS_VERSION}.jar | ||
|
|
||
| # install cudatoolkit 12.2 via runfile approach | ||
| wget https://developer.download.nvidia.com/compute/cuda/12.2.2/local_installers/cuda_12.2.2_535.104.05_linux.run | ||
| sh cuda_12.2.2_535.104.05_linux.run --silent --toolkit | ||
|
|
||
| # reset symlink and update library loading paths | ||
| rm /usr/local/cuda | ||
| ln -s /usr/local/cuda-12.2 /usr/local/cuda | ||
| curl -L https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark_2.13/${SPARK_RAPIDS_VERSION}/rapids-4-spark_2.13-${SPARK_RAPIDS_VERSION}-cuda12.jar -o /databricks/jars/rapids-4-spark_2.13-${SPARK_RAPIDS_VERSION}.jar | ||
|
|
||
| # upgrade pip | ||
| /databricks/python/bin/pip install --upgrade pip | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
rapids-4-spark_2.13-26.4.2.jar(no leading zero), but the init script setsSPARK_RAPIDS_VERSION=26.04.2and downloads the jar to/databricks/jars/rapids-4-spark_2.13-26.04.2.jar(with leading zero). The comment in the script even documents this convention: "SPARK_RAPIDS_VERSION (jar) should have leading 0 in month/minor (e.g. 26.04.2 and not 26.4.2)". At runtime, Spark executors will fail to start because the PYTHONPATH points to a file that was never written to disk.