Skip to content

Commit

Permalink
set package repository
Browse files Browse the repository at this point in the history
Given the demise of Bintray, we now need to specificy the packages repository. See: https://spark.apache.org/news/new-repository-service.html
  • Loading branch information
michaelkamprath committed May 17, 2021
1 parent f2e398c commit 0a2c693
Show file tree
Hide file tree
Showing 2 changed files with 9 additions and 3 deletions.
7 changes: 5 additions & 2 deletions spark-qfs-swarm/jupyter-server/Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -6,13 +6,15 @@ FROM qfs-master:latest
# /data/spark - Spark's data directory
#

ENV GRAPHFRAMES_VERSION 0.8.1-spark3.0-s_2.12

USER root
RUN apt-get install -y g++
RUN pip3 install \
notebook==5.7.9 \
jupyter_nbextensions_configurator \
jupyter_contrib_nbextensions \
yapf
yapf wget

COPY start-jupyter.sh /

Expand All @@ -24,6 +26,7 @@ RUN jupyter nbextension enable codefolding/main
RUN jupyter nbextension enable execute_time/ExecuteTime

RUN mkdir -p /home/spark/jupyter/runtime \
&& mkdir -p /home/spark/jupyter/notebooks
&& mkdir -p /home/spark/jupyter/notebooks \
&& mkdir -p /home/spark/jars/

CMD ["/bin/bash", "/start-jupyter.sh"]
5 changes: 4 additions & 1 deletion spark-qfs-swarm/jupyter-server/start-jupyter.sh
Original file line number Diff line number Diff line change
Expand Up @@ -4,4 +4,7 @@ SHELL=/bin/bash \
XDG_RUNTIME_DIR=/home/spark/jupyter/runtime \
PYSPARK_DRIVER_PYTHON=jupyter \
PYSPARK_DRIVER_PYTHON_OPTS="notebook --no-browser --port=7777 --notebook-dir=/home/spark/jupyter/notebooks --ip=0.0.0.0 --NotebookApp.password='' --NotebookApp.token=''" \
$SPARK_HOME/bin/pyspark --packages graphframes:graphframes:0.8.1-spark3.0-s_2.12 --master spark://spark-master:7077
$SPARK_HOME/bin/pyspark \
--packages graphframes:graphframes:$GRAPHFRAMES_VERSION \
--repositories https://repos.spark-packages.org/ \
--master spark://spark-master:7077

0 comments on commit 0a2c693

Please sign in to comment.