-
Notifications
You must be signed in to change notification settings - Fork 356
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: Issue with Running Hop Pipeline on Beam Spark Pipeline Engine #4752
Comments
Have you tried the steps from our guide and does it give this error? https://hop.apache.org//manual/latest/pipeline/beam/beam-samples-spark.html#_get_spark It seems like a jar might be missing. But it could also be a mismatch on spark/scala versions between Hop and your cluster |
Apache Beam Version: 2.61.0
Apache Spark Version: 3.5.1
Apache Hop Version: apache-hop-client-2.11.0
These are the versions we are using
Can you help me what all the versions I have to be use
Our team is working on more than 10days, We are following the same
documentation.
…On Fri, Jan 3, 2025 at 4:58 PM Hans Van Akelyen ***@***.***> wrote:
Have you tried the steps from our guide and does it give this error?
https://hop.apache.org//manual/latest/pipeline/beam/beam-samples-spark.html#_get_spark
It seems like a jar might be missing. But it could also be a mismatch on
spark/scala versions between Hop and your cluster
—
Reply to this email directly, view it on GitHub
<#4752 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/BOEJ2FAL7ZZEZDYY4OQPPTL2IZX6HAVCNFSM6AAAAABUQACHGKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNRZGA4DANJTHA>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
decoders@decoders-Latitude-7480:~/Downloads/apache-hop-client-2.11.0/hop$ spark-submit
Error find metadata class for key null The metadata plugin for key null could not be found in the plugin registry org.apache.hop.core.exception.HopException: Error find metadata class for key null The metadata plugin for key null could not be found in the plugin registry
Caused by: org.apache.hop.core.exception.HopException: The metadata plugin for key null could not be found in the plugin registry
Caused by: org.apache.hop.core.exception.HopException:
25/01/03 17:26:51 INFO ShutdownHookManager: Shutdown hook called This is the issue we are facing. The documentation you shared is that we are following |
seems to be working here...
|
make sure to also follow the steps here your metadata.json is wrong. It are also arguments to the fat-jar so no need to add |
decoders@decoders-Latitude-7480:~/Downloads/apache-hop-client-2.11.0/hop$ spark-submit
Error find metadata class for key null The metadata plugin for key null could not be found in the plugin registry org.apache.hop.core.exception.HopException: Error find metadata class for key null The metadata plugin for key null could not be found in the plugin registry
Caused by: org.apache.hop.core.exception.HopException: The metadata plugin for key null could not be found in the plugin registry
Caused by: org.apache.hop.core.exception.HopException:
This is the issue I'm Facing |
The metadata is wrong you need to export it from the client it is not the same as the project-config.json as explained in the second link I sent |
Apache Hop version?
2.61.0
Java version?
17
Operating system
Linux
What happened?
I am reaching out to seek your assistance with an issue I am encountering while running a simple Hop pipeline using the Beam Spark pipeline engine. The pipeline executes without any issues when using the default local runner, but it fails when I switch to the Beam Spark engine.
Below are the details of my setup:
Apache Beam Version: 2.61.0
Apache Spark Version: 3.5.1
Apache Hop Version: apache-hop-client-2.11.0
The error logs are as follows:
Error:
2025/01/02 15:30:53 - Hop - Pipeline opened.
2025/01/02 15:30:53 - Hop - Launching pipeline [hop_trans2]...
2025/01/02 15:30:53 - Hop - Started the pipeline execution.
java.lang.NoClassDefFoundError: org/apache/spark/metrics/source/Source
at java.base/java.lang.Class.getDeclaredMethods0(Native Method)
...
2025/01/02 15:30:53 - Hop - ERROR: hop_trans2: preparing pipeline execution failed
2025/01/02 15:30:53 - Hop - ERROR: org.apache.hop.core.exception.HopException:
2025/01/02 15:30:53 - Hop - Error preparing remote pipeline
2025/01/02 15:30:53 - Hop - Error converting Hop pipeline to Beam
2025/01/02 15:30:53 - Hop -
2025/01/02 15:30:53 - Hop - Caused by: java.lang.NoClassDefFoundError: org/apache/spark/metrics/source/Source
2025/01/02 15:30:53 - Hop - at java.base/java.lang.Class.getDeclaredMethods0(Native Method)
...
It appears that the issue is related to the org.apache.spark.metrics.source.Source class not being found.
Issue Priority
Priority: 3
Issue Component
Component: Hop Config
The text was updated successfully, but these errors were encountered: