Skip to content

[kerberized HDFS] One of --proxy-user or --principal can be provided #2783

Answered by pan3793
fuhaiq asked this question in Q&A
Discussion options

You must be logged in to vote

Generally, we encourage using proxy user mode instead of providing superuser keytab to Spark application directly for security purposes. The reason is, that users can access all resources including keytab if you provided throw Scala/Spark api, in other words, providing a superuser keytab to Spark application means exposing your superuser keytab to anyone.

In practice, I know there are two approaches that have been adopted widely,

  1. Job submitter service(Kyuubi plays this role) holds the superuser keytab, and uses proxy user to submit jobs
  2. Job submitter service holds every user keytabs, and uses their own keytab to submit jobs

You should choose one of two.

Replies: 2 comments 3 replies

Comment options

You must be logged in to vote
2 replies
@fuhaiq
Comment options

@pan3793
Comment options

pan3793 May 31, 2022
Collaborator

Answer selected by fuhaiq
Comment options

You must be logged in to vote
1 reply
@fuhaiq
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants