-
Notifications
You must be signed in to change notification settings - Fork 509
[SPARK] Support azure reads using vended credentials #416
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK] Support azure reads using vended credentials #416
Conversation
"fs.azure.account.auth.type" -> "SAS", | ||
"fs.azure.account.hns.enabled" -> "true", | ||
"fs.azure.sas.token.provider.type" -> "io.unitycatalog.connectors.spark.AbfsVendedTokenProvider", | ||
"fs.azure.sas.fixed.token" -> azCredentials.getSasToken, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
shall we put these string constants in AbfsVendedTokenProvider
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good call! I cleaned that up, and pulled in the constants from abfs in cases where we were using already established conf values.
|
||
import static java.lang.String.format; | ||
|
||
public class AbfsVendedTokenProvider implements SASTokenProvider { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
not related to this PR, do we have document to tell uses how to set this provider in Spark configs?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not yet -- @dennyglee and I have been working on docs for this. I'm planning on getting that polished up once we get changes in and confirm everything is working as expected
**PR Checklist** - [x] A description of the changes is added to the description of this PR. - [ ] If there is a related issue, make sure it is linked to this PR. - [x] If you've fixed a bug or added code that should be tested, add tests! - [ ] If you've added or modified a feature, documentation in `docs` is updated **Description of changes** Updates the spark connector to support vended credentials for azure cloud storage (`abfs` / `abfss`). Requires `hadoop-azure`, e.g.: ``` spark.jars.packages=org.apache.hadoop:hadoop-azure:3.3.6 ``` *Note*: this currently doesn't work with hadoop-azure:3.4.0, this issue might be addressed in 3.4.1:https://issues.apache.org/jira/browse/HADOOP-19208 Signed-off-by: Vikas Sharma <[email protected]>
**PR Checklist** - [x] A description of the changes is added to the description of this PR. - [ ] If there is a related issue, make sure it is linked to this PR. - [x] If you've fixed a bug or added code that should be tested, add tests! - [ ] If you've added or modified a feature, documentation in `docs` is updated **Description of changes** Updates the spark connector to support vended credentials for azure cloud storage (`abfs` / `abfss`). Requires `hadoop-azure`, e.g.: ``` spark.jars.packages=org.apache.hadoop:hadoop-azure:3.3.6 ``` *Note*: this currently doesn't work with hadoop-azure:3.4.0, this issue might be addressed in 3.4.1:https://issues.apache.org/jira/browse/HADOOP-19208 Signed-off-by: Kevin Wang <[email protected]>
PR Checklist
docs
is updatedDescription of changes
Updates the spark connector to support vended credentials for azure cloud storage (
abfs
/abfss
).Requires
hadoop-azure
, e.g.:Note: this currently doesn't work with hadoop-azure:3.4.0, this issue might be addressed in 3.4.1:https://issues.apache.org/jira/browse/HADOOP-19208