linkis icon indicating copy to clipboard operation
linkis copied to clipboard

[Feature][EngineConn] Enable customized and isolated python environment for Pyspark

Open saLeox opened this issue 3 years ago • 2 comments

Search before asking

  • [X] I had searched in the issues and found no similar feature requirement.

Problem Description

For Python developers, using a customized and isolated Python runtime environment is an indispensable requirement.

You and your colleagues may want to use different versions of python and python packages and don’t want to affect each others’ environment.

Thus it's necessary to support using customized and isolated Python environment in hadoop yarn cluster and achieve this for PySpark.

Description

No response

Use case

No response

solutions

From the official Spark Configuration, we can know that

Property spark.pyspark.driver.python take precedence than PYSPARK_DRIVER_PYTHON if it is set.

Property spark.pyspark.python take precedence than PYSPARK_PYTHON if it is set.

And Property archives is the same as spark.yarn.dist.archives.

We can leverage the global settings module, to let user define the variables.

By default, these variables can be empty, and in that case EnvVar in spark-defaults.conf or spark-env.sh will take precedence.

If user defines, the defined variables will be isolated from other users, and take precedence.

They will be appended into the Spark engine start script, when them can be read to overwrite the default setting in spark-defaults.conf or spark-env.sh.

Anything else

No response

Are you willing to submit a PR?

  • [X] Yes I am willing to submit a PR!

saLeox avatar Sep 15 '22 06:09 saLeox

:blush: Welcome to the Apache Linkis (incubating) community!! We are glad that you are contributing by opening this issue.

Please make sure to include all the relevant context. We will be here shortly.

If you are interested in contributing to our website project, please let us know! You can check out our contributing guide on :point_right: How to Participate in Project Contribution.

WeChat Group:

image Mailing Lists:

name description Subscribe Unsubscribe archive
[email protected] community activity information subscribe unsubscribe archive

github-actions[bot] avatar Sep 15 '22 06:09 github-actions[bot]