wuyi comments

Results 20 comments of


                                            wuyi

[SPARK-39955][CORE] Improve LaunchTask process to avoid Stage failures caused by fail-to-send LaunchTask messages

@mridulm The massive disconnection issue is an intermittent issue that can't be reproduced. I tend to believe it's not a Spark's issue but due to the bad nodes. The current...

[SPARK-39853][CORE] Support stage level task resource profile for standalone cluster when dynamic allocation disabled

We should also update `ResourceProfileBuilder` to provide the API for user to create `TaskResourceProfile`, e.g., ``` ResourceProfileBuilder().taskOnly().require(taskReqs).build() ``` or we could also extend `ResourceProfileBuilder` to have `TaskResourceProfileBuilder`.

[SPARK-33782][K8S][CORE]Place spark.files, spark.jars and spark.files under the current working directory on the driver in K8S cluster mode

The change generally looks good to me. It'd be good if @dongjoon-hyun could take a look since he has more knowledge in K8s.

[SPARK-39957][CORE] Delay onDisconnected to enable Driver receives ExecutorExitCode

Thanks @kevin85421 @mridulm , merged to Master!

[LIVY-585][SERVER] Avoid duplicate stopping log when stop/interrupt a session

@vanzin please have a look, thx.

[LIVY-585][SERVER] Avoid duplicate stopping log when stop/interrupt a session

@mgaido91 Thank you for your review. I've updated it.

[LIVY-585][SERVER] Avoid duplicate stopping log when stop/interrupt a session

> If we are calling stop when it is not necessary, I think we should rather avoid calling it in those cases. I was thinking about that way, but I...

[wip][SPARK-40320][Core] Executor should exit when it failed to initialize for fatal error

https://github.com/apache/spark/blob/39b65b414c4ba36ada478369149f54452d90dd7b/core/src/main/scala/org/apache/spark/executor/CoarseGrainedExecutorBackend.scala#L169-L176 The issue seems to be that `Executor` construction failed due to the fatal error thrown during plugin initialization. And the fatal error doesn't fail the executor process, which leaves...

[wip][SPARK-40320][Core] Executor should exit when it failed to initialize for fatal error

> The throw should result in uncaught exception handler killing the jvm - and if it does not, then the re-enqueue in prev step will cause the message to be...

[wip][SPARK-40320][Core] Executor should exit when it failed to initialize for fatal error

> Essentially, since we are leveraging a ThreadPoolExecutor, it does not result in killing the thread with the exception/error thrown - but rather, will call ThreadPoolExecutor.afterExecute with the cause for...