otterc
otterc
@wankunde > Send finalize RPCs will block the main thread due to creating connection to some unreachable nodes. Which main thread are you referring to here? Could you please explain...
Also your solution is adding shuffle service nodes to an excluded list which isn't what the description says. Could you please explain with examples/logs of what problems are you facing...
So the issue is that the wait period timer doesn't take into account the time for connection creation which is a bug. However, in this PR you are adding another...
Gentle ping @wankunde. Do you think you can update the PR soon? Please let us know if you need any help.
@thejdeep The test failures are related to this change. Please fix them. ``` /home/runner/work/spark/spark/core/src/main/scala/org/apache/spark/status/protobuf/TaskDataWrapperSerializer.scala:92:5: not enough arguments for constructor TaskDataWrapper: (taskId: Long, index: Int, attempt: Int, partitionId: Int, launchTime: Long,...
Have a general question. Why is user identifier needed to be shared with Celeborn? I can't find any information in the linked jira: [CELEBORN-1285](https://issues.apache.org/jira/browse/CELEBORN-1285) Also, I think if registration is...