Colin comments

Results 28 comments of


                                            Colin

[Improvement]record split across writer buffer to save memory

@summaryzb For this optimization, I think there has once more `System.arraycopy` to add record, and it will impact performance a lot with previous test. For this PR, I think it...

[Improvement]record split across writer buffer to save memory

> > @summaryzb For this optimization, I think there has once more `System.arraycopy` to add record, and it will impact performance a lot with previous test. For this PR, I...

[Improvement]record split across writer buffer to save memory

In spark client, all memory are requested from executor, so there shouldn't have critical problem, eg, memory leak, oom, etc. Can you show the case how this PR improve the...

[Improvement] Can we support AE of SPARK2.4

I think AE is an experience feature in Spark 2.4, and it was officially announced in Spark 3. So it isn't planned to be supported in Spark 2.4 in short...

[Improvement] Read shuffle data fail because read index file fail

> I found if use `MEMORY_LOCALFILE`, `finishShuffle` will not be called, and buffer in server side may not flush in time, and than reader will fail because read index file...

[Improvement] Read shuffle data fail because read index file fail

> We had set `spark.rss.data.replica.write=2` and `spark.rss.data.replica=3`.But we found all shuffle server of a partition have not flush in time today and we have found in two applications. It may...

[Improvement] Read shuffle data fail because read index file fail

@frankliee can you do more clarification about how to config `spark.rss.data.replica.write` & `spark.rss.data.replica.read` ?

Benchmark: ESS and Uniffle

@zuston You can refer this [blog](https://cloud.tencent.com/developer/article/1943179) for the benchmark related.

Benchmark: ESS and Uniffle

@zuston The benchmark of blog is based on Spark 2.4.6. If there has no random disk IO problem with ESS, Uniffle is expected has **poor performance** than ESS

[Improvement] Disallow sendShuffleData if requireBufferId expired

I think @xianjingfeng is right, with current implementation, OOM will happen if `requireBufferId` was expired in Shuffle Server already, this maybe caused by GC, network problem, high workload in shuffle...