l.zonghai comments

Results 11 comments of


                                            l.zonghai

Not supporting MR now?

> @baitian77 @microeastcowboy we will open it next month Hi @bdyx123 MR version will be released this month?

cannot run css cluster

> 1. css push data should use two replicas, so we should start two workers at least > 2. does dir /home/aa/css/logs exist? or dir permission issues? > 3. the...

Disk damage causes failure

> Would you try spark.shuffle.rss.replicas=2? Thanks @hiboyang , it works ! Seems replicas=1 actually means the original data itself, no extra replication. replicas=2 is ok.

Disk damage causes failure

> Hi @hiboyang , I set `replicas=2` but another exception is thrown: When mapper-A sends data to StreamServer5 and a replication to StreamServer3 * if I kill StreamServer5，mapper-A will use...

Hi @hiboyang I post 2 apps for different exceptions but both failed for StreamServer5 **application-1** failed on job10-Stage14, 26 tasks ( including retry attempts) failed for the same reason, the...

Disk damage causes failure

> Thanks @Lobo2008 for the debugging info! I checked the source code again. The [code](https://github.com/uber/RemoteShuffleService/blob/7220c23694e0175e01719621707680a2718173cf/src/main/java/com/uber/rss/clients/ReplicatedWriteClient.java#L145) in RSS is supposed to try another server if hitting error with one server including...

l.zonghai

Not supporting MR now?

cannot run css cluster

Disk damage causes failure

Disk damage causes failure

Disk damage causes failure

Disk damage causes failure

Disk damage causes failure

How long the shuffle data of each ShuffleStage will be stored in RSS nodes?

How long the shuffle data of each ShuffleStage will be stored in RSS nodes?

How long the shuffle data of each ShuffleStage will be stored in RSS nodes?