ps-lite icon indicating copy to clipboard operation
ps-lite copied to clipboard

How to implement the 'Bounded Delay' consistency model in ps-lite?

Open blazerye opened this issue 9 years ago • 4 comments

'Bounded Delay' consistency model is mentioned in the paper "Scaling Distributed Machine Learning with the Parameter Server". How to implement it in ps-lite? Can give some tips? Thx.

blazerye avatar Jul 05 '16 02:07 blazerye

there is an example,

t1 = push(..)
t2 = push(..)
wait(t1)
t3 = push(...)
wait(t2)

at this point, t1 and t2 are guaranteed to be finished.

mli avatar Jul 15 '16 21:07 mli

@mli thx, I want to implement it in MXNet. (1) if I implement it as you say, which file I should modify? kvstore_dist.h? (2) if implement it in kvstore server not in worker can be better? if I implement it in server which file I should modify? kv_app.h?

blazerye avatar Jul 18 '16 11:07 blazerye

I think the 'Bounded Delay' consistency model means that waiting for all the previous \tau updates (pushes) of all the workers have been applied. It seems that the wait function only waits for the local push to be done? Or, the timestamp returned by the push function is a global timestamp, so that the wait function is actually something like a global barrier?

congxie1108 avatar Nov 13 '16 00:11 congxie1108

As @xcgoner said, the code above is not a SSP algorithm.

Local wait only guarantees memory that received parameters be available .

zhangyafeikimi avatar Jan 30 '17 14:01 zhangyafeikimi