Miscellaneous server-side improvements

Open justheuristic opened this issue 3 years ago • 0 comments

task_pool
- [x] remove interate_minibatches, once https://github.com/learning-at-home/hivemind/pull/506 is merged
- [ ] batch sequences of similar length
runtime
- [x] do not log "0 parameters" on init (misleading)
- [x] consider removing Runtime, once https://github.com/learning-at-home/hivemind/pull/505 is merged
handler
- [x] verify what happens if server have mixed torch_dtype and/or compression
- [x] actually follow forward/backward/inference schema instead of hard-coding
- [x] extract the code for adding prompts into a separate file
- [x] consider merging the code from hivemind's ConnectionHandler instead of inheriting
- [x] add a test that covers _rpc_inference with prompts
- [x] add a test that covers _rpc_inference with hypo-ids
MemoryCache
- [x] when running inference over multiple layers on the same server, avoid passing layer activations between cpu<->gpu by storing them in MemoryCache
  - before implementing this, gotta check if this will bring any performance benefit
- [ ] LRU-offload stale cache from gpu to ram
point system
- [x] make sure points are integers everywhere
- [ ] implement a nonzero prioritizer :)
- [x] move client-side spending polity to sequence_manager

Sep 07 '22 02:09 justheuristic