Rohit Singh Rathaur
Rohit Singh Rathaur
[StatLog(German Credit Data)](https://archive.ics.uci.edu/ml/datasets/statlog+(german+credit+data))
feat: Add LeCun normal weight initialization - Implemented `lecun_normal` function for weight initialization based on fan-in. - Provided default initializer for flexibility. - Marked `lecun_normal` as non-differentiable using ChainRulesCore. It...
- integrated the property `latency` to the `OracleLightningNetwork` class
update-modules because those were not maintained.
It resolves issue #295
https://github.com/RohitRathore1/elk/blob/84e99a36a5050881d85f1510a2486ce46ac1f942/tests/test_smoke_eval.py#L19C1-L20C35
It fixes issue #1232 These changes fix the IndexError caused by invalid token IDs in allowed_tokens_concat by handling eos_token_id appropriately and adjusting token handling in CFGGuide. The updates maintain backward...
In continuation of issue #194.. @erfanzar The current `RMSNorm` implementation in [rms_norm](https://github.com/erfanzar/EasyDeL/blob/main/easydel/kernels/rms_norm.py) uses `Pallas` for TPUs and falls back to basic JAX operations for other platforms. On NVIDIA GPUs, a...
Fixes #165422 The issue was that `ProcessGroupGloo` operations were capturing the timeout value at `AsyncWork` construction time and storing it in a const member variable. When operations executed, they used...
Fixes #164929 cc @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @pragupta @msaroufim @dcci @LucasLLC @pradeepfn