Jared Willard
Jared Willard
This following code does not execute for me and gives the stack trace shown after. It appears I cannot even declare dask storage on a LocalCUDACluster. ``` from dask.distributed import...
### Version source ### On which installation method(s) does this occur? Source ### Describe the issue When calling` earth2mip.datasets.hindcast.open_forecast()` I expected the `chunks` argument to function as it would it...
I am trying to implement the vllm flag in this reference implementation, however as I understand I have to manually spin up vllm outside of running the benchmark. However, the...
The DistributedCheckpointer class seems to have argument conflicts with llama2 when using `full_finetune_distributed.py` In the config I have ``` checkpointer: _component_: torchtune.training.DistributedCheckpointer checkpoint_dir: /pscratch/sd/t/thes/lingua/llama2_7b/distributed output_dir: ${output_dir} model_type: LLAMA2 ``` but...