linyubupa
linyubupa
**Describe the bug** deepspeed_stage_3 was used in pytorch_lightning。when initialize, it cost huge cpu memory 。 the amount of cpu memory used = gpu_number * 2 * model_size **To Reproduce** Steps...
### Bug description when using huggingface pretrained model with multi-gpu, model parameters were duplicate for every gpu in ram ### How to reproduce the bug ```python trainer = Trainer( max_epochs=1,...
Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## ❓ Questions and Help ### Before asking: 1. search the issues. 2. search the...
2024-07-11 14:13:05.423 | ERROR | pipecat.pipeline.task:_process_up_queue:135 - Error running app: Error getting audio (status: 422, error: {"detail":[{"type":"string_type","loc":["body","stream_chunk_size"],"msg":"Input should be a valid string","input":20}]})
when i run example simple-bot on cloud machine, i got these error 2024-07-11 10:54:32.151 | ERROR | pipecat.transports.services.daily:join:280 - Time out joining https://autoark.daily.co/1r4by424NvuhJS8hxxxx {"timestamp":"2024-07-11T02:54:44.715005Z","level":"ERROR","fields":{"message":"Failed to fetch room information: GET failed:...
when i use faster-whisper to load deepdml/faster-whisper-large-v3-turbo-ct2,i found that every segment.no_speech_prob is 0.0
### Your current environment The output of `python collect_env.py` ```text Your output of `python collect_env.py` here ``` ### 🐛 Describe the bug device :8 * H100 python3 -m vllm.entrypoints.openai.api_server --host...
Is this project unable to use Hugging Face's Trainer? when using trainer , i just got stuck on "Initializing global memoery buffer." and then get the error below [rank0]:[E ProcessGroupNCCL.cpp:523]...
### Before Reporting - [x] I have pulled the latest code of main branch to run again and the bug still existed. 我已经拉取了主分支上最新的代码,重新运行之后,问题仍不能解决。 - [x] I have read the [README](https://github.com/tinyvision/DAMO-YOLO/blob/master/README.md)...