ThomaswellY
ThomaswellY
Dear Jian Wei I've tried to build this project in conda envs with python3.6 and pytorch1.0.0 . But encountered failure when compiling the cuda dependencies using commands "python setup.py build...
I have been using mmpretrain project [https://github.com/open-mmlab/mmpretrain](url), which consists of abundant of classification scripts. However, they use torch.distributed.launch to start distributed training, I wonder is there any method under kubeflow...
Hi, i have been using mpi-operator to achieve distributed training recently。 the most command i used is “kubectl apply -f yaml”. Let me take the mpi-operator yaml for example apiVersion:...
Hi , Thanks for your owesome work ! i was doing experiment on your code, and found the standard way to start training is limited to torch.distributde.run. And if i...
Hello, I am using Mediapipe's Pose module in Python to extract region masks from images. My environment includes VS Code for debugging. Despite my efforts to suppress logs, I am...
Thanks for your amazing work on ALBEF! While reviewing the paper and analyzing the provided implementation, I noticed a potential inconsistency between the paper's description of KL divergence computation and...
### Question Thanks for the wonderful work from your team ! I have been following the advanced VLM for text-image retrieval research, and i wonder does LLaVA support text-image retrieval?...
Hello Qwen-VL-Chat Team! I am currently exploring multi-image understanding tasks with Qwen-VL-Chat. I understand that the model already supports two-image comparison questions, for example: `query = tokenizer.from_list_format([ {'image': 'assets/mm_tutorial/Chongqing.jpeg'}, {'image':...
If I want to perform text-image retrieval, which models from the LAVIS series are supported? BLIP-2 might not be the most advanced option.
Thank you for your outstanding work! I am trying to use evaluate.py to evaluate a face recognition model, but I encountered an error while extracting the validation dataset using the...