XPretrain issues

Hi, how to understand the LF-hdvila-8m?

1

Is the line in 'lfvila8m_clipid.jsonl' a video clips-sentence pair? And I see an variational number of video-clips per row. So how the video-clips of 'lfvila8m_clipid.jsonl' is divided from the original...

sunwhw

Code for transcript text processing

2

May I request the transcript text processing code? [[email protected]](mailto:[email protected]) Thank you very much!

wwyy1234

Dockerfile and requirements for Clip-ViP

Thanks for your great work! I want to follow your work, but I meet some problems with the dockfile. It seems the image nvidia/cuda:10.1-devel-ubuntu18.04 not exist. Can you provide a...

ncTimTang

About LF-VILA code in PatchEmbed3D of video encoder

the padding seems not right, or maybe i made a mistake ``` # padding _, _, D, H, W = x.size() if H % self.patch_size[0] != 0: x = F.pad(x,...

musicman217

Update Dockerfile for Issue #34

MasoudKaviani

Error on starting horovod

When we run `horovodrun -np 1 python src/pretrain/run_pretrain.py --config src/configs/msrvtt_retrieval/msrvtt_retrieval_vip_base_32.json` We get the following error: ```:Traceback (most recent call last): : File "src/pretrain/run_pretrain.py", line 22, in : from transformers import...

MasoudKaviani

Asking for a simple script to get text and video features

8

First of all - Amazing work on this one. I'm a bit getting lost with the repo, may I request a simple few line script that does something like the...

yotammarton

Bump transformers from 4.30.0 to 4.36.0 in /LF-VILA/docker

Bumps [transformers](https://github.com/huggingface/transformers) from 4.30.0 to 4.36.0. Release notes Sourced from transformers's releases. v4.36: Mixtral, Llava/BakLlava, SeamlessM4T v2, AMD ROCm, F.sdpa wide-spread support New model additions Mixtral Mixtral is the new...

dependabot[bot]

dependencies

Model checkpoints

Excellent work! I'm trying to deploy some attacks on your models, but I cannot fine-tune your pretrained ones on my local server due to a VRAM shortage. Could you please...

michaeltian108

Error in finetuning

1

When running the command inside the docker image for finetuning LF-VILA, following error is created, root@8dccc81930c3:/LF-VILA# deepspeed src/tasks/run_video_classification.py --distributed --blob_mount_dir /blob_mount --config $CONFIG_PATH --deepspeed [2023-10-17 11:11:02,765] [WARNING] [runner.py:132:fetch_hostfile] Unable to...

Ravindu-Yasas-Nagasinghe

XPretrain
XPretrain copied to clipboard

Metadata

Hi, how to understand the LF-hdvila-8m?

Code for transcript text processing

Dockerfile and requirements for Clip-ViP

About LF-VILA code in PatchEmbed3D of video encoder

Update Dockerfile for Issue #34

Error on starting horovod

Asking for a simple script to get text and video features

Bump transformers from 4.30.0 to 4.36.0 in /LF-VILA/docker

Model checkpoints

Error in finetuning

← Metadata

Owner

Metadata

XPretrain XPretrain copied to clipboard

Metadata

← Metadata

Owner

Metadata

XPretrain
XPretrain copied to clipboard