Jack Wu
Jack Wu
> Hi! > > We also encountered OOM issue while training the tokenizer. To overcome this problem, we sampled 10 x 10^6 random subsequences from the whole dataset to train...
> ValueError: With n_samples=0, test_size=0.2 and train_size=None, the resulting train set will be empty. Adjust any of the aforementioned parameters. > > @jeya-maria-jose > > I got this error while...
Thank you very much for your reply! I'm sure this calculation is happening on the GPU and not the CPU. I used nvitop to monitor the GPU usage in my...
> Can you try encoding with different batch sizes using any [sentence-transformer models](https://huggingface.co/models?library=sentence-transformers&sort=trending)? Similar to LLM2Vec, you have to call `encode` function with list of sentences. This will help us...
>  常规设置中 非常感谢!
Well, I found a way that might solve this problem. First, when using Reszie, use ToTenser first, that is: ``` train_transform = transforms.Compose([ transforms.ToTensor(), transforms.Resize((32, 32), antialias=True), transforms.Normalize((0.5, 0.5, 0.5),...
> Hey, First of all, thank you very much for the excellent work by the authors. I am currently reading through the paper and the code, and I have encountered...
抱歉,其实应该是在访达中通过右键来打开就好,情况如下: 