Jack Wu

Results 8 comments of Jack Wu

> Hi! > > We also encountered OOM issue while training the tokenizer. To overcome this problem, we sampled 10 x 10^6 random subsequences from the whole dataset to train...

> ValueError: With n_samples=0, test_size=0.2 and train_size=None, the resulting train set will be empty. Adjust any of the aforementioned parameters. > > @jeya-maria-jose > > I got this error while...

Thank you very much for your reply! I'm sure this calculation is happening on the GPU and not the CPU. I used nvitop to monitor the GPU usage in my...

> Can you try encoding with different batch sizes using any [sentence-transformer models](https://huggingface.co/models?library=sentence-transformers&sort=trending)? Similar to LLM2Vec, you have to call `encode` function with list of sentences. This will help us...

> ![image](https://private-user-images.githubusercontent.com/78214553/311603466-eef50d5a-f9c2-4abb-bd14-bf1ea3c210a8.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTAyMDIxNzIsIm5iZiI6MTcxMDIwMTg3MiwicGF0aCI6Ii83ODIxNDU1My8zMTE2MDM0NjYtZWVmNTBkNWEtZjljMi00YWJiLWJkMTQtYmYxZWEzYzIxMGE4LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDAzMTIlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwMzEyVDAwMDQzMlomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTNkZDQwMDZlYjNhMTQ0YjcxNmY4ZTI5YjllNTJiYTI4NmU5YzRlYjcwZjU3MTcxMDU4MTBmMTM1NTlmMDI2MTgmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.HIHjIgxh55nR2pNG54hSOAgVLn5Hjab2Akt3-R-vwik) 常规设置中 非常感谢!

Well, I found a way that might solve this problem. First, when using Reszie, use ToTenser first, that is: ``` train_transform = transforms.Compose([ transforms.ToTensor(), transforms.Resize((32, 32), antialias=True), transforms.Normalize((0.5, 0.5, 0.5),...

> Hey, First of all, thank you very much for the excellent work by the authors. I am currently reading through the paper and the code, and I have encountered...

抱歉,其实应该是在访达中通过右键来打开就好,情况如下: ![Image](https://github.com/user-attachments/assets/a61901c8-4654-46db-ba51-8d5dff4027df)