Federico Nanni
Federico Nanni
Hi! It seems there's an issue with the path of the embedding file. Could you check two things: 1. if the file is correctly in that folder. You should download...
Can you re-download the embeddings file making sure it is downloaded properly? (it seems the file is broken there). Note that the file size should be around 1.3G
From here it is a bit hard to debug. I have just reinstalled it all and it seems to be working for me using that input embedding file and the...
Ah - check the order of the commands! You should have: - input folder (where your documents sit) - embedding file - output file Your examples has embeddings first and...
I just noticed that this is wrong in the documentation. Above we say the correct order, but here they are inverted! Sorry for this, I'll fix it now:
Fixed it - let me know if this works now:
I see, maybe you could group tweets together by author to reduce the number of files. So one file for each user - this way you'll be scaling users, not...
No worries and all the best with your work!
Starting point, reading this recent NAACL student workshop paper: https://aclanthology.org/2024.naacl-srw.2.pdf
Also: https://arxiv.org/pdf/2402.07927