jiaohui
jiaohui
请问如何合并多个指标,如合并train、test的loss,或者train的loss,acc。教程里只给出了不同实验的相同train的相同metric可以合并。 我看别人以前写的教程是支持的,现在没了吗?? 
what is this pretrainedmodels in models fpn_inception.py
### 请提出你的问题 您好! 我在运行run_moe_pretrain的时候报错:ImportError: cannot import name '_legacy_C_ops' from 'paddle',这该怎么解决呢? 除此之外,使用moe进行模型并行只要用group_sharded_parallel替换DataParallel吗?? 
I have reimplemented llama with the paddlepaddle framework and provided an example of running 7b using aistudio free computing power, feel free to test and suggest improvements. repo: [ppllama](https://github.com/MiuGod0126/ppllama)
## 🐛 Bug ### To Reproduce 1. env conda create -n seq python=3.6 conda activate seq conda install pytorch==1.5.0 torchvision==0.6.0 cudatoolkit=10.1 -c pytorch git clone https://github.com/pytorch/fairseq cd fairseq git checkout...
**问题描述 / Problem Description** 用简洁明了的语言描述这个问题 / Describe the problem in a clear and concise manner. 在使用webui加载并上传txt后出现了红色的error,在终端报faiss的错误。 **复现问题的步骤 / Steps to Reproduce** 1. python webui.py 2. 打开gradio 3. 上传txt **预期的结果 /...
## 🐛 Bug ype "help", "copyright", "credits" or "license" for more information. >>> from fairseq.modules import LayerNorm, MultiheadAttention Traceback (most recent call last): File "", line 1, in File "/home/jiahui/workspace/nmt/bert_nmt/fairseq/fairseq/modules/__init__.py",...
Error Message: root@dsw-252469-64d6fd99c7-hfsns:/mnt/workspace/XenC# make Consolidate compiler generated dependencies of target XenC [ 1%] Building CXX object CMakeFiles/XenC.dir/src/Xen.cpp.o In file included from /mnt/workspace/XenC/src/../include/utils/../kenlm/lm/model.hh:13, from /mnt/workspace/XenC/src/../include/utils/../XenLMken.h:43, from /mnt/workspace/XenC/src/../include/utils/../eval.h:41, from /mnt/workspace/XenC/src/../include/utils/xenio.h:37, from /mnt/workspace/XenC/src/../include/Xen.h:39,...
“VECO 2.0: Cross-lingual Language Model Pre-training with Multi-granularity Contrastive Learning”这篇论文的代码是否有开源的打算?如果有,计划啥时候开源?
I encountered an error, how should I resolve it? [WARNING|trainer.py:1272] 2024-04-27 12:04:25,428 >> Activated GaLoRE fine-tuning, depending on your model size and hardware, the training might take a while before...