dshwei

Results 15 issues of dshwei

sublayerout = layerNorm(x +sublayer(x)) **首先是残差链接然后是层标准化** 在你代码中:sublayer.py中 应该是 def forward(self, x, sublayer): "Apply residual connection to any sublayer with the same size." # return x + self.dropout(sublayer(self.norm(x))) return self.norm( x +...

pandas==1.1.5 Keras==2.3.1 keras_bert==0.83.0 numpy==1.16.4 scikit_learn==0.24.0 scipy==1.4.1 tensorboard=2.3.0 tensorflow-estimator==2.3.0 tensorflow==2.2 config_path = './chinese_L-12_H-768_A-12/bert_config.json' checkpoint_path = './chinese_L-12_H-768_A-12/bert_model.ckpt' dict_path = './chinese_L-12_H-768_A-12/vocab.txt' model = load_trained_model_from_checkpoint(config_path,checkpoint_file=checkpoint_path) : 8.9) Traceback (most recent call last): File "model_train.py",...

bug

**Before submitting a bug, please make sure the issue hasn't been already addressed by searching through the [FAQs](https://ai.meta.com/llama/faq/) and [existing/past issues](https://github.com/facebookresearch/llama/issues)** ## Describe the bug Model is CodeLlama-7b-Instruct $ cat...

## 🐛 Bug ```text File "/demo/project/pre_train_ft_7b.py", line 200, in run_training trainer = Trainer( File "/demo/miniconda3/envs/sqlcode/lib/python3.9/site-packages/transformers/trainer.py", line 503, in __init__ self.callback_handler = CallbackHandler( File "/demo/miniconda3/envs/sqlcode/lib/python3.9/site-packages/transformers/trainer_callback.py", line 313, in __init__ self.add_callback(cb) File...

type / bug
help wanted

when mutil gpu run starcoder in full parameter tuning , File "starcoder-git/finetune.py", line 44, in on_save kwargs["model"].save_pretrained(checkpoint_folder) File "/miniconda3/envs/sqlcode/lib/python3.9/site-packages/transformers/modeling_utils.py", line 2480, in save_pretrained os.remove(full_filename) FileNotFoundError: [Errno 2] No such file...

slurm command as following : ``` #!/bin/bash #SBATCH --job-name=pretrain_7 # name #SBATCH --nodes=2 # nodes #SBATCH -w server-gpu-[10,15] #SBATCH --ntasks-per-node=1 # crucial - only 1 task per dist per node!...

1. select distinct employees.firstname, employees.lastname from employees as employees inner join employeeterritories as employeeterritories on employees.employeeid = employeeterritories.employeeid inner join territories as territories on employeeterritories.territoryid = territories.territoryid inner join region...

### 提交前必须检查以下项目 - [X] 请确保使用的是仓库最新代码(git pull),一些问题已被解决和修复。 - [X] 我已阅读[项目文档](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2/wiki)和[FAQ章节](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2/wiki/常见问题)并且已在Issue中对问题进行了搜索,没有找到相似问题和解决方案。 - [X] 第三方插件问题:例如[llama.cpp](https://github.com/ggerganov/llama.cpp)、[LangChain](https://github.com/hwchase17/langchain)、[text-generation-webui](https://github.com/oobabooga/text-generation-webui)等,同时建议到对应的项目中查找解决方案。 ### 问题类型 None ### 基础模型 None ### 操作系统 None ### 详细描述问题 ``` # 请在此处粘贴运行代码(请粘贴在本代码块里) ``` ### 依赖情况(代码类问题务必提供) ```...

stale

### Self Checks - [x] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general). - [x] I have searched for existing...

🐞 bug