Seaton comments

Results 10 comments of


                                            Seaton

CapsGNN (Loss=nan)?

> > In layers.py, add a line` b_ij = b_ij + u_vj1` before line 143 `b_max = torch.max(b_ij, dim = 2, keepdim = True)` > > Hi, > > Thank...

CapsGNN (Loss=nan)?

> > In layers.py, add a line` b_ij = b_ij + u_vj1` before line 143 `b_max = torch.max(b_ij, dim = 2, keepdim = True)` > > same problem > when...

CapsGNN (Loss=nan)?

> Graph level classification, how to add batchsize？ If the graph classification algorithm uses the DGL framework, it can divided a graph into mini-batches to accelerate the training.However, in my...

I was trying to run this code but got this error. See pic below

> @huntingingit 请问您的问题有得到解决吗我也遇到了同样的问题，如果解决了的话，能否告知一下方法解决表象的方式是这样的，首先squash函数那里，s = (mag_sq / (1.0 + mag_sq)) * (s / mag + 1e-8) 添加个1e-8，然后在动态路由方法里给b做一个归一化b_max = torch.max(b_ij, dim=2, keepdim=True) ; b_ij = b_ij / b_max.values （这部分是其他人提出来的，实际上应该是放大b的值）...

列表、函数参数以及缺省值的问题

谢谢老师，我知道了。测试了一下，发现append_list()函数在首次被调用之后，再次调用时，无论调用多少次，lst和lst_updated都指向了同样的地址。 ![image](https://user-images.githubusercontent.com/56856685/81890178-ea527080-95d7-11ea-805c-50507729a825.png) ![image](https://user-images.githubusercontent.com/56856685/81890185-efafbb00-95d7-11ea-925f-6a28741158e7.png) ![image](https://user-images.githubusercontent.com/56856685/81890192-f3dbd880-95d7-11ea-872c-cc24b410c3a4.png) 老师在设计函数时的建议，让我收获很多。我在采用老师的建议编写代码时，还遇到一个小问题：我在使用递归的思想设计“查找树的所有后代节点”这个函数（部分代码见下图）时，确实需要修改参数中的列表。而这样做必须每次都先创建一个空列表descendants=[ ]，然后再传入函数中进行修改、获得所有后代节点。这样每次使用该函数时都需要先创建一个空列表。我觉得为了这样程序的安全性，这样设计的函数使用起来不那么直接，似乎有一点别扭。不知道有没有即直接又安全的方法设计这样的函数？ ![image](https://user-images.githubusercontent.com/56856685/81891483-1cb19d00-95db-11ea-999d-a5d6e95d5ecd.png)

列表、函数参数以及缺省值的问题

之前写的时候没有考虑到extend函数，现在明白了，谢谢老师！

Upgrade hotprotein framework from ESM1 to ESM2, and train it with deepspeed

Hi, Thank you for your integrating ESM2 work. I have some questions, as follows: 1.Does the key point SAP in the paper only provide the model parameters, such that it's...

Upgrade hotprotein framework from ESM1 to ESM2, and train it with deepspeed

老哥好呀，我也在深圳。昨天简单看了你的repo，发现你只是把esm的代码照搬过来了，没有get到这篇论文的精髓。他ESM的模型参数是冻住了，但是在每个注意力层的q和v都添加了用于微调的LoRA层（这些层可学习的，Hu E J, Shen Y, Wallis P, et al. Lora: Low-rank adaptation of large language models[J]. arXiv preprint arXiv:2106.09685, 2021.），还有一些sparse层也是可以学习的。建议你看看这个仓库esm/module/Transformers以及esm/sparse_multihead_attention/SparseMultiheadAttention 呢。注意rank参数和use_sparse参数，这些都是微调相关的。祝好运！

讯飞星火模型回复的内容是乱码

> 好像是随机验证的，通过接口请求报需验证时，网页端发个消息就好了，还没找到验证的代码我这边报错是一样的，但是网页端发了消息后，chathub中还是这样

Is facebook going to discontinue the esm development?

True, but those guys has startup a new company.