magic-hya

Results 49 comments of magic-hya

使用的是runp部署方式 ``` secretflow-registry.cn-hangzhou.cr.aliyuncs.com/secretflow/kuscia-secretflow $ kuscia -v kuscia version 6994ca0 ```

我使用的是RunP模式,官方给出的镜像是[这个](https://www.secretflow.org.cn/zh-CN/docs/kuscia/v0.11.0b0/deployment/K8s_deployment_kuscia/deploy_with_runp_cn) ``` 下文将以物理机和 K8s 两种部署环境为例,来介绍基于 RunP 的部署流程。 在物理机上部署 完整的详细流程请参考 [多机部署中心化集群](https://www.secretflow.org.cn/zh-CN/docs/kuscia/v0.11.0b0/deployment/Docker_deployment_kuscia/deploy_master_lite_cn) 和 [多机部署点对点集群](https://www.secretflow.org.cn/zh-CN/docs/kuscia/v0.11.0b0/deployment/Docker_deployment_kuscia/deploy_p2p_cn)。 其中,使用 RunP 部署的不同点是: 使用 kuscia-secretflow 镜像。 export KUSCIA_IMAGE=secretflow-registry.cn-hangzhou.cr.aliyuncs.com/secretflow/kuscia-secretflow ```

测试结果 ``` $ env PYTHONPATH=$PYTHONPATH:$PWD/.. python update_meta.py INFO:root:1. Update secretflow comp list. INFO:root:2. Update translation. $ git diff translation.json diff --git a/docker/translation.json b/docker/translation.json index 149a130..b581b5d 100644 --- a/docker/translation.json +++ b/docker/translation.json...

@warriorpaw 分支被我弄坏了,我重新弄个分支提交

现在日志只能看到center的,其他的日志从哪里获取

1.进入节点,输入命令kubectl get kj报错 ``` bash-5.2# kubectl get kj E0925 11:14:52.820799 28753 memcache.go:265] couldn't get current server API group list: Get "http://localhost:8080/api?timeout=32s": dial tcp 127.0.0.1:8080: connect: connection refused E0925 11:14:52.821673 28753...

1.数据是自定义的数据,并没有看到自带的数据 [psi_guest.csv](https://github.com/user-attachments/files/17126156/psi_guest.csv) [psi_host.csv](https://github.com/user-attachments/files/17126157/psi_host.csv) 2.内存配置如下 ``` [root@k8s-master73 ~]# free -h total used free shared buff/cache available Mem: 251G 49G 34G 21G 167G 171G Swap: 0B 0B [root@k8s-master74 ~]# free -h total...

在报错的74节点拿到日志文件 [kuscia.log](https://github.com/user-attachments/files/17128255/kuscia.log) [internal.log](https://github.com/user-attachments/files/17128257/internal.log) [external.log](https://github.com/user-attachments/files/17128258/external.log)

重新部署master notls后,求交报错 ``` 2024-09-25 19:28:53 INFO the jobId=anpo, taskId=anpo-oronmgzc-node-3 start ... 2024-09-25 19:29:45 INFO the jobId=anpo, taskId=anpo-oronmgzc-node-3 failed: party kvlohttz failed msg: container[secretflow] terminated state reason "Error", message: " log...