deepflow icon indicating copy to clipboard operation
deepflow copied to clipboard

[BUG] deepflow agent无法上报

Open unionhu opened this issue 5 months ago • 2 comments

Search before asking

  • [x] I had searched in the issues and found no similar feature requirement.

DeepFlow Component

Agent

What you expected to happen

虚拟机批量安装deepflow agent后,部分节点出现以下日志,发现agent无法上报 agent version:6.5 server version:6.5

"[2025-08-20 11:41:14.657106 +08:00] INFO [src/platform/synchronizer.rs:285] local version 1752727196, remote version 0, about to resync", "[2025-08-20 11:41:14.657142 +08:00] INFO [src/platform/synchronizer.rs:260] local version is 1752727196, will send whole message", "[2025-08-20 11:41:15.797895 +08:00] INFO [src/config/handler.rs:1731] analyzer_ip not set, remote log disabled", "[2025-08-20 11:41:24.752672 +08:00] INFO [src/platform/synchronizer.rs:285] local version 1752727196, remote version 0, about to resync", "[2025-08-20 11:41:24.752711 +08:00] INFO [src/platform/synchronizer.rs:260] local version is 1752727196, will send whole message", "[2025-08-20 11:41:28.051662 +08:00] INFO [src/config/handler.rs:1731] analyzer_ip not set, remote log disabled", "[2025-08-20 11:41:34.855512 +08:00] INFO [src/platform/synchronizer.rs:285] local version 1752727196, remote version 0, about to resync", "[2025-08-20 11:41:34.855560 +08:00] INFO [src/platform/synchronizer.rs:260] local version is 1752727196, will send whole message"

配置: max_collect_pps: 100 max_cpus: 1 max_millicpus: 200 max_memory: 200 system_load_circuit_breaker_threshold: 0.8 system_load_circuit_breaker_recover: 0.6 tap_interface_regex: ^(tap.|vvport.|cali.|veth.|eth.|en[osipx].|lxc.*)$ vtap_flow_1s_enabled: 0 l4_log_tap_types:

  • -1 platform_enabled: 1 log_retention: 7 l7_metrics_enabled: 0 l7_log_store_tap_types:
  • -1 log_file_size: 100 external_agent_http_proxy_enabled: 0 static_config: ebpf: disabled: true on-cpu-profile: disabled: true off-cpu-profile: disabled: true

How to reproduce

No response

DeepFlow version

No response

DeepFlow agent list

No response

Kubernetes CNI

No response

Operation-System/Kernel version

No response

Anything else

No response

Are you willing to submit a PR?

  • [ ] Yes I am willing to submit a PR!

Code of Conduct

unionhu avatar Aug 20 '25 03:08 unionhu

agent log 这部分日志是说没有注册上,从 agent-group-config 来看,部署在主机上时,需要添加一个专用的配置: https://www.deepflow.io/docs/zh/configuration/agent/#inputs.resources.private_cloud.hypervisor_resource_enabled

不过你用的是 v6.5 的版本(agent 和 server 版本需要一致),打开链接后使用 Upgrade from old version 里面的参数,从上面配置来看好像写到了 l4_log_tap_types 下面,具体格式参考: https://github.com/deepflowio/deepflow/blob/v6.5/server/agent_config/example.yaml#L339

另外,v6.5 版本已经是三个版本前的 LTS 了,已经过了支持周期,建议升级到目前最新的 LTS v7.0 版本: https://www.deepflow.io/docs/zh/release-notes/release-timeline/

1473371932 avatar Aug 26 '25 02:08 1473371932

agent log 这部分日志是说没有注册上,从 agent-group-config 来看,部署在主机上时,需要添加一个专用的配置: https://www.deepflow.io/docs/zh/configuration/agent/#inputs.resources.private_cloud.hypervisor_resource_enabled

不过你用的是 v6.5 的版本(agent 和 server 版本需要一致),打开链接后使用 Upgrade from old version 里面的参数,从上面配置来看好像写到了 l4_log_tap_types 下面,具体格式参考: https://github.com/deepflowio/deepflow/blob/v6.5/server/agent_config/example.yaml#L339

另外,v6.5 版本已经是三个版本前的 LTS 了,已经过了支持周期,建议升级到目前最新的 LTS v7.0 版本: https://www.deepflow.io/docs/zh/release-notes/release-timeline/

上面的配置是格式问题,实际上我已经配置了platform_enabled: 1, 补充一下说明,deepflow agent的上报失联,初次安装都没问题,而是运行一段时候出现上报失联,然后查看日志就报上面的日志

unionhu avatar Sep 04 '25 08:09 unionhu