doris icon indicating copy to clipboard operation
doris copied to clipboard

[Bug] 官网提供的Docker Compose script无法启动

Open mano233 opened this issue 2 years ago • 1 comments

Search before asking

  • [X] I had searched in the issues and found no similar issues.

Version

1.2.1

What's Wrong?

按照官网提供的1FE-1BE compose模板启动,BE无法注册到FE

What You Expected?

BE报错信息如下:

doris-fe     | 2023-03-01 08:02:18,249 WARN (heartbeat-mgr-pool-2|142) [HeartbeatMgr$BackendHeartbeatHandler.call():268] backend heartbeat got exception
doris-fe     | org.apache.thrift.transport.TTransportException: java.net.ConnectException: Connection refused (Connection refused)
doris-fe     |  at org.apache.thrift.transport.TSocket.open(TSocket.java:226) ~[spark-dpp-1.0-SNAPSHOT.jar:1.0-SNAPSHOT]
doris-fe     |  at org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:143) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe     |  at org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:126) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe     |  at org.apache.commons.pool2.BaseKeyedPooledObjectFactory.makeObject(BaseKeyedPooledObjectFactory.java:62) ~[commons-pool2-2.2.jar:2.2]
doris-fe     |  at org.apache.commons.pool2.impl.GenericKeyedObjectPool.create(GenericKeyedObjectPool.java:1012) ~[commons-pool2-2.2.jar:2.2]
doris-fe     |  at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:356) ~[commons-pool2-2.2.jar:2.2]
doris-fe     |  at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:277) ~[commons-pool2-2.2.jar:2.2]
doris-fe     |  at org.apache.doris.common.GenericPool.borrowObject(GenericPool.java:95) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe     |  at org.apache.doris.system.HeartbeatMgr$BackendHeartbeatHandler.call(HeartbeatMgr.java:225) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe     |  at org.apache.doris.system.HeartbeatMgr$BackendHeartbeatHandler.call(HeartbeatMgr.java:203) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe     |  at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[?:1.8.0_342]
doris-fe     |  at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) ~[?:1.8.0_342]
doris-fe     |  at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) ~[?:1.8.0_342]
doris-fe     |  at java.lang.Thread.run(Thread.java:750) ~[?:1.8.0_342]
doris-fe     | Caused by: java.net.ConnectException: Connection refused (Connection refused)
doris-fe     |  at java.net.PlainSocketImpl.socketConnect(Native Method) ~[?:1.8.0_342]
doris-fe     |  at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:350) ~[?:1.8.0_342]
doris-fe     |  at java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206) ~[?:1.8.0_342]
doris-fe     |  at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188) ~[?:1.8.0_342]
doris-fe     |  at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[?:1.8.0_342]
doris-fe     |  at java.net.Socket.connect(Socket.java:607) ~[?:1.8.0_342]
doris-fe     |  at org.apache.thrift.transport.TSocket.open(TSocket.java:221) ~[spark-dpp-1.0-SNAPSHOT.jar:1.0-SNAPSHOT]
doris-fe     |  ... 13 more

同时我观察到有如下输出:

doris-be     | 2023-03-01T08:01:56+00:00 [Note] [Entrypoint]: MASTER FE is started!
doris-be     | 2023-03-01T08:01:56+00:00 [Note] [Entrypoint]: add priority_networks 172.20.80.0/24 to /opt/apache-doris/be/conf/be.conf
doris-be     | 2023-03-01T08:01:56+00:00 [Note] [Entrypoint]: Register BE to FE is failed. retry.

但是docker-compose.yaml 里声明的cidr是:172.20.80.0/16

How to Reproduce?

docker-compose up

docker-compose.yaml

version: '3'
services:
   docker-fe:
     image: "apache/doris:1.2.1-fe-x86_64"
     container_name: "doris-fe"
     hostname: "fe"
     environment:
       - FE_SERVERS=fe1:172.20.80.2:9010
       - FE_ID=1
     ports:
       - 8030:8030
       - 9030:9030
     volumes:
       - /data/fe/doris-meta:/opt/apache-doris/fe/doris-meta
       - /data/fe/conf:/opt/apache-doris/fe/conf
       - /data/fe/log:/opt/apache-doris/fe/log
     networks:
       doris_net:
         ipv4_address: 172.20.80.2
   docker-be:
     image: "apache/doris:1.2.1-be-x86_64"
     container_name: "doris-be"
     hostname: "be"
     depends_on:
       - docker-fe
     environment:
       - FE_SERVERS=fe1:172.20.80.2:9010
       - BE_ADDR=172.20.80.3:9050
     ports:
       - 8040:8040
     volumes:
       - /data/be/storage:/opt/apache-doris/be/storage
       - /data/be/conf:/opt/apache-doris/be/conf
       - /data/be/script:/docker-entrypoint-initdb.d
       - /data/be/log:/opt/apache-doris/be/log
     networks:
       doris_net:
         ipv4_address: 172.20.80.3
networks:
   doris_net:
     ipam:
       config:
         - subnet: 172.20.80.0/16

Anything Else?

No response

Are you willing to submit PR?

  • [ ] Yes I am willing to submit a PR!

Code of Conduct

mano233 avatar Mar 01 '23 08:03 mano233

加个微信好友我来给你看看 fl_manyi

FreeOnePlus avatar Mar 01 '23 11:03 FreeOnePlus

加个微信好友我来给你看看 fl_manyi

谢谢,已解决!原因是没有设置 sysctl -w vm.max_map_count=2000000

mano233 avatar Mar 02 '23 03:03 mano233

我有个类似的问题,环境在 macbook M1 上,用的镜像是 apache/doris:1.2.2-fe-arm 与 apache/doris:1.2.2-be-arm。docker-compose 脚本首次是能运行的,后面 docker-compose down 之后在 up,无法启动。但是,删掉宿主机挂载路径路径的文件后,再次运行就可以了。不过的话,这样数据没办法一直保留了。

wengzhenjie avatar Mar 20 '23 13:03 wengzhenjie

我这里有同样的问题,已经设置 vm.max_map_count 了。

FE 日志

doris-fe  | 2023-03-29 01:52:14,184 WARN (heartbeat mgr|21) [HeartbeatMgr.runAfterCatalogReady():139] get bad heartbeat response: type: BACKEND, status: BAD, msg: java.net.ConnectException: Connection refused (Connection refused), beId: 10003, beHost: 172.20.80.3, bePort: 0, httpPort: 0, brpcPort: 0
doris-fe  | 2023-03-29 01:52:19,227 WARN (heartbeat-mgr-pool-7|147) [HeartbeatMgr$BackendHeartbeatHandler.call():268] backend heartbeat got exception
doris-fe  | org.apache.thrift.transport.TTransportException: java.net.ConnectException: Connection refused (Connection refused)
doris-fe  |     at org.apache.thrift.transport.TSocket.open(TSocket.java:226) ~[spark-dpp-1.0-SNAPSHOT.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:143) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:126) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.commons.pool2.BaseKeyedPooledObjectFactory.makeObject(BaseKeyedPooledObjectFactory.java:62) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.commons.pool2.impl.GenericKeyedObjectPool.create(GenericKeyedObjectPool.java:1012) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:356) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:277) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.doris.common.GenericPool.borrowObject(GenericPool.java:95) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.system.HeartbeatMgr$BackendHeartbeatHandler.call(HeartbeatMgr.java:225) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.system.HeartbeatMgr$BackendHeartbeatHandler.call(HeartbeatMgr.java:203) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) ~[?:1.8.0_342]
doris-fe  |     at java.lang.Thread.run(Thread.java:750) ~[?:1.8.0_342]
doris-fe  | Caused by: java.net.ConnectException: Connection refused (Connection refused)
doris-fe  |     at java.net.PlainSocketImpl.socketConnect(Native Method) ~[?:1.8.0_342]
doris-fe  |     at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:350) ~[?:1.8.0_342]
doris-fe  |     at java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206) ~[?:1.8.0_342]
doris-fe  |     at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188) ~[?:1.8.0_342]
doris-fe  |     at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[?:1.8.0_342]
doris-fe  |     at java.net.Socket.connect(Socket.java:607) ~[?:1.8.0_342]
doris-fe  |     at org.apache.thrift.transport.TSocket.open(TSocket.java:221) ~[spark-dpp-1.0-SNAPSHOT.jar:1.0-SNAPSHOT]
doris-fe  |     ... 13 more
doris-fe  | 2023-03-29 01:52:19,228 WARN (heartbeat mgr|21) [HeartbeatMgr.runAfterCatalogReady():139] get bad heartbeat response: type: BACKEND, status: BAD, msg: java.net.ConnectException: Connection refused (Connection refused), beId: 10003, beHost: 172.20.80.3, bePort: 0, httpPort: 0, brpcPort: 0
doris-fe  | 2023-03-29 01:52:21,546 INFO (tablet checker|28) [TabletChecker.checkTablets():331] finished to check tablets. unhealth/total/added/in_sched/not_ready: 0/0/0/0/0, cost: 0 ms
doris-fe  | 2023-03-29 01:52:21,550 INFO (leaderCheckpointer|86) [BDBJEJournal.getFinalizedJournalId():405] database names: 1 
doris-fe  | 2023-03-29 01:52:21,550 INFO (leaderCheckpointer|86) [Checkpoint.doCheckpoint():94] last checkpoint journal id: 0, current finalized journal id: 0
doris-fe  | 2023-03-29 01:52:21,668 WARN (ForkJoinPool-2-worker-2|159) [TabletStatMgr.lambda$null$0():70] task exec error. backend[10003]
doris-fe  | org.apache.thrift.transport.TTransportException: Invalid port -1
doris-fe  |     at org.apache.thrift.transport.TSocket.open(TSocket.java:213) ~[spark-dpp-1.0-SNAPSHOT.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:143) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:126) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.commons.pool2.BaseKeyedPooledObjectFactory.makeObject(BaseKeyedPooledObjectFactory.java:62) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.commons.pool2.impl.GenericKeyedObjectPool.create(GenericKeyedObjectPool.java:1012) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:356) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:277) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.doris.common.GenericPool.borrowObject(GenericPool.java:95) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.catalog.TabletStatMgr.lambda$null$0(TabletStatMgr.java:63) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at java.util.stream.ForEachOps$ForEachOp$OfRef.accept(ForEachOps.java:183) ~[?:1.8.0_342]
doris-fe  |     at java.util.Iterator.forEachRemaining(Iterator.java:116) ~[?:1.8.0_342]
doris-fe  |     at java.util.Spliterators$IteratorSpliterator.forEachRemaining(Spliterators.java:1801) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:482) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.ForEachOps$ForEachTask.compute(ForEachOps.java:290) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.CountedCompleter.exec(CountedCompleter.java:731) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinTask.doInvoke(ForkJoinTask.java:401) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinTask.invoke(ForkJoinTask.java:734) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.ForEachOps$ForEachOp.evaluateParallel(ForEachOps.java:159) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateParallel(ForEachOps.java:173) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:233) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.ReferencePipeline.forEach(ReferencePipeline.java:485) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.ReferencePipeline$Head.forEach(ReferencePipeline.java:650) ~[?:1.8.0_342]
doris-fe  |     at org.apache.doris.catalog.TabletStatMgr.lambda$runAfterCatalogReady$1(TabletStatMgr.java:57) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at java.util.concurrent.ForkJoinTask$AdaptedRunnableAction.exec(ForkJoinTask.java:1386) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:175) ~[?:1.8.0_342]
doris-fe  | 2023-03-29 01:52:21,669 INFO (tablet stat mgr|23) [TabletStatMgr.runAfterCatalogReady():125] finished to update index row num of all databases. cost: 0 ms

BE 日志

doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 172.20.80.2_9010_1680054434346      172.20.80.2     9010    8030    9030    9020    FOLLOWER        true    578058187       true    true    25      2023-03-29 01:48:20     true            doris-1.2.1-rc01-Unknown        Yes
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: MASTER FE is started!
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: add priority_networks 172.20.80.0/24 to /opt/apache-doris/be/conf/be.conf
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: Register BE to FE is failed. retry.
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: BE successfully registered to FE!
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: BE is not register. retry.
doris-be  | 10003       default_cluster 172.20.80.3     9050    -1      -1      -1      NULL    NULL    false   false   false   0       0.000   1.000 B 0.000   0.00 %  0.00 %  0.000   {"location" : "default"}                        {"lastSuccessReportTabletsTime":"N/A","lastStreamLoadTime":-1,"isQueryDisabled":false,"isLoadDisabled":false}       0
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: Init Check - Verify that BE is registered to FE successfully
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: Ready to start BE!
doris-be  | 2023-03-29T01:48:44+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:48:44+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:49:04+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:49:04+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:49:25+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:49:25+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:49:45+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:49:45+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:50:14+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:50:14+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | Instruction check fail. The CPU does not support AVX2 instruction set.
doris-be  | 2023-03-29T01:50:36+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | For example, if your CPU does not support AVX2, you need to rebuild the Doris BE with: USE_AVX2=0 sh build.sh --be2023-03-29T01:50:36+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:50:57+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:50:57+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:51:17+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:51:17+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:51:38+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:51:38+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:51:58+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:51:58+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:52:19+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:52:19+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:52:39+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:52:39+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:52:59+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:53:00+00:00 [Note] [Entrypoint]: register is failed, wait next~

hungtcs avatar Mar 29 '23 01:03 hungtcs

注意,fe/be的节点配置文件是以追加模式写入的(吐槽),需要手动清理

mano233 avatar Mar 31 '23 14:03 mano233

add vm.max_map_count didn't work for me ,and got error like:

sysctl: permission denied on key "vm.max_map_count"

dachn avatar Apr 02 '23 00:04 dachn

ermission denied on key "vm.max_map_c

current value:sudo sysctl -a|grep max_map_count

permanent modification:sudo vim /etc/sysctl.conf

vm.max_map_count=2000000

sysctl -p

mano233 avatar Apr 04 '23 13:04 mano233

你使用的是哪个版本?新版本不会有这个问题

datayesJava2 @.***> 于2023年4月17日周一 15:02写道:

注意,fe/be的节点配置文件是以追加模式写入的(吐槽),需要手动清理

这个怎么清理,第一次启动成功了,后面再启动时就报java.net.ConnectException: Connection refused (Connection refused)错误 `

              BackendId: 13003
                Cluster: default_cluster
                     IP: 172.20.80.3
               HostName: doris-be.doris_doris_net
          HeartbeatPort: 9050
                 BePort: -1
               HttpPort: -1
               BrpcPort: -1
          LastStartTime: NULL
          LastHeartbeat: NULL
                  Alive: false
   SystemDecommissioned: false
  ClusterDecommissioned: false
              TabletNum: 0
       DataUsedCapacity: 0.000
          AvailCapacity: 1.000 B
          TotalCapacity: 0.000
                UsedPct: 0.00 %
         MaxDiskUsedPct: 0.00 %
     RemoteUsedCapacity: 0.000
                    Tag: {"location" : "default"}
                 ErrMsg: java.net.ConnectException: Connection refused (Connection refused)
                Version:
                 Status: {"lastSuccessReportTabletsTime":"N/A","lastStreamLoadTime":-1,"isQueryDisabled":false,"isLoadDisabled":false}
HeartbeatFailureCounter: 1
               NodeRole: 1 row in set (0.01 sec)`

—
Reply to this email directly, view it on GitHub
<https://github.com/apache/doris/issues/17286#issuecomment-1510809591>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AM5HVUQZZ2PDOSWGUGJSAL3XBTTG5ANCNFSM6AAAAAAVLWU4VE>
.
You are receiving this because you commented.Message ID:
***@***.***>

FreeOnePlus avatar Apr 24 '23 10:04 FreeOnePlus

我有个类似的问题,环境在 macbook M1 上,用的镜像是 apache/doris:1.2.2-fe-arm 与 apache/doris:1.2.2-be-arm。docker-compose 脚本首次是能运行的,后面 docker-compose down 之后在 up,无法启动。但是,删掉宿主机挂载路径路径的文件后,再次运行就可以了。不过的话,这样数据没办法一直保留了。

1.2.2的x86版本遇到了同样的问题,请问有找到保留数据的办法吗

actcwlf avatar Apr 27 '23 06:04 actcwlf

我这里有同样的问题,已经设置 vm.max_map_count 了。

FE 日志

doris-fe  | 2023-03-29 01:52:14,184 WARN (heartbeat mgr|21) [HeartbeatMgr.runAfterCatalogReady():139] get bad heartbeat response: type: BACKEND, status: BAD, msg: java.net.ConnectException: Connection refused (Connection refused), beId: 10003, beHost: 172.20.80.3, bePort: 0, httpPort: 0, brpcPort: 0
doris-fe  | 2023-03-29 01:52:19,227 WARN (heartbeat-mgr-pool-7|147) [HeartbeatMgr$BackendHeartbeatHandler.call():268] backend heartbeat got exception
doris-fe  | org.apache.thrift.transport.TTransportException: java.net.ConnectException: Connection refused (Connection refused)
doris-fe  |     at org.apache.thrift.transport.TSocket.open(TSocket.java:226) ~[spark-dpp-1.0-SNAPSHOT.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:143) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:126) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.commons.pool2.BaseKeyedPooledObjectFactory.makeObject(BaseKeyedPooledObjectFactory.java:62) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.commons.pool2.impl.GenericKeyedObjectPool.create(GenericKeyedObjectPool.java:1012) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:356) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:277) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.doris.common.GenericPool.borrowObject(GenericPool.java:95) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.system.HeartbeatMgr$BackendHeartbeatHandler.call(HeartbeatMgr.java:225) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.system.HeartbeatMgr$BackendHeartbeatHandler.call(HeartbeatMgr.java:203) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) ~[?:1.8.0_342]
doris-fe  |     at java.lang.Thread.run(Thread.java:750) ~[?:1.8.0_342]
doris-fe  | Caused by: java.net.ConnectException: Connection refused (Connection refused)
doris-fe  |     at java.net.PlainSocketImpl.socketConnect(Native Method) ~[?:1.8.0_342]
doris-fe  |     at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:350) ~[?:1.8.0_342]
doris-fe  |     at java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206) ~[?:1.8.0_342]
doris-fe  |     at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188) ~[?:1.8.0_342]
doris-fe  |     at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[?:1.8.0_342]
doris-fe  |     at java.net.Socket.connect(Socket.java:607) ~[?:1.8.0_342]
doris-fe  |     at org.apache.thrift.transport.TSocket.open(TSocket.java:221) ~[spark-dpp-1.0-SNAPSHOT.jar:1.0-SNAPSHOT]
doris-fe  |     ... 13 more
doris-fe  | 2023-03-29 01:52:19,228 WARN (heartbeat mgr|21) [HeartbeatMgr.runAfterCatalogReady():139] get bad heartbeat response: type: BACKEND, status: BAD, msg: java.net.ConnectException: Connection refused (Connection refused), beId: 10003, beHost: 172.20.80.3, bePort: 0, httpPort: 0, brpcPort: 0
doris-fe  | 2023-03-29 01:52:21,546 INFO (tablet checker|28) [TabletChecker.checkTablets():331] finished to check tablets. unhealth/total/added/in_sched/not_ready: 0/0/0/0/0, cost: 0 ms
doris-fe  | 2023-03-29 01:52:21,550 INFO (leaderCheckpointer|86) [BDBJEJournal.getFinalizedJournalId():405] database names: 1 
doris-fe  | 2023-03-29 01:52:21,550 INFO (leaderCheckpointer|86) [Checkpoint.doCheckpoint():94] last checkpoint journal id: 0, current finalized journal id: 0
doris-fe  | 2023-03-29 01:52:21,668 WARN (ForkJoinPool-2-worker-2|159) [TabletStatMgr.lambda$null$0():70] task exec error. backend[10003]
doris-fe  | org.apache.thrift.transport.TTransportException: Invalid port -1
doris-fe  |     at org.apache.thrift.transport.TSocket.open(TSocket.java:213) ~[spark-dpp-1.0-SNAPSHOT.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:143) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:126) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.commons.pool2.BaseKeyedPooledObjectFactory.makeObject(BaseKeyedPooledObjectFactory.java:62) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.commons.pool2.impl.GenericKeyedObjectPool.create(GenericKeyedObjectPool.java:1012) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:356) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:277) ~[commons-pool2-2.2.jar:2.2]
doris-fe  |     at org.apache.doris.common.GenericPool.borrowObject(GenericPool.java:95) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at org.apache.doris.catalog.TabletStatMgr.lambda$null$0(TabletStatMgr.java:63) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at java.util.stream.ForEachOps$ForEachOp$OfRef.accept(ForEachOps.java:183) ~[?:1.8.0_342]
doris-fe  |     at java.util.Iterator.forEachRemaining(Iterator.java:116) ~[?:1.8.0_342]
doris-fe  |     at java.util.Spliterators$IteratorSpliterator.forEachRemaining(Spliterators.java:1801) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:482) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.ForEachOps$ForEachTask.compute(ForEachOps.java:290) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.CountedCompleter.exec(CountedCompleter.java:731) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinTask.doInvoke(ForkJoinTask.java:401) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinTask.invoke(ForkJoinTask.java:734) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.ForEachOps$ForEachOp.evaluateParallel(ForEachOps.java:159) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateParallel(ForEachOps.java:173) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:233) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.ReferencePipeline.forEach(ReferencePipeline.java:485) ~[?:1.8.0_342]
doris-fe  |     at java.util.stream.ReferencePipeline$Head.forEach(ReferencePipeline.java:650) ~[?:1.8.0_342]
doris-fe  |     at org.apache.doris.catalog.TabletStatMgr.lambda$runAfterCatalogReady$1(TabletStatMgr.java:57) ~[doris-fe.jar:1.0-SNAPSHOT]
doris-fe  |     at java.util.concurrent.ForkJoinTask$AdaptedRunnableAction.exec(ForkJoinTask.java:1386) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692) ~[?:1.8.0_342]
doris-fe  |     at java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:175) ~[?:1.8.0_342]
doris-fe  | 2023-03-29 01:52:21,669 INFO (tablet stat mgr|23) [TabletStatMgr.runAfterCatalogReady():125] finished to update index row num of all databases. cost: 0 ms

BE 日志

doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 172.20.80.2_9010_1680054434346      172.20.80.2     9010    8030    9030    9020    FOLLOWER        true    578058187       true    true    25      2023-03-29 01:48:20     true            doris-1.2.1-rc01-Unknown        Yes
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: MASTER FE is started!
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: add priority_networks 172.20.80.0/24 to /opt/apache-doris/be/conf/be.conf
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: Register BE to FE is failed. retry.
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: BE successfully registered to FE!
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: BE is not register. retry.
doris-be  | 10003       default_cluster 172.20.80.3     9050    -1      -1      -1      NULL    NULL    false   false   false   0       0.000   1.000 B 0.000   0.00 %  0.00 %  0.000   {"location" : "default"}                        {"lastSuccessReportTabletsTime":"N/A","lastStreamLoadTime":-1,"isQueryDisabled":false,"isLoadDisabled":false}       0
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: Init Check - Verify that BE is registered to FE successfully
doris-be  | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: Ready to start BE!
doris-be  | 2023-03-29T01:48:44+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:48:44+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:49:04+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:49:04+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:49:25+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:49:25+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:49:45+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:49:45+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:50:14+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:50:14+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | Instruction check fail. The CPU does not support AVX2 instruction set.
doris-be  | 2023-03-29T01:50:36+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | For example, if your CPU does not support AVX2, you need to rebuild the Doris BE with: USE_AVX2=0 sh build.sh --be2023-03-29T01:50:36+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:50:57+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:50:57+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:51:17+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:51:17+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:51:38+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:51:38+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:51:58+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:51:58+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:52:19+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:52:19+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:52:39+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:52:39+00:00 [Note] [Entrypoint]: register is failed, wait next~
doris-be  | 2023-03-29T01:52:59+00:00 [Warn] [Entrypoint]: start check be status~
doris-be  | 2023-03-29T01:53:00+00:00 [Note] [Entrypoint]: register is failed, wait next~

我也遇到了一样的问题,部署完几天后就出现了这个问题。

modyuan avatar May 25 '23 03:05 modyuan

https://github.com/apache/doris/discussions/19160 @FreeOnePlus 上面这个连接中提到了docker镜像里不能改密码,真的是这样吗?那这样的话,是不是官方给的docker-compose就不能在生产中使用了?

modyuan avatar May 25 '23 03:05 modyuan