GreatRiver comments

Results 92 comments of


                                            GreatRiver

[Bug]: [customer regression test] load data 'lost connection'

bvt的峰值内存大概11G左右，跑完结束等待几分钟，大概res是8G。load tpch 期间增加4G左右的内存，宿主机15G，docker limit也是15G，如果跑完bvt 内存释放不及时，马上load tpch 1G，mo-service很可能超过14G，这样宿主机就会因为oom kill mo-service。手动测试了一下，跑完bvt 7.7G，load tpch峰值能到11G，load完成后过段时间又回到7.7G。感觉可以增加内存到18G就可以解决，或者调整mem cache为默认的512M，现在是1.5G

[Bug]: [customer regression test] load data 'lost connection'

[mem.zip](https://github.com/user-attachments/files/21463475/mem.zip) 007 load 前 009 load 后 010 load 后等待几分钟

[Bug]: [customer regression test] load data 'lost connection'

[Bug]: 2.1.1-hotfix tke regression: tpch 1t 4cn test 'rpc timeout'

https://grafana.ci.matrixorigin.cn/explore?panes=%7B%22Ykg%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-branch-nightly-34de694e7-20250609%5C%22,%20pod%3D%5C%22nightly-regression-dis-tp-cn-jw8wf%5C%22%7D%20%7C%3D%20%60slow%20event%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%221749529440000%22,%22to%22:%221749529606000%22%7D%7D%7D&schemaVersion=1&orgId=1 在这个时间段有一个cn s3的读压力很大，延迟很高，导致rpc timeout

[Bug]: 2.1.1-hotfix tke regression: tpch 1t 4cn test 'rpc timeout'

https://grafana.ci.matrixorigin.cn/d/ae3ttwut16qdca/fileservice-metrics?orgId=1&from=1749529200000&to=1749531599000&var-interval=1m&var-namespace=mo-branch-nightly-34de694e7-20250609&var-pod=nightly-regression-dis-tp-cn-jw8wf

[Bug]: TestCheckpointChaos3 ut fail

https://github.com/matrixorigin/matrixone/actions/runs/13968520227/job/39104416001?pr=21572

[Enhancement]: add bloom filter and zonemap info in checkpoint

这是tn这边基于GC的checkpoint优化，待1.3 checkpoint merge 完成后再开始开发

[Bug]: TN keeps panic and restart

还没复现

[Bug]: TN keeps panic and restart

还没复现