YuweiXIAO comments

Results 39 comments of


                                            YuweiXIAO

如何将mp3文件转换成支持的语音文件

https://ffmpeg.org/ 安装一下命令汗工具，然后 ffmpeg -i xxx.mp4 xxx.wav

[HUDI-4373] Flink Consistent hashing bucket index write path code

> [4373.patch.zip](https://github.com/apache/hudi/files/9782092/4373.patch.zip) Thank, i have reviewed and applied a patch, let's move the clustering update strategy logic into sub-clazzs of `StreamWriteFunction` first. Thanks for the patch, Danny! Moving the update...

[HUDI-4824]Add new index RANGE_BUCKET , when primary key is auto-increment like most mysql table

> > Hey, thanks for the contribution. It is a great enhancement for bucket index. > > On high-level, could we use the current BucketIndex abstraction to unify the implementation...

stack alignment of matrial bindata causes segmentfault

Thanks for the reply! I tested the demo and tests. It works and no error reported. My project is just a toy example trying polyscope. I found the following setting...

[SUPPORT] Hudi table has duplicate data.

Could you share your write config, e.g., operation type and index type.

[SUPPORT] Hudi table has duplicate data.

> In this test, we did not change the index, we only used the bloom index Through the test, I saw a strange phenomenon. At the beginning, the data was...

[SUPPORT] Hudi table has duplicate data.

@jjtjiang Hey, another guess is the merging of the base file and log files is turned off. In your latest test, could you paste the full query result of the...

[SUPPORT] Hudi table has duplicate data.

Hi @jjtjiang , looking at the data you posted. I am wondering if you enable the de-duplication option during writing. Because there are records with the same key in a...

[SUPPORT] Hudi table has duplicate data.

There is an option `hoodie.datasource.merge.type` to control the combine logic during read, the default is merging records. The option is for spark read, not sure what is the behavior in...

[SUPPORT] Hudi table has duplicate data.

Yes, it should do dedup for log files. I'll test the default behavior when reading, to see if there is any potential bug.