YuweiXIAO
YuweiXIAO
https://ffmpeg.org/ 安装一下命令汗工具,然后 ffmpeg -i xxx.mp4 xxx.wav
> [4373.patch.zip](https://github.com/apache/hudi/files/9782092/4373.patch.zip) Thank, i have reviewed and applied a patch, let's move the clustering update strategy logic into sub-clazzs of `StreamWriteFunction` first. Thanks for the patch, Danny! Moving the update...
> > Hey, thanks for the contribution. It is a great enhancement for bucket index. > > On high-level, could we use the current BucketIndex abstraction to unify the implementation...
Thanks for the reply! I tested the demo and tests. It works and no error reported. My project is just a toy example trying polyscope. I found the following setting...
Could you share your write config, e.g., operation type and index type.
> In this test, we did not change the index, we only used the bloom index Through the test, I saw a strange phenomenon. At the beginning, the data was...
@jjtjiang Hey, another guess is the merging of the base file and log files is turned off. In your latest test, could you paste the full query result of the...
Hi @jjtjiang , looking at the data you posted. I am wondering if you enable the de-duplication option during writing. Because there are records with the same key in a...
There is an option `hoodie.datasource.merge.type` to control the combine logic during read, the default is merging records. The option is for spark read, not sure what is the behavior in...
Yes, it should do dedup for log files. I'll test the default behavior when reading, to see if there is any potential bug.