WilliamZhu comments

Results 115 comments of


                                            WilliamZhu

Paging does not work with aggregation query by command OFFSET ... LIMIT ...

Caching result query in memory may crash user's application ,or may even crash ES. Also this behavior can cause some confusion in people who lack of basic search engine concept.

Not implemented? SQLSubqueryTableSource cannot be cast to SQLJoinTableSource

If wanna sql supports join,subquery, you may try es-hadoop project which provide spark-sql support On Tue, Nov 29, 2016 at 10:51 PM, Lars wrote: > Okay, thats look really nice...

file number problem

check: https://github.com/allwefantasy/delta-plus/pull/9

compaction(delta plus) 在第三种方案里是不需要的。原因是因为在每次同步的时候，delta-plus会自动控制文件数目。如果你的hive满足要求的话，官方已经提供了hive 读delta 的[connector](https://github.com/delta-io/connectors)，并不需要再导入到hive， hive可以直接读取delta。所以可以实现非常低的延时。

[Engine][2.1.0][Feature] 查看某个Job涉及到的·executor所在Node节点的CPU/内存情况

目的：排查Yarn宿主系统对adhoc查询的影响

TfIdfInPlace 在 Byzer 3.0不能使用

是的。目前这两个类有三个问题： 1. 实现的不好，需要找时间重构 2. 依赖的分词包不在 Maven仓库中 3. 在 3.0 里有兼容性问题还没来得及修所以我们暂时从文档删除了他两。用户可以先用 Byzer-python 中引入 python库来完成相关工作。

[Engine][2.1.0][Enhance] 集群资源太少时，使用!ray 命令时应该报错

没有修正这个问题会导致如下ISSUE : https://github.com/allwefantasy/mlsql/issues/1423

使用mlsql中小细节梳理

是不是可以将这些点弄成一个小贴士集锦的文章？否则就一篇文章而言，信息量有点太少。

python ray运行报错

1. mlsql-engine 使用 2.1.0-SNAPSHOT版本 2. Ray 使用 1.3.0版本将代码： ```python ray.init(redis_address="192.168.6.180:17997", redis_password="123456") ray_context = RayContext.connect(globals(), None) ``` 修改成 ```python ray_context = RayContext.connect(globals(),"192.168.6.180:10001") ``` PS: ,如果你使用2.1.0版本，那么这个版本暂时锁定了 ray 0.8.0版本。版本需要配套。然后连接采用下面的方式： ``` ray_context...

python ray运行报错

driver 端必要的依赖： ``` pip install Cython pip install ray==1.3.0 pip install aiohttp psutil setproctitle grpcio pandas xlsxwriter pip install watchdog requests click uuid sfcli pip install pyjava ```