hustnn

Results 16 issues of hustnn

# Description Let rename can work even without s3 lock. It is helpful for single writer case or the concurrency can be done in application side.

I tried the grizzly data_cleaning demo in https://github.com/weld-project/weld/tree/master/examples/python/grizzly in my macbook. The pandas version is 0.19.2. The performance improvement actually is very close. Native: Total end-to-end time: 1.62 Grizzly: Total...

performance

I found that memory usage of grizzle is much larger than pandas. Then I go into it and find that it is may be caused by change the encoding type...

I am using https://shell.duckdb.org/ to read a parquet from S3. ``` SET s3_region='us-east-1'; SET s3_access_key_id='xxx'; SET s3_secret_access_key='xxx'; select count(*) from read_parquet('s3://xxx/part-1.gz.parquet'); ``` I also followed https://docs.aws.amazon.com/AmazonS3/latest/userguide/enabling-cors-examples.html to set CORS for...

https://pola-rs.github.io/polars/polars/docs/lazy/index.html#black-box-function I met the error `Error: SchemaMisMatch("Series of dtype: List(Float64) != Struct")` when running apply_multiples with version 0.23.2.

bug
good first issue

Compared with traditional SQL query. It seems that the query has status and the query result can be monitored continuously and the result is updated once some monitored values change....

## Enhancement By default, the `vw_schedule_algo` setting RM is using is `Unknown`, it is better to set `LocalRoundRobin` by default to achieve better vw resource usage.

enhancement
good first issue

Welcome to share your ideas on the roadmap. The updated roadmap for Q3 and Q4 are shown below. ## Storage - [x] Data cache preload - Q1 https://github.com/ByConity/ByConity/pull/189 - [x]...

enhancement

## Feature Request ### Describe the feature you'd like Since we already integrate the HIVE in ci with https://github.com/ByConity/ByConity/pull/426,now we can add hive related test cases to ci

CI