Tengjun Jin

Results 11 issues of Tengjun Jin

Check the limit result is a subset of the full result, current check may not work for floating point numbers due to weird serialization issues.

help wanted

https://github.com/ddkang/aidb/blob/bef78d2339b0f6f654467f58651f80a21ae1f2da/aidb/engine/approx_aggregate_join_engine.py#L215-L218 The program utilizes a column name as a key since the dataframe retrieved from the database use this column name instead of the format {table}.{column}. Consequently, when two dataframes...

We have many validity checks for different queries. It's better to put them together and check the validity first.

Raise an issue _Originally posted by @ddkang in https://github.com/ddkang/aidb/pull/145#discussion_r1434128359_ DuckDB enables SQL querying over a DataFrame. To remove the dependency of DuckDB, the same functionality can be implemented in a...

For queries that require multiple inference services, our system does not support using User Defined Functions (UDFs) as intermediate filter predicates for filtering inputs. Consider the following example query: ```sql...

'SELECT confidence FROM objects00' will raise an error.

For counts, what's wrong with setting it to the estimated count? I think for derived rows, we want to estimate the count and do proportional importance sampling or something actually...

Need add python version requirement in setup.py

help wanted

When testing the law dataset, an exception occurred during the HTTP inference process. Upon reviewing the dataset, I discovered that the presence of NULL values in some columns was the...

help wanted

When call Hugging Face API many times like 1000, in a short time. There will raise a HTTPError:429 Client Error: Too many Requests for url https://github.com/ddkang/aidb-new/blob/3049bd4f52f6cc1ae56770f1158c174b373052ce/aidb/engine/full_scan_engine.py#L22-L26 ![Screenshot 2023-11-06 at 11...