Virushade

Results 10 issues of Virushade

Fixes #16956. ### Description This PR introduces min() and max() aggregators for string columns: `StringMinAggregator` and `StringMaxAggregator`, and their respective buffer and vector aggregators. These aggregators compute the minimum and...

Area - Documentation
Area - Querying

### Description Currently, Kubernetes jobs will retry for a total of 10 times to create a peon pod for ingestion tasks. However, there are some pain points that surface during...

Area - Documentation
Kubernetes

### Description This PR introduces the `druid-exact-count` extension, providing a new aggregation function for computing the exact distinct count of values within a dimension. Unlike approximate estimators like HyperLogLog, this...

Area - Documentation
Area - Querying
Area - Extension
Area - Dependencies
GHA

### Description According to the [log4j2 javadocs](https://logging.apache.org/log4j/2.x/javadoc/log4j-core/org/apache/logging/log4j/core/config/AppenderRef.html), `Appender-ref` is an alias for `AppenderRef`. I think we should standardize to use `AppenderRef` in our examples, so as not to confuse the...

Area - Documentation

### Description #### Concurrent loading of cached segments during startup Previously, we are using 1 thread to run `SegmentLocalCacheManager#getCachedSegments`. We may take quite a bit of time if the number...

Performance
Design Review

Attempt at #18781 . ### Description #### Metrics Reporting via GroupByQueryMetrics - Remove method signatures in codebase which rely on using PerQueryStats to retrieve metrics, and retrieve query stats via...

Area - Querying

Fixes #17902 Huge thanks to @gianm for the implementation tip in the issue! ### Description #### Tracking merge buffer usage - Usage of a direct byte buffer is done under...

Area - Documentation

Possibly fixes #17944 ### Motivation Currently, we have GroupByStatsMonitor, which provide metrics for GroupBy queries by: Aggregating results of all GroupBy metrics in a certain time interval. Emit them regularly...

Design Review
Proposal

Fixes #18446 ### Description This PR belongs to a set of PR's that hope to optimize the start-up time of Historical. I came across this problem when I am running...

Area - Documentation
Performance
Design Review

### Description - Propose a configurable startup strategy that eagerly loads only recent (“hot”) segments, while leaving older (“cold”) segments to load lazily on first access. - Propose to deprecate...

Feature/Change Description