SimpleAggregateFunction

SimpleAggregateFunction(name, types_of_arguments...) data type stores current value (intermediate state) of the aggregate function, but not its full state as AggregateFunction does. This optimization can be applied to functions for which the following property holds: the result of applying a function f to a row set S1 UNION ALL S2 can be obtained by applying f to parts of the row set separately, and then again applying f to the results: f(S1 UNION ALL S2) = f(f(S1) UNION ALL f(S2)). This property guarantees that partial aggregation results are enough to compute the combined one, so you don't have to store and process additional data.

The common way to produce an aggregate function value is by calling the aggregate function with the -SimpleState suffix.

The following aggregate functions are supported:

  • any
  • anyLast
  • min
  • max
  • sum
  • sumWithOverflow
  • groupBitAnd
  • groupBitOr
  • groupBitXor
  • groupArrayArray
  • groupUniqArrayArray
  • sumMap
  • minMap
  • maxMap

Values of the SimpleAggregateFunction(func, Type) look and stored the same way as Type, so you do not need to apply functions with -Merge/-State suffixes.

SimpleAggregateFunction has better performance than AggregateFunction with same aggregation function.

### Parameters

  • Name of the aggregate function.
  • Types of the aggregate function arguments.
Was this page helpful?
Updated