docs-v2/content/v2.0/reference/flux/stdlib/built-in/transformations/histogram.md

2.5 KiB

title description aliases menu weight related
histogram() function The `histogram()` function approximates the cumulative distribution of a dataset by counting data frequencies for a list of bins.
/v2.0/reference/flux/functions/transformations/histogram
/v2.0/reference/flux/functions/built-in/transformations/histogram/
v2_0_ref
name parent
histogram built-in-transformations
402
/v2.0/query-data/flux/histograms/

The histogram() function approximates the cumulative distribution of a dataset by counting data frequencies for a list of bins. A bin is defined by an upper bound where all data points that are less than or equal to the bound are counted in the bin. The bin counts are cumulative.

Each input table is converted into a single output table representing a single histogram. The output table has the same group key as the input table. Columns not part of the group key are removed and an upper bound column and a count column are added.

Function type: Transformation
Output data type: Object

histogram(column: "_value", upperBoundColumn: "le", countColumn: "_value", bins: [50.0, 75.0, 90.0], normalize: false)

Parameters

column

The name of a column containing input data values. The column type must be float. Defaults to "_value".

Data type: String

upperBoundColumn

The name of the column in which to store the histogram's upper bounds. Defaults to "le".

Data type: String

countColumn

The name of the column in which to store the histogram counts. Defaults to "_value".

Data type: String

bins

A list of upper bounds to use when computing the histogram frequencies. Bins should contain a bin whose bound is the maximum value of the data set. This value can be set to positive infinity if no maximum is known.

Data type: Array of floats

Bin helper functions

The following helper functions can be used to generated bins.

linearBins()
logarithmicBins()

normalize

When true, will convert the counts into frequency values between 0 and 1. Defaults to false.

Data type: Boolean

{{% note %}} Normalized histograms cannot be aggregated by summing their counts. {{% /note %}}

Examples

Histogram with dynamically generated bins
// Dynamically generate 10 bins from 0,10,20,...,100
histogram(
  bins: linearBins(start:0.0, width:10.0, count:10)
)