influxdb

History

Dom Dwyer 7d0e3637ed perf(ingester): projection pushdown to data source Prior to this change projection pushdown was implemented as a filter, which meant a query using it would take the following steps: * Query arrives * Find necessary partition data * Copy all the partition data into a RecordBatch * Filter that RecordBatch to apply the projection * Return results to caller This is far from ideal, as the underlying partition data is copied in its entirety and then the unneeded columns discarded - a pure waste! After this PR, the projection is pushed down to the point of RecordBatch generation: * Query arrives * Find necessary partition data * Copy only the projected columns to a RecordBatch * Return results to the caller This minimises the amount of data copying, which for large amounts of data should lead to a meaningful performance improvement when querying for a subset of columns. It also uses a slightly more efficient projection implementation by using a single pass over the columns (still O(n) but less constant overhead).	2023-07-05 13:44:11 +02:00
..
src	perf(ingester): projection pushdown to data source	2023-07-05 13:44:11 +02:00
Cargo.toml	chore(deps): Bump indexmap from 1.9.3 to 2.0.0 (#8073 )	2023-06-26 08:52:51 +00:00

perf(ingester): projection pushdown to data source

Prior to this change projection pushdown was implemented as a filter,
which meant a query using it would take the following steps:

    * Query arrives
    * Find necessary partition data
    * Copy all the partition data into a RecordBatch
    * Filter that RecordBatch to apply the projection
    * Return results to caller

This is far from ideal, as the underlying partition data is copied in
its entirety and then the unneeded columns discarded - a pure waste!

After this PR, the projection is pushed down to the point of RecordBatch
generation:

    * Query arrives
    * Find necessary partition data
    * Copy only the projected columns to a RecordBatch
    * Return results to the caller

This minimises the amount of data copying, which for large amounts of
data should lead to a meaningful performance improvement when querying
for a subset of columns. It also uses a slightly more efficient
projection implementation by using a single pass over the columns (still
O(n) but less constant overhead).

2023-07-05 13:44:11 +02:00

src

perf(ingester): projection pushdown to data source

2023-07-05 13:44:11 +02:00

Cargo.toml

chore(deps): Bump indexmap from 1.9.3 to 2.0.0 (#8073 )

2023-06-26 08:52:51 +00:00