influxdb/compactor
Carol (Nichols || Goulding) de74415cbe
feat: Gather parquet files for a partition compaction operation
Fixes #5118.

Given a partition ID, look up the non-deleted Parquet files for that
partition. Separate them into level 0 and level 1, and sort the level 0
files by max sequence number.

This is not called anywhere yet.
2022-07-13 16:53:21 -04:00
..
src feat: Gather parquet files for a partition compaction operation 2022-07-13 16:53:21 -04:00
Cargo.toml chore: Update datafusion + arrow/parquet/arrow-flight to `17.0.0` (#5021) 2022-07-07 19:22:15 +00:00
README.md docs: add consensus for the desired final output of the compactor (#5069) 2022-07-07 19:11:16 +00:00

README.md

After a partition of a table has not received any writes for some amount of time, the compactor will ensure it is stored in object store as N parquet files which:

  • have non overlapping time ranges
  • each does not exceed a size specified by config param max_desired_file_size_bytes.