influxdb/compactor
Nga Tran b8a80869d4
feat: introduce a new way of max_sequence_number for ingester, compactor and querier (#6692)
* feat: introduce a new way of max_sequence_number for ingester, compactor and querier

* chore: cleanup

* feat: new column max_l0_created_at to order files for deduplication

* chore: cleanup

* chore: debug info for chnaging cpu.parquet

* fix: update test parquet file

Co-authored-by: Marco Neumann <marco@crepererum.net>
2023-01-26 10:52:47 +00:00
..
src feat: introduce a new way of max_sequence_number for ingester, compactor and querier (#6692) 2023-01-26 10:52:47 +00:00
Cargo.toml chore(deps): Bump async-trait from 0.1.61 to 0.1.63 (#6660) 2023-01-23 08:41:27 +00:00
README.md docs: add consensus for the desired final output of the compactor (#5069) 2022-07-07 19:11:16 +00:00

README.md

After a partition of a table has not received any writes for some amount of time, the compactor will ensure it is stored in object store as N parquet files which:

  • have non overlapping time ranges
  • each does not exceed a size specified by config param max_desired_file_size_bytes.