influxdb/compactor at 13ed1c089aa82e7fa6731d24d00812e4eb4455a8 - influxdb - Gitea: ArmstrongLabs

History

Andrew Lamb 66dbb9541f chore: Update datafusion and `arrow`/`parquet`/`arrow-flight` to 23.0.0, `thrift` to 0.16.0 (#5694 ) * chore: Update datafusion and `arrow`/`parquet`/`arrow-flight` to 23.0.0 * chore: Update thrift / remove parquet_format * fix: Update APIs * chore: Update lock + Run cargo hakari tasks * fix: use patched version of arrow-rs to work around https://github.com/apache/arrow-rs/issues/2779 * chore: Run cargo hakari tasks Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>		2022-09-27 12:50:54 +00:00
..
src	feat: instead of adding num_files and memory budget into the reason text column, let us create differnt columns for them. We will be able to filter them easily (#5742 )	2022-09-26 20:14:04 +00:00
Cargo.toml	chore: Update datafusion and `arrow`/`parquet`/`arrow-flight` to 23.0.0, `thrift` to 0.16.0 (#5694 )	2022-09-27 12:50:54 +00:00
README.md	docs: add consensus for the desired final output of the compactor (#5069 )	2022-07-07 19:11:16 +00:00

README.md

After a partition of a table has not received any writes for some amount of time, the compactor will ensure it is stored in object store as N parquet files which:

have non overlapping time ranges
each does not exceed a size specified by config param max_desired_file_size_bytes.