Commit Graph

8955 Commits (439bcf08d98066c167b3b144d4caaecd2853babe)

Author SHA1 Message Date
Joe-Blount 439bcf08d9
Merge pull request #5583 from influxdata/jrb_1_skip_empty_splits
fix: avoid splitting compaction output for time ranges with no chunks
2022-09-08 10:43:31 -05:00
Joe-Blount 06982b80a4
Merge branch 'main' into jrb_1_skip_empty_splits 2022-09-08 10:31:05 -05:00
Marco Neumann c3b47dfe59
refactor: disable read buffer usage in querier (#5579)
* refactor: read querier parquet files from cache

* refactor: only use parquet files in querier (no RB)

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2022-09-08 13:18:22 +00:00
Marco Neumann a636b9ec1f
feat: instrument OS `HEAD` and `GET_RANGE` requests (#5587)
* feat: instrument OS `HEAD` requests

* feat: instrument OS `GET_RANGE` requests
2022-09-08 13:10:39 +00:00
Marco Neumann dab69e573f
refactor: avoid double-hash when updating addressable heap orders (#5577)
* feat: naive `AddresableHeap::update_order`

* refactor: use `update_order` within LRU policy

* test: add benchmark for `AddressableHeap::update_order`

* refactor: avoid double-hash when updating addressable heap orders

```text
update_order_existing_after_n_elements/1
                        time:   [25.483 ns 25.513 ns 25.547 ns]
                        change: [-42.490% -42.365% -42.247%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 3 outliers among 100 measurements (3.00%)
  3 (3.00%) high severe
update_order_existing_after_n_elements/10
                        time:   [68.158 ns 68.211 ns 68.266 ns]
                        change: [-19.391% -19.131% -18.952%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high mild
update_order_existing_after_n_elements/100
                        time:   [128.10 ns 128.43 ns 128.83 ns]
                        change: [-17.732% -17.531% -17.255%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  3 (3.00%) high mild
  4 (4.00%) high severe
update_order_existing_after_n_elements/1000
                        time:   [223.08 ns 224.06 ns 225.30 ns]
                        change: [-9.0635% -8.5828% -7.9794%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  2 (2.00%) high mild
  5 (5.00%) high severe
update_order_existing_after_n_elements/10000
                        time:   [1.0032 µs 1.0216 µs 1.0402 µs]
                        change: [-6.0920% -3.7038% -1.0826%] (p = 0.01 < 0.05)
                        Performance has improved.

update_order_new_after_n_elements/0
                        time:   [35.898 ns 35.919 ns 35.943 ns]
                        change: [+183.39% +183.77% +184.12%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 4 outliers among 100 measurements (4.00%)
  3 (3.00%) high mild
  1 (1.00%) high severe
update_order_new_after_n_elements/1
                        time:   [13.273 ns 13.299 ns 13.344 ns]
                        change: [-6.6980% -5.9798% -5.2633%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 6 outliers among 100 measurements (6.00%)
  2 (2.00%) high mild
  4 (4.00%) high severe
update_order_new_after_n_elements/10
                        time:   [14.010 ns 14.084 ns 14.183 ns]
                        change: [-13.579% -13.117% -12.553%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 11 outliers among 100 measurements (11.00%)
  2 (2.00%) high mild
  9 (9.00%) high severe
update_order_new_after_n_elements/100
                        time:   [23.846 ns 23.883 ns 23.921 ns]
                        change: [-4.7412% -4.3738% -4.0715%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 3 outliers among 100 measurements (3.00%)
  3 (3.00%) high mild
update_order_new_after_n_elements/1000
                        time:   [28.590 ns 28.646 ns 28.705 ns]
                        change: [-4.1597% -3.6132% -3.0701%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 2 outliers among 100 measurements (2.00%)
  2 (2.00%) high mild
update_order_new_after_n_elements/10000
                        time:   [31.459 ns 31.975 ns 32.601 ns]
                        change: [-32.153% -20.689% -11.961%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  5 (5.00%) high mild
  2 (2.00%) high severe
````

Improvements might be even bigger for more expensive hash functions
(e.g. for `K = Arc<str>`).

Note that there is one outlier: `update_order_new_after_n_elements/0`. I
suspect this is due to slightly different compiler decisions (there is
no technical difference for "update a key of an empty heap"). Since this
case is also pretty uncommon in practice (only ~once when the process
boots up), I deem this acceptable.

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2022-09-08 11:50:55 +00:00
Marco Neumann 267a53a9e8
chore: update `tracing-subscriber`, fix trogging, fix CLI test port allocation (#5581)
* test: use dedicated ports for CLI tests

* chore: update `tracing-subscriber`

* fix: work around tracing-subscriber weirdness

It seems that trogging with tracing-subscriber >= 0.3.14 does not
produce any output at all. I suspect we are hitting
<https://github.com/tokio-rs/tracing/issues/2265>. Let's change the
construct to not use multiple optional layers but a single dyn-dispatch
layer. Logging shouldn't have such a high throughput that his makes any
difference, esp. because the dyn-dispatch happens AFTER the filter.
2022-09-08 09:37:37 +00:00
dependabot[bot] 48ff036b86
chore(deps): Bump url from 2.2.2 to 2.3.0 (#5585)
* chore(deps): Bump url from 2.2.2 to 2.3.0

Bumps [url](https://github.com/servo/rust-url) from 2.2.2 to 2.3.0.
- [Release notes](https://github.com/servo/rust-url/releases)
- [Commits](https://github.com/servo/rust-url/compare/v2.2.2...v2.3.0)

---
updated-dependencies:
- dependency-name: url
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

* chore: Run cargo hakari tasks

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: CircleCI[bot] <circleci@influxdata.com>
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2022-09-08 08:36:48 +00:00
dependabot[bot] 423b4f874a
chore(deps): Bump tonic from 0.8.0 to 0.8.1 (#5584)
Bumps [tonic](https://github.com/hyperium/tonic) from 0.8.0 to 0.8.1.
- [Release notes](https://github.com/hyperium/tonic/releases)
- [Changelog](https://github.com/hyperium/tonic/blob/master/CHANGELOG.md)
- [Commits](https://github.com/hyperium/tonic/compare/v0.8.0...v0.8.1)

---
updated-dependencies:
- dependency-name: tonic
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2022-09-08 08:29:41 +00:00
Dom 331d780f99
Merge pull request #5586 from influxdata/dependabot/cargo/tonic-health-0.7.1
chore(deps): Bump tonic-health from 0.7.0 to 0.7.1
2022-09-08 09:21:33 +01:00
dependabot[bot] 297ce7d198
chore(deps): Bump tonic-health from 0.7.0 to 0.7.1
Bumps [tonic-health](https://github.com/hyperium/tonic) from 0.7.0 to 0.7.1.
- [Release notes](https://github.com/hyperium/tonic/releases)
- [Changelog](https://github.com/hyperium/tonic/blob/master/CHANGELOG.md)
- [Commits](https://github.com/hyperium/tonic/compare/v0.7.0...v0.7.1)

---
updated-dependencies:
- dependency-name: tonic-health
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-09-08 01:20:18 +00:00
Joe-Blount 333cfa4f3c chore: address comments - use TimestampMinMax passed by reference 2022-09-07 16:36:39 -05:00
Joe-Blount 97ebad5adb chore: rustfmt changes 2022-09-07 13:22:36 -05:00
Joe-Blount 4188230694 fix: avoid splitting compaction output for time ranges with no chunks 2022-09-07 13:01:14 -05:00
kodiakhq[bot] e01b645696
Merge pull request #5536 from influxdata/cn/prepare-for-the-cold
refactor: Prepare for sharing some code between hot and cold compaction
2022-09-07 17:39:26 +00:00
Carol (Nichols || Goulding) b5ca99a3d5
refactor: Make CompactorConfig fields pub
I'm spending way too long with the wrong number of arguments to
CompactorConfig::new and not a lot of help from the compiler. If these
struct fields are pub, they can be set directly and destructured, etc,
which the compiler gives way more help on. This also reduces duplication
and boilerplate that has to be updated when the config fields change.
2022-09-07 13:28:19 -04:00
Carol (Nichols || Goulding) 54eea79773
refactor: Make filtering the parquet files into a closure argument too
So that the cold compaction can use different filtering but still use
the memory budget function.

Not sure I'm happy with this yet, but it's a start.
2022-09-07 13:26:42 -04:00
Carol (Nichols || Goulding) 3e76a155f7
refactor: Make memory budget compaction group function more general
In preparation for using it for cold compaction too.
2022-09-07 13:26:42 -04:00
Carol (Nichols || Goulding) 1f69d11d46
refactor: Move hot compaction function into hot compaction module 2022-09-07 13:26:40 -04:00
Carol (Nichols || Goulding) 85fb0acea6
refactor: Extract read_parquet_file test helper function to iox_tests::utils 2022-09-07 13:21:28 -04:00
Marco Neumann eee7007c71
chore: `cargo update` (#5580)
* chore: update `rand`

* chore: update `crossbeam-utils`

* chore: update `tracing-core`

* chore: update `tracing`

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2022-09-07 16:07:35 +00:00
Marco Neumann d24fb0eae7
fix: support `u64` in `query_functions::selectors` (#5582)
This this to be used for InfluxRPC and is currently panicking the prod
querier from time to time.
2022-09-07 15:59:16 +00:00
Marco Neumann 15d2490642
chore: `cargo update` (#5574)
* chore: update `android_system_properties` and `iana-time-zone`

* chore: update `anyhow`

* chore: update `prettyplease`

* chore: update `block-buffer`

* chore: update `lz4`

* chore: update `miniz_oxide`

* chore: update `socket2`

* chore: update `ucd-trie`

* chore: update `which`

* chore: update `aho-corasick`

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2022-09-07 13:28:46 +00:00
Marco Neumann f8bb1378ad
chore: update to pprof 0.10.1 (#5578) 2022-09-07 13:15:23 +00:00
kodiakhq[bot] 2f2d09a4b9
Merge pull request #5576 from influxdata/dom/rskafka-bump
perf: use rskafka producer optimisations
2022-09-07 12:10:34 +00:00
Dom Dwyer 33b78eb5d2 build: bump rskafka
Update rskafka to HEAD, picking up:

d7e14a8 test: increase timeouts, CircleCI is slow
4e92ed2 refactor: replace `time` w/ `chono`
c0ba668 fix: never leak flusher background tasks
786d6e1 refactor: move batch into producer mod
82862df perf: use RwLock for BroadcastOnce
e12c812 perf: async batch flushing & lock contention
ad126c5 test: increase timeouts
6565321 test: improve testing config
3379959 refactor: also invalidate broker cache when erroring on "unknown topic/partition"
14ae812 refactor: clarify binding mechanism
b59d9ad docs: fix spelling
e73fef5 test: increase timeouts
0dd1bda feat: introduce bind mode for partition client
a3633c6 fix: disable topic auto creation in tests
72c6dd2 fix: make redpanda happy
ae6df2e ci: bump redpanda version
a1ff3e5 chore: update Rust to 1.63
1ca7c5f ci: shellcheck
01a648b ci: yammlint
3248dd6 ci: check that versions are in-sync
ebf87b5 ci: run doctests
32c34ec fix: address deprecation warnings
0f6ad6c chore: fix `cargo bench -- --save-baseline`
2022-09-07 14:00:19 +02:00
kodiakhq[bot] 48f32dd282
Merge pull request #5575 from influxdata/dom/ingester-shard-connect
fix(ingester): connect to assigned Kafka partitions
2022-09-07 11:38:00 +00:00
kodiakhq[bot] 2b673d2eac
Merge branch 'main' into dom/ingester-shard-connect 2022-09-07 11:30:55 +00:00
Marco Neumann d33ecb2ea5
refactor: avoid double-hash in `AddessableHeap::insert` (#5562)
Instead of a naive "remove + insert", use a proper insertion routine
that touches the hash map only once.

Note that in case of an override (i.e. the entry with this key already
existed) we need to touch the heap twice, because the sort order likely
changed (we don't optimize the "identical order" case here because it is
pretty unlikely that this will happen in practice).

**Perf results:**

```text
insert_n_elements/0     time:   [16.489 ns 16.497 ns 16.506 ns]
                        change: [-8.1154% -7.9967% -7.8990%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 4 outliers among 100 measurements (4.00%)
  1 (1.00%) low mild
  1 (1.00%) high mild
  2 (2.00%) high severe
insert_n_elements/1     time:   [59.806 ns 59.839 ns 59.875 ns]
                        change: [-14.241% -14.160% -14.086%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
  7 (7.00%) high mild
  1 (1.00%) high severe
insert_n_elements/10    time:   [601.58 ns 602.26 ns 603.09 ns]
                        change: [-20.870% -20.714% -20.565%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  4 (4.00%) high mild
  5 (5.00%) high severe
insert_n_elements/100   time:   [6.9096 µs 6.9161 µs 6.9246 µs]
                        change: [-18.759% -18.667% -18.553%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  4 (4.00%) high mild
  3 (3.00%) high severe
insert_n_elements/1000  time:   [107.71 µs 107.76 µs 107.82 µs]
                        change: [-14.564% -14.427% -14.295%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 4 outliers among 100 measurements (4.00%)
  3 (3.00%) high mild
  1 (1.00%) high severe
insert_n_elements/10000 time:   [2.8642 ms 2.8700 ms 2.8765 ms]
                        change: [-11.079% -10.860% -10.605%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 15 outliers among 100 measurements (15.00%)
  15 (15.00%) high severe
````

Note that the results are even better for keys with more expansive hash
functions (we have few in the querier).

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2022-09-07 11:30:40 +00:00
Dom a57748d741
Merge branch 'main' into dom/ingester-shard-connect 2022-09-07 12:25:40 +01:00
Dom Dwyer d1ca29c029 fix(ingester): connect to assigned Kafka partition
During initialisation, the ingester connects to the Kafka brokers - this
involves per-partition leadership discovery & connection establishment.
These connections are then retained for the lifetime of the process.

Prior to this commit, the ingester would establish a connection to all
partition leaders for a given topic. After this commit, the ingester
connects to only the partition leaders it is going to consume from
(for those shards that it is assigned.)
2022-09-07 13:21:06 +02:00
YIXIAO SHI fa6c26b38d
chore: fix comment typo (#5550)
Co-authored-by: Dom <dom@itsallbroken.com>
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2022-09-07 08:57:34 +00:00
YIXIAO SHI 52ae60bf2e
chore: fix comment typo (#5551)
Co-authored-by: Dom <dom@itsallbroken.com>
2022-09-07 08:49:29 +00:00
Luke Bond 16f1b40efd
Merge pull request #5549 from influxdata/alamb/guilio-python-main
chore: changed `iox_shared` to `iox-shared`
2022-09-06 17:16:40 +01:00
Luke Bond a280acb860
Merge branch 'main' into alamb/guilio-python-main 2022-09-06 16:57:00 +01:00
Marco Neumann adeacf416c
ci: fix (#5569)
* ci: use same feature set in `build_dev` and `build_release`

* ci: also enable unstable tokio for `build_dev`

* chore: update tokio to 1.21 (to fix console-subscriber 0.1.8

* fix: "must use"
2022-09-06 14:13:28 +00:00
dependabot[bot] b494c73cb3
chore(deps): Bump console-subscriber from 0.1.7 to 0.1.8 (#5558)
Bumps [console-subscriber](https://github.com/tokio-rs/console) from 0.1.7 to 0.1.8.
- [Release notes](https://github.com/tokio-rs/console/releases)
- [Commits](https://github.com/tokio-rs/console/compare/tokio-console-v0.1.7...console-subscriber-v0.1.8)

---
updated-dependencies:
- dependency-name: console-subscriber
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-06 12:46:07 +00:00
dependabot[bot] 6c9e9481d6
chore(deps): Bump sha2 from 0.10.3 to 0.10.5 (#5557)
Bumps [sha2](https://github.com/RustCrypto/hashes) from 0.10.3 to 0.10.5.
- [Release notes](https://github.com/RustCrypto/hashes/releases)
- [Commits](https://github.com/RustCrypto/hashes/compare/sha2-v0.10.3...sha2-v0.10.5)

---
updated-dependencies:
- dependency-name: sha2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2022-09-06 12:17:43 +00:00
Marco Neumann eea8bc7e40
refactor: do not always box `FunctionEstimator` function (#5568)
This is similar to #5530. Do not box the function because the user will
likely use it as `Box<dyn ResourceEstimator<...>>` or
`Arc<dyn ResourceEstimator<...>>`.

The slightly larger code change is because the clean-up around `TestSize`
which helps with the doctests (and also we had two copies of that type).
2022-09-06 12:09:50 +00:00
Andrew Lamb d3278ea490
fix: Update service_grpc_catalog/src/lib.rs
Co-authored-by: Marco Neumann <marco@crepererum.net>
2022-09-06 07:44:08 -04:00
dependabot[bot] 9f0b0328f7
chore(deps): Bump thiserror from 1.0.33 to 1.0.34 (#5556)
Bumps [thiserror](https://github.com/dtolnay/thiserror) from 1.0.33 to 1.0.34.
- [Release notes](https://github.com/dtolnay/thiserror/releases)
- [Commits](https://github.com/dtolnay/thiserror/compare/1.0.33...1.0.34)

---
updated-dependencies:
- dependency-name: thiserror
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-06 09:18:41 +00:00
Michael Angerman 11877b7ef5
refactor: remove the write_buffer crate dependency from ioxd_querier (#5563)
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2022-09-06 09:09:40 +00:00
dependabot[bot] 366c4d9965
chore(deps): Bump once_cell from 1.13.1 to 1.14.0 (#5555)
Bumps [once_cell](https://github.com/matklad/once_cell) from 1.13.1 to 1.14.0.
- [Release notes](https://github.com/matklad/once_cell/releases)
- [Changelog](https://github.com/matklad/once_cell/blob/master/CHANGELOG.md)
- [Commits](https://github.com/matklad/once_cell/compare/v1.13.1...v1.14.0)

---
updated-dependencies:
- dependency-name: once_cell
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2022-09-06 09:02:28 +00:00
Stuart Carnie bde26e11ab
feat: Teach IOx to parse `SHOW MEASUREMENTS` statement (#5430)
* feat: Add measurement expression and statement terminator combinators

* feat: Begin parsing SHOW MEASUREMENTS statements

* chore: Placate clippy

* feat: parse LIMIT clause

* feat: parse OFFSET clause

* chore: Clippy

* chore: Fix doc comments

* feat: Parse ORDER BY clause

* feat: Parse WHERE clause

* feat: Add Call expression

* chore: Clippy

* feat: parse WITH MEASUREMENT regex; WHERE clause

* chore: Test to validate regex not-equal operator is not supported

* chore: No need to be public

* chore: Remove invalid comment

* chore: PR Feedback – use `tag_no_case`

Also added `cut` to force Err::Failure state, as `ORDER BY` must be
followed by one of the specified rules.
2022-09-06 01:58:41 +00:00
dependabot[bot] 3fbff263ca
chore(deps): Bump md-5 from 0.10.2 to 0.10.4 (#5554)
Bumps [md-5](https://github.com/RustCrypto/hashes) from 0.10.2 to 0.10.4.
- [Release notes](https://github.com/RustCrypto/hashes/releases)
- [Commits](https://github.com/RustCrypto/hashes/compare/md-5-v0.10.2...md-5-v0.10.4)

---
updated-dependencies:
- dependency-name: md-5
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-05 17:19:19 +00:00
Marco Neumann 87772a6aec
refactor: debug log improvements (#5553)
* feat: extend log output for ingester responses

* feat: add debug log for parquet `read_filter` calls

* feat: add debug log to `get_write_info`

* feat: add debug log parquet cache invalidation
2022-09-05 13:54:13 +00:00
Marco Neumann e6ff1db5fc
chore: update `cpufeatures` to 0.2.5 (#5559)
Version 0.2.3 was yanked.
2022-09-05 13:01:18 +00:00
Marco Neumann 064f0e9b29
refactor: use DataFusion to read parquet files (#5531)
Remove our own hand-rolled logic and let DataFusion read the parquet
files.

As a bonus, this now supports predicate pushdown to the deserialization
step, so we can use parquets as in in-mem buffer.

Note that this currently uses some "nested" DataFusion hack due to the
way the `QueryChunk` interface works. Midterm I'll change the interface
so that the `ParquetExec` nodes are directly visible to DataFusion
instead of some opaque `SendableRecordBatchStream`.
2022-09-05 09:25:04 +00:00
Marco Neumann f45cbfb88d
refactor: fine-grained file size mocking (#5541)
* refactor: do not override parquet file size in querier

This is going to be an issue when we actually rely on the size for
reading, see #5531.

* refactor: use selected file size mocking in compactor

Do not blindly override parquet file sizes for all subsystems.

This is going to be an issue when we actually rely on the size for
reading, see #5531.

* refactor: remove ability to override file sizes in catalog

Blindly overriding data for all subsystems is dangerous, because some
parts of our stack actually rely on the actual file size. See #5531.

* docs: explain `size_overrides`
2022-09-05 08:50:04 +00:00
Andrew Lamb 1e1d964fdb fix: Some other stragglers 2022-09-04 07:59:07 -04:00
Juul Christiaens 8b419ecd84 refactor: changed iox_shared to iox-shared
changed io_shared to iox-shared in the following files: update_catalog.rs, partition.rs, lib.rs (in the service_grpc_catalog folder) and lib.rs (in the service_grpc_object_store folder).
2022-09-04 07:59:07 -04:00