influxdb

Commit Graph

Author	SHA1	Message	Date
Dom Dwyer	2068ff394b	perf(ingester): cache Partition This commit implements a PartitionCache decorator over the PartitionProvider abstraction. When an ingester starts up, the internal data structures are empty and are lazily initialised for each namespace / table / partition as they are observed in the stream of DML ops. This lazy initialisation includes resolving the Partition ID and last persisted sequence number offset value from the catalog for each partition in each table in each namespace for which an op is observed - this occurs in the hot path, while blocking ingest for a shard. resolving each partition will cause a catalog query, this can cause a spike in queries against the catalog, also resulting in unnecessarily slow ingester recovery - we're effectively lazily warming a cache of PartitionData in the hot path! Instead this cache can be used to pre-warm the N most recently created partitions (which are likely to have ongoing writes) at startup to eliminate the hot-path overhead and associated catalog queries. NOTE: unlike most of the other hot-path queries, partition persist offset resolution cannot be eliminated by changes to the Kafka wire format.	2022-09-27 17:15:57 +02:00
Dom Dwyer	a3d6e7a45a	refactor(ingester): server-wide PartitionProvider Lifts the PartitionProvider initialisation higher in the stack to a point where a single instance can be used across all shards an ingester manages. This is a pre-requisite for sharing a cache of Partitions across all shards.	2022-09-27 17:15:31 +02:00
Dom Dwyer	38ebd5fb20	test: simplify partition provider mock Removes redundant fields from the MockPartitionProvider.	2022-09-27 17:11:13 +02:00
Dom	2ef04e99da	Merge branch 'main' into dom/non-pub-shard	2022-09-27 14:21:23 +01:00
Andrew Lamb	66dbb9541f	chore: Update datafusion and `arrow`/`parquet`/`arrow-flight` to 23.0.0, `thrift` to 0.16.0 (#5694 ) * chore: Update datafusion and `arrow`/`parquet`/`arrow-flight` to 23.0.0 * chore: Update thrift / remove parquet_format * fix: Update APIs * chore: Update lock + Run cargo hakari tasks * fix: use patched version of arrow-rs to work around https://github.com/apache/arrow-rs/issues/2779 * chore: Run cargo hakari tasks Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-27 12:50:54 +00:00
Dom Dwyer	b873297fad	refactor(ingester): limit visibility Marks many internal data structures as non-pub. Many remain as they're used across tests / from multiple callers "peeking", but this limits the scope of false sharing in the future.	2022-09-27 14:27:32 +02:00
Dom Dwyer	11be746dc0	refactor: internalise ShardData init Move the initialisation of ShardData (an internal ingester data structure) into the ingester itself. Previously callers would initialise the ingester state, and pass it into the IngesterData constructor.	2022-09-27 14:26:17 +02:00
Dom Dwyer	61aecc3044	refactor: decouple partition init from table Removes the "how" of initialising a per-partition buffer structure (PartitionData) from the per-table buffer (TableData). This is a cleaner separation of concerns - a table buffer is responsible for addressing and initialising per-table partitions as necessary, and buffering of ops for them. It does not have to be concerned with the series of steps necessary to look up the various bits of data in order to construct a PartitionData. This abstract provider can be layered up to provide more complex behaviours - I intend to add a read-through cache impl that decorates the catalog impl in this commit, which should eliminate most partition queries at ingester startup utilising the indirection added here.	2022-09-26 14:35:15 +02:00
Carol (Nichols \|\| Goulding)	c8108f01e7	chore: Upgrade to Rust 1.64 (#5727 ) * chore: Upgrade to Rust 1.64 * fix: Use iter find instead of a for loop, thanks clippy * fix: Remove some needless borrows, thanks clippy * fix: Use then_some rather than then with a closure, thanks clippy * fix: Use iter retain rather than filter collect, thanks clippy Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-22 18:04:00 +00:00
Marco Neumann	55ef272920	refactor: acquire table locks concurrently (#5722 ) Waiting for one after the other (one per shard) in serial fashion likely increases latency too much.	2022-09-22 10:56:22 +00:00
Marco Neumann	365a246f8d	refactor: do not run de-dup in ingester for querier requests (#5626 ) * refactor: do not run de-dup in ingester for querier requests This removes the entire de-dup logic from the inegster for querier requests. Furthermore, it even removes the entire datafusion execution from the querier and just dumps the in-memory record batches as quickly as possible. No filters are applied. Note that even prior to this PR, we've never applied projections (tracked by #5624). Pros: - speed up query planning within the querier (since we need the ingester response for state reconciling) - lowered ingester CPU load Cons: - more querier<>ingester network traffic Closes #5602. * test: extend query test case * fix: ingester tests Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-22 07:33:54 +00:00
Marco Neumann	c66f16e4af	fix: ingester retries (#5708 ) * fix: retry ingester requests faster The retries introduced in #5695 are too slow and block the entire querier for minutes (until the very long gRPC timeout kicks in). * fix: add error details on why the query planning failed Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-21 09:27:47 +00:00
Dom Dwyer	c6fe0dab3e	refactor(ingester): reduced internal visibility Changes many pub fields / methods to be pub(super), or if necessary, pub(crate). This helps maintain an internal API boundary for code hygiene, and helps identify functions that are unused / only used in tests (which I've annotated with cfg(test) and intend to remove - we should be driving code under test via the public API rather than using test-only state mutation, otherwise we're just testing our tests!)	2022-09-20 16:24:27 +01:00
Dom Dwyer	6d00d6b683	test(ingester): refactor querier API tests This commit changes the prepare_data_to_querier() tests to drive the ingester state by applying DML ops, therefore driving the prod code paths (and testing them!) rather than having the tests set up what the tests believe is the correct internal ingester state, and then asserting on that state. This gives us much better coverage of prod code paths, decouples the tests from the internal state/representation of ingesters (making the tests less fragile), and removes a bunch of special-cased, test-only functions that are functionally similar, but not the same as, the prod functions. Unblocks #5658, further clean-up to come.	2022-09-20 16:24:27 +01:00
dependabot[bot]	4fbb32eed6	chore(deps): Bump tokio-stream from 0.1.9 to 0.1.10 (#5667 ) Bumps [tokio-stream](https://github.com/tokio-rs/tokio) from 0.1.9 to 0.1.10. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-stream-0.1.9...tokio-stream-0.1.10) --- updated-dependencies: - dependency-name: tokio-stream dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-09-19 07:36:11 +00:00
Dom Dwyer	b0eb85ddd5	refactor: store ShardId in child nodes Instead of passing the ShardId into each function for child nodes of the Shard, store it. This avoids the possibility of mistakenly passing the wrong value.	2022-09-16 18:00:11 +02:00
Dom Dwyer	07b08fa9cb	refactor: add table name in PartitionData A partition belongs to a table - this commit stores the table name in the PartitionData (which was readily available at construction time) instead of redundantly passing it into various functions at the risk of getting it wrong.	2022-09-16 17:59:22 +02:00
Dom Dwyer	c7ba0bea91	refactor: ShardId & TableId in PartitionData When we construct a PartitionData we have the ShardId and TableId. This commit stores them in the PartitionData for later use, rather than repeatedly passing them in again when constructing snapshots, at the risk of passing the wrong IDs.	2022-09-16 17:17:16 +02:00
Dom Dwyer	85d6efafe1	refactor: snapshot_to_persisting redundant ID Partition::snapshot_to_persisting() passes the ID of the partition it is calling `snapshot_to_persisting()` on. The partition already knows what its ID is, so at best it's redundant, and at worst, inconsistent with the actual ID.	2022-09-16 17:08:08 +02:00
Dom Dwyer	ce0d189260	perf: O(1) partition persist mark discovery Changes the ingest code path to eliminate scanning the parquet_files table to discover the last persisted offset per partition, instead utilising the new persisted_sequence_number field on the Partition itself to read the same value. This lookup blocks ingest for the shard, so removing the expensive query from the ingest hot path should improve catch-up time after a restart/deployment.	2022-09-16 14:06:42 +02:00
Dom Dwyer	66bf0ff272	refactor(db): NULLable persisted_sequence_number Makes the partition.persisted_sequence_number column in the catalog DB NULLable. 0 is a valid persisted sequence number.	2022-09-15 18:19:39 +02:00
Dom Dwyer	234d460fcb	chore: rename update_persisted_sequence_number fn	2022-09-15 16:10:35 +02:00
Dom Dwyer	f91d802107	feat: store per-partition persist markers Changes the ingester to record the per-partition, maximum persisted sequencer offsets to the catalog. This will enable quick O(1) lookup in the future, but the currently persisted value is only used to assert the per-partition monotonic persist ordering invariant.	2022-09-15 16:10:35 +02:00
Dom Dwyer	300938f858	refactor: assert partition persistence ordering Assert the per-shard / per-partition persistence watermarks monotonically increase, and document the invariant. NOTE: this is not a new invariant, just a new assertion to validate it.	2022-09-15 16:10:35 +02:00
Dom Dwyer	d199a83355	feat(catalog): per-partition persist mark API Adds the "persisted_sequence_number" field to the Partition model, and updates the catalog API to read & update it.	2022-09-15 16:10:35 +02:00
Dom Dwyer	fc17f2ec2d	refactor: hoist persistence watermark from buffer The maximum persisted sequence number is tracked to answer "up to where has this partition been persisted", used for querying and skipping writes that have already been applied (though I suspect this is redundant). This is a property of the partition, not the actual data buffer, so this commit hoists it up out of the data buffer and onto the per-partition data structure, internalising the field in the process (not pub).	2022-09-14 18:07:45 +02:00
Dom Dwyer	ee8cdb48af	style(ingester): fmt imports & long strings Rewrite the imports to be a consistent order; std, external, crate and merge all crate-level imports into one use statement.	2022-09-14 14:20:19 +02:00
Dom Dwyer	074722eb3e	refactor(ingester): split data.rs into modules Breaks the gigantic data.rs file into sub-modules for Shard, Namespace, Table, Partition, and finally the actual data buffer used to store writes.	2022-09-14 14:20:19 +02:00
Andrew Lamb	f86d3e31da	chore: Update datafusion + object_store (#5619 ) * chore: Update datafusion pin * chore: update object_store to 0.5.0 * chore: Run cargo hakari tasks Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-13 12:34:54 +00:00
Marco Neumann	e4a66f69c7	refactor: dedicated query path for querier tombstone materialization (#5618 ) I would like to remove de-dup from the querier<>ingester query path in #5602, but the tombstone application and parquet write path can still do the full work.	2022-09-13 07:16:38 +00:00
Andrew Lamb	1fd31ee3bf	chore: Update datafusion / `arrow` / `arrow-flight` / `parquet` to version 22.0.0 (#5591 ) * chore: Update datafusion / `arrow` / `arrow-flight` / `parquet` to version 22.0.0 * fix: enable dynamic comparison flag * chore: derive Eq for clippy * chore: update explain plans * chore: Update sizes for ReadBuffer encoding * chore: update more tests Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-12 17:45:03 +00:00
Dom Dwyer	6342be2fa9	refactor: consistent logging Changes the ingester to log ALL reasons for a partition being marked for persistence, rather than just one of the 4 previously. Fields are consistent, but verbose / repetitive. Also cleans up some misleading messages like "updating ..." to be logged only when the update actually takes place.	2022-09-12 15:57:27 +02:00
Marco Neumann	8933f47ec1	refactor: make `QueryChunk::partition_id` non-optional (#5614 ) In our data model, a chunk always belongs to a partition[^1], so let's not make this attribute optional. The optional value only leads to -- mostly surprising -- conditional behavior, ranging from "do not equalize the partition sort key" (querier) to "always consider the chunk overlapping" (iox_query when dealing with ingester chunks). [^1]: This is even true when the chunk belongs to a parquet file that is not yet added to the catalog, contrary to what a comment in the ingester stated. The catalog and data model used by the querier are two totally different things.	2022-09-12 13:52:51 +00:00
Marco Neumann	caa0dfd1e0	refactor: query code clean ups (#5612 ) * refactor: remove dead code * refactor: `Deduplicator::build_scan_plan` consumes `self` There is no good reason to use the same `Deduplicator` twice. In contrast I'm quite sure that this would lead to nasty bugs, because `split_overlapped_chunks` exists early in some cases so the 2nd plan would have old and new chunks mixed together.	2022-09-12 13:00:56 +00:00
dependabot[bot]	786ce75e26	chore(deps): Bump tokio-util from 0.7.3 to 0.7.4 (#5596 ) Bumps [tokio-util](https://github.com/tokio-rs/tokio) from 0.7.3 to 0.7.4. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-util-0.7.3...tokio-util-0.7.4) --- updated-dependencies: - dependency-name: tokio-util dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-09-09 07:40:16 +00:00
YIXIAO SHI	52ae60bf2e	chore: fix comment typo (#5551 ) Co-authored-by: Dom <dom@itsallbroken.com>	2022-09-07 08:49:29 +00:00
Marco Neumann	adeacf416c	ci: fix (#5569 ) * ci: use same feature set in `build_dev` and `build_release` * ci: also enable unstable tokio for `build_dev` * chore: update tokio to 1.21 (to fix console-subscriber 0.1.8 * fix: "must use"	2022-09-06 14:13:28 +00:00
Marco Neumann	87772a6aec	refactor: debug log improvements (#5553 ) * feat: extend log output for ingester responses * feat: add debug log for parquet `read_filter` calls * feat: add debug log to `get_write_info` * feat: add debug log parquet cache invalidation	2022-09-05 13:54:13 +00:00
dependabot[bot]	7c61bdcf35	chore(deps): Bump paste from 1.0.8 to 1.0.9 (#5526 ) Bumps [paste](https://github.com/dtolnay/paste) from 1.0.8 to 1.0.9. - [Release notes](https://github.com/dtolnay/paste/releases) - [Commits](https://github.com/dtolnay/paste/compare/1.0.8...1.0.9) --- updated-dependencies: - dependency-name: paste dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-09-01 12:07:53 +00:00
Dom	ed2490deb2	Merge branch 'main' into dom/ingester-row-limit	2022-08-31 14:56:42 +01:00
Dom Dwyer	2a19606456	feat(ingester): restrict partition row count This limit restricts a single partition to containing at most N rows before it is marked for persistence (note: being marked for persistence does not currently prevent further ingest for that partition.)	2022-08-31 15:48:18 +02:00
Andrew Lamb	6669d85fb4	chore: Update datafusion + arrow/parquet to `21.0.0` (#5519 ) * chore: Update arrow/arrow-flight/parquet to 21.0.0 * chore: Update datafusion pin * chore: Fix arrow update script * chore: Update Cargo.lock * chore: Update for new API	2022-08-31 13:30:47 +00:00
Nga Tran	cb10a7c6d8	feat: More accurate memory estimate for compaction (#5471 ) * feat: initial implementation of memory estimation for a compaction * feat: estimate size of files and have the right actions for the needed budget * feat: run candidates in parallel * fix: have the right name for the column field of the output struct * feat: add metrics for estimated budgets * chore: cleanup * chore: Apply suggestions from code review Co-authored-by: Carol (Nichols \|\| Goulding) <193874+carols10cents@users.noreply.github.com> * fix: fix syntax after applying review's suggestions * refactor: Convert a Vec to VecDeque to go well with pop and push * chore: remove max_concurrent_size_bytes and input_size_threshold_bytes * chore: remove input_file_count_threshold * test: tests for estimate_arrow_bytes_for_file Co-authored-by: Carol (Nichols \|\| Goulding) <193874+carols10cents@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-08-30 13:44:44 +00:00
Carol (Nichols \|\| Goulding)	dbd27f648f	refactor: Rename more mentions of Kafka to their other name where appropriate	2022-08-29 14:27:02 -04:00
Carol (Nichols \|\| Goulding)	58f0b63cdc	refactor: Rename KafkaTopic to Topic or TopicMetadata or topic name as appropriate	2022-08-29 14:27:02 -04:00
Carol (Nichols \|\| Goulding)	74c9529062	fix: Rename KafkaPartition to ShardIndex	2022-08-29 14:07:18 -04:00
Carol (Nichols \|\| Goulding)	c9567cad7d	fix: Rename some more sequencer to shard	2022-08-29 14:06:45 -04:00
Carol (Nichols \|\| Goulding)	fe9c474620	fix: rustfmt	2022-08-29 14:06:45 -04:00
Carol (Nichols \|\| Goulding)	fbae4282df	fix: Rename another sequencer to shard to be hopefully clearer	2022-08-29 14:06:45 -04:00
Jake Goulding	4abf21c724	refactor: Rename Sequencer (and its entourage) to Shard	2022-08-29 14:06:43 -04:00
Andrew Lamb	35f99fe940	fix: fix intermittent failures in `data::tests::persist` (#5437 ) * fix: fix intermittent failures in data::tests::persist * fix: tweak comments and message * fix: space	2022-08-19 21:16:00 +00:00
kodiakhq[bot]	2b3ca54168	Merge branch 'main' into cn/upgrade-l0-metrics	2022-08-17 16:01:42 +00:00
Andrew Lamb	7f0ae53d6f	chore: Update to (almost) released object_store 0.4.0 (#5419 ) * chore: update object_store * chore: update hakari config * chore: Run cargo hakari tasks Co-authored-by: CircleCI[bot] <circleci@influxdata.com>	2022-08-17 13:44:48 +00:00
Carol (Nichols \|\| Goulding)	ed44817ed1	feat: Add a histogram of ingested (new L0) Parquet file sizes Connects to #5348.	2022-08-15 10:13:54 -04:00
Marco Neumann	6b8b922fe7	fix: do not loose data when Kafka reports that offset is above watermark (#5322 ) * fix: do not loose data when Kafka reports that offset is above watermark This can happen in certain cluster rebalance settings. This is also linked to https://github.com/influxdata/rskafka/issues/147 but for the upstream issue I currently have no idea how to fix it, so let's at least harden IOx against it. Fixes #5128. * refactor: panic for `SequenceNumberAfterWatermark`	2022-08-11 07:32:04 +00:00
Andrew Lamb	3a945dbcb2	chore: return a struct with named and documented fields from `compact_persisting_batch` (#5346 ) * chore: return a struct with named and documented fields from `compact_persisting_batch` * docs: Remove extra 'the' and fix a typo Co-authored-by: Carol (Nichols \|\| Goulding) <carol.nichols@gmail.com>	2022-08-10 20:22:29 +00:00
Andrew Lamb	16ddc5efc6	chore: Update datafusion / arrow/parquet/arrow-flight and prost/tonic ecosystem (#5360 ) * chore: Update datafusion and arrow * chore: Update Cargo.lock * chore: update to Decimal128 * chore: Update tonic/prost/pbjson/etc * chore: Run cargo hakari tasks * fix: doctest in generated types Co-authored-by: CircleCI[bot] <circleci@influxdata.com>	2022-08-09 17:30:44 +00:00
Andrew Lamb	7219f512c3	fix: update sort key in catalog before adding parquet file to catalog (#5333 ) * fix: update sort key before parquet file * fix: Remove left over debugging * fix: fix bug, improve logging * chore: move debug log after catalog update, improve args and docs	2022-08-09 10:27:51 +00:00
Marco Neumann	9fbc95c3ad	feat: add sequencer reset count metric and log to ingester (#5286 ) Split out from #5253. Helps with #5128. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-08-03 13:00:36 +00:00
dependabot[bot]	94fe5b4c10	chore(deps): Bump paste from 1.0.7 to 1.0.8 (#5280 ) Bumps [paste](https://github.com/dtolnay/paste) from 1.0.7 to 1.0.8. - [Release notes](https://github.com/dtolnay/paste/releases) - [Commits](https://github.com/dtolnay/paste/compare/1.0.7...1.0.8) --- updated-dependencies: - dependency-name: paste dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-08-03 09:03:25 +00:00
dependabot[bot]	fbd39844d8	chore(deps): Bump async-trait from 0.1.56 to 0.1.57 (#5247 ) Bumps [async-trait](https://github.com/dtolnay/async-trait) from 0.1.56 to 0.1.57. - [Release notes](https://github.com/dtolnay/async-trait/releases) - [Commits](https://github.com/dtolnay/async-trait/compare/0.1.56...0.1.57) --- updated-dependencies: - dependency-name: async-trait dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-08-01 08:30:33 +00:00
Andrew Lamb	9215a534d0	chore: Update datafusion and `arrow`/`parquet`/`arrow-flight` to `19.0.0` (#5229 ) * chore: Update datafusion and `arrow`/`parquet`/`arrow-flight` to `19.0.0` * chore: Run cargo hakari tasks * fix: Update for API changes * fix: clippy Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-28 08:10:47 +00:00
Marko Mikulicic	9da8062a16	fix: Fix typo in log message (#5222 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-27 15:34:37 +00:00
Marco Neumann	9a9a1a4777	feat: limit per-table chunk data for every query (#5223 ) * feat: `QueryChunk::as_any` * feat: allo `ChunkPruner::prune_chunks` to fail * feat: limit per-table chunk data for every query Closes #5211. * fix: address review comments Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-07-27 13:20:05 +00:00
Andrew Lamb	fbf672015e	refactor: Reduce ceremony requried to create a `Span` from `SpanContext` (#5181 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-22 11:19:38 +00:00
Nga Tran	69cb3f2b19	refactor: remove min_sequence_number from Compactor and Querier, add `count_by_overlaps_with_level_0` and `count_by_overlaps_with_level_1` to catalog (#5151 ) * refactor: remove min_sequnce_number * fix: typos * fix: remove min_sequencer_number from new files from merging main * fix: add back throwing error if the compactor compacts files persisted by the ingester after the ingester sends max seq_num back to querier * test: add test_compactor_collision back but modify the input to make it work woth new changes Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-21 13:51:54 +00:00
Marco Neumann	0561423475	refactor: enforce proper `IOxSessionContext` (#5158 ) - remove `IOxSessionContext::default()` because untracked contexts should only be created by tests - remove `Option<IOxSessionContext>` because it is a typed workaround for `IOxSessionContext::default` Tests should use `IOxSessionContext::testing` and all _normal_ users should create proper contexts. I suspect this will help tracing or at least prevent silent regressions. See #5129. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-20 16:25:43 +00:00
dependabot[bot]	278a7f91af	chore(deps): Bump bytes from 1.1.0 to 1.2.0 (#5156 ) Bumps [bytes](https://github.com/tokio-rs/bytes) from 1.1.0 to 1.2.0. - [Release notes](https://github.com/tokio-rs/bytes/releases) - [Changelog](https://github.com/tokio-rs/bytes/blob/master/CHANGELOG.md) - [Commits](https://github.com/tokio-rs/bytes/compare/v1.1.0...v1.2.0) --- updated-dependencies: - dependency-name: bytes dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-20 10:00:08 +00:00
Andrew Lamb	e2d871b00b	chore: Update datafusion and arrow/parquet/arrow-flight to `18.0.0` (#5079 ) * chore: Update datafusion to 10.0.0, arrow/parquet/arrow-flight to 18 * chore: Run cargo hakari tasks * fix: update cargo pin Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-18 15:01:03 +00:00
Andrew Lamb	5bebff0b06	Revert "feat: skip ingester buffering if INFLUXDB_IOX_INGESTER_SKIP_BUFFER is set" (#5116 ) This reverts commit ca6875f60bec935eb6079b684d6eaa0cbc8a5306. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-15 13:22:45 +00:00
kodiakhq[bot]	18ffe581b5	Merge branch 'main' into dependabot/cargo/tokio-1.20.0	2022-07-14 14:18:51 +00:00
Marco Neumann	512f9850ee	refactor: ingester seek log debug => info (#5127 ) This message will be printed once per partition on ingester startup and shouldn't be too noisy, but is very helpful to judge "replay" / "catch-up".	2022-07-14 10:28:16 +00:00
dependabot[bot]	9b67de2f43	chore(deps): Bump tokio from 1.19.2 to 1.20.0 Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.19.2 to 1.20.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-1.19.2...tokio-1.20.0) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2022-07-14 01:21:43 +00:00
Carol (Nichols \|\| Goulding)	61c023139b	refactor: Switch compaction levels to an enum with values rather than separate consts Bonuses: - Type checking - Validation - Less casting - Exhaustiveness checking - Less use of the numerical value	2022-07-13 11:30:36 -04:00
Andrew Lamb	64b6b4fd6f	feat: skip ingester buffering if INFLUXDB_IOX_INGESTER_SKIP_BUFFER is set (#5115 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-13 14:21:06 +00:00
Andrew Lamb	c46e1c6347	chore: Update datafusion + arrow/parquet/arrow-flight to `17.0.0` (#5021 ) * fix: correct nullability declaration of system tables * chore: Update datafusion and arrow/parquet/arrow-flight * chore: Run cargo hakari tasks * fix: Update tests * fix: Update tests * fix: predicate pruning * fix: add some tests * fix: query_functions * fix: fix read_buffer test * fix: fix clippy Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-07 19:22:15 +00:00
Marco Neumann	aacdeaca52	refactor: prep work for #5032 (#5060 ) * refactor: remove parquet chunk ID to `ChunkMeta` * refactor: return `Arc` from `QueryChunk::summary` This is similar to how we handle other chunk data like schemas. This allows a chunk to change/refine its "believe" over its own payload while it is passed around in the query stack. Helps w/ #5032.	2022-07-07 13:21:48 +00:00
Andrew Lamb	8f5210ea3e	test: add test for "duration since production" in kafka `write_buffer` implementation (#5043 ) * test: add test for timestamps in kafka write buffer * refactor: move timestamp batching test to generic tests Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-07 10:27:27 +00:00
Marco Neumann	16bd3e67c0	refactor: unify `apply_predicate_to_metadata` (#5030 ) Instead of using some hand-rolled timestamp-based logic (or just "unknown") all over the place, just use logic introduced in #5017. This requires slightly improved table summaries within the querier that at least has min/max for the timestamp column. For that, the former `IngesterChunk`-specific `calculate_summary` method was extended to `create_basic_summary` to include that data and is now also used by `QuerierParquetChunk`. Note: `QuerierRBChunk` already has detailled metrics that are provided by the read buffer implementation. Should we ever need even better pruning for `QuerierParquetChunk` (or `IngesterChunk`) then we _only_ need add extra data to the table summaries. Closes #4976. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-05 12:51:59 +00:00
Marco Neumann	be53716e4d	refactor: use IDs for `parquet_file.column_set` (#4965 ) * feat: `ColumnRepo::list_by_table_id` * refactor: use IDs for `parquet_file.column_set` Closes #4959. * refactor: introduce `TableSchema::column_id_map`	2022-06-30 15:08:41 +00:00
Raphael Taylor-Davies	835e1c91c7	chore: update object_store to 0.3.0 (#4707 ) * chore: update object_store to 0.3.0 * chore: review feedback Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-29 21:44:03 +00:00
Markus Westerlind	edf3f08e81	refactor: Replace all uses of lazy_static with once_cell Went through and remove all lazy_static uses with once_cell (while waiting for the project to compile). There are still dependencies using lazy_static so it is still in the crate graph but at least there isn't an explicit dependency on it (and it is easier to update to `std::lazy::Lazy` once that is stable).	2022-06-29 16:22:02 +02:00
Nga Tran	cfcc4b8426	refactor: change level 1 to level 2 preparing for next design changes (#4954 ) * refactor: change level 1 to level 2 preparing for next design changes * fix: make level-2 consistent everywhere * chore: remove unused comments * refactor: change all the name level_1 to level_2 to completely replace 1 with 2 to amke everything consistent * chore: add correspinding constants for the comapction levels in the comments Co-authored-by: Dom <dom@itsallbroken.com>	2022-06-29 14:08:58 +00:00
Andrew Lamb	bfddb032ce	docs: improve docs for `persist_partition_size_threshold_bytes` / `INFLUXDB_IOX_PERSIST_PARTITION_SIZE_THRESHOLD_BYTES` (#4877 ) * docs: improve docs for `persist_partition_size_threshold_bytes` / `INFLUXDB_IOX_PERSIST_PARTITION_SIZE_THRESHOLD_BYTES` * docs: improve comments about LifecycleConfig::partition_size_threshold Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-27 21:52:40 +00:00
Marco Neumann	215f297162	refactor: parquet file metadata from catalog (#4949 ) * refactor: remove `ParquetFileWithMetadata` * refactor: remove `ParquetFileRepo::parquet_metadata` * refactor: parquet file metadata from catalog Closes #4124.	2022-06-27 15:38:39 +00:00
Nga Tran	3c0fb6e8ef	fix: avoid using min_time, which can be negative, for ChunkId. Using object store id which is uuid instead (#4942 ) * fix: avoid using min_time, which can be negative, for ChunkId. Using object store id which is uuid instead * chore: Apply suggestions from code review Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * chore: run fmt Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-23 19:00:13 +00:00
Andrew Lamb	49b34e1135	test: add appropriate tests	2022-06-23 11:50:55 -04:00
Andrew Lamb	fb4c3ed294	fix: revert test change	2022-06-23 11:34:59 -04:00
Dom Dwyer	9a79d16585	fix: account for partition memory until persisted The ingester maintains a rough "total memory in use" counter it uses to try and limit the amount of memory the ingester is using overall. When a partition is persisted, this total memory usage value is adjusted to account for releasing the partition memory. Prior to this commit, the ordering was: * Writes increase the memory counter * maybe_persist() is called to trigger persistence * A partition is identified for persistence * Partition memory usage is released back to the total memory counter * Persistence starts This meant that the partitions in the process of being persisted were not accounted for in the ingester's total memory counter, and therefore we could significantly overrun the configured memory limit. After this commit, the ordering is: * Writes increase the memory counter * maybe_persist() is called to trigger persistence * A partition is identified for persistence * Persistence starts * Persistence completes * Partition memory usage is released back to the total memory counter This ensures persisting partitions are sill tracked in the total memory counter, causing pauses to correctly fire.	2022-06-23 15:40:51 +01:00
Dom Dwyer	87af3848d1	refactor: remove unused errors These errors are not referenced, but are hidden from the "unused" lint because of the macro magic code generation.	2022-06-23 11:24:30 +01:00
Andrew Lamb	16c558e11e	refactor: Make some structures in `LifecycleManager` non pub (#4929 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-23 09:55:39 +00:00
Dom Dwyer	75a3fd5e1e	refactor: use propagated partition key in ingester Changes the ingester to use the partition key derived in the router, and transmitted over through the kafka API boundary. This should have no observable behavioural change, but be more resilient as we're no longer assuming the partitioning algorithm produces the same value in both the router (where data is partitioned) and the ingester (where data is persisted, segregated by partition key). This is a pre-requisite to allowing the user to specify partitioning schemes.	2022-06-21 15:57:30 +01:00
Marco Neumann	c3912e34e9	refactor: store per-file column set in catalog (#4908 ) * refactor: store per-file column set in catalog Together with the table-wide schema and the partition-wide sort key, this should be everything we need to read a parquet file directly into memory without peeking any file-level metadata. The querier will use this to directly load parquet files into the read buffer. WARNING: This requires a catalog wipe! Ref #4124. * refactor: use proper `ColumnSet` type	2022-06-21 10:26:12 +00:00
Andrew Lamb	f151b1e89f	fix: categorize `NamespaceNotFound` as ingester not found errors as well (#4899 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-20 08:40:31 +00:00
Marco Neumann	0fbff981ec	chore(deps): Bump sqlx to 0.6.0 and uuid to 1 (#4894 ) Closes #4889. Closes #4890. Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-06-17 10:28:28 +00:00
Marco Neumann	743c1692ea	refactor: stream query results from ingester to querier (#4875 ) * refactor: stream partitions from ingester Ref #4849. * refactor: do not collect record batched on the ingester side Ref #4849. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-16 12:58:50 +00:00
Andrew Lamb	d67336fd69	fix(ingester): ensure all ingester metrics are prefixed with `ingester_` (#4871 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-16 12:52:35 +00:00
Andrew Lamb	74f4006580	fix(ingester): make ingester metrics start with `ingester` (#4870 ) * fix(ingester): make ingester metrics start with `ingester` * fix: Update ingester/src/stream_handler/handler.rs Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-16 12:46:37 +00:00
Andrew Lamb	8c56909218	fix(ingester): Distinguish between "not found" and other flight errors (#4874 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-16 12:39:37 +00:00
Marco Neumann	4b945493be	test: test gRPC and stream flattening (#4873 ) Ref #4849. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-16 11:44:59 +00:00
Marco Neumann	66c7d95312	refactor: use new ingester<>querier wire protocol (#4867 ) * refactor: use new ingester<>querier wire protocol Use and document the new and more flexible ingester<>querier wire protocol. Note that the ingester does NOT stream the response data yet, but the internal data structures would allow that. A follow-up change will adjust the ingester code to stream the data. Ref #4849. * fix: typos Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * refactor: clarify naming and public interface * test: add schema assertion to `ingester_response_to_record_batches` Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-06-16 08:02:28 +00:00
Andrew Lamb	6b771375bf	feat: log when partitions are written due to going over size (#4868 )	2022-06-15 20:12:43 +00:00
Dom Dwyer	4df2964566	refactor: store PartitionKey in DmlWrite Carry the PartitionKey in the DmlWrite, allowing the batch to be associated with a specific partition key.	2022-06-15 15:48:54 +01:00
Marco Neumann	7c60edd38c	refactor: prepare new ingester<>querier protocol on the querier side (#4863 ) * refactor: prepare new ingester<>querier protocol on the querier side This changes the querier internals to work with the new protocol. The wire protocol stays the same (for now). There's a (somewhat hackish) adapter in place on the querier side that converts the old to the new protocol on-the-fly. This is an intermediate step before we actually change the wire protocol (and in a step after that also take advantage of the new possibilites on the ingester side). Ref #4849. * docs: explain adapter	2022-06-15 14:32:24 +00:00
Andrew Lamb	005610b172	refactor: remove some `&` use in iox_catalog (#4862 ) * refactor: remove some `&` use in iox_catalog * fix: Update data_types/src/lib.rs	2022-06-15 11:31:49 +00:00
Nga Tran	b682dbbc2e	chore: Add debug info of sort_key for ingester (#4859 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-14 20:39:17 +00:00
Andrew Lamb	c8f70b8933	feat: log query from querier to ingester at `info` level (#4856 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-14 18:35:50 +00:00
Andrew Lamb	eca3b6b9a1	fix: reduce memory usage in ingester with less buffering prior to query engine (#4830 ) * refactor: remove another buffer copy in ingester * docs: Update arrow_util/src/util.rs Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-14 18:22:55 +00:00
Andrew Lamb	7d2a5c299f	refactor: remove one buffer copy in the ingester (#4855 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-14 17:15:36 +00:00
Andrew Lamb	e91d00b10c	chore: Update datafusion + `arrow`/`parquet`/`arrow-flight` to `16.0.0 (#4851 ) * chore: TEMP Update DataFusion to pre-release * chore: update arrow et al to 16.0.0 * chore: Run cargo hakari tasks * fix: update reader read_dictionary API * chore: Update to real Datafusion release * fix: Update parquet API * fix: update test Co-authored-by: CircleCI[bot] <circleci@influxdata.com>	2022-06-14 16:31:40 +00:00
Andrew Lamb	34e8659876	refactor: consolidate plan creation from `QueryChunk`s in `iox_query` (#4837 ) * refactor: consolidate plan creation from Chunks * docs: update docstrings Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-14 14:36:07 +00:00
Dom Dwyer	b41ea1d718	refactor: PartitionKey type This commit changes the code base to use a new reference-counted PartitionKey type wrapper, instead of passing a bare String around. This allows the compiler to type check & verify usage of the partition key, instead of passing a bare string around. By reference counting the underlying string, we reduce memory usage for some use cases.	2022-06-14 14:47:56 +01:00
Andrew Lamb	9fdbfb05e7	refactor: Use scan_and_filter in ReorgPlanner (#4822 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-10 17:31:25 +00:00
kodiakhq[bot]	dd8d44e24f	Merge branch 'main' into cn/duration	2022-06-10 14:23:09 +00:00
Nga Tran	13c57d524a	feat: Change data type of catalog partition's sort_key from a string to an array of string (#4801 ) * feat: Change data type of catalog Postgres partition's sort_key from a string to an array of string * test: add column with comma * fix: use new protonuf field to avoid incompactible * fix: ensure sort_key is an empty array rather than NULL * refactor: address review comments * refactor: address more comments * chore: clearer comments * chore: Update iox_catalog/migrations/20220607102200_change_sort_key_type_to_array.sql * chore: Update iox_catalog/migrations/20220607102200_change_sort_key_type_to_array.sql * fix: Rename migration so it will be applied after Co-authored-by: Marko Mikulicic <mkm@influxdata.com>	2022-06-10 13:31:31 +00:00
Andrew Lamb	dc992209be	test: account for active writes when reporting readable status (#4782 ) * test: account for active writes when reporting readable status * fix: logical merge conflict	2022-06-10 12:59:09 +00:00
Andrew Lamb	11cec18edc	refactor: Move `scan_and_filter` into a `common` module for reuse (#4823 ) * refactor: remove unused error variants * refactor: move scan_and_filter into a module so it can be reused * docs: update comments about pruning	2022-06-10 11:15:47 +00:00
Andrew Lamb	50697906b1	refactor: Make `DMLWrite::sequence_number` a `SequenceNumber` (#4817 )	2022-06-09 19:36:37 +00:00
Carol (Nichols \|\| Goulding)	1c7cbaf5ae	refactor: Use DurationHistogram in more places	2022-06-09 14:20:51 -04:00
Andrew Lamb	2ec7764fdd	refactor: rename builder like predicate methods to be `with_` (#4808 ) * refactor: rename builder like predicate methods to be `with_` * fix: merge conflict Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-09 11:26:03 +00:00
Andrew Lamb	d8331e8679	fix: do not return 'readable' until a write is completely readable (#4778 ) * fix: do not return readable until a write is completely readable * docs: Add diagram with partially buffered write * refactor: account for actively buffering during update rather than fixup * fix: fixup * fix: use checked_sub Co-authored-by: Marco Neumann <marco@crepererum.net> * fix: checked_sub calculation Co-authored-by: Marco Neumann <marco@crepererum.net>	2022-06-09 11:15:15 +00:00
Andrew Lamb	f34282be2c	fix: Do not run DataFusion optimizer pass twice (#4809 ) * fix: Do not run DataFusion optimizer pass twice * docs: improve docstring and logging	2022-06-08 21:01:22 +00:00
Andrew Lamb	afc1c12062	refactor: consolidate `PredicateBuilder` into `Predicate` (#4799 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-08 12:21:24 +00:00
Dom Dwyer	1fc5596023	perf: streaming compaction in ingester Reduces memory usage in the ingester during persist operations by streaming the results of the snapshot merge/sort/dedupe directly to the parquet file. Prior to this commit the output of the compact was buffered in memory before being wrote to the parquet file.	2022-06-07 12:01:26 +01:00
dependabot[bot]	04c685b3b7	chore(deps): Bump tokio-util from 0.7.2 to 0.7.3 (#4784 ) Bumps [tokio-util](https://github.com/tokio-rs/tokio) from 0.7.2 to 0.7.3. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-util-0.7.2...tokio-util-0.7.3) --- updated-dependencies: - dependency-name: tokio-util dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-06-06 14:46:27 +00:00
dependabot[bot]	a1ea793e13	chore(deps): Bump tokio-stream from 0.1.8 to 0.1.9 (#4785 ) Bumps [tokio-stream](https://github.com/tokio-rs/tokio) from 0.1.8 to 0.1.9. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-stream-0.1.8...tokio-stream-0.1.9) --- updated-dependencies: - dependency-name: tokio-stream dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-06 14:21:54 +00:00
dependabot[bot]	e03bf94420	chore(deps): Bump tokio from 1.18.2 to 1.19.1 (#4783 ) Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.18.2 to 1.19.1. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-1.18.2...tokio-1.19.1) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-06 14:15:12 +00:00
Andrew Lamb	3592aa52d8	chore: Update datafusion + `arrow`/`parquet`/`arrow-flight` to `15.0.0` (#4743 ) * chore: Update datafusion + `arrow`/`parquet`/`arrow-flight` to `15.0.0` * chore: Update APIs * chore: Run cargo hakari tasks * feat: normalize parquet file metadata * chore: update size tests * chore: add docs on metadata stripping * chore: TEMP UPDATE TO DF BRANCH * chore: Update for new API * fix: Update to latest DF * fix: cargo hakari Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: Raphael Taylor-Davies <r.taylordavies@googlemail.com>	2022-06-03 10:32:26 +00:00
dependabot[bot]	9a21292db8	chore(deps): Bump async-trait from 0.1.53 to 0.1.56 (#4774 ) Bumps [async-trait](https://github.com/dtolnay/async-trait) from 0.1.53 to 0.1.56. - [Release notes](https://github.com/dtolnay/async-trait/releases) - [Commits](https://github.com/dtolnay/async-trait/compare/0.1.53...0.1.56) --- updated-dependencies: - dependency-name: async-trait dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-06-03 09:10:40 +00:00
Ryan Russell	d279deddad	docs(various): Improve Readability (#4768 ) Signed-off-by: Ryan Russell <git@ryanrussell.org> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-02 18:01:06 +00:00
Marco Neumann	c91dbe062e	test: "optimize" ingesterrecord batches in query tests (#4700 ) * test: "optimize" ingesterrecord batches in query tests It seems that I had the right idea in #4656 but wasn't able to trigger https://github.com/influxdata/conductor/issues/955 because the query tests do not "optimize" the record batches in the same way the actual gRPC implementation does. If we apply the same transformation we indeed end up with the same error. * fix: all batches within the ingester flight response must have same schema * refactor: simplify and reuse code Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-01 07:37:11 +00:00
Paul Dix	6af32b7750	feat: add concurrency limit for ingester queries (#4703 ) I've defaulted it to 20, we can adjust as needed. Closes #4657 Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-30 10:22:17 +00:00
Andrew Lamb	dde3c3922c	refactor: use consistent spelling of serialize (#4717 )	2022-05-27 14:42:59 +00:00
Nga Tran	ea81152fac	refactor: add partition ID into debug info and panic earlier to identify the bug easier (#4716 ) * chore: point tests to the new ticket * chore: cleanup * refactor: add partition ID into debug info and panic earlier to identify the bug easier	2022-05-27 12:20:36 +00:00
Marco Neumann	31d1b37d73	refactor: de-duplicate low-level arrow code (#4697 ) It seems that during prototyping NG we've copied low level code (w/o tests!) and never cleaned up. Let's not have this functionality twice.	2022-05-25 16:24:28 +00:00
Carol (Nichols \|\| Goulding)	6ce6a38094	fix: Make metric names potentially less confusing	2022-05-25 10:04:39 -04:00
Dom	9cd1286051	Merge branch 'main' into dom/meta-remove-row-count	2022-05-23 16:39:38 +01:00
Marco Neumann	2029bd16ba	feat: enable debugging of failed querier->ingester requests (#4659 ) * feat: enable debugging of failed querier->ingester requests - extend `query-ingester` CLI to allow usage of predicates - on failed requests: log all information that required for the CLI - test the "ingester fails" scenario * test: explain Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * docs: improve Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * refactor: move b64 pred. serde into a single crate Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-05-23 15:37:31 +00:00
Dom Dwyer	2e6c49be83	refactor: remove IoxMetadata min & max timestamp Removes the min/max timestamp fields from the IoxMetadata proto structure embedded within a Parquet file's metadata. These values are redundant as they already exist within the Parquet column statistics, and precluded streaming serialisation as these removed min/max values were needed before serialising the file.	2022-05-23 16:27:08 +01:00
Dom Dwyer	a142a9eb57	refactor: remove row_count from IoxMetadata Remove the redundant row_count from the IoxMetadata structure that is serialised into the Parquet file. The reasoning is twofold: * The Parquet file's native metadata already contains a row count * Needing to know the number of rows up-front precludes streaming	2022-05-23 16:18:35 +01:00
Dom	f0d0f1ba0c	Merge branch 'main' into dom/codec-object-store	2022-05-23 15:39:54 +01:00
kodiakhq[bot]	a06746c715	Merge branch 'main' into cn/last-available	2022-05-23 13:08:19 +00:00
Carol (Nichols \|\| Goulding)	05bd9de4d3	test: Add a test for the sequence number skipping metric Ok, so... this needed lots of... channels. Channels everywhere. The stream method on TestWriteBufferStreamHandler previously assumed it would only be called once. In a test where reset_to_earliest is called, stream might be called again to get the reset stream. We want to be able to control which of the streams gets which operations, so that's why the macro now takes a vec of vec of operations-- one vec of operations per expected call to stream, and the stream will send all the operations in its vec. The test thread needs to wait for the handler stream to consume the last item from the last receiver stream, so when the TestWriteBufferStreamHandler has set up the last expected call to stream, pass back the last transmitter and have it wait until it's at full expected capacity (which means all operations have been consumed by the receiver).	2022-05-20 20:50:02 -04:00
Carol (Nichols \|\| Goulding)	bda231051a	feat: Record metrics when resetting the write buffer and skipping sequence numbers	2022-05-20 20:48:17 -04:00
Carol (Nichols \|\| Goulding)	bcbf7b4f46	refactor: Move error handling logic to be all together	2022-05-20 20:48:17 -04:00
Carol (Nichols \|\| Goulding)	549dd497ea	refactor: Extract an ingester verification function	2022-05-20 20:48:16 -04:00
Carol (Nichols \|\| Goulding)	2aa76622c3	refactor: Extract a test setup function	2022-05-20 11:51:57 -04:00
Carol (Nichols \|\| Goulding)	ab72c93a5e	docs: Updating wrapping, content, and grammar of comments	2022-05-20 10:51:07 -04:00
Carol (Nichols \|\| Goulding)	c811bebdb7	feat: Add ingester CLI option to skip to oldest available WB seq num The default behavior of the ingester is to panic if the min unpersisted sequence number in the catalog is unknown to the write buffer due to the retention policies having evicted that sequence number. Specifying `--skip-to-oldest-available` changes this behavior to skip to the oldest sequence number the write buffer does have available and go from there. Fixes #4624.	2022-05-20 10:51:07 -04:00
Carol (Nichols \|\| Goulding)	b3f97bdb9d	test: Capture existing behavior for unknown sequence number	2022-05-20 10:51:06 -04:00
Dom Dwyer	b9a745d42d	feat: RecordBatch stream to Parquet file upload Implements an upload() method on the ParquetStorage type, consuming a stream of RecordBatch, serialising the Parquet file, and uploading the result to object storage. Returns the IOx-specific file metadata. Currently while the upload() method accepts a stream of RecordBatch, the actual resulting Parquet file is buffered in memory before uploading to object store, due to lack of streaming upload functionality in the ObjectStore abstraction - this isn't the end of the world, as the files tend to be relatively small with our current usage. This impl should be easily modified to be fully streaming once streaming object store puts are implemented: https://github.com/influxdata/object_store_rs/issues/9	2022-05-20 15:17:40 +01:00
Carol (Nichols \|\| Goulding)	4bad553dc6	fix: Use a method instead of holding a lock across an await point	2022-05-19 16:09:47 -04:00
Dom Dwyer	baa86d846f	refactor: use ParquetStore instead of ObjectStore Changes the code paths that interact with Parquet files in the object store to reference the ParquetStorage directly (DRY refactor). This change takes us from a dependency graph of: ┌─────────────────┐ │ │ ▼ │ Parquet Consumer │ │ ┌──────────────┐ ├────────▶│ParquetStorage│ ▼ └──────────────┘ ┌──────────────┐ │ ObjectStore │ └──────────────┘ │ ┌────┴────┐ ▼ ▼ File s3 System (etc) to: Parquet Consumer │ ▼ ┌──────────────┐ │ParquetStorage│ └──────────────┘ │ ▼ ┌──────────────┐ │ ObjectStore │ └──────────────┘ │ ┌────┴────┐ ▼ ▼ File s3 System (etc) With the ParquetStorage being solely responsible for managing interactions with the object store when dealing with Parquet files.	2022-05-19 13:52:51 +01:00
Dom Dwyer	d3548653d5	refactor: rename Storage -> ParquetStorage Renames the Storage type so the context is clear in usage (i.e. fn args), rather than having to rely on knowing the fully-qualified import path to know what the type stores.	2022-05-19 13:51:07 +01:00
Marco Neumann	52346642a0	ci: fix cargo deny (#4629 ) * ci: fix cargo deny * chore: downgrade `socket2`, version 0.4.5 was yanked * chore: rename `query` to `iox_query` `query` is already taken on crates.io and yanked and I am getting tired of working around that.	2022-05-18 09:38:35 +00:00
Andrew Lamb	3a33e806c7	chore: Update datafusion + `arrow`/`parquet`/`arrow-flight` to `14.0.0` (#4619 ) * chore: Update datafusion deps * chore: update arrow/parquet/arrow flight deps * chore: Run cargo hakari tasks * chore: Update location of utils * chore: Update some more APIs Co-authored-by: CircleCI[bot] <circleci@influxdata.com>	2022-05-17 14:13:03 +00:00
Carol (Nichols \|\| Goulding)	9eb21095e7	feat: Add more logging in particular situations to debug flaky test	2022-05-16 16:46:29 -04:00
dependabot[bot]	259d2486c1	chore(deps): Bump tokio-util from 0.7.1 to 0.7.2 (#4605 ) Bumps [tokio-util](https://github.com/tokio-rs/tokio) from 0.7.1 to 0.7.2. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-util-0.7.1...tokio-util-0.7.2) --- updated-dependencies: - dependency-name: tokio-util dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-05-16 11:42:31 +00:00
Raphael Taylor-Davies	f2bb0fdf77	feat: update to crates.io object_store version (#4595 ) * feat: update to crates.io object_store version * chore: Run cargo hakari tasks * fix: tests * chore: remove object store integration test plumbing Co-authored-by: CircleCI[bot] <circleci@influxdata.com>	2022-05-13 16:26:07 +00:00
kodiakhq[bot]	0f8f294319	Merge branch 'main' into cn/remove-chunk-addr	2022-05-13 13:54:44 +00:00
Carol (Nichols \|\| Goulding)	07c7c75067	fix: Remove ng_chunk method Connects to #4450.	2022-05-12 16:09:08 -04:00
Carol (Nichols \|\| Goulding)	faba90d992	fix: Remove ChunkAddr	2022-05-12 15:50:41 -04:00
Carol (Nichols \|\| Goulding)	26170b7a07	refactor: Move gRPC conversion code to generated_types to share	2022-05-11 14:07:12 -04:00
Carol (Nichols \|\| Goulding)	8545bb60c6	refactor: Move KafkaPartitionWriteStatus to data_types to share more	2022-05-11 14:07:06 -04:00
kodiakhq[bot]	5b866989ca	Merge branch 'main' into dom/query-metrics	2022-05-11 15:51:55 +00:00
Dom Dwyer	7f3473e19f	refactor(ingester): emit per-op debugging info Emit a TRACE level log containing the op offset & other helpful fields. This will allow us to identify which messages were last successfully decoded, and which caused errors so we can pull them from analysis.	2022-05-11 16:35:35 +01:00
Dom Dwyer	3890a8986b	feat(ingester): emit query latency metrics Adds a histogram metric "flight_query_duration_ms" that records the duration a flight RPC query takes to complete. Broken down by query result (success/error).	2022-05-11 16:14:47 +01:00
Raphael Taylor-Davies	8b379c83cc	refactor: simplify object_store path handling (#4534 ) * refactor: simplify object_store path handling * fix: aws integration tests * chore: lint * fix: update gcs tests * refactor: move errors into submodules * chore: lint * chore: review feedback * refactor: replace provider with Display * fix: failing tests Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-09 18:43:22 +00:00
kodiakhq[bot]	ebd078133c	Merge branch 'main' into cn/more-cleanup	2022-05-09 13:58:25 +00:00
YIXIAO SHI	cbe9eb4b1e	chore: fix comment typo (#4533 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-09 11:24:12 +00:00
Carol (Nichols \|\| Goulding)	4506bf3b8f	fix: Remove or rescope dead code in ingester	2022-05-06 16:58:03 -04:00
Jake Goulding	e07bcd40c2	refactor: Remove unused dependencies These were found by iterating over all of the dependencies of each Cargo.toml, then grepping that crate for the dependency's name. If it didn't show up, I attempted to remove it. I left a few dependencies that this process flagged: * generated_types - `pbjson`,`serde`. Apparently used by the generated code. * grpc-router-test-gen - `prost`. Apparently used by the generated code. * influxdb_iox - `heappy`. Doesn't appear used, but is behind enough feature flags that I don't care to reason about and it's already optional. - `tikv_jemalloc_sys`. Appears to be setting a feature flag of an indirect dependency. * iox_gitops_adapter - `k8s_openapi`. Appears to be setting a feature flag of an indirect dependency.	2022-05-06 15:57:58 -04:00
Carol (Nichols \|\| Goulding)	fcd4815645	fix: Rename router2 to router	2022-05-06 14:51:52 -04:00
Carol (Nichols \|\| Goulding)	068096e7e1	fix: Rename data_types2 to data_types	2022-05-06 14:45:39 -04:00
Carol (Nichols \|\| Goulding)	0541c6e40f	fix: Remove data_types crate where it's no longer used	2022-05-06 14:45:39 -04:00
Carol (Nichols \|\| Goulding)	2ef44f2024	fix: Move timestamp types to data_types2	2022-05-06 14:45:38 -04:00
Carol (Nichols \|\| Goulding)	eb31b347b0	refactor: Move tombstones_to_delete_predicates to the predicate crate	2022-05-06 14:45:37 -04:00
Carol (Nichols \|\| Goulding)	485d6edb8f	refactor: Move IngesterQueryRequest to generated_types	2022-05-06 14:45:37 -04:00
Carol (Nichols \|\| Goulding)	ea46830954	fix: Remove iox_object_store crate; move ParquetFilePath to parquet_file	2022-05-06 14:45:36 -04:00
Andrew Lamb	02893e598c	chore: Update datafusion and upgrade arrow/parquet/arrow-flight to 13 (#4516 ) * chore: Tool for automating arrow version update * chore: Update datafusion and arrow/parquet/arrow-flight * fix: update for changes in Arrow API Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-05 00:21:02 +00:00
dependabot[bot]	420c306caa	chore(deps): Bump tokio from 1.17.0 to 1.18.0 (#4453 ) Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.17.0 to 1.18.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-1.17.0...tokio-1.18.0) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-04-28 08:21:17 +00:00
Marco Neumann	59f6556483	fix: do not create empty batches in ingester (#4443 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-27 17:52:22 +00:00
Nga Tran	fa2c1febf4	feat: use stored partition sort key to deduplicate data (#4360 ) * feat: use stored sort key to deduplicate data * refactor: verify if one is a super sort key of the other * test: unit tests for scan and deduplication plans * fix: typo * refactor: refactor and add comments * feat: cache partition sort key to read during planning as needed * test: tests for query plans with different overlap groups * chore: cleanup * chore: resolve merge conflicts Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-26 20:36:32 +00:00
Marco Neumann	11f87cffdd	fix: memorize max persisted tombstone (#4430 )	2022-04-26 16:13:09 +00:00
Marco Neumann	bd600bbac6	refactor: allow ingester to be integrated into query tests (#4427 ) * refactor: improve `IngesterData` public interface * feat: impl `Debug` for `Test{Namespace,Sequencer}` * refactor: trait interface for `LifecyleHandle` This is required to mock the lifecycle for query tests. * refactor: trait for partitioner	2022-04-26 13:44:30 +00:00
二手掉包工程师	4b47d723b1	refactor: Rename time to iox_time (#4416 ) Signed-off-by: hi-rustin <rustin.liu@gmail.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-26 00:19:59 +00:00
Marco Neumann	86e8f05ed1	fix: make all catalog IDs 64bit (#4418 ) Closes #4365. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-25 16:49:34 +00:00
Nga Tran	d963110842	feat: group chunk overlaps based on time range only (#4389 ) * feat: overlap for NG querier * chore: cleanup * refactor: address review comments * fix: typo Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-25 13:32:07 +00:00
Marco Neumann	f444e63960	test: include materialized delete predicates in NG query tests (#4371 ) * refactor: move `batch_filter` to `datafusion_util` * fix: outdated docstring * feat: allow passing record batches to `iox_tests` parquet files * test: include materialized delete predicates in NG query tests * docs: improve wording Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-21 13:00:13 +00:00
Andrew Lamb	73bed810da	chore: Update arrow, arrow-flight, parquet, tonic, prost, etc (#4357 ) * chore: Update datafusion * chore: Update arrow/arrow-flight/parquet to 12 * chore: update datafusion correctly * chore: Update prost, tonic, and dependents * fix: Fixup some api changes * fix: Update test output in db * fix: Update test output in parquet_file * fix: remove old pbjson types * fix: Add "--experimental_allow_proto3_optional" flag * chore: Run cargo hakari tasks * fix: compile error * chore: Update heappy Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-20 11:12:17 +00:00
Andrew Lamb	5ea676d3f7	feat: add per kafka partition durability reporting to write info response (#4341 ) * feat: add per kafka partition durability reporting to write info response * fix: buf lint + test cleanup * fix: clean up protobuf * refactor: pull out conversion of KafkaPartitionStatus into a function * fix: fmt * fix: typo Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-19 16:46:20 +00:00
Marco Neumann	5b48675435	fix: actually transmit record-batch metadata from querier (#4347 ) Attaching the "batch => partition" mapping via per-batch schema KV metadata does NOT work because flight will transmit the schema once for all batches (even though on the Rust side we have a schema ref attached to every batch, probably for convenience). Instead we now use the same global protobuf metadata that we also use for the "partition => max sequence number" information. This somewhat limits our ability to create record batches lazily on the ingester side (since the global metadata is sent before any actual payload) but I think we should not modify the usage of the flight protocol too much right now (e.g. by sending more schema messages). If this becomes an issue, we can always find a more complex solution in the future.	2022-04-19 10:54:23 +00:00
Nga Tran	2a601c3099	fix: Revert "chore: Revert "fx: Revert "fix: Revert "feat: Use the sort key stored in the catalog during compaction" (#4299 )" (#4303 )" (#4327 )" (#4328 ) * fix: Revert "chore: Revert "fx: Revert "fix: Revert "feat: Use the sort key stored in the catalog during compaction" (#4299)" (#4303)" (#4327)" This reverts commit `7e5d719027`. * chore: resolve merge conflict Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-18 15:27:39 +00:00
Nga Tran	7e5d719027	chore: Revert "fix: Revert "fix: Revert "feat: Use the sort key stored in the catalog during compaction" (#4299 )" (#4303 )" (#4327 ) This reverts commit `fe8d9948d5`.	2022-04-14 17:11:55 +00:00
Carol (Nichols \|\| Goulding)	fe8d9948d5	fix: Revert "fix: Revert "feat: Use the sort key stored in the catalog during compaction" (#4299 )" (#4303 ) This reverts commit `7ddbf7c025`. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-14 15:42:28 +00:00
Marco Neumann	351b0d0c15	fix: unknown namespace/table in querier<>ingester flight protocol (#4307 ) * fix: return "not found" gRPC error instead of "internal" when ingester does not know table * fix: properly handle "namespace not found" in ingester queries * fix: make `initialize_db` work with async code * test: add custom step for NG tests * fix: handle "unknown table/namespace" resp. in querier * docs: explain test setup Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-04-14 12:36:15 +00:00
Marco Neumann	8bf2fbb7d3	fix: ingester min-unpersisted-sequence-number calc + doc (#4302 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-14 07:05:06 +00:00
Carol (Nichols \|\| Goulding)	7ddbf7c025	fix: Revert "feat: Use the sort key stored in the catalog during compaction" (#4299 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-13 14:11:10 +00:00
kodiakhq[bot]	21f748062e	Merge branch 'main' into cn/sort-in-compactor	2022-04-13 12:43:31 +00:00
Marco Neumann	83f77712b1	refactor: querier<>ingester flight protocol adjustments (#4286 ) * refactor: querier<>ingester flight protocol adjustments This makes a few adjustments to the querier<>ingester flight protocol. Query Scope =========== The querier will request data for ALL sequencer IDs for now. There is no reason to have a request per sequencer ID. We can add a range/set filter later if we want, but this is not required for now. Partition-level =============== The only time when the querier cares about sequencer IDs (i.e. sharding) at all is when it selects which ingesters to ask for unpersisted data (this is currently not implemented, it just asks all ingesters). Afterwards the querier only cares about partitions (which are bound to specific sequencers anyways) because this is the level where parquet file persistence and compaction as well as deduplication happen. So we make partitions a first-class citizen in the ingester response. Metadata VS RecordBatches ========================= The global app-metadata will list all partitions and their max persisted parquet files and tombstones (theoretically tombstones are at table-level, but the ingester could in the future break them down to the partition-level). Then it receives a stream of record batches. Each record batch is tagged (via key-value metadata in its schema) so it can be assigned to a partition. At the moment the ingester returns 0 or 1 batches per unpersisted partition (0 in case we've filtered out all the data via the predicate), but in the future it is free to return multiple batches. This setup gives the ingester more freedom over memory management and (potentially parallel) query processing, while at the same time keeps the set of duplicated information minimal and allows easy extensions (since the global metadata is a full-blown protobuf message). Querier ======= At the moment the querier ignores all the metdata. Follow-up PRs will change that. * docs: improve Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * refactor: make code clearer Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-04-12 16:48:40 +00:00
Carol (Nichols \|\| Goulding)	87e4a1a51d	refactor: Move ingester sort key to schema sort key This logic isn't actually ingester specific	2022-04-11 14:09:45 -04:00
Carol (Nichols \|\| Goulding)	a053077a05	refactor: Make compute_sort_key more general than the ingester Enable computing sort keys for a schema and an iterator of record batches.	2022-04-11 14:09:45 -04:00
Dom Dwyer	5c3cbb14b4	test: join ingester background tasks	2022-04-08 14:24:56 +01:00
Dom Dwyer	dce939c580	refactor: use SequencedStreamHandler Removes the old stream_in_sequenced_entries() write buffer handler, replacing it with the SequencedStreamHandler introduced in #4203. This change will affect the metrics emitted by an ingester as outlined in #4243.	2022-04-08 11:28:39 +01:00
Dom Dwyer	71a278ac7e	refactor: accept !Sync write buffer streams Removes the Sync bound SequencedStreamHandler input stream type, as the BoxStream returned by the WriteBufferStreamHandler is not Sync. This change means the SequencedStreamHandler is not Sync either, but is still Send and therefore can be moved into tokio tasks.	2022-04-08 11:28:39 +01:00
Dom Dwyer	c2236fa3fb	feat: impl DmlSink for IngesterData This commit adds an adaptor (IngestSinkAdaptor) that provides a DmlSink implementation for the existing write path (IngesterData). With this, the existing write path becomes compatible with the new op stream handler (SequencedStreamHandler).	2022-04-08 11:28:39 +01:00
Andrew Lamb	a30a85e62c	feat: Add get_write_info service (#4227 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-07 19:24:58 +00:00
kodiakhq[bot]	8bd0bfb669	Merge branch 'main' into dom/ingester-op-instrumentation	2022-04-07 16:33:25 +00:00
kodiakhq[bot]	f5996c5ab4	Merge branch 'main' into cn/sort-key-across-persists	2022-04-07 14:40:55 +00:00
Dom	998a66fd98	docs: Update ingester/src/stream_handler/sink_instrumentation.rs Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-04-07 12:18:14 +01:00
Carol (Nichols \|\| Goulding)	30c3ef5aa6	fix: Only save relevant columns in parquet file's sort key	2022-04-06 14:09:08 -04:00
Dom Dwyer	24eeddce8a	chore: fix lint warnings	2022-04-06 16:45:31 +01:00
Dom Dwyer	091640bb23	feat: emit tracing span for op apply This commit uses the tracing metadata within the DmlOperation to emit a tracing span from the ingester covering the DmlSink::apply() operation.	2022-04-06 16:32:00 +01:00
Dom Dwyer	f6c65f52a3	refactor: impl WatermarkFetcher Implement WatermarkFetcher for PeriodicWatermarkFetcher and remove unnecessary async.	2022-04-06 16:32:00 +01:00
Dom Dwyer	436da19d9a	feat: DmlSink instrumentation This commit adds the SinkInstrumentation type that decorates an inner DmlSink with call latency and write buffer metrics. The write buffer / sink call metrics may be split apart into two separate responsibilities in the future if there are multiple DmlSink that need instrumentation, but deferring adding more types until it is needed.	2022-04-06 16:32:00 +01:00
Andrew Lamb	c244b03281	feat: Add `SequencerProgress` reporting to ingester (#4238 ) * feat: Add `SequencerProgress` reporting to ingester * refactor: Use KafkaPartition in write_summary * fix: Update docstrings * refactor: Change ingester to use KafkaPartition everywhere * refactor: add SequencerProgress::combine * refactor: return new SequencerProgress rather than updating * fix: distinguish between yes/no/unknown in WriteSummary * docs: Update data_types2/src/lib.rs Co-authored-by: Paul Dix <paul@pauldix.net> Co-authored-by: Paul Dix <paul@pauldix.net> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-06 15:13:21 +00:00
Carol (Nichols \|\| Goulding)	bf3cb45723	refactor: Pass PartitionInfo as argument	2022-04-06 09:31:42 -04:00
Carol (Nichols \|\| Goulding)	f0d5987317	feat: Update partition sort_key in catalog after persist Connects to #4196.	2022-04-06 09:31:42 -04:00
Carol (Nichols \|\| Goulding)	b16fcc284d	feat: Add new columns to the sort key during compaction Connects to #4196.	2022-04-06 09:31:42 -04:00
Carol (Nichols \|\| Goulding)	98d052dba7	feat: Use catalog sort key if specified Pass the sort key from the catalog through to compact_persisting_batch. If the sort key is Some, use that. If the sort key is None, compute it from the data's cardinality with compute_sort_key. Connects to #4196.	2022-04-06 09:31:42 -04:00
Dom Dwyer	891d2e1368	feat: periodic kafka max watermark offset fetcher Adds the PeriodicWatermarkFetcher type responsible for querying write buffer / Kafka for the maximum sequence number / offset, surfacing any errors via both logs & metrics. This high watermark / max offset value is used within the ingest instrumentation metrics. This use case is tolerant of caching / stale values, and as such the value is periodically updated to minimise load on the write buffer.	2022-04-05 12:02:07 +01:00
Dom Dwyer	aaa677dec8	docs: describe graceful shutdown behaviour	2022-04-05 11:31:55 +01:00
Dom Dwyer	8edefc415d	refactor: rename ttbr -> write_time in tests	2022-04-05 11:31:55 +01:00
Dom Dwyer	a387ec361d	refactor: use self.deref() instead of **self	2022-04-05 11:31:55 +01:00
Dom Dwyer	f15275cf96	feat: expose ingest sequencer errors Instruments the SequencedStreamHandler with a series of new metrics that record the various error classes observable in the stream handler. These metrics are labelled with potential_data_loss=true where relevant to surface potential data loss events for alerting & further review.	2022-04-05 11:31:55 +01:00
Dom Dwyer	083ff1f8e3	refactor: ingest stream handler Refactors the stream_in_sequenced_entries() into a new impl in the SequencedStreamHandler type, decoupling the reading / decoding of ops from Kafka (and associated error handling) from the "what happens to those ops" concern to ease testing, encapsulate the specifics of "how to get an op" and improve flexibility. This is intended to provide robust error handling within what is reasonably possible (unexpected errors are always unexpected!) while retaining the existing metrics and functionality. I've also separated out code that exists in the current impl specifically to drive tests from the prod code path, instead driving those behaviours through mocks. As of this commit, the handler is not used - this commit simply adds the new impl.	2022-04-05 11:31:54 +01:00
Paul Dix	81d41f81a1	fix: ingester replay logic (#4212 ) Fix the ingester to track the max persisted sequence number per partition. Ensure replay takes in data from unpersisted partitions. Simplify the table persist info to not return a max persisted sequence number for the table as that information isn't needed.	2022-04-04 18:04:34 +00:00
Carol (Nichols \|\| Goulding)	d41adf074f	test: Add assertions for sort keys	2022-04-01 13:13:04 -04:00
Carol (Nichols \|\| Goulding)	f4b5fa1b5e	feat: Implement distinct counts in terms of distinct values For one record batch. Connects to #4194.	2022-03-31 16:46:27 -04:00
Carol (Nichols \|\| Goulding)	832495a7c9	feat: Implement ingester compute_sort_key similarly to query compute_sort_key And add a test that currently fails because this implementation doesn't include actually computing the cardinalities. Connects to #4194.	2022-03-31 16:35:16 -04:00
Carol (Nichols \|\| Goulding)	9d83554f20	feat: Get the sort key from the schema and data in the QueryableBatch Connects to #4194.	2022-03-31 16:34:48 -04:00
Carol (Nichols \|\| Goulding)	9043966443	docs: Fix some typos in comments as I noticed them	2022-03-31 16:34:47 -04:00
Andrew Lamb	a384448b92	refactor: rename Sequence::id and Sequence::number field names (#4190 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-31 15:17:58 +00:00
Nga Tran	ddc2c8304f	fix: have the compaction level set correctly (#4184 ) * fix: have the compaction level set correctly, especially for compacted file from the compactor * fix: typo	2022-03-30 21:23:40 +00:00
Marco Neumann	2b76c31157	refactor: make statistics null counts optional (#4160 ) Min/max values and distinct counts are already optional, so let's make the null counts optional as well. This will be helpful for NG to deal w/ partial statistics (e.g. we only populate stats for the time column). Note that the total count is still mandatory, but we normally have the chunk/file-level row count at hand.	2022-03-29 17:47:57 +00:00
Carol (Nichols \|\| Goulding)	f3f792fd08	feat: Add namespace_id to the parquet_files table; object store paths need it	2022-03-29 08:15:26 -04:00
Carol (Nichols \|\| Goulding)	a373c90415	refactor: Extract the list_all function to object store I'm about to use this in a third file, so time to extract this. Make it clear that this is appropriate for tests only.	2022-03-29 08:15:24 -04:00
dependabot[bot]	17af5fcbd1	chore(deps): Bump tokio-util from 0.7.0 to 0.7.1 (#4154 ) * chore(deps): Bump tokio-util from 0.7.0 to 0.7.1 Bumps [tokio-util](https://github.com/tokio-rs/tokio) from 0.7.0 to 0.7.1. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-util-0.7.0...tokio-util-0.7.1) --- updated-dependencies: - dependency-name: tokio-util dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * chore: Run cargo hakari tasks Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-29 08:39:02 +00:00
dependabot[bot]	4f9515ffba	chore(deps): Bump async-trait from 0.1.52 to 0.1.53 (#4141 ) Bumps [async-trait](https://github.com/dtolnay/async-trait) from 0.1.52 to 0.1.53. - [Release notes](https://github.com/dtolnay/async-trait/releases) - [Commits](https://github.com/dtolnay/async-trait/compare/0.1.52...0.1.53) --- updated-dependencies: - dependency-name: async-trait dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-03-28 08:55:24 +00:00
Andrew Lamb	5c69a3f43b	chore: Update deps: datafusion, arrow/arrow-flight/parquet to 11, zstd to 0.11 (#4119 ) * chore: update datafusion * chore(deps): Bump arrow from 10.0.0 to 11.0.0 Bumps [arrow](https://github.com/apache/arrow-rs) from 10.0.0 to 11.0.0. - [Release notes](https://github.com/apache/arrow-rs/releases) - [Changelog](https://github.com/apache/arrow-rs/blob/master/CHANGELOG.md) - [Commits](https://github.com/apache/arrow-rs/compare/10.0.0...11.0.0) --- updated-dependencies: - dependency-name: arrow dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> * chore(deps): Bump arrow-flight from 10.0.0 to 11.0.0 Bumps [arrow-flight](https://github.com/apache/arrow-rs) from 10.0.0 to 11.0.0. - [Release notes](https://github.com/apache/arrow-rs/releases) - [Changelog](https://github.com/apache/arrow-rs/blob/master/CHANGELOG.md) - [Commits](https://github.com/apache/arrow-rs/compare/10.0.0...11.0.0) --- updated-dependencies: - dependency-name: arrow-flight dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> * chore: update parquet to 11.0.0 * fix: error on create schema, test for same * fix: upgrade zstd * chore: Run cargo hakari tasks * fix: fix logical merge conflict * fix: hakari * fix: hakari * fix: update newly introduced dep Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-24 15:27:36 +00:00
Andrew Lamb	204dd7c8e9	refactor: Fix some random clippy lints from the future (#4118 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-24 09:21:29 +00:00
Carol (Nichols \|\| Goulding)	67e13a7c34	fix: Change to_delete column on parquet_files to be a time (#4117 ) Set to_delete to the time the file was marked as deleted rather than true. Fixes #4059. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-23 18:47:27 +00:00
Marco Neumann	51da6dd7fa	feat: store sort key in NG metadata (#4110 ) The sort key is optional and currently only produced by `iox_tests`. Writing it within the ingester/compactor is tracked by #3968. The sort key is read by the querier (and this will be verified by the query tests and is required to merge #4103). Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-23 18:24:46 +00:00
Paul Dix	b18b18afd9	fix: have ingester use single mutable batch for buffer (#4095 ) Removed some unnecessary tests as they no longer apply with the new buffer structure. This will hopefully reduce the memory footprint of the ingesters significantly. Closes #4072	2022-03-22 13:42:52 +00:00
Andrew Lamb	b83b000590	chore: Update datafusion (#4071 ) * chore: update to datafusion 5936edc2a94d5fb20702a41eab2b80695961b9dc * chore: Update apis to match datafusion changes	2022-03-22 13:17:41 +00:00
kodiakhq[bot]	67939fb37d	Merge branch 'main' into crepererum/issue3934b	2022-03-19 06:56:30 +00:00
Paul Dix	85287abc4e	feat: add ttbr metric to ingester (#4068 )	2022-03-18 17:35:30 +00:00
Marco Neumann	169fa2fb2f	refactor: make `QueryChunk` object-safe This makes it way easier to dyn-type database implementations. The only real change is that we make `QueryChunk::Error` opaque. Nobody is going to inspect that anyways, it's just printed to the user. This is a follow-up of #4053. Ref #3934.	2022-03-18 11:40:31 +01:00
Dom Dwyer	d9900f661b	refactor: ingester table & namespace count metrics Record the number of tables / namespaces an ingester process has observed.	2022-03-17 17:20:30 +00:00
Dom Dwyer	c0d5c6a559	feat: ingester pause metrics Emit a counter metric "ingest_paused_duration_ms_total" that records the duration of time an ingester stream is paused with millisecond granularity. This metric will allow us to measure the frequency and severity of, and alert on, an ingester stopping ingest due to memory limits enforced by the LifecycleManager. This will help us tune these config params.	2022-03-17 17:20:30 +00:00

... 3 4 5 6 7 ...

571 Commits (b521c68eefdea481a1ba7b1390de67baa850e89b)