influxdb

Commit Graph

Author	SHA1	Message	Date
Carol (Nichols \|\| Goulding)	1c7cbaf5ae	refactor: Use DurationHistogram in more places	2022-06-09 14:20:51 -04:00
Marco Neumann	4e5842dec7	feat: expose hit-miss metrics for querier caches (#4811 ) * feat: `MetricsCache` * feat: expose hit-miss metrics for querier caches * refactor: `MetricsCache` -> `CacheWithMetrics` Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-09 13:07:40 +00:00
Andrew Lamb	2ec7764fdd	refactor: rename builder like predicate methods to be `with_` (#4808 ) * refactor: rename builder like predicate methods to be `with_` * fix: merge conflict Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-09 11:26:03 +00:00
Andrew Lamb	5e4fcfaa4d	refactor: reduce mut usage in Predicate (#4807 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-09 10:46:01 +00:00
Andrew Lamb	afc1c12062	refactor: consolidate `PredicateBuilder` into `Predicate` (#4799 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-08 12:21:24 +00:00
Marco Neumann	317e9486df	refactor: make `Cache` a trait (#4802 ) * refactor: make `Cache` a trait To insert more high-level metrics (e.g. cache misses/hits) it would be helpful if we could easily instrument the layer right above the cache driver (that combines the backend and the loader). To do that without polluting the types too much, let's introduce a trait that describes the driver interface and that we could later wrap with intrumentation. This also pulls out the test into a generic setup, similar to how this is done for the cache storage backends. This does NOT include any functionality changes. * fix: typo Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-06-08 11:04:53 +00:00
Marco Neumann	4509e3db57	feat: wire up RB metrics for querier chunks	2022-06-07 15:31:49 +02:00
Andrew Lamb	8e96a2721d	chore: Update datafusion (again) (#4788 ) * chore: Update datafusion * chore: Update imports * refactor: update API usage * refactor: clean up some uses of binary_expr * fix: remove unused export * fix: update explain output * chore: update more explain tests Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-07 08:17:56 +00:00
dependabot[bot]	04c685b3b7	chore(deps): Bump tokio-util from 0.7.2 to 0.7.3 (#4784 ) Bumps [tokio-util](https://github.com/tokio-rs/tokio) from 0.7.2 to 0.7.3. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-util-0.7.2...tokio-util-0.7.3) --- updated-dependencies: - dependency-name: tokio-util dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-06-06 14:46:27 +00:00
dependabot[bot]	e03bf94420	chore(deps): Bump tokio from 1.18.2 to 1.19.1 (#4783 ) Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.18.2 to 1.19.1. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-1.18.2...tokio-1.19.1) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-06 14:15:12 +00:00
Carol (Nichols \|\| Goulding)	5af0cc6acf	fix: Handle read buffer column not existing for column_names in QueryChunk impl	2022-06-03 12:45:16 -04:00
Carol (Nichols \|\| Goulding)	aa510ae4e6	fix: Remove test uses of parquet chunks and document as unused The querier is now using read buffer chunks only, but we're leaving the parquet chunk code around for the moment.	2022-06-03 09:16:04 -04:00
Carol (Nichols \|\| Goulding)	a4f51d99f6	feat: Use the read buffer chunk cache in the querier	2022-06-03 09:16:04 -04:00
Andrew Lamb	3592aa52d8	chore: Update datafusion + `arrow`/`parquet`/`arrow-flight` to `15.0.0` (#4743 ) * chore: Update datafusion + `arrow`/`parquet`/`arrow-flight` to `15.0.0` * chore: Update APIs * chore: Run cargo hakari tasks * feat: normalize parquet file metadata * chore: update size tests * chore: add docs on metadata stripping * chore: TEMP UPDATE TO DF BRANCH * chore: Update for new API * fix: Update to latest DF * fix: cargo hakari Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: Raphael Taylor-Davies <r.taylordavies@googlemail.com>	2022-06-03 10:32:26 +00:00
dependabot[bot]	9a21292db8	chore(deps): Bump async-trait from 0.1.53 to 0.1.56 (#4774 ) Bumps [async-trait](https://github.com/dtolnay/async-trait) from 0.1.53 to 0.1.56. - [Release notes](https://github.com/dtolnay/async-trait/releases) - [Commits](https://github.com/dtolnay/async-trait/compare/0.1.53...0.1.56) --- updated-dependencies: - dependency-name: async-trait dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-06-03 09:10:40 +00:00
Carol (Nichols \|\| Goulding)	9d9c5d3692	fix: Take backoff config as an argument to be consistent with the other caches	2022-06-02 09:50:48 -04:00
Carol (Nichols \|\| Goulding)	76b40ac6a1	refactor: Make the type alias into a struct	2022-06-02 09:26:11 -04:00
Carol (Nichols \|\| Goulding)	715c65dfef	docs: Clarify a comment about what is an Arc	2022-06-02 09:22:44 -04:00
Carol (Nichols \|\| Goulding)	879dd7cec4	test: LRU behavior of the read buffer chunk cache	2022-06-02 09:22:44 -04:00
Carol (Nichols \|\| Goulding)	9328ba8c45	feat: Use new extra loading info to load read buffer chunks into cache	2022-06-02 09:22:44 -04:00
Carol (Nichols \|\| Goulding)	054c25de50	refactor: Add more methods to DecodedParquetFile I'm tired of trying to remember which info is on which metadata.	2022-06-02 09:22:44 -04:00
Marco Neumann	9e30a3eb29	refactor: rework querier concurrency limiting (#4760 ) * refactor: rework querier concurrency limiting With #4752 we introduced a concurrency limit into the querier. It works by drawing permits from a central semaphore whenever we create a `QuerierNamespace`. This however only limits concurrency during query planning and not query execution, because the objects contained within the plan (chunks and some metadata) neither reference the permit nor the `QuerierNamespace`. Now one approach to fix that would be to wire up the permit all the down into all the query-related data structures. This however is very fiddly and potentially will get lost at some point, because as soon as we transform these data structures -- e.g. into streams -- the permit might get lost again. This will be potentially query-dependent and very hard to debug. So instead we reverse the approach and track the permits at the upper layer of the stack: the gRPC service entry points. There we also need to be careful -- e.g. when we return streams to tonic -- but it's way easier to review that then the deeply nested object hierarchy that is involved with queries. Also the separation of concerns is a bit clearer, because why would a "chunk" care about the "query concurrency" as a whole. * refactor: improve gRPC permit keeping and prepare tests	2022-06-02 09:49:58 +00:00
Carol (Nichols \|\| Goulding)	37347f2389	feat: Add an Extra type to Cacher Loader to specify extra information for loading entries	2022-06-01 08:58:19 -04:00
Andrew Lamb	2886149afc	chore: naming / comment cleanups from namespace semaphore (#4753 )	2022-06-01 12:46:38 +00:00
Marco Neumann	ebeccf037c	feat: limit querier concurrency by limiting number of active namespaces (#4752 ) This is a rather quick fix for prod. On the mid-term we probably wanna rethink our deployment strategy, e.g. by using "one query per pod" and by deploying queryd w/ IOx into the same pod.	2022-06-01 11:59:35 +00:00
Andrew Lamb	d0903b11bb	refactor: reduce test duplication in `querier/src/table/mod.rs` (#4698 ) * refactor: reduce test duplication in `querier/src/table/mod.rs` * fix: Apply suggestions from code review Co-authored-by: Jake Goulding <jake.goulding@integer32.com> * fix: Update querier/src/table/test_util.rs Co-authored-by: Jake Goulding <jake.goulding@integer32.com> * fix: use now_nanos() * refactor: Add TestQuerierTable * refactor: rename functions for explicitness Co-authored-by: Jake Goulding <jake.goulding@integer32.com>	2022-05-30 12:56:09 +00:00
Carol (Nichols \|\| Goulding)	55cd8d15be	fix: Update method name to specify the kind of chunk it makes	2022-05-27 13:04:24 -04:00
Carol (Nichols \|\| Goulding)	f0b4d71f47	docs: Update comment to reflect new implementation	2022-05-27 13:04:24 -04:00
Carol (Nichols \|\| Goulding)	5232594aab	docs: Fix grammar in a comment Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-05-27 13:04:13 -04:00
Carol (Nichols \|\| Goulding)	2cb351cd0d	feat: Make a QuerierRBChunk wrapper to handle traits and extra data This brings back a bunch of code from OG from read buffer backed DbChunks.	2022-05-26 16:52:14 -04:00
Carol (Nichols \|\| Goulding)	5fd3ffc17f	refactor: Rename ParquetChunkAdapter to only ChunkAdapter It might be creating chunks of different kinds other than ParquetChunks.	2022-05-26 16:52:14 -04:00
Andrew Lamb	633117e595	feat: avoid catalog access on each query (#4650 ) * feat: cache catalog access on query * fix: Apply suggestions from code review Co-authored-by: Carol (Nichols \|\| Goulding) <193874+carols10cents@users.noreply.github.com> Co-authored-by: Carol (Nichols \|\| Goulding) <193874+carols10cents@users.noreply.github.com>	2022-05-26 20:44:22 +00:00
kodiakhq[bot]	1043c98e17	Merge branch 'main' into cn/welcome-back-read-buffer	2022-05-26 13:47:27 +00:00
Andrew Lamb	2d5a327bf4	fix: expire empty parquet_files cache and empty tombstones cache (#4701 ) * fix: expire empty parquet_files cache * fix: expire empty tombstones cache Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-26 11:08:15 +00:00
Carol (Nichols \|\| Goulding)	cddcca1e05	feat: Implement a method to get a read buffer chunk from a stream of record batches	2022-05-25 17:24:35 -04:00
Carol (Nichols \|\| Goulding)	f7bc551d9a	feat: Sketch out skeleton methods for RBChunk cache	2022-05-25 17:19:10 -04:00
Carol (Nichols \|\| Goulding)	04531e77dd	feat: Implement get on ReadBufferCache	2022-05-25 17:19:10 -04:00
Carol (Nichols \|\| Goulding)	25b8260b72	feat: Implement ReadBufferCache::new	2022-05-25 17:19:10 -04:00
Carol (Nichols \|\| Goulding)	ab9010d9a6	refactor: Rename QuerierParquetChunk::new_parquet to new	2022-05-25 17:19:10 -04:00
Carol (Nichols \|\| Goulding)	df10452e2e	refactor: Rename methods from new_querier_chunk to new_querier_parquet_chunk	2022-05-25 17:19:10 -04:00
Carol (Nichols \|\| Goulding)	4a90d0af32	refactor: Remove ChunkStorage enum; inline into QuerierParquetChunk instead	2022-05-25 17:19:10 -04:00
Carol (Nichols \|\| Goulding)	b2c62c6808	refactor: Rename QuerierChunk to QuerierParquetChunk	2022-05-25 17:19:10 -04:00
Carol (Nichols \|\| Goulding)	66823522f3	docs: Fix comment wrapping while reading through	2022-05-25 17:19:10 -04:00
Marco Neumann	a08a91c5ba	fix: ensure querier cache is refreshed for partition sort key (#4660 ) * test: call `maybe_start_logging` in auto-generated cases * fix: ensure querier cache is refreshed for partition sort key Fixes #4631. * docs: explain querier sort key handling and test * test: test another version of issue 4631 * fix: correctly invalidate partition sort keys * fix: fix `table_not_found_on_ingester`	2022-05-25 10:44:42 +00:00
Andrew Lamb	935743b525	refactor: Implement `new_querier_chunk` and `new_querier_chunk_from_file_with_metadata` (#4685 )	2022-05-24 21:58:27 +00:00
Andrew Lamb	4d8ece5524	feat: Add `Tombstone` to querier cache (#4663 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-24 13:21:23 +00:00
Marco Neumann	a3dab68f3f	fix: actually log error (#4672 ) While logging all the helpful information to replicate failing querier->ingester requests via CLI, I totally forgot to log the error message itself. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-24 08:44:35 +00:00
Andrew Lamb	e877a64462	feat: Add `ParquetFiles` cache and memory size estimation for ParquetMetadata (#4661 ) * feat: Add `ParquetFiles` cache * fix: Apply suggestions from code review Co-authored-by: Marko Mikulicic <mkm@influxdata.com> * fix: remove commented out debugging println * refactor: Improve size calculation * fix: mark `ParquetFileCache::clear` test only * fix: assert on metric count Co-authored-by: Marko Mikulicic <mkm@influxdata.com>	2022-05-23 17:11:38 +00:00
Marco Neumann	2029bd16ba	feat: enable debugging of failed querier->ingester requests (#4659 ) * feat: enable debugging of failed querier->ingester requests - extend `query-ingester` CLI to allow usage of predicates - on failed requests: log all information that required for the CLI - test the "ingester fails" scenario * test: explain Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * docs: improve Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * refactor: move b64 pred. serde into a single crate Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-05-23 15:37:31 +00:00
Dom Dwyer	7df7c4844c	refactor: remove redundant ParquetChunk errors Eliminates unused / refactors away unnecessary errors for the parquet::chunk module.	2022-05-20 15:17:40 +01:00
Andrew Lamb	a18a49736d	refactor: Encapsulate reconciliation logic more (#4644 ) * refactor: extract code from state_reconciler * refactor: Encapsulate reconcilation logic more * fix: docs	2022-05-19 19:25:36 +00:00
Dom Dwyer	baa86d846f	refactor: use ParquetStore instead of ObjectStore Changes the code paths that interact with Parquet files in the object store to reference the ParquetStorage directly (DRY refactor). This change takes us from a dependency graph of: ┌─────────────────┐ │ │ ▼ │ Parquet Consumer │ │ ┌──────────────┐ ├────────▶│ParquetStorage│ ▼ └──────────────┘ ┌──────────────┐ │ ObjectStore │ └──────────────┘ │ ┌────┴────┐ ▼ ▼ File s3 System (etc) to: Parquet Consumer │ ▼ ┌──────────────┐ │ParquetStorage│ └──────────────┘ │ ▼ ┌──────────────┐ │ ObjectStore │ └──────────────┘ │ ┌────┴────┐ ▼ ▼ File s3 System (etc) With the ParquetStorage being solely responsible for managing interactions with the object store when dealing with Parquet files.	2022-05-19 13:52:51 +01:00
Dom Dwyer	e20b02b914	refactor: tidy ParquetChunk constructor Removes two unused constructors for a ParquetChunk, and moves the bare fn constructor that is actually used to be an associated method (a conventional constructor).	2022-05-19 13:51:07 +01:00
Andrew Lamb	ed41622593	chore: Remove dead code from QueryDatabase (#4637 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-19 10:29:54 +00:00
Marco Neumann	6577887440	feat: instrument querier cache loaders w/ metrics (#4635 ) * feat: `MetricsLoader` Add ability to instrument cache loaders w/ metrics. * feat: instrument querier cache loaders w/ metrics * fix: fix metric descriptions and names	2022-05-19 08:30:34 +00:00
Marco Neumann	770293a973	feat: add LRU cache metrics (#4632 ) * refactor: require `Resource`s to be convertible to `u64` * refactor: require `Resource`s to have a unit name * refactor: make LRU cache IDs static * feat: add LRU cache metrics * docs: improve type names in LRU doctest * docs: epxlain `MeasuredT` Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * docs: explain `test_metrics` Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-05-19 08:05:17 +00:00
Marco Neumann	4bd899369e	feat: check for overlapping ingester partititions in querier (#4633 ) Right now this would clearly indicate a bug and before I am trying to understand some prod issues, I wanna rule that one out.	2022-05-18 13:16:27 +00:00
Marco Neumann	23b37a1991	refactor: remove unused `TableCache::id`	2022-05-18 11:39:30 +02:00
Marco Neumann	7c20acb2e6	refactor: remove unused `NamespaceCache::name`	2022-05-18 11:39:30 +02:00
Marco Neumann	52346642a0	ci: fix cargo deny (#4629 ) * ci: fix cargo deny * chore: downgrade `socket2`, version 0.4.5 was yanked * chore: rename `query` to `iox_query` `query` is already taken on crates.io and yanked and I am getting tired of working around that.	2022-05-18 09:38:35 +00:00
Andrew Lamb	3a33e806c7	chore: Update datafusion + `arrow`/`parquet`/`arrow-flight` to `14.0.0` (#4619 ) * chore: Update datafusion deps * chore: update arrow/parquet/arrow flight deps * chore: Run cargo hakari tasks * chore: Update location of utils * chore: Update some more APIs Co-authored-by: CircleCI[bot] <circleci@influxdata.com>	2022-05-17 14:13:03 +00:00
Marco Neumann	779f0e9cdf	feat: querier RAM pool (#4593 ) * feat: `SortKey::size` * feat: `FunctionEstimator` * feat: querier RAM pool Let's put all the caches into a single RAM pool, so we can at least somewhat control RAM usage. Note that this does NOT limit the peak memory during query execution though, but should at least stop unlimited cache growth. A follow-up PR will add metrics. * refactor: improve some size calculations Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-17 13:11:20 +00:00
Carol (Nichols \|\| Goulding)	9eb21095e7	feat: Add more logging in particular situations to debug flaky test	2022-05-16 16:46:29 -04:00
dependabot[bot]	259d2486c1	chore(deps): Bump tokio-util from 0.7.1 to 0.7.2 (#4605 ) Bumps [tokio-util](https://github.com/tokio-rs/tokio) from 0.7.1 to 0.7.2. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-util-0.7.1...tokio-util-0.7.2) --- updated-dependencies: - dependency-name: tokio-util dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-05-16 11:42:31 +00:00
Andrew Lamb	22c1853f3e	chore: Update datafusion dependency (#4599 ) * chore: Update datafusion dependency * chore: Update for new API	2022-05-13 19:10:22 +00:00
Raphael Taylor-Davies	f2bb0fdf77	feat: update to crates.io object_store version (#4595 ) * feat: update to crates.io object_store version * chore: Run cargo hakari tasks * fix: tests * chore: remove object store integration test plumbing Co-authored-by: CircleCI[bot] <circleci@influxdata.com>	2022-05-13 16:26:07 +00:00
kodiakhq[bot]	0f8f294319	Merge branch 'main' into cn/remove-chunk-addr	2022-05-13 13:54:44 +00:00
kodiakhq[bot]	542ec97b66	Merge branch 'main' into cn/rename-no-ng	2022-05-13 13:47:48 +00:00
Marco Neumann	cb0a4176fd	refactor: move `querier::cache_system` into its own crate (#4592 )	2022-05-13 13:12:07 +00:00
Andrew Lamb	ff8241ea57	fix: Use one (shared) http2 connection per querier and ingester pair (#4583 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-13 11:15:10 +00:00
Marco Neumann	31f3d988ae	feat: LRU cache infrastructure (#4189 ) * feat: `AddressableHeap::is_empty` * feat: add type-safe size trait * feat: LRU cache infrastructure * fix: typos Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * fix: update after rebase * docs: explain test code * test: ensure that values are dropped from LRU cache * test: ensure that backends are dropped from LRU cache * docs: explain where LRU state is stored * docs: explain high-level LRU usage * refactor: "memory (size)" => "resource (consumption)" This should make the reasoning more generic and easier to understand. Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-13 07:34:59 +00:00
Carol (Nichols \|\| Goulding)	55313d290a	fix: Update or remove comments that mention NG or OG Connects to #4450.	2022-05-12 16:09:08 -04:00
Carol (Nichols \|\| Goulding)	07c7c75067	fix: Remove ng_chunk method Connects to #4450.	2022-05-12 16:09:08 -04:00
Carol (Nichols \|\| Goulding)	71d1f2c9bd	fix: Remove old_gen_partition_key	2022-05-12 16:05:03 -04:00
Carol (Nichols \|\| Goulding)	faba90d992	fix: Remove ChunkAddr	2022-05-12 15:50:41 -04:00
Carol (Nichols \|\| Goulding)	975dd288d4	refactor: Rearrange delegation to remove one usage of generated types (#4579 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-12 16:10:38 +00:00
Carol (Nichols \|\| Goulding)	10413a6e9a	docs: Explain that the querier gets the write info from the ingesters Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-05-12 09:48:46 -04:00
Carol (Nichols \|\| Goulding)	48b84b3bdf	feat: Querier can get write status from ingesters Connects to influxdata/influxdb-iox-client-go#27.	2022-05-11 14:12:10 -04:00
Carol (Nichols \|\| Goulding)	77205d9a8e	fix: Remove some unused error variants	2022-05-11 14:07:48 -04:00
Andrew Lamb	b8cb4c3f2b	feat: Interrogate schema from querier (as well as router) (#4557 ) * refactor: move SchemaService into `service_grpc_schema` * feat: implement schema gRPC for querier * chore: Run cargo hakari tasks Co-authored-by: CircleCI[bot] <circleci@influxdata.com>	2022-05-10 20:55:58 +00:00
Raphael Taylor-Davies	8b379c83cc	refactor: simplify object_store path handling (#4534 ) * refactor: simplify object_store path handling * fix: aws integration tests * chore: lint * fix: update gcs tests * refactor: move errors into submodules * chore: lint * chore: review feedback * refactor: replace provider with Display * fix: failing tests Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-09 18:43:22 +00:00
Carol (Nichols \|\| Goulding)	dfced5b21c	fix: Move top-level allow dead code in querier to specific items	2022-05-06 16:58:03 -04:00
Jake Goulding	e07bcd40c2	refactor: Remove unused dependencies These were found by iterating over all of the dependencies of each Cargo.toml, then grepping that crate for the dependency's name. If it didn't show up, I attempted to remove it. I left a few dependencies that this process flagged: * generated_types - `pbjson`,`serde`. Apparently used by the generated code. * grpc-router-test-gen - `prost`. Apparently used by the generated code. * influxdb_iox - `heappy`. Doesn't appear used, but is behind enough feature flags that I don't care to reason about and it's already optional. - `tikv_jemalloc_sys`. Appears to be setting a feature flag of an indirect dependency. * iox_gitops_adapter - `k8s_openapi`. Appears to be setting a feature flag of an indirect dependency.	2022-05-06 15:57:58 -04:00
Carol (Nichols \|\| Goulding)	068096e7e1	fix: Rename data_types2 to data_types	2022-05-06 14:45:39 -04:00
Carol (Nichols \|\| Goulding)	0541c6e40f	fix: Remove data_types crate where it's no longer used	2022-05-06 14:45:39 -04:00
Carol (Nichols \|\| Goulding)	2ef44f2024	fix: Move timestamp types to data_types2	2022-05-06 14:45:38 -04:00
Carol (Nichols \|\| Goulding)	d2671355c3	fix: Move partition metadata types to data_types2	2022-05-06 14:45:37 -04:00
Carol (Nichols \|\| Goulding)	485d6edb8f	refactor: Move IngesterQueryRequest to generated_types	2022-05-06 14:45:37 -04:00
Carol (Nichols \|\| Goulding)	ea46830954	fix: Remove iox_object_store crate; move ParquetFilePath to parquet_file	2022-05-06 14:45:36 -04:00
Andrew Lamb	37c7ce793c	chore: Update datafusion (again) (#4518 ) * chore: Update datafusion (again) * refactor: Update ExecutionPlan:execute to not be async	2022-05-05 15:43:41 +00:00
Andrew Lamb	02893e598c	chore: Update datafusion and upgrade arrow/parquet/arrow-flight to 13 (#4516 ) * chore: Tool for automating arrow version update * chore: Update datafusion and arrow/parquet/arrow-flight * fix: update for changes in Arrow API Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-05 00:21:02 +00:00
Nga Tran	4813cc8332	test: Added explain tests for querier. Found and fixed #4468 (#4469 ) * test: Added explain tests for querier. Found and fixed #4468 * chore: cleanup * chore: Apply suggestions from code review Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-29 14:15:30 +00:00
Marco Neumann	6eed09a926	test: use "real" ingester in query tests (#4455 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-28 14:39:31 +00:00
dependabot[bot]	420c306caa	chore(deps): Bump tokio from 1.17.0 to 1.18.0 (#4453 ) Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.17.0 to 1.18.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-1.17.0...tokio-1.18.0) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-04-28 08:21:17 +00:00
Marco Neumann	bd0bae13ce	fix: extend + harden querier `ensure_schema` (#4429 ) - only convert dictionary types that we really want to convert (instead of blindly converting all types) - handle missing / NULL columns Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-27 12:49:59 +00:00
Nga Tran	fa2c1febf4	feat: use stored partition sort key to deduplicate data (#4360 ) * feat: use stored sort key to deduplicate data * refactor: verify if one is a super sort key of the other * test: unit tests for scan and deduplication plans * fix: typo * refactor: refactor and add comments * feat: cache partition sort key to read during planning as needed * test: tests for query plans with different overlap groups * chore: cleanup * chore: resolve merge conflicts Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-26 20:36:32 +00:00
Marco Neumann	2337935660	test: chunks in ingester stage (#4415 ) * refactor: document and improve `MockIngesterConnection` * refactor: split `OldOneMeasurementFourChunksWithDuplicates` for `EXPLAIN` queries * fix: mark "IngsterPartition" chunks as unsorted * fix: "group by" queries may require sorted comparison * refactor: re-export a few more types from querier * fix: ensure that test parquet files are de-duped * test: chunks in ingester stage * docs: explain test code	2022-04-26 07:55:19 +00:00
二手掉包工程师	4b47d723b1	refactor: Rename time to iox_time (#4416 ) Signed-off-by: hi-rustin <rustin.liu@gmail.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-26 00:19:59 +00:00
Marco Neumann	86e8f05ed1	fix: make all catalog IDs 64bit (#4418 ) Closes #4365. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-25 16:49:34 +00:00
Marco Neumann	f5f80e879e	test: add benchmarks for addressable heap (#4201 )	2022-04-25 14:37:29 +00:00
Nga Tran	d963110842	feat: group chunk overlaps based on time range only (#4389 ) * feat: overlap for NG querier * chore: cleanup * refactor: address review comments * fix: typo Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-25 13:32:07 +00:00
Marco Neumann	99f6fb5f59	feat: calculate summaries for `IngesterPartition`	2022-04-22 10:21:14 +02:00
Andrew Lamb	e67cc9dbce	chore: Update datafusion again (#4385 ) * chore: Update datafusion * fix: Update imports Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-21 21:05:16 +00:00
Marco Neumann	c084785bc3	feat: fuse ingester and catalog states in querier (#4355 ) * feat: fuse ingester and catalog states in querier This now correctly combines the data we get from the ingester w/ the data we get from the catalog. Right now it bails out if during the very small time windows between asking the ingester and querying the catalog the compactor combines the newest files w/ "too new" files (see tests). * fix: improve error wording Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * fix: improve doc comment Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * fix: explain tests Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * refactor: improve tests, method naming and docs Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-20 14:37:19 +00:00
Andrew Lamb	73bed810da	chore: Update arrow, arrow-flight, parquet, tonic, prost, etc (#4357 ) * chore: Update datafusion * chore: Update arrow/arrow-flight/parquet to 12 * chore: update datafusion correctly * chore: Update prost, tonic, and dependents * fix: Fixup some api changes * fix: Update test output in db * fix: Update test output in parquet_file * fix: remove old pbjson types * fix: Add "--experimental_allow_proto3_optional" flag * chore: Run cargo hakari tasks * fix: compile error * chore: Update heappy Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-20 11:12:17 +00:00
Andrew Lamb	383c0b328d	feat: Issue queries from querier to ingester in parallel (#4359 ) * feat: Issue queries from querier to ingester in parallel * refactor: complete Arc-ification * refactor: use a named struct to pass the state	2022-04-20 10:55:14 +00:00
Marco Neumann	d711816548	feat: add sequencer ID and correct partition key to `IngesterPartition` (#4348 ) * feat: impl `Debug` for `TestCatalog` * feat: add sequencer ID and correct partition key to `IngesterPartition` - simplifies debugging (parquet chunks and ingester chunks now use the same partition key naming) - the sequencer ID is required to correctly reason about tombstones (to be implemented in a later PR) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-19 15:59:49 +00:00
Marco Neumann	3588a06647	feat: correctly dismantle ingester gRPC reponse in querier (#4323 ) This now correctly processes record batches for the different partitions. The actual code change is rather small but it requires some substantial test infrastructure.	2022-04-19 11:09:40 +00:00
Marco Neumann	de1241db85	test: mock gRPC ingester response for querier Add infrastructure to test how the querier processes ingester gRPC responses w/o performing a full query or end2end test.	2022-04-14 15:00:35 +02:00
Marco Neumann	351b0d0c15	fix: unknown namespace/table in querier<>ingester flight protocol (#4307 ) * fix: return "not found" gRPC error instead of "internal" when ingester does not know table * fix: properly handle "namespace not found" in ingester queries * fix: make `initialize_db` work with async code * test: add custom step for NG tests * fix: handle "unknown table/namespace" resp. in querier * docs: explain test setup Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-04-14 12:36:15 +00:00
Carol (Nichols \|\| Goulding)	8c9b7b501b	fix: Update a signature to use ParquetFileWithMetadata	2022-04-13 11:09:06 -04:00
Carol (Nichols \|\| Goulding)	94dcde4996	fix: Do fewer queries for metadata By adding another _with_metadata catalog function. Also introduce a new type rather than passing around tuples everywhere.	2022-04-13 10:43:20 -04:00
Carol (Nichols \|\| Goulding)	02fee3b84f	feat: Request parquet metadata from the catalog when needed only	2022-04-13 10:43:19 -04:00
Marco Neumann	f75d3b1f5d	fix: proper executor shutdown in querier This is not a huge issue but might drain resources like file descriptors during tests. The dedicated exuector also logs a warning.	2022-04-13 15:52:44 +02:00
Marco Neumann	83f77712b1	refactor: querier<>ingester flight protocol adjustments (#4286 ) * refactor: querier<>ingester flight protocol adjustments This makes a few adjustments to the querier<>ingester flight protocol. Query Scope =========== The querier will request data for ALL sequencer IDs for now. There is no reason to have a request per sequencer ID. We can add a range/set filter later if we want, but this is not required for now. Partition-level =============== The only time when the querier cares about sequencer IDs (i.e. sharding) at all is when it selects which ingesters to ask for unpersisted data (this is currently not implemented, it just asks all ingesters). Afterwards the querier only cares about partitions (which are bound to specific sequencers anyways) because this is the level where parquet file persistence and compaction as well as deduplication happen. So we make partitions a first-class citizen in the ingester response. Metadata VS RecordBatches ========================= The global app-metadata will list all partitions and their max persisted parquet files and tombstones (theoretically tombstones are at table-level, but the ingester could in the future break them down to the partition-level). Then it receives a stream of record batches. Each record batch is tagged (via key-value metadata in its schema) so it can be assigned to a partition. At the moment the ingester returns 0 or 1 batches per unpersisted partition (0 in case we've filtered out all the data via the predicate), but in the future it is free to return multiple batches. This setup gives the ingester more freedom over memory management and (potentially parallel) query processing, while at the same time keeps the set of duplicated information minimal and allows easy extensions (since the global metadata is a full-blown protobuf message). Querier ======= At the moment the querier ignores all the metdata. Follow-up PRs will change that. * docs: improve Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * refactor: make code clearer Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-04-12 16:48:40 +00:00
Andrew Lamb	d8de38cdb9	feat: MVP include un-persisted results from the ingester in query results (#4255 ) * feat: Return not-yet-persisted data in query results * fix: comments from code review * fix: update for logical merge conflict Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-12 11:03:19 +00:00
Marco Neumann	380cd9bbff	refactor: use a single flight client implementation (#4273 ) "end-user -> querier" and "querier -> ingester" should use a single Flight client implementation. The difference is just the request and response metadata. This changes our default Flight client to use protobuf instead of JSON for the ticket format.	2022-04-12 09:08:25 +00:00
Andrew Lamb	3f5eab7648	feat: allow the querier to talk with multiple ingesters (#4271 ) * refactor: Move querier config to clap_blocks * refactor: Add tests * refactor: allow multiple addresses * refactor: Update to use multiple addresses * fix: bow to clippy * fix: docstring * fix: error if address is repeated multiple times * chore: Add error enum, plumb through * fix: clippy * refactor: improve Rust API * fix: fix test	2022-04-11 18:49:49 +00:00
Andrew Lamb	941dcc8e80	fix: return error rather than panic in querier namspace access (#4270 )	2022-04-11 14:01:15 +00:00
Andrew Lamb	f6e6821276	feat: Add basic Querier <--> Ingester "Service Configuration" (#4259 ) * feat: Add basic Querier <--> Ingester "Service Configuration" * docs: update comments in test * refactor: cleanup tests a little * refactor: make trait more consistent * docs: improve comments in IngesterPartition	2022-04-11 11:50:22 +00:00
Andrew Lamb	bbbdcc75a8	feat: `QuerierDatabase::chunks` returns `Result` (#4260 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-08 18:54:17 +00:00
Andrew Lamb	eb7d41f7a1	test: Add schema validation to end to end querier test (#4258 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-08 18:11:00 +00:00
Andrew Lamb	34e65c23fa	fix: Update for signature change (#4252 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-08 11:21:07 +00:00
Marco Neumann	5bebc73e3f	fix: consider "in-between" tombstones as processed (#4187 ) Abstract ======== We need to be careful w/ tombstone that fall exactly in sequence number range of a parquet file. Current Bug =========== Imagine the following order of events: 1. Router creates write at sequence number 1: - `table,selector=1 payload=1 1` - `table,selector=2 payload=2 2` 2. Ingester pulls write, waits a bit and persists it to parquet file 1: - `table,selector=1 payload=1 1` - `table,selector=2 payload=2 2` 4. Router creates write at sequence number 2: - `table,selector=1 payload=3 3` - `table,selector=2 payload=4 4` 5. Ingester pulls write 6. Router create delete at sequencer number 3: full time range, `selector=1` 7. Ingeser pulls delete and creates tombstone 1 8. Router creates write at sequence number 4: - `table,selector=1 payload=5 5` - `table,selector=2 payload=6 6` 9. Ingester pulls write 10. Ingester persists parquet file 2: - `table,selector=2 payload=4 4` - `table,selector=1 payload=5 5` - `table,selector=2 payload=6 6` When reading parquet file 2, the tombstone MUST NOT be applied. Otherwise `table,selector=1 payload=5 5` will be deleted. Notes ===== Technically this issue also applies to files created by the compactor, however the compactor marks tombstones as processed that fall into the sequence number range. It even does that in a single transaction: `fc4635a334/compactor/src/compact.rs (L821-L861)` Alternative =========== An alternative solution would be if the ingester would mark tombstones that it materialized during persistence as "processed" (tombstone 1 for parquet file 2 in the example above). However "processed" markers are currently a mere optimization and don't affect correctness, which is nice for caching on the querier side as well as reasoning. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-31 15:09:58 +00:00
Andrew Lamb	de6505c801	fix: retry catalog list operation (#4188 )	2022-03-31 13:07:00 +00:00
Andrew Lamb	a1df864283	feat: Support 'SHOW NAMESPACES' in sql repl (#4164 ) * feat: Support `SHOW NAMESPACES` in sql repl * feat: add basic support to clients * fix: add get_namespaces service test * fix: proper error handling * test: end to end test for namespace client * refactor: Use QuerierDatabase rather than Catalog * refactor: remove unused function	2022-03-31 12:57:33 +00:00
Andrew Lamb	22b24bdab3	chore: Update datafusion again (#4148 ) * chore: update datafusoon * refactor: Update for DataFusion API changes * chore: TEMP TEMP change df to local copy * chore: Update to datafusion again * fix: Update Cargo.lock * fix: logical conflict	2022-03-30 16:51:48 +00:00
Marco Neumann	b1af5b3f44	feat: query log system table for querier (#4157 ) * feat: query log system table for querier Closes #4084. * fix: typo Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * docs: extend Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-30 15:38:11 +00:00
Marco Neumann	5c6583e09a	feat: `CacheBackend::is_empty` (#4174 ) * test: run generic tests for dual cache backend * feat: `CacheBackend::is_empty` Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-30 15:28:59 +00:00
Marco Neumann	20bbb88dc5	refactor: remove table name from `TableSummary` (#4170 ) This allows us to remove the table name from the low-level chunk representations (like `ParquetFile`, RUB, ...) since table names are already tracked by the higher-level data structures (e.g. catalog, catalog chunk) that manage the low-level chunk representations. This is similar to #4167. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-30 13:24:00 +00:00
Marco Neumann	036626a576	refactor: remove partition key from `ParquetChunk` (#4167 ) The parquet chunk is always wrapped into some higher-level data structure (e.g. a catalog chunk, a partition, ...) that knows exactly "where" the chunk is located. There is no need for the parquet chunk to back-reference container-level attributes. In the contrary: double-bookkeeping makes the code more complex and costs additional memory. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-30 09:24:56 +00:00
Marco Neumann	decd018a6a	refactor: remove querier sync loop (#4146 ) Namespaces are now created on demand and contain their full schema. Tombstones/chunks are created on demand during the query. Closes #4123. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-29 11:47:23 +00:00
dependabot[bot]	17af5fcbd1	chore(deps): Bump tokio-util from 0.7.0 to 0.7.1 (#4154 ) * chore(deps): Bump tokio-util from 0.7.0 to 0.7.1 Bumps [tokio-util](https://github.com/tokio-rs/tokio) from 0.7.0 to 0.7.1. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-util-0.7.0...tokio-util-0.7.1) --- updated-dependencies: - dependency-name: tokio-util dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * chore: Run cargo hakari tasks Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-29 08:39:02 +00:00
Marco Neumann	ec88df6cb7	feat: cache namespace schemas (#4142 ) * feat: `TestColumn` for IOx tests * feat: cache namespace schemas In preparation to remove the querier sync loop completely. Ref #4123.	2022-03-29 08:19:21 +00:00
Nga Tran	80b7e9cce1	feat: delete fully processed tombstones & integration tests for find_and_compact (#4116 ) * feat: remove fully processed tombstones * test: first few tests * fix: delete SQL * fix: test how IN (...) works in PG * fix: test how IN (?) works in PG * fix: test how IN (?) works in PG * fix: dynamically add IN (?, ?, ...) * fix: dynamically add IN (?, ?, ...) & its dynamic values * fix: add argument directly in the SQL * test: more tests for catalog read and update functions * chore: move a subfunction to make it easier to read) * test: first test for find_can_compact but disabled due to bug * test: integration tests and a bug fix for find_and_compact * chore: cleanup * refactor: address review comments * fix: put 2 delete processed tombstones and tombstones in a transaction	2022-03-28 18:35:54 +00:00
Marco Neumann	fb186c6733	refactor: `QueryDatabaseProvider::db` should be async (#4143 ) This is required to fetch querier namespaces on demand. Ref #4123.	2022-03-28 11:18:20 +00:00
dependabot[bot]	4f9515ffba	chore(deps): Bump async-trait from 0.1.52 to 0.1.53 (#4141 ) Bumps [async-trait](https://github.com/dtolnay/async-trait) from 0.1.52 to 0.1.53. - [Release notes](https://github.com/dtolnay/async-trait/releases) - [Commits](https://github.com/dtolnay/async-trait/compare/0.1.52...0.1.53) --- updated-dependencies: - dependency-name: async-trait dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-03-28 08:55:24 +00:00
Marco Neumann	9886ff42cc	refactor: clean up querier public interface	2022-03-25 11:54:52 +01:00
Marco Neumann	942c3198f6	refactor: pass through backoff config to querier table	2022-03-25 11:18:45 +01:00
Andrew Lamb	5c69a3f43b	chore: Update deps: datafusion, arrow/arrow-flight/parquet to 11, zstd to 0.11 (#4119 ) * chore: update datafusion * chore(deps): Bump arrow from 10.0.0 to 11.0.0 Bumps [arrow](https://github.com/apache/arrow-rs) from 10.0.0 to 11.0.0. - [Release notes](https://github.com/apache/arrow-rs/releases) - [Changelog](https://github.com/apache/arrow-rs/blob/master/CHANGELOG.md) - [Commits](https://github.com/apache/arrow-rs/compare/10.0.0...11.0.0) --- updated-dependencies: - dependency-name: arrow dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> * chore(deps): Bump arrow-flight from 10.0.0 to 11.0.0 Bumps [arrow-flight](https://github.com/apache/arrow-rs) from 10.0.0 to 11.0.0. - [Release notes](https://github.com/apache/arrow-rs/releases) - [Changelog](https://github.com/apache/arrow-rs/blob/master/CHANGELOG.md) - [Commits](https://github.com/apache/arrow-rs/compare/10.0.0...11.0.0) --- updated-dependencies: - dependency-name: arrow-flight dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> * chore: update parquet to 11.0.0 * fix: error on create schema, test for same * fix: upgrade zstd * chore: Run cargo hakari tasks * fix: fix logical merge conflict * fix: hakari * fix: hakari * fix: update newly introduced dep Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-24 15:27:36 +00:00
Andrew Lamb	29b89aaca7	refactor: extract influxrpc, flight and testing gRPC out of influxdb_ioxd (#4106 ) * refactor: extract grpc service implementations out of influxdb_ioxd * chore: Run cargo hakari tasks * refactor: rename server_common to service_common * refactor: rename server_grpc_influxrpc to service_grpc_influxrpc * refactor: rename server_grpc_flight to service_grpc_flight * refactor: rename server_grpc_testing to service_grpc_testing * fix: Cargo.toml Co-authored-by: CircleCI[bot] <circleci@influxdata.com>	2022-03-23 20:14:45 +00:00
Marco Neumann	51da6dd7fa	feat: store sort key in NG metadata (#4110 ) The sort key is optional and currently only produced by `iox_tests`. Writing it within the ingester/compactor is tracked by #3968. The sort key is read by the querier (and this will be verified by the query tests and is required to merge #4103). Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-23 18:24:46 +00:00
Marco Neumann	89206e013c	test: run SOME query tests for querier (#4098 ) This includes some type changes to dispatch between OG and NG and allows some tests to be run against the NG querier. This only contains parquet files though, so it's somewhat a limited scope. For #3934.	2022-03-22 17:39:19 +00:00
Nga Tran	886f9dc8c1	feat: split compacted data into 2 compacted sets (#4088 ) * feat: split compacted data into 2 compacted sets * chore: clean up * refactor: address review comments Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-22 13:28:32 +00:00
Andrew Lamb	b83b000590	chore: Update datafusion (#4071 ) * chore: update to datafusion 5936edc2a94d5fb20702a41eab2b80695961b9dc * chore: Update apis to match datafusion changes	2022-03-22 13:17:41 +00:00
Marco Neumann	c9908b260c	refactor: dyn-dispatch database in query subsystem (#4083 ) * refactor: dyn-dispatch database in query subsystem This is similar to #4080 but concerns the database itself. For #3934. * docs: improve wording Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-22 09:15:52 +00:00
Marco Neumann	55643945a1	refactor: `querier` w/o `db` (#4063 ) * feat: `TombstoneRepo::list_by_table` * feat: `ParquetFileRepo::list_by_table_not_to_delete` * refactor: `querier` w/o `db` Get the `querier` to work w/o relying on `db`. A few notes: - Testing is kinda shallow, we really need to get `query_tests` working w/ `querier` (see #3934). - We still run a sync loop for namespaces, tables and schemas. This will be a replaced by "update namespace incl. tables and schemas on demand". Note however that we cannot fetch single tables and schemas on demand at the moment, because DataFusion doesn't implement async schema inspection (only `scan` / "give me all the chunks" is async). I think that's OK for now and we can address this later. - There is NO cache for parquet files and tombstones at the moment. For correctness, they need to be fetched in a single transaction (or we need a kinda tricky sequence number / logical clock tracking) and I am not sure yet how this makes sense when we have the ingester data wired up and predicates pushed down to the catalog (see next point). So let's measure first and then decide on a caching strategy for this. - Predicates are currently NOT pushed down to the catalog. I'll need to figure out how to extract time range from generic DataFusion expressions to make that work (it's easier for InfluxRPC queries, but they are not tested at the moment, see first point). Sorry that this commit is kinda huge. I initially planned to only migrate the chunks away from `db` and leave the tables and schemas for a follow-up PR, but the DataFusion trait structure (chunks are bound to their tables) makes this kinda pointless. Closes #3974. * docs: explain what we're doing Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * docs: mention tracking issues * docs: explain what we're doing Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-03-21 16:58:00 +00:00
Marco Neumann	0779f81b6b	refactor: rework `TableCache (#4054 ) * feat: `TableRepo::get_by_namespace_and_name` * refactor: rework `TableCache` - dual cache that can also map table names to IDs - deal w/ missing tables w/o panics - set proper timeouts to missing data For #3974. * test: extend table cache tests	2022-03-21 13:40:06 +00:00
Marco Neumann	d1df95df87	refactor: dyn-dispatch chunks in query subsystem - this is what DataFusion is doing as well; it's also fast enough because the number of chunks in a query is not THAT massive (it's not like we are doing row-level dyn dispatching) - it simplifies abstracting over different databases - it allows us to drop our enum-based dispatching that we have for `DbChunk` and that we would also need for the querier (e.g. depending on if a chunk is backed by a parquet file or ingester data) - it likely speeds up compile times because the `query` is no longer contains massive amounts of generic code For #3934.	2022-03-21 12:47:54 +01:00
Marco Neumann	ca152e7934	refactor: avoid generics in `QueryDatabase` A step to make this trait object-safe. Ref #3934.	2022-03-21 10:45:05 +01:00
Marco Neumann	0071b85c22	refactor: make `ExecutionContextProvider` object-safe Ref #3934.	2022-03-21 10:40:53 +01:00
Marco Neumann	98c8475e3b	feat: add "dual" cache pattern (#4039 ) * feat: add "dual" cache pattern This will be useful for certain parts that are addressed internally via ID but where the user-facing APIs use names. For #3985. * refactor: rework "dual" cache construct to be backend based Pros: - easiser to reason about the locking and consistency, esp. in concurrent applications Cons: - we are not canceling running queries for the dual cache any longer Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-17 13:39:09 +00:00
Marco Neumann	0850a93f20	refactor: make `QueryDatabase::chunks` async (#4047 ) For OG we can determine the chunks w/o any IO, for NG however this might require a few catalog queries. This is likely not the last change of this sort, i.e. the whole schema handling is currently sync as well. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-17 12:55:25 +00:00
Marco Neumann	dc67570e1c	feat: `OptionalValueTtlProvider` (#4040 ) Quite a few caches will request data from the catalog w/o knowing if it exists (e.g. a table by name). We should have different TTLs for "exists" and "unknown" w/o writing much boilerplate code. For #3985. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-16 11:56:21 +00:00
Marco Neumann	31d6a7e6b3	fix: race condition in `Cache::set` (#4038 ) In theory on a multi-threaded tokio executor, the following could have happened: \| Thread 1 \| Thread 2 \| \| --------------------- \| ----------------------------------- \| \| \| Running query begin \| \| \| ... \| \| \| `loader.await` finished \| \| `Cache::set` begin \| \| \| state locked \| \| \| \| try state lock, blocking \| \| running query removed \| \| \| ... \| \| \| state unlocked \| \| \| `Cache::set` end \| \| \| \| state locked \| \| \| panic because running query is gone \| Another issue that could happen is if we: 1. issue a get request, loader takes a while, this results in task1 2. side-load data into the running query (task1 still running) 3. the underlying cache backend drops the result very quickly (task1 still running) 4. we request the same data again, resulting in yet another query task (task2), task1 is still running at this point In this case the original not-yet-finalized query task (task1) would remove the new query task (task2) from the active query set, even though task2 is actually not done. We fix this by the following measures: - task tagging: tasks are tagged so if two tasks for the same key are running, we can tell them apart - task->backend propagation: let the query task only write to the underlying backend if it is actually sure that it is running - prefer side-loaded results: restructure the query task to strongly prefer side-loaded data over whatever comes from the loader - async `Cache::set`: Let `Cache::set` wait until a running query task completes. This has NO correctness implications, it's probably just nicer for resource management. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-16 11:36:20 +00:00
Marco Neumann	1cbd878379	feat: do not attempt to store entries that will immediately expire (#4045 ) Let's keep the TTL cache as clean as possible. For #3985.	2022-03-16 11:16:46 +00:00
Dom Dwyer	5585dd3c21	refactor: switch to using DynObjectStore Changes all consumers of the object store to use the dynamically dispatched DynObjectStore type, instead of using a hardcoded concrete implementation type.	2022-03-15 16:32:52 +00:00
Dom Dwyer	1d5066c421	refactor: rename ObjectStore -> ObjectStoreImpl Frees up the name for so we can use `dyn ObjectStore` throughout the code instead of `ObjectStoreApi`.	2022-03-15 16:29:43 +00:00
Marco Neumann	97d595e4fb	feat: `Cache::set` (#4036 ) * feat: `Cache::set` This will be helpful to fill caches if we got the information from somewhere else. For #3985. * docs: improve Co-authored-by: Edd Robinson <me@edd.io> * docs: explain lock gap * feat: add debug log to `Cache` Co-authored-by: Edd Robinson <me@edd.io>	2022-03-15 12:12:26 +00:00
Marco Neumann	4b5cf6a70e	feat: cache processed tombstones (#4030 ) * test: allow to mock time in `iox_test` * feat: cache processed tombstones For #3974. * refactor: introduce `TTL_NOT_PROCESSED`	2022-03-15 10:28:08 +00:00
Marco Neumann	87e53f30d1	refactor: add TTL cache backend (#4027 ) * feat: `CacheBackend::as_any` * refactor: add TTL cache backend This is based on the new `AddressableHeap`, which simplifies the implementation quite a lot. For #3985. * refactor: `TtlBackend::{update->evict_expired}` * docs: exlain ttl cache eviction Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-14 15:35:27 +00:00
Marco Neumann	27efb66237	test: add proptest for `AddressableHeap` (#4025 ) * test: add proptest for `AddressableHeap` For #3985. * refactor: simplify code Co-authored-by: Edd Robinson <me@edd.io> Co-authored-by: Edd Robinson <me@edd.io>	2022-03-14 12:38:30 +00:00
Marco Neumann	632c4953b4	feat: add addressable heap for query cache (#4016 ) * feat: add addressable heap for query cache This will be used as a helper data structure for TTL and LRU. It's probably not the most performant implementation but it's good enough for now. This is for #3985. * fix: test + explain tie breaking in `AddressableHeap`	2022-03-14 09:35:38 +00:00
Marco Neumann	d46de98183	feat: extract "backend" from querier cache (#4015 ) * feat: extract "backend" from querier cache The backend will implement pruning policies like LRU and TTL as well as where/how the data is stored. Having a proper interface for that simplifies the implementation since we don't need to have one massive `Cache` object with a super complex mechanism. This is for #3985. * refactor: `Backend` -> `CacheBackend` Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-11 11:49:57 +00:00
Nga Tran	f03ebd79ab	refactor: move querier's test utils to a new crate to get reused by tests in other crates (#4013 ) * refactor: move querier's test utils to a new crate to be able resued by tests in other crates * chore: remove unused import	2022-03-10 18:17:58 +00:00
Paul Dix	27999ff72f	feat: add compaction_level and created_at to parquet_file (#3972 )	2022-03-10 15:56:57 +00:00
Andrew Lamb	2c3d30ca32	chore: Update datafusion, arrow, flight and parquet (#4000 ) * chore: Update datafusion, arrow, flight and parquet * fix: api change * fix: fmt * fix: update test metadata size * fix: Update sizes in parquet test * fix: more metadata size update	2022-03-10 12:24:47 +00:00
Marco Neumann	30d1c77d36	feat: querier test system, ground work (#3991 ) * feat: querier test system, ground work See #3985 for the motivation. This introduces a cache system for the querier which can later be extended to support the remaining features listed in #3985 (e.g. metrics, LRU/TTL). All current caches are wired up to go throw the new cache system. Once we move away from (ab)using `db`, the set of caches will be different but the system will remain. * test: explain it Co-authored-by: Raphael Taylor-Davies <1781103+tustvold@users.noreply.github.com> * refactor: simplify cache result broadcast * refactor: introduce `Loader` crate * fix: docs * docs: explain why we manually drop removed hashmap entries * docs: fix intra-doc link Co-authored-by: Raphael Taylor-Davies <1781103+tustvold@users.noreply.github.com>	2022-03-10 11:27:24 +00:00
Nga Tran	c6cab3538f	refactor: move parquet chunk's new and decode to parquet_file crate (#3987 )	2022-03-08 22:04:32 +00:00
Marco Neumann	77f6153f72	refactor: remove `QueryDatabase::chunk_summaries` (#3977 ) - This is not used by the query engine at all. - The query engine should not care about ALL chunks but only about the chunks it gets via `QueryDatabase::chunks` (which includes a table name and a predicate). - All other users of that API are NOT really query-related.	2022-03-08 11:34:26 +00:00
Marco Neumann	5cc1c697fc	refactor: remove `QueryDatabase::partition_addr` (#3976 ) - This was not actually used by the query engine. - The query engine doesn't have a concept of a "partition", it only cares about chunks. - Unbound access to all partitions in the database is quite expensive (esp. on NG).	2022-03-08 11:17:31 +00:00
Marco Neumann	a3e952847a	refactor: split `querier::namespace` into submodules (#3975 ) This makes it easier to see what's required to support the query interface.	2022-03-08 10:49:00 +00:00
Marco Neumann	db3f1e8db7	feat: wire up tombstones into querier (#3962 ) * feat: `TombstoneRepo::list_by_namespace` * test: model sequencer properly * feat: wire up tombstones into querier Closes #3932. * refactor: `override_delete_predicates` => `set_delete_predicates`	2022-03-08 10:06:22 +00:00
Carol (Nichols \|\| Goulding)	9961efd702	feat: Send parquet and tombstone seq nums with ingester query response (#3925 ) Fixes #3867.	2022-03-04 15:22:29 +00:00
Marco Neumann	8d00aaba90	feat: sync chunks in querier (#3911 ) * feat: `ParquetFileRepo::list_by_namespace_not_to_delete` * feat: `ChunkAddr: Clone` * test: ensure that querier keeps same partition objects * test: improve `create_parquet_file` flexibility * feat: sync chunks in querier * test: improve `test_parquet_file`	2022-03-04 08:53:39 +00:00
Raphael Taylor-Davies	e304613546	feat: include trace ID in query log (#3912 ) (#3923 ) * feat: include trace ID in query log (#3912) * chore: fmt * chore: lint	2022-03-03 17:50:49 +00:00
Marco Neumann	bbeba73345	feat: sync partitions in querier (#3900 ) * feat: sync partitions in querier * docs: explain per-table partition grouping	2022-03-03 09:28:44 +00:00
kodiakhq[bot]	caba3e9fd2	Merge branch 'main' into cn/querier-flight-request	2022-03-02 20:30:00 +00:00
Edd Robinson	3d047073b9	feat: add tracing down to the chunk level (#3804 ) * refactor: wire exectution context to Deduplicator * feat: example trace to chunk read_filter * refactor: make execution context required * refactor: expose metadata API * refactor: more span context for chunk read_filter * refactor: fix build * refactor: push context into result stream * refactor: make executor optional	2022-03-02 19:08:22 +00:00
Carol (Nichols \|\| Goulding)	3f2a58b47f	refactor: pub use data_types from data_types2 So it's clearer which parts of data_types the NG design is using, and which types can be cleaned up eventually.	2022-03-02 13:55:31 -05:00
Carol (Nichols \|\| Goulding)	2a90841715	refactor: Move IngesterQueryRequest to data_types2 So that querier doesn't need to depend on ingester.	2022-03-02 13:52:13 -05:00
Carol (Nichols \|\| Goulding)	8f3e44bf76	refactor: Extract a crate for shared data types in the new design	2022-03-02 12:16:15 -05:00
Carol (Nichols \|\| Goulding)	16d86ed05b	feat: Deserialize metadata to get max_sequencer_number And add an end-to-end test for the flight request to the ingester.	2022-03-02 11:50:47 -05:00
Carol (Nichols \|\| Goulding)	141a6087d0	feat: Querier able to send Flight queries to Ingester Fixes #3773.	2022-03-02 11:50:45 -05:00
Marco Neumann	2fd68ea75f	feat: sync tables and schemas in querier (#3895 ) * feat: convert `iox_catalog` schema to `schema::Schema` * fix: remove leftover println statements * feat: sync tables and schemas in querier * feat: `PartitionRepo::list_by_namespace` * docs: explain `QuerierNamespace` data structs a bit * refactor: improve variable naming * test: extend `test_sync_schemas * fix: do not block forever when namespace is gone	2022-03-02 15:32:03 +00:00
Andrew Lamb	286d5f7b2b	feat: add `success` column to system.queries (#3891 ) * feat: add `success` column to system.queries * refactor: Remove lifetime from QueryCompletedToken and thread through flight * test: update test to make incomplete query clearer * refactor: use better patter to set complete * fix: logical merge conflict	2022-03-02 15:05:06 +00:00
Marco Neumann	af57664f53	feat: wire up flight into the querier (#3889 )	2022-03-02 09:20:26 +00:00
Marco Neumann	936f51013d	test: querier shutdown and background task handling (#3881 ) This is similar to what we already have for the ingester.	2022-03-02 08:59:50 +00:00
Marco Neumann	daf14f6506	refactor: clean up `querier` a bit Before adding more and more features, here is a bit of a clean up and prep work: - Pull out caching into its own module and add proper tests for it. - Start to build a test infrastructure so tests are shorter and easier to read. This doesn't fully pay off just yet but gets more and more important when we actually sync tables and chunks.	2022-03-01 13:24:20 +01:00
Marco Neumann	48722783f9	feat: offer metrics for in-mem catalog (#3876 ) This can be quite helpful to test certain caching behavior w/o writing yet-another abstraction layer.	2022-03-01 11:33:54 +00:00
Marco Neumann	6e2470bf5f	feat: create CatalogChunk in querier (#3862 ) * feat: `NamespaceRepo::get_by_id` * feat: create `CatalogChunk` in querier	2022-02-28 17:20:38 +00:00
Marco Neumann	b213796c98	feat: sync namespaces in querier (#3865 ) * feat: `NamespaceRepo::list` * feat: sync namespaces in querier Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-28 15:01:28 +00:00
Marco Neumann	f966f4c7a4	feat: create `ParquetChunk` in querier (#3857 ) Adds a small adapter that is able to produce `ParquetChunk`s for NG. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-25 08:54:16 +00:00
Luke Bond	4731913c44	feat: skeleton of querier CLI (#3843 ) * feat: skeleton of querier CLI * chore: wrap metrics in opt&arc in querier to satisfy new api * chore: derive debug in querier handler * chore: add join handles and their shutdown to nascent querier server * chore: querier server http unimpl -> 404 * fix: join/shutdown fix in querier; removed unused delegates	2022-02-24 15:42:56 +00:00
dependabot[bot]	5a79b3a68b	chore(deps): Bump arrow-flight from 9.0.2 to 9.1.0 (#3829 ) Bumps [arrow-flight](https://github.com/apache/arrow-rs) from 9.0.2 to 9.1.0. - [Release notes](https://github.com/apache/arrow-rs/releases) - [Changelog](https://github.com/apache/arrow-rs/blob/master/CHANGELOG.md) - [Commits](https://github.com/apache/arrow-rs/compare/9.0.2...9.1.0) --- updated-dependencies: - dependency-name: arrow-flight dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-23 11:03:22 +00:00
Andrew Lamb	a30803e692	chore: Update datafusion, update `arrow`/`parquet`/`arrow-flight` to 9.0 (#3733 ) * chore: Update datafusion * chore: Update arrow * fix: missing updates * chore: Update cargo.lock * fix: update for smaller parquet size * fix: update test for smaller parquet files * test: ensure parquet_file tests write multiple row groups * fix: update callsite * fix: Update for tests * fix: harkari * fix: use IoxObjectStore::existing Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-15 12:10:24 +00:00
Carol (Nichols \|\| Goulding)	73828323ac	feat: Ingester Flight gRPC API (#3623 ) * feat: Add a way to run ingester with an in-memory catalog from the CLI If you set the --catalog-dsn string to "mem", rather than using that as a Postgres connection URL, create an in-memory catalog. Planning on using this in tests, so not documenting. * fix: Set default topic to the same value as SHARED_KAFKA_TOPIC Namely, both should use an underscore. I don't think there's a way to directly share these values between a constant and an annotation. * feat: Add a flight API (handshake only) to ingester * fix: Create partitions if using file-based write buffer * fix: Change the server fixture to handle ingester server type For now, the ingester doesn't implement the deployment API. Not sure if it should or not. * feat: Start implementing ingester do_get, namely decoding the query Skip serialization of the predicate for the moment. * refactor: Rename ingest protos to ingester to match crate name * refactor: Rename QueryResults to QueryData * feat: Move ingester flight client to new querier crate * fix: Off by one error, different starting indexes in sequencers * fix: Create new CLI argument to pick the catalog type * fix: Create a CLI option to set the number of topics to auto-create in the write buffer * fix: Check the arrow flight service's health to tell that the ingester gRPC is up * fix: Set postgres as the default catalog type * fix: Return an error rather than panicking if CLI args aren't right	2022-02-09 19:07:44 +00:00

... 2 3 4 5 6 ...

347 Commits (7202dddab6d9ede46c74664c0675fe349da2fd13)