influxdb

Commit Graph

Author	SHA1	Message	Date
Dom Dwyer	9eafa9dbed	style: consistent import ordering Reorder all imports in the ingester to match a consistent order: * stdlib * external crates * intra-crate imports This helps prevent merge conflicts & keeps everything tidy.	2022-11-22 14:11:10 +01:00
Dom Dwyer	ee8b728c32	refactor: decouple Shard & BufferTree Splits out the nested tree of namespace -> tables -> partitions (referred to as the "buffer tree") from the Shard which previously held the namespace map. This allows the BufferTree to exist without a shard, or many trees to exist within a shard, etc.	2022-11-22 14:11:10 +01:00
Marco Neumann	e4c12fa6a5	fix: slice flight response batches (#6205 ) * fix: slice flight response batches Same as #6094 but for the Apache Flight interface. Ref https://github.com/influxdata/idpe/issues/16073. * refactor: use `RecordBatch::slice` Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-22 12:25:23 +00:00
dependabot[bot]	04c00bbb62	chore(deps): Bump bytes from 1.2.1 to 1.3.0 (#6199 ) Bumps [bytes](https://github.com/tokio-rs/bytes) from 1.2.1 to 1.3.0. - [Release notes](https://github.com/tokio-rs/bytes/releases) - [Changelog](https://github.com/tokio-rs/bytes/blob/master/CHANGELOG.md) - [Commits](https://github.com/tokio-rs/bytes/commits) --- updated-dependencies: - dependency-name: bytes dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-22 08:23:24 +00:00
Dom Dwyer	097f0acb85	refactor: move SequenceNumberRange Moves the SequenceNumberRange type out of "data" and into the root to be reused outside of the data module. This construct is universally useful across all the ingester code.	2022-11-21 16:11:55 +01:00
Dom Dwyer	1938c18c50	refactor: decouple DmlSink error type Allows different DmlSink implementations to return different error types. This allows for small, concise errors that are local to the DmlSink implementation and specific to it. This helps avoid bloated "kitchen sink" error types.	2022-11-21 15:29:13 +01:00
Dom Dwyer	64c9d87b9b	refactor: move DmlSink Extracts the DmlSink trait into its own module - it is independent of the Kafka handler and will be reused.	2022-11-21 15:02:24 +01:00
dependabot[bot]	a9db7581cd	chore(deps): Bump tokio from 1.21.2 to 1.22.0 (#6183 ) Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.21.2 to 1.22.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-1.21.2...tokio-1.22.0) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-21 10:21:24 +00:00
Dom Dwyer	85c8d16680	refactor: add a message to unreachable!() Adds a message to say an impossible thing is impossible.	2022-11-18 17:33:58 +01:00
Dom Dwyer	9dc32f1c16	refactor: remove names from DML init Fixes conflicts introduced by #6170.	2022-11-18 17:31:56 +01:00
Dom	59b3c793d3	Merge branch 'main' into dom/ingester-rpc-write	2022-11-18 16:21:07 +00:00
Dom Dwyer	9351e01068	refactor: log dml apply errors Ensures DML apply errors are recorded in the ingester logs.	2022-11-18 16:48:31 +01:00
Dom Dwyer	16eed699fd	refactor: avoid needless partition key clone Moves the trace! invocation to before the DmlWrite init to avoid having to clone the partition key.	2022-11-18 16:46:14 +01:00
Carol (Nichols \|\| Goulding)	9751512d44	fix: Insert columns in schema in ingester tests where we have table names	2022-11-18 10:40:40 -05:00
Carol (Nichols \|\| Goulding)	02c3083192	fix: Remove table names from Dml operations	2022-11-18 10:40:38 -05:00
Dom Dwyer	90dd9906f6	feat(ingester): rpc write endpoint Adds a handler implementation of the gRPC WriteService to receive direct RPC writes from a router. This code is currently unused.	2022-11-18 16:36:19 +01:00
Dom Dwyer	229e2adbb1	refactor: split gRPC services into modules Splits the everything-grpc-in-one-file into smaller, per-service modules.	2022-11-18 15:51:54 +01:00
Nga Tran	49a9565240	feat: gRPC that creates namespace (#6103 ) * feat: create namespace API call in router Co-authored-by: Nga Tran <nga-tran@live.com> * chore: treat retention as ns except in CLI * fix: overflow in nanosecond calc * fix: retention test after changing it from hours to ns * chore: comment clarification in cli; better response type for error in ns API * fix: correct some rebase mistakes * chore: merge namespace create & create_with_retention; renamed ns create test helper fn & const * fix: ns autocreation test was wrong after rebase * fix: mem catalog has default 1hr retention, accidently removed in rebase * chore: remove mem catalogs default 1hr retention; make it settable in sets & router Co-authored-by: Luke Bond <luke.n.bond@gmail.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-18 13:02:12 +00:00
Nga Tran	6f7b1e2e26	feat: reject writes that are outside the retention period (#6148 ) * feat: reject writes that are outside the retention period * feat: add retention validator into handler stack * chore: Apply suggestions from code review Co-authored-by: Dom <dom@itsallbroken.com> * refactor: address review comments * test: unit tests fot retention validation * chore: address review comments * test: more unit tests and integration tests * refactor: make time inside retention period for emphemeral_mode test * fix: 2 hours Co-authored-by: Dom <dom@itsallbroken.com>	2022-11-17 20:55:58 +00:00
kodiakhq[bot]	1a49fa4864	Merge branch 'main' into cn/test-refactor	2022-11-17 14:01:36 +00:00
Dom Dwyer	5afe58d4d2	refactor: remove unused errors These error states are no longer possible after several refactors, but do not cause a "not used" lint because of macro magic.	2022-11-17 13:53:54 +01:00
Carol (Nichols \|\| Goulding)	d4715a9fde	fix: Simplify tests by using and creating more test helpers The most important part of this is creating the DmlWrites in one spot.	2022-11-16 21:48:43 -05:00
Carol (Nichols \|\| Goulding)	4e2b68a7c5	fix: Simplify test by not actually creating a catalog namespace This isn't actually needed for what this test is testing.	2022-11-16 21:06:44 -05:00
Carol (Nichols \|\| Goulding)	b6286767b0	fix: Validating the schema in ingester tests isn't necessary The router validates schemas; schema validation shouldn't be tested in the ingester	2022-11-16 21:05:51 -05:00
Carol (Nichols \|\| Goulding)	c7b9866483	feat: Have make_write_op take the table name as an argument to be more flexible	2022-11-16 21:05:46 -05:00
Carol (Nichols \|\| Goulding)	d0218fb025	refactor: Simplify tests by using make_write_op helper function	2022-11-16 21:00:10 -05:00
Carol (Nichols \|\| Goulding)	cac241b7ad	refactor: Extract shared test setup for ingester data tests	2022-11-16 21:00:10 -05:00
Carol (Nichols \|\| Goulding)	256ded7e00	fix: Move a NamespaceData test into its module	2022-11-16 21:00:10 -05:00
Marco Neumann	62851afc27	feat: add querier->ingester circuit breaker (#6147 ) * feat: add log ingester memory pressure persist * feat: add querier->ingester circuit breaker Closes #4608. * docs: explain high-level circuit breaker * docs: improve Co-authored-by: Andrew Lamb <alamb@influxdata.com> * test: add additional test assertion * refactor: upgrade info to warning log Co-authored-by: Andrew Lamb <alamb@influxdata.com>	2022-11-16 10:50:33 +00:00
Carol (Nichols \|\| Goulding)	c27d3a22d2	fix: Remove namespace argument from test helper function	2022-11-14 16:46:04 -05:00
Carol (Nichols \|\| Goulding)	3943faf998	fix: Remove namespace from DmlWrite and DmlDelete constructors	2022-11-14 16:46:04 -05:00
Carol (Nichols \|\| Goulding)	f78195f7c7	fix: Remove namespace name field from DmlWrite and DmlDelete But leave the argument in their constructors for now. Not all numbers in tests can be 42, Dom.	2022-11-14 16:46:04 -05:00
Carol (Nichols \|\| Goulding)	c203e8295f	test: Keep track of namespaces by ID in ingester TestContext	2022-11-14 16:46:04 -05:00
kodiakhq[bot]	6c1e9f04ef	Merge branch 'main' into dom/deferred-table-name	2022-11-14 18:22:46 +00:00
Carol (Nichols \|\| Goulding)	fd898cea2a	docs: Correct grammar and update outdated comment	2022-11-14 13:21:55 -05:00
dependabot[bot]	a969754819	chore(deps): Bump chrono from 0.4.22 to 0.4.23 (#6129 ) * chore(deps): Bump chrono from 0.4.22 to 0.4.23 Bumps [chrono](https://github.com/chronotope/chrono) from 0.4.22 to 0.4.23. - [Release notes](https://github.com/chronotope/chrono/releases) - [Changelog](https://github.com/chronotope/chrono/blob/main/CHANGELOG.md) - [Commits](https://github.com/chronotope/chrono/compare/v0.4.22...v0.4.23) --- updated-dependencies: - dependency-name: chrono dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * refactor: chrono future compat Integer->timstamp conversions should not silently panic. Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Marco Neumann <marco@crepererum.net> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-14 13:34:09 +00:00
Dom Dwyer	413b7c8f4a	refactor: use table name from catalog Changes the TableData within the ingester to utilise a TableNameResolver to fetch the TableName via the catalog on demand / in the background, instead of using the table name sent over the write. This change causes the ingester to perform a catalog query in the background (or on demand) to resolve the table name. This is a pre-requisite for removing the table name from the write wire format.	2022-11-14 11:32:22 +01:00
Dom Dwyer	0df6c7877c	refactor: indirect DeferredLoad<TableName> init Like the NamespaceNameProvider, this commit adds a TableNameProvider to provide decoupled initialisation of a DeferredLoad<TableName> instead of hard-coding in a catalog instance / query code, and plumbs it into position to be used when initialising a TableName.	2022-11-14 11:32:21 +01:00
Dom Dwyer	8dae6d3994	perf(ingester): address tables by ID only Changes the buffer tree to address TableData by their ID only (removing support for addressing tables by their string names). This removes the double reference book keeping / twin indexes and associated overhead. As part of this change, the TableName is now wrapped in a DeferredLoad in preparation for removal of the names in the DmlOperation wire format. This commit also switches the map of TableData within the NamespaceData (the parent node) to use the ArcMap for faster lookups and DRY exactly-once initialisation.	2022-11-14 11:27:19 +01:00
Dom Dwyer	d8fc9ff258	test: fix testing deadlocks The MemCatalog suffers from deadlocks when attempting to obtain a second ref to RepoCollection: https://github.com/influxdata/influxdb_iox/issues/3859	2022-11-14 10:50:10 +01:00
Dom Dwyer	9e97866b48	refactor: internalise PartitionProvider Removes the need to leak the PartitionProvider outside of the ingester crate. This will allow the PartitionProvider to utilise a DeferredLoad<TableName> without having to make the DeferredLoad and TableName pub.	2022-11-14 10:50:05 +01:00
Marco Neumann	746032af0f	fix: compatibility after hashbrown upgrade - Some methods need explicit types - `hashbrown::HashMap` now takes 32 bytes, not 64	2022-11-11 13:25:39 -05:00
Jake Goulding	cc17e5a54b	refactor: use a workspace dependency for hashbrown	2022-11-11 13:25:39 -05:00
dependabot[bot]	5024523f00	chore(deps): Bump hashbrown from 0.12.3 to 0.13.1 Bumps [hashbrown](https://github.com/rust-lang/hashbrown) from 0.12.3 to 0.13.1. - [Release notes](https://github.com/rust-lang/hashbrown/releases) - [Changelog](https://github.com/rust-lang/hashbrown/blob/master/CHANGELOG.md) - [Commits](https://github.com/rust-lang/hashbrown/compare/v0.12.3...v0.13.1) --- updated-dependencies: - dependency-name: hashbrown dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2022-11-11 13:24:56 -05:00
Dom	2e7a1391f8	Merge branch 'main' into dom/deferred-namespace-name	2022-11-11 17:39:10 +00:00
Dom Dwyer	0f6470c390	refactor: use correct description for retries Use the correct description for namespace query retries.	2022-11-11 18:38:30 +01:00
Dom Dwyer	1e5d3f31af	docs: clearer code comments / docs Remove redundant comments & clarify returns.	2022-11-11 18:38:29 +01:00
Dom	18c86ca44f	refactor: named unused return Co-authored-by: Carol (Nichols \|\| Goulding) <193874+carols10cents@users.noreply.github.com>	2022-11-11 17:32:42 +00:00
Nga Tran	9c4266c503	refactor: first step to remove unused retention_duration (#6113 ) * refactor: first step to remove unused retention_duration * refactor: remove retenion_duration from update catalog Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-11 15:21:06 +00:00
Dom Dwyer	2521aedb6a	perf(ingester): address namespaces by ID only Removes reliance on string name identifiers for namespaces in the ingester buffer tree, reducing the memory usage of the namespace index and associated overhead. The namespace name is required (though unused by IOx) in the IoxMetadata embedded within a parquet file, and therefore the name is necessary at persist time. For this reason, a DeferredLoad is used to query the catalog (by ID) for the name, at some uniformly random duration of time after initialisation of the NamespaceData, up to a maximum of 1 minute later. This ensures the query remains off the hot ingest path, and the jitter prevents spikes in catalog load during replay/ingester startup. As an additional / easy optimisation, the persist code causes a pre-fetch of the name in the background while compacting, hiding the query latency should it not have already been resolved. In order to keep the the ingester buffer & catalog decoupled / easily testable, this commit uses a provider/factory trait NamespaceNameProvider and corresponding implementation (NamespaceNameResolver) in a similar fashion to the PartitionResolver, allowing easy mocking for tests, and composition for prod code, allowing future optimisations such as pre-fetching / caching the "hot" namespace names at startup. Internal string identifier removal is a pre-requisite for removing string identifiers from the write wire format (#4880).	2022-11-11 14:37:21 +01:00

1 2 3 4 5 ...

542 Commits (eb6abb5d670799e9e1d9a1a4784305aadf5a6914)