influxdb

Commit Graph

Author	SHA1	Message	Date
Dom	a7770f0f7a	Merge branch 'main' into dom/reduce-write-timeout	2023-01-30 09:59:37 +00:00
dependabot[bot]	ed7d02a225	chore(deps): Bump tokio from 1.24.2 to 1.25.0 Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.24.2 to 1.25.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/commits/tokio-1.25.0) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2023-01-30 01:57:27 +00:00
Dom Dwyer	353b1ad575	feat: configurable RPC write request timeout Allows the user to configure the timeout used for a single RPC write request, and changes the default to a more sensible value (30 -> 3 seconds).	2023-01-27 14:53:48 +01:00
Dom Dwyer	6797eab5fc	feat(router): configurable partition key Allows the partition key to be set at runtime, though it's probably best no one does so for now.	2023-01-27 14:26:18 +01:00
Dom Dwyer	3a9b5a4d29	fix: bind NamespaceService to gRPC server I forgot to bind the service!	2023-01-26 17:32:11 +01:00
Dom Dwyer	1a7679bcee	refactor: expose underlying gRPC implementations Changes the gRPC delegate to return the underlying service (type erased) implementations instead of the RPC service wrappers.	2023-01-26 17:32:11 +01:00
Dom Dwyer	c66f4a3d92	fix(router): restore NamespaceService This was removed in the RPC variant of the router - no idea why, we definitely should have it!	2023-01-26 15:10:22 +01:00
Dom Dwyer	9132343dac	feat(metrics): export RPC upstream health state Adds a metric with a per-ingester label recording the current health state of the upstream ingester from the perspective of the router instance. Also logs periodically when one or more ingesters are offline.	2023-01-24 19:27:15 +01:00
Dom Dwyer	87b553fe9d	feat: WARN logs w/ endpoint for unhealthy upstream Changes the DEBUG log event to a WARN now that it includes the endpoint to which the event applies.	2023-01-24 19:19:31 +01:00
Dom Dwyer	085de40127	feat: lazy-connect to ingester gRPC endpoints Lazily establish connections in the background, instead of using tonic's connect_lazy(). connect_lazy() causes error handling to take a different path in tonic compared to "normal" connections, and this stops reconnections from being performed when the endpoint goes away (likely a bug). It also means the first few write requests won't have to wait while the connection is dialed, which brings down the P99 as a nice side-effect.	2023-01-24 16:44:55 +01:00
Dom Dwyer	c6d6c50fbf	perf(router): circuit break ingester connections Adds on-path health checking / recording using the CircuitBreaker construct, stopping requests to unhealthy upstreams (minus the probe requests) until they recover. This removes the horrible gRPC balancer hack I added to get us deployed ASAP, and should eliminate latency spikes and elevated error responses observed during deployments as a result.	2023-01-24 15:30:01 +01:00
Dom Dwyer	107006c801	revert: influxdata/dom/rpc-balancer This reverts commit `a3805dbccf`, reversing changes made to `bcb1232c5d`.	2023-01-24 14:47:05 +01:00
Dom Dwyer	7596dc0826	perf(router): circuit break ingester connections Adds on-path health checking / recording using the CircuitBreaker construct, stopping requests to unhealthy upstreams (minus the probe requests) until they recover. This removes the horrible gRPC balancer hack I added to get us deployed ASAP, and should eliminate latency spikes and elevated error responses observed during deployments as a result.	2023-01-24 12:38:27 +01:00
Dom Dwyer	0d111c4672	refactor: delegate frontend shutdown to backend Prior to this commit, the (happy path) shutdown sequence of an IOx process was hard coded to: 1. Stop gRPC & HTTP servers 2. Stop backend server (i.e. ingester2) After this commit, the execution of step 1 is delegated to the handler for step 2; the server implementation (router / ingester / querier / etc) now chooses when to shut down the RPC & HTTP servers. This allows the server shutdown delegate to correctly sequence the shutdown of all components of the IOx server. This allows ingester2 to correctly sequence the shutdown of the query RPC server w.r.t the graceful stop & persist, ensuring queries continue to be serviced.	2023-01-12 14:59:50 +01:00
dependabot[bot]	b49cc2e35e	chore(deps): Bump tokio from 1.24.0 to 1.24.1 (#6545 ) Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.24.0 to 1.24.1. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-1.24.0...tokio-1.24.1) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-01-10 09:48:44 +00:00
Dom Dwyer	a5a26f5efb	fix(router2): lazily connect to ingesters Allow the routers to start up without requiring full availability of all downstream ingesters. Previously a single unavailable ingester prevented the routers from starting up. This has downsides: * Lazily initialising a connection will cause the first writes to have higher latency as the connection is established. * The routers MAY come up in a state that will never work (i.e. bad ingester addresses) * Using the opaque gRPC load balancing mechanism restricts the visibility into which nodes are up/down (hindering useful log messages) and prevents us from implementing more advanced circuit breaking / probing logic / load-balancing strategies. This change is a quick fix - it leaves the round-robin handler in place, load-balancing over a single tonic Channel, which internally load-balances. This will need cleaning up.	2023-01-05 11:25:35 +01:00
Carol (Nichols \|\| Goulding)	dfa70269cb	fix: Make multiple ingester addresses in the router work (#6440 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-12-19 16:27:57 +00:00
dependabot[bot]	299f0e99f9	chore(deps): Bump thiserror from 1.0.37 to 1.0.38 Bumps [thiserror](https://github.com/dtolnay/thiserror) from 1.0.37 to 1.0.38. - [Release notes](https://github.com/dtolnay/thiserror/releases) - [Commits](https://github.com/dtolnay/thiserror/compare/1.0.37...1.0.38) --- updated-dependencies: - dependency-name: thiserror dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2022-12-19 10:33:32 +00:00
Carol (Nichols \|\| Goulding)	7d216ba1fd	feat: Error if you run the wrong command with the wrong env var set Connects to #6402.	2022-12-15 14:06:59 -05:00
Carol (Nichols \|\| Goulding)	aec98015d7	fix: Remove the rpc_write feature flag and use INFLUXDB_IOX_MODE env var instead And standardize on ingester2 and router2 for consistency. Connects to #6402.	2022-12-15 14:06:59 -05:00
Carol (Nichols \|\| Goulding)	619a2d0856	fix: Remove conflicting arguments from the RouterRpcWriteConfig (#6355 ) These were added in https://github.com/influxdata/influxdb_iox/pull/6346. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-12-08 20:21:37 +00:00
Luke Bond	551bb0ef6a	feat: allow enabling/disabling ns autocreation in router (#6346 ) * feat: allow enabling/disabling ns autocreation in router * fix: missed an import for something behind router2 compile flag	2022-12-07 16:12:00 +00:00
dependabot[bot]	1d38d400f0	chore(deps): Bump object_store from 0.5.1 to 0.5.2 (#6339 ) * chore(deps): Bump object_store from 0.5.1 to 0.5.2 Bumps [object_store](https://github.com/apache/arrow-rs) from 0.5.1 to 0.5.2. - [Release notes](https://github.com/apache/arrow-rs/releases) - [Changelog](https://github.com/apache/arrow-rs/blob/master/CHANGELOG-old.md) - [Commits](https://github.com/apache/arrow-rs/compare/object_store_0.5.1...object_store_0.5.2) --- updated-dependencies: - dependency-name: object_store dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * chore: Run cargo hakari tasks Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-12-06 07:53:54 +00:00
Carol (Nichols \|\| Goulding)	a51848b361	fix: Use client_util GrpcConnection instead of tonic Channel (#6320 ) * fix: Use client_util GrpcConnection instead of tonic Channel * refactor: include server addr in error Co-authored-by: Dom <dom@itsallbroken.com>	2022-12-02 15:57:42 +00:00
Carol (Nichols \|\| Goulding)	fef3bc02cd	refactor: Extract a clap block for the router RPC write path To be able to share it with the coming all-in-one2 command	2022-12-01 11:39:30 -05:00
Carol (Nichols \|\| Goulding)	c008219692	feat: Add a feature flag to switch to the router RPC write path (#6247 ) * feat: Add a feature flag to switch to the router RPC write path Fixes #6242. * refactor: Remove a weird arc clone/rename that's not needed I'm sure this was needed at some point, but it doesn't make much sense. I wasn't going to change this, but I'm now trying to minimize the differences between this function and the write path init function, so make this one better too. * fix: Add the namespace autocreation to the RPC write path too The topic/query pool don't really apply to this case, but use them anyway to be able to use the existing catalog methods. Also add a bunch of comments pointing out where the RPC write path initializer and the old router's initializer are the same and where they're different, so that perhaps it'll be easier to keep them in sync while they both exist. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-12-01 11:05:39 +00:00
Luke Bond	d07658282c	feat: add router config parameter for retention (#6278 ) * chore: remove unused/moved ns_autocreation dml handler * feat(router): expose new ns retention as config * fix: forgot to set default value for router retention arg * chore: make new namespace retention param an option	2022-11-30 13:14:39 +00:00
dependabot[bot]	a9db7581cd	chore(deps): Bump tokio from 1.21.2 to 1.22.0 (#6183 ) Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.21.2 to 1.22.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-1.21.2...tokio-1.22.0) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-21 10:21:24 +00:00
Nga Tran	49a9565240	feat: gRPC that creates namespace (#6103 ) * feat: create namespace API call in router Co-authored-by: Nga Tran <nga-tran@live.com> * chore: treat retention as ns except in CLI * fix: overflow in nanosecond calc * fix: retention test after changing it from hours to ns * chore: comment clarification in cli; better response type for error in ns API * fix: correct some rebase mistakes * chore: merge namespace create & create_with_retention; renamed ns create test helper fn & const * fix: ns autocreation test was wrong after rebase * fix: mem catalog has default 1hr retention, accidently removed in rebase * chore: remove mem catalogs default 1hr retention; make it settable in sets & router Co-authored-by: Luke Bond <luke.n.bond@gmail.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-18 13:02:12 +00:00
Nga Tran	6f7b1e2e26	feat: reject writes that are outside the retention period (#6148 ) * feat: reject writes that are outside the retention period * feat: add retention validator into handler stack * chore: Apply suggestions from code review Co-authored-by: Dom <dom@itsallbroken.com> * refactor: address review comments * test: unit tests fot retention validation * chore: address review comments * test: more unit tests and integration tests * refactor: make time inside retention period for emphemeral_mode test * fix: 2 hours Co-authored-by: Dom <dom@itsallbroken.com>	2022-11-17 20:55:58 +00:00
Luke Bond	9365d933f1	chore: router namespace api (#6151 ) * chore: move ns api from querier to router * chore: add explanatory comment in querier about moved namespace API * fix: add namespace service to router * fix: querier returns unimplemented error for ns retention, not panic * chore: reuse namespace -> proto in router ns api * chore: grpc namespace - consume ns to avoid clone Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-16 15:25:49 +00:00
kodiakhq[bot]	05d7d1495e	Merge branch 'main' into dependabot/cargo/hashbrown-0.13.1	2022-11-11 21:26:40 +00:00
Carol (Nichols \|\| Goulding)	bdff4e8848	fix: Consistently use 'namespace' instead of 'database' in comments and other internal text	2022-11-11 15:46:04 -05:00
Jake Goulding	cc17e5a54b	refactor: use a workspace dependency for hashbrown	2022-11-11 13:25:39 -05:00
dependabot[bot]	5024523f00	chore(deps): Bump hashbrown from 0.12.3 to 0.13.1 Bumps [hashbrown](https://github.com/rust-lang/hashbrown) from 0.12.3 to 0.13.1. - [Release notes](https://github.com/rust-lang/hashbrown/releases) - [Changelog](https://github.com/rust-lang/hashbrown/blob/master/CHANGELOG.md) - [Commits](https://github.com/rust-lang/hashbrown/compare/v0.12.3...v0.13.1) --- updated-dependencies: - dependency-name: hashbrown dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2022-11-11 13:24:56 -05:00
Nga Tran	9c4266c503	refactor: first step to remove unused retention_duration (#6113 ) * refactor: first step to remove unused retention_duration * refactor: remove retenion_duration from update catalog Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-11 15:21:06 +00:00
Andrew Lamb	694443bb87	chore: Rename DatabaseName to NamespaceName (#6100 ) * chore: Rename DatabaseName to NamespaceName * fix: fmt * chore: Updates some more references * chore: more cleanup * fix: adjust test Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-10 14:13:59 +00:00
Carol (Nichols \|\| Goulding)	fa46951524	fix: Remove needless deref done by auto deref, thanks Clippy!	2022-11-09 10:54:18 -05:00
Dom Dwyer	2331aaac94	perf(router): remove routing indirection Removes an unnecessary Arc pointer indirection for routing to the HTTP handler delegate.	2022-11-02 11:21:33 +01:00
Dom Dwyer	d166de931d	refactor: resolve namespace before DML dispatch This commit introduces a new (composable) trait; a NamespaceResolver is an abstraction responsible for taking a string namespace from a user request, and mapping to it's catalog ID. This allows the NamespaceId to be injected through the DmlHandler chain in addition to the namespace name. As part of this change, the NamespaceAutocreation layer was changed from an implementator of the DmlHandler trait, to a NamespaceResolver as it is a more appropriate abstraction for the functionality it provides.	2022-10-28 13:41:05 +02:00
Carol (Nichols \|\| Goulding)	2e83e04eab	feat: Use workspace package metadata to reduce differences and repetition	2022-10-24 13:04:09 -04:00
dependabot[bot]	933493fab3	chore(deps): Bump object_store from 0.5.0 to 0.5.1 Bumps [object_store](https://github.com/apache/arrow-rs) from 0.5.0 to 0.5.1. - [Release notes](https://github.com/apache/arrow-rs/releases) - [Changelog](https://github.com/apache/arrow-rs/blob/master/CHANGELOG-old.md) - [Commits](https://github.com/apache/arrow-rs/compare/object_store_0.5.0...object_store_0.5.1) --- updated-dependencies: - dependency-name: object_store dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2022-10-11 01:19:10 +00:00
Andrew Lamb	04ae0aee80	refactor: Remove protobuf based write service (#5750 ) * refactor: Remove grpc WriteService * fix: update end to end test * fix: Update generated_types/protos/influxdata/pbdata/v1/influxdb_pb_data_protocol.proto Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-30 10:55:03 +00:00
dependabot[bot]	227dde1dfc	chore(deps): Bump thiserror from 1.0.36 to 1.0.37 (#5753 ) Bumps [thiserror](https://github.com/dtolnay/thiserror) from 1.0.36 to 1.0.37. - [Release notes](https://github.com/dtolnay/thiserror/releases) - [Commits](https://github.com/dtolnay/thiserror/compare/1.0.36...1.0.37) --- updated-dependencies: - dependency-name: thiserror dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-29 10:37:14 +00:00
dependabot[bot]	b1740f45d6	chore(deps): Bump thiserror from 1.0.35 to 1.0.36 (#5737 ) Bumps [thiserror](https://github.com/dtolnay/thiserror) from 1.0.35 to 1.0.36. - [Release notes](https://github.com/dtolnay/thiserror/releases) - [Commits](https://github.com/dtolnay/thiserror/compare/1.0.35...1.0.36) --- updated-dependencies: - dependency-name: thiserror dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-09-26 14:44:36 +00:00
dependabot[bot]	b4a25fdb0e	chore(deps): Bump thiserror from 1.0.34 to 1.0.35 (#5629 ) Bumps [thiserror](https://github.com/dtolnay/thiserror) from 1.0.34 to 1.0.35. - [Release notes](https://github.com/dtolnay/thiserror/releases) - [Commits](https://github.com/dtolnay/thiserror/compare/1.0.34...1.0.35) --- updated-dependencies: - dependency-name: thiserror dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-14 12:54:12 +00:00
Andrew Lamb	f86d3e31da	chore: Update datafusion + object_store (#5619 ) * chore: Update datafusion pin * chore: update object_store to 0.5.0 * chore: Run cargo hakari tasks Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-13 12:34:54 +00:00
dependabot[bot]	786ce75e26	chore(deps): Bump tokio-util from 0.7.3 to 0.7.4 (#5596 ) Bumps [tokio-util](https://github.com/tokio-rs/tokio) from 0.7.3 to 0.7.4. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-util-0.7.3...tokio-util-0.7.4) --- updated-dependencies: - dependency-name: tokio-util dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-09-09 07:40:16 +00:00
Dom	a57748d741	Merge branch 'main' into dom/ingester-shard-connect	2022-09-07 12:25:40 +01:00
Dom Dwyer	d1ca29c029	fix(ingester): connect to assigned Kafka partition During initialisation, the ingester connects to the Kafka brokers - this involves per-partition leadership discovery & connection establishment. These connections are then retained for the lifetime of the process. Prior to this commit, the ingester would establish a connection to all partition leaders for a given topic. After this commit, the ingester connects to only the partition leaders it is going to consume from (for those shards that it is assigned.)	2022-09-07 13:21:06 +02:00

1 2

78 Commits (363ce1629c8b1f2ea5f5e4dba9840bcccefbceff)