influxdb

Commit Graph

Author	SHA1	Message	Date
Marko Mikulicic	d26ad8e079	feat: Allow passing service protection limits in create db gRPC call (#7941 ) * feat: Allow passing service protection limits in create db gRPC call * fix: Move the impl into the catalog namespace trait --------- Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-06-08 14:28:32 +00:00
Carol (Nichols \|\| Goulding)	d0db1194e2	feat: Validate custom partition templates on their creation Make sure custom partition templates have: - At least one part - No more than 8 parts - Only nonempty, valid strftime formats	2023-06-07 11:38:12 -04:00
Dom Dwyer	8e61dc5aef	refactor: remove InvalidStrftime value It's big, it's annoying, it's already available to the user.	2023-06-05 11:31:02 +02:00
Dom Dwyer	f0832818ee	test(router): invalid strftime partition template An integration test asserting that a router returns an error when attempting to partition a write with an invalid strftime partition formatter, rather than panicking.	2023-06-01 17:44:44 +02:00
Dom Dwyer	27bef292a3	feat: unambiguously reversible partition keys This commit changes the format of partition keys when generated with non-default partition key templates ONLY. A prior fixture test is unchanged by this commit, ensuring the default partition keys remain the same. When a custom partition key template is provided, it may specify one or more parts, with the TagValue template causing values extracted from tag columns to appear in the derived partition key. This commit changes the generated partition key in the following ways: * The delimiter of multi-part partition keys; the character used to delimit partition key parts is changed from "/" to "\|" (the pipe character) as it is less likely to occur in user-provided input, reducing the encoding overhead. * The format of the extracted TagValue values (see below). Building on the work of custom partition key overrides, where an immutable partition template is resolved and set at table creation time, the changes in this PR enable the derived partition key to be unambiguously reversed into the set of tag (column_name, column_value) tuples it was generated from for use in query pruning logic. This is implemented by the build_column_values() method in this commit, which requires both the template, and the derived partition key. Prior to this commit, a partition key value extracted from a tag column was in the form "tagname_x" where "x" is the value and "tagname" is the name of the tag column it was extracted from. After this commit, the partition key value is in the form "x"; the column name is removed from the derived string to reduce the catalog storage overhead (a key driver of COGS). In the case of a NULL tag value, the sentinel value "!" is inserted instead of the prior "tagname_" marker. In the case of an empty string tag value (""), the sentinel "^" value is inserted instead of the "tagname_-" marker, ensuring the distinction between an empty value and a not-present tag is preserved. Additionally tag values utilise percent encoding to encode reserved characters (part delimiter, empty sentinel character, % itself) to eliminate deserialisation ambiguity. Examples of how this has changed derived partition keys, for a template of [Time(YYYY-MM-DD), TagValue(region), TagValue(bananas)]: Write: time=1970-01-01,region=west,other=ignored Old: "1970-01-01-region_west-bananas" New: "1970-01-01\|west\|!" Write: time=1970-01-01,other=ignored Old: "1970-01-01-region-bananas" New: "1970-01-01\|!\|!"	2023-05-30 15:58:25 +02:00
Carol (Nichols \|\| Goulding)	de243ad823	test: Verify default template usage	2023-05-25 10:55:51 -04:00
Carol (Nichols \|\| Goulding)	fe07e34714	test: Add router tests that set templates and verify writes	2023-05-25 10:44:57 -04:00
Carol (Nichols \|\| Goulding)	17219d71fe	feat: Use the table service in the router	2023-05-25 10:44:57 -04:00
Carol (Nichols \|\| Goulding)	fb53faaa2f	refactor: Only use Partitioner::default and derive it	2023-05-24 10:34:31 -04:00
Carol (Nichols \|\| Goulding)	9c0faa66f0	feat: Set a table partition template explicitly or from the namespace And use the table partition template when partitioning writes to that table.	2023-05-24 10:34:30 -04:00
Carol (Nichols \|\| Goulding)	604bab9508	fix: Make Table create_or_get be only create	2023-05-24 10:34:30 -04:00
Carol (Nichols \|\| Goulding)	afb3838437	feat: Optionally supply the namespace partition template when creating a namespace	2023-05-24 10:10:34 -04:00
Carol (Nichols \|\| Goulding)	6f92bccc99	feat: Use protobuf for PartitionTemplate in CreateNamespace gRPC API The service implementation doesn't use this field yet.	2023-05-24 10:10:34 -04:00
Dom Dwyer	ec0d1375d4	feat(router): put timestamps in retention error Include the minimum acceptable timestamp (the retention bound) and the observed timestamp that exceeds this bound in the retention enforcement write error response.	2023-05-23 11:50:32 +02:00
Dom Dwyer	82500720e4	refactor(cli): update replication help text The replication flag defines the total number of copies of each write - slightly less confusing than the additional copies it was previously, and matches with the actual code.	2023-05-18 16:01:12 +02:00
Carol (Nichols \|\| Goulding)	57bedb1c2d	refactor: Extract a test helper function to create a basic namespace	2023-05-15 14:20:38 -04:00
Kaya Gökalp	5fe8affb18	refactor: accept NamespaceName with Namespace create (#7774 ) Co-authored-by: Dom <dom@itsallbroken.com>	2023-05-15 10:03:55 +00:00
Carol (Nichols \|\| Goulding)	23c0110b32	feat: Create newtypes for different partition templates So that the different kinds aren't mixed up. Also extracts the logic having to do with which template takes precedence onto the PartitionTemplate type itself.	2023-05-09 14:55:02 +02:00
Carol (Nichols \|\| Goulding)	70dca8f60b	fix: Pass the NamespaceSchema through the dml write traits	2023-05-09 14:55:02 +02:00
Carol (Nichols \|\| Goulding)	c1a8408572	fix: Consolidate the default partition template; remove --partition-key-pattern CLI option	2023-05-09 14:54:57 +02:00
Carol (Nichols \|\| Goulding)	b0959667d5	fix: Move topic and query pool within iox catalog (#7734 ) Still insert them into the database and associate them with namespaces, but don't ever query them back out. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-05-04 13:45:56 +00:00
wiedld	1d2003d385	feat(idpe-17265): cst write authorization (#7527 ) * feat(idpe-17265): authorization should occur as part of the single_tenant specific mod * authz service is accessed only through the single_tenant mod handler * authz service is wrapped in auth mod * move auth integration test into auth mod * push down the authorize() call into the query params parser call, in order to access query params in the extract_token * provide configuration error when authz or single_tenant mode are not co-presented * update authz e2e fixtures * feat(idpe-17265): extract tokens based upon preferred ordering in spec, and write tests to verify behavior. * chore(idpe-17265): update naming conventions for a unifying parser * test: make MockAuthorizer have default, and add a test_delegate_to_authz for CST * chore: record authz duration metric, and include in delegation test. * chore: use authz terminology instead of auth_service * chore: more explicit naming * Revert "chore: record authz duration metric, and include in delegation test." This reverts commit 05c36888ca7247b6953343d759a5185098fae679. * refactor: extract_header_token versus the else condition * refactor: make single_tenant mod and move auth within * chore: make unreachable explicitly panic in the build * test: make token values be const, to be consumed when MockAuthorizer is used * test: use locking for calls_counter in test * fix: add base64 encoding as expected for Basic header * fix: merge conflict resolution. The AuthorizationHeaderExtension is now under the authz::http mod, which is a required feature for router package. * chore: run rustfmt nightly with preferred import handling, on files with modified imports * chore: code cleanup, to have minimal code needed	2023-04-19 15:28:10 +00:00
Carol (Nichols \|\| Goulding)	d60e4d5823	feat: Delete delete parsing code from router (#7573 ) And return the "deletes unsupported" error sooner. Co-authored-by: Dom <dom@itsallbroken.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-04-18 09:57:02 +00:00
Carol (Nichols \|\| Goulding)	6387a9576a	fix: Remove the write_summary crate and write info service	2023-04-12 11:31:23 -04:00
Fraser Savage	728b7293b9	feat(router): Use read-through namespace cache for NamespaceResolver The NamespaceResolver was using its own very similar look-aside caching to the DML handlers, this commit leverages the read-through cache implementation to deduplicate more code and makes the read through behavioural expectation explicit for namespace autocreation.	2023-04-11 15:38:18 +01:00
Fraser Savage	d590d19e3b	feat(router): Use read-through NamespaceCache with DML handlers This removes the look-aside cache from the retention_validation and schema_validation DML handlers, instead setting up the new NamespaceCache decorator and using that to handle cache misses.	2023-04-11 15:38:17 +01:00
Dom Dwyer	7fed2ba456	feat(router): single tenancy operational mode Adds a single-tenant mode (CST) to the IOx routers. Single-tenancy mode differs in two main ways: * V1 write endpoint is partially supported * V2 write endpoint ignores "org" parameter The "normal" mode is "multi tenant" which is the default operational mode, and all existing behaviour remains unchanged. Single tenant mode can be enabled by specifying INFLUXDB_IOX_SINGLE_TENANCY=true. Request parsing is delegated to two implementations of the WriteParamExtractor trait, one each for CST and MT - the logic of each "mode" is defined within these files and all other functionality is common between the two. This commit also renames some of the error types for clarity (NoSpecified -> NoOrgBucketSpecified, other NotSpecified -> NoQueryParams, etc). Note: single tenant code requires testing	2023-04-10 12:59:20 -07:00
Fraser Savage	b53b8c7d76	refactor(namespace): Flatten service protection limits in Namespace proto definition This commit also cleans up the code formatting for the gRPC handler and simplifies some of the gRPC handler tests for the new update service limit API.	2023-04-05 14:46:30 +01:00
Fraser Savage	3ad4cbe7a9	feat(router): Add grpc integration tests for namespace limit update This adds additional testing coverage for updates to service protection limits to a namespace, and how they affect subsequent writes that exceed the limits.	2023-03-31 17:35:10 +01:00
Dom Dwyer	125fef388c	feat: MVP replication support This commit implements replication for the router's RpcWrite handler. The desired number of replica copies is specified at startup time, and each user write will be fanned-out with the specified replication factor (replicas + 1). A failure to write to any upstreams returns the write error, but a failure to obtain enough ACKs (enough successful writes) after at least 1 ACK will return a "partial write" error - this differentiation is important, as the user's write will be readable after a partial write error has occurred. This currently writes to upstreams serially; this is clearly an opportunity for improvement! A follow-on PR will parallelise writes across the desired number of replicas while maintaining the "at most one ack'd write to one host" invariant. Note that replication is currently hard-coded as disabled.	2023-03-23 17:48:41 +01:00
Fraser Savage	8e9d0e8e74	refactor(router): Leverage MissingNamespaceAction directly for TestContext This minimises addition of unnecessary types.	2023-03-21 15:28:00 +00:00
Fraser Savage	bf35b951ad	refactor(router): Allow test contexts to specify namespace autocreate retention using a Duration	2023-03-21 11:13:55 +00:00
Fraser Savage	392972a3ad	refactor(router): Simplify builder code patterns Use more fluent method names and leverage defaulting to remove the need for an option on the namespace autocreate policy.	2023-03-21 10:26:23 +00:00
Fraser Savage	acdeee62e0	refactor(router): Remove unused catalog field on TestContextBuilder	2023-03-20 17:11:55 +00:00
Fraser Savage	1a2565b45a	refactor(router): Split TestContextBuilder namespace parameters Most of the test cases set up a context with or without namespace autocreation, while few set a retention period. This removes the requirement to pass a retention period when enabling autocreation or vice versa.	2023-03-20 17:04:14 +00:00
Fraser Savage	32a3c81ce9	refactor(router): Use builder pattern to construct TestContext Using the builder pattern enables a more ergonomic workflow for setting optional namespace creation semantics when constructing the TestContext for the router.	2023-03-20 17:03:40 +00:00
Martin Hilton	13657d5bcc	feat(authz): authorization service client and write integration (#7216 ) * feat(authz): add authorization client. Add a new authz crate to provide the interface for making authorization checks from within IOx. This includes the default client that uses the influxdata.iox.authz.v1 gRPC protocol. This feature is not used by any IOx component yet. * feat: optional authorization on write path Support optionally enabling authorization checks on the /api/v2/write handler. If an authrorizer is configured then the handler will attempt to retrieve a token from the request's Authorization header. If no such token exists then a response with a 401 error code is returned. If the token is not valid, or does not have write permission for the requested namespace then a response with a 403 error is returned. * chore: add unit test for authz in write handler Add unit tests that test the correct functioning of the /api/v2/write handler when an Authorizer is configured. * chore(authz): use lazy connection Change the initialization of the authz client to use a lazy connection. This allows the client to be initialised synchronously. * chore: Run cargo hakari tasks * fix(authz): protolint complaints * fix: authz tests * fix: benches and lint * chore: Update clap_blocks/src/authz.rs Co-authored-by: Marko Mikulicic <mkm@influxdata.com> * chore: Update authz/src/lib.rs Co-authored-by: Marko Mikulicic <mkm@influxdata.com> * chore: Update clap_blocks/src/authz.rs Co-authored-by: Marko Mikulicic <mkm@influxdata.com> * chore: review suggestions * chore: review suggestions Apply a number of suggestions from review comments. The main behavioural change is that if the authz service is configured applictions will perform a probe request to ensure it can communicate before continuing startup. * chore: Update router/src/server/http.rs Co-authored-by: Dom <dom@itsallbroken.com> --------- Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: Marko Mikulicic <mkm@influxdata.com> Co-authored-by: Dom <dom@itsallbroken.com>	2023-03-17 15:20:14 +00:00
Dom Dwyer	5ca165b76e	docs: fix two typos Comments in test code.	2023-02-15 10:49:39 +01:00
Dom Dwyer	61fb92b85c	feat(router): soft-delete RPC handler This implements the RPC "delete_namespace" handler, allowing a namespace to be soft-deleted. Adds unit coverage for all handlers & e2e test coverage for the new handler (the rest were already covered). The tests also highlight the caching issue documented here: https://github.com/influxdata/influxdb_iox/issues/6175	2023-02-15 10:49:38 +01:00
Dom Dwyer	4a0149a418	test(router): retention change after cache reset Assert that the new retention period value is used once a router is restarted and the cache converged.	2023-02-13 14:40:14 +01:00
Dom Dwyer	ce12daa96e	refactor(e2e): simulate router restart Allow a router to be "restarted" within an e2e test.	2023-02-13 14:40:11 +01:00
Carol (Nichols \|\| Goulding)	0e10561ce0	fix: Port router unit tests to the RPC write path (#6956 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-02-13 12:28:06 +00:00
Dom Dwyer	2d46a364dc	feat: namespace soft-delete support This commit adds initial support for "soft" namespace deletion, where the actual records & data remain, but are no longer queryable / writeable. Soft deletion is eventually consistent - users can expect to continue writing to and reading from a bucket after issuing a soft delete call, until the various components either restart, or have their caches flushed. The components treat soft-deleted namespaces differently: * router: ignore soft deleted namespaces * ingester: accept soft deleted namespaces * compactor: accept soft deleted namespaces * querier: ignore soft deleted namespaces * various gRPC services: ignore soft deleted namespaces This ensures that the ingester & compactor do not see rows "vanishing" from the database, and continue to make forward progress. Writes for the deleted namespace that are buffered in the ingester will be persisted as normal, allowing us to support "un-delete" operations where the system is restored to a the state at which the delete was issued (rather than loosing the buffered data). Follow-on work is required to ensure GC drops the orphaned parquet files after the configured GC time, and optimisations such as not compacting parquet from soft-deleted namespaces seems like a trivial win.	2023-02-13 12:01:35 +01:00
Carol (Nichols \|\| Goulding)	30fea67701	fix: Move variables within format strings. Thanks clippy! Changes made automatically using `cargo clippy --fix`.	2023-02-03 13:06:17 -05:00
Dom Dwyer	7f363b55df	test(router): e2e namespace retention coverage Assert the correct handling of 0 and negative retention periods when interacting with the namespace create & update gRPC handlers.	2023-02-01 11:49:53 +01:00
Dom Dwyer	0aa5469ac6	test(e2e): explicit namespace creation Adds an end-to-end test of the router's gRPC NamespaceService covering creation and reading of new namespaces.	2023-01-26 17:32:12 +01:00
Dom Dwyer	ac8fa293cb	refactor(test): TestContext::write_lp() helper Adds a helper method to construct the HTTP write request.	2023-01-26 17:32:10 +01:00
Dom Dwyer	6f1869f9dc	test(router): initialise gRPC delegate in e2e Initialise the "rpc mode" gRPC handlers in the router e2e TestContext.	2023-01-26 17:32:10 +01:00
Dom Dwyer	3efc42baac	refactor(test): dedicated e2e TestContext module Moves the router's TestContext to its own file/module.	2023-01-26 17:32:10 +01:00
Luke Bond	551bb0ef6a	feat: allow enabling/disabling ns autocreation in router (#6346 ) * feat: allow enabling/disabling ns autocreation in router * fix: missed an import for something behind router2 compile flag	2022-12-07 16:12:00 +00:00

1 2

78 Commits (fd8a89deea311a071e535eb192d9c7125705aeb7)