influxdb

Commit Graph

Author	SHA1	Message	Date
Dom Dwyer	8dd159456a	test: assert partitioner row counts Assert the number of rows yielded by the partitioner matches the number of input rows.	2023-06-16 14:14:03 +02:00
Dom Dwyer	f92b866979	test: better proptest timestamp for DML partition Changes the proptest for the router's partitioner handler to use a timestamp generation strategy that more accurately models the distribution of timestamps in real-world requests.	2023-06-16 12:40:33 +02:00
Dom Dwyer	0a2a315b91	chore: limit chrono features See https://rustsec.org/advisories/RUSTSEC-2020-0071	2023-06-15 16:41:20 +02:00
Dom Dwyer	5388e49734	test: router partition handler Asserts the partitioning code within the router (that drives the low-level partitioning logic) generates partitions with rows with timestamps that belong in those partitions.	2023-06-15 14:54:46 +02:00
Marco Neumann	335d9f7357	chore: minimize proptest features (#7993 )	2023-06-14 12:28:18 +00:00
dependabot[bot]	2ffa9f3cda	chore(deps): Bump crossbeam-utils from 0.8.15 to 0.8.16 Bumps [crossbeam-utils](https://github.com/crossbeam-rs/crossbeam) from 0.8.15 to 0.8.16. - [Release notes](https://github.com/crossbeam-rs/crossbeam/releases) - [Changelog](https://github.com/crossbeam-rs/crossbeam/blob/master/CHANGELOG.md) - [Commits](https://github.com/crossbeam-rs/crossbeam/compare/crossbeam-utils-0.8.15...crossbeam-utils-0.8.16) --- updated-dependencies: - dependency-name: crossbeam-utils dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2023-06-13 02:00:14 +00:00
Marko Mikulicic	d26ad8e079	feat: Allow passing service protection limits in create db gRPC call (#7941 ) * feat: Allow passing service protection limits in create db gRPC call * fix: Move the impl into the catalog namespace trait --------- Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-06-08 14:28:32 +00:00
Carol (Nichols \|\| Goulding)	d0db1194e2	feat: Validate custom partition templates on their creation Make sure custom partition templates have: - At least one part - No more than 8 parts - Only nonempty, valid strftime formats	2023-06-07 11:38:12 -04:00
Carol (Nichols \|\| Goulding)	ac26ceef91	feat: Make a place to do partition template validation - Create data_types::partition_template::ValidationError - Make creation of NamespacePartitionTemplateOverride and TablePartitionTemplateOverride fallible - Move SerializationWrapper into a module to make its inner field private to force creation through one fallible constructor; this is where the validation logic will go to be shared among all uses of partition templates	2023-06-07 11:38:12 -04:00
Dom Dwyer	8e61dc5aef	refactor: remove InvalidStrftime value It's big, it's annoying, it's already available to the user.	2023-06-05 11:31:02 +02:00
Dom Dwyer	a873e119c4	test(bench): router partitioner Adds a benchmark that exercises the router's partitioning DmlHandler implementation against a set of three files (very small, small, medium) with 4 different partitioning schemes: * Single tag, which occurs in all rows * Single tag, which does not occur in any row * Default strftime formatter (YYYY-MM-DD) * Long and complicated strftime formatter This covers the entire partitioning overhead - building the formatters, evaluating each row, grouping the values into per-partition buckets, and returning to the caller, where it normally would be passed to the next handler in the pipeline. Note that only one template part is evaluated in each case - this measures the overhead of each type of formatter. In reality, we'd expect partitioning with custom schemes to utilise more than one part, increasing the cost of partitioning proportionally. This is a lower-bound measurement!	2023-06-02 16:04:09 +02:00
Dom Dwyer	f0832818ee	test(router): invalid strftime partition template An integration test asserting that a router returns an error when attempting to partition a write with an invalid strftime partition formatter, rather than panicking.	2023-06-01 17:44:44 +02:00
Dom Dwyer	47214ec9a0	fix: prevent panics in partitioning logic Changes the partitioning logic to be fallible. This prevents an invalid partition template from causing a panic, previously possible through two known code paths: * TagValue formatter referencing a non-tag column * Time formatter using an invalid strftime format string If either occurs, the write attempt is now aborted and an error returned to the user with a HTTP 500 status code. Additionally unexpected partitioner errors now map to a catch-all error instead of panicking.	2023-06-01 17:44:44 +02:00
Dom Dwyer	27bef292a3	feat: unambiguously reversible partition keys This commit changes the format of partition keys when generated with non-default partition key templates ONLY. A prior fixture test is unchanged by this commit, ensuring the default partition keys remain the same. When a custom partition key template is provided, it may specify one or more parts, with the TagValue template causing values extracted from tag columns to appear in the derived partition key. This commit changes the generated partition key in the following ways: * The delimiter of multi-part partition keys; the character used to delimit partition key parts is changed from "/" to "\|" (the pipe character) as it is less likely to occur in user-provided input, reducing the encoding overhead. * The format of the extracted TagValue values (see below). Building on the work of custom partition key overrides, where an immutable partition template is resolved and set at table creation time, the changes in this PR enable the derived partition key to be unambiguously reversed into the set of tag (column_name, column_value) tuples it was generated from for use in query pruning logic. This is implemented by the build_column_values() method in this commit, which requires both the template, and the derived partition key. Prior to this commit, a partition key value extracted from a tag column was in the form "tagname_x" where "x" is the value and "tagname" is the name of the tag column it was extracted from. After this commit, the partition key value is in the form "x"; the column name is removed from the derived string to reduce the catalog storage overhead (a key driver of COGS). In the case of a NULL tag value, the sentinel value "!" is inserted instead of the prior "tagname_" marker. In the case of an empty string tag value (""), the sentinel "^" value is inserted instead of the "tagname_-" marker, ensuring the distinction between an empty value and a not-present tag is preserved. Additionally tag values utilise percent encoding to encode reserved characters (part delimiter, empty sentinel character, % itself) to eliminate deserialisation ambiguity. Examples of how this has changed derived partition keys, for a template of [Time(YYYY-MM-DD), TagValue(region), TagValue(bananas)]: Write: time=1970-01-01,region=west,other=ignored Old: "1970-01-01-region_west-bananas" New: "1970-01-01\|west\|!" Write: time=1970-01-01,other=ignored Old: "1970-01-01-region-bananas" New: "1970-01-01\|!\|!"	2023-05-30 15:58:25 +02:00
Dom Dwyer	9e0570f2bf	refactor: explicit submod for partition_template Move the import into the submodule itself, rather than re-exporting it at the crate level. This will make it possible to link to the specific module/logic.	2023-05-30 15:13:20 +02:00
Andrew Lamb	1ff76b7bf2	chore: use workspace dependencies for `object_store`	2023-05-26 07:03:42 -04:00
dependabot[bot]	ececd0ada7	chore(deps): Bump base64 from 0.21.1 to 0.21.2 (#7874 ) Bumps [base64](https://github.com/marshallpierce/rust-base64) from 0.21.1 to 0.21.2. - [Changelog](https://github.com/marshallpierce/rust-base64/blob/master/RELEASE-NOTES.md) - [Commits](https://github.com/marshallpierce/rust-base64/compare/v0.21.1...v0.21.2) --- updated-dependencies: - dependency-name: base64 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Dom <dom@itsallbroken.com>	2023-05-26 09:28:43 +00:00
Carol (Nichols \|\| Goulding)	de243ad823	test: Verify default template usage	2023-05-25 10:55:51 -04:00
Carol (Nichols \|\| Goulding)	fe07e34714	test: Add router tests that set templates and verify writes	2023-05-25 10:44:57 -04:00
Carol (Nichols \|\| Goulding)	17219d71fe	feat: Use the table service in the router	2023-05-25 10:44:57 -04:00
Carol (Nichols \|\| Goulding)	fb53faaa2f	refactor: Only use Partitioner::default and derive it	2023-05-24 10:34:31 -04:00
Carol (Nichols \|\| Goulding)	9c0faa66f0	feat: Set a table partition template explicitly or from the namespace And use the table partition template when partitioning writes to that table.	2023-05-24 10:34:30 -04:00
Carol (Nichols \|\| Goulding)	604bab9508	fix: Make Table create_or_get be only create	2023-05-24 10:34:30 -04:00
Carol (Nichols \|\| Goulding)	afb3838437	feat: Optionally supply the namespace partition template when creating a namespace	2023-05-24 10:10:34 -04:00
Carol (Nichols \|\| Goulding)	6f92bccc99	feat: Use protobuf for PartitionTemplate in CreateNamespace gRPC API The service implementation doesn't use this field yet.	2023-05-24 10:10:34 -04:00
dependabot[bot]	24a4f36d24	chore(deps): Bump proptest from 1.1.0 to 1.2.0 (#7857 ) Bumps [proptest](https://github.com/proptest-rs/proptest) from 1.1.0 to 1.2.0. - [Release notes](https://github.com/proptest-rs/proptest/releases) - [Changelog](https://github.com/proptest-rs/proptest/blob/master/CHANGELOG.md) - [Commits](https://github.com/proptest-rs/proptest/compare/v1.1.0...v1.2.0) --- updated-dependencies: - dependency-name: proptest dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Dom <dom@itsallbroken.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-05-24 09:21:32 +00:00
dependabot[bot]	b7fbfa6fb2	chore(deps): Bump criterion from 0.4.0 to 0.5.0 (#7856 ) Bumps [criterion](https://github.com/bheisler/criterion.rs) from 0.4.0 to 0.5.0. - [Changelog](https://github.com/bheisler/criterion.rs/blob/master/CHANGELOG.md) - [Commits](https://github.com/bheisler/criterion.rs/compare/0.4.0...0.5.0) --- updated-dependencies: - dependency-name: criterion dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-05-24 09:08:37 +00:00
Dom Dwyer	928a4d163e	build: remove unused dependencies from crates This commit fixes loads of crates (47!) had unused dependencies, or mis-configured dependencies (test deps as normal deps). I added the "unused_crate_dependencies" to all crates to help prevent this mess from growing again! https://doc.rust-lang.org/beta/nightly-rustc/rustc_lint_defs/builtin/static.UNUSED_CRATE_DEPENDENCIES.html This has the minor downside of false-positives when specifying dev-dependencies for test/bench binaries - these are files in /test or /benches (not normal tests). This commit includes a workaround, importing them in lib.rs (gated by a feature flag). I think the trade-off of better dependency management is worth it!	2023-05-23 14:55:43 +02:00
Dom Dwyer	6cf180738b	test: more exacting retention validation tests The old tests used partial error string matching, with the whole error message! So when I added more to the error message, the fixture tests didn't fail. This changes the tests to match the full string, and validate the timestamps are included.	2023-05-23 12:04:52 +02:00
Dom Dwyer	ec0d1375d4	feat(router): put timestamps in retention error Include the minimum acceptable timestamp (the retention bound) and the observed timestamp that exceeds this bound in the retention enforcement write error response.	2023-05-23 11:50:32 +02:00
dependabot[bot]	6cb7619d83	chore(deps): Bump base64 from 0.21.0 to 0.21.1 (#7832 ) Bumps [base64](https://github.com/marshallpierce/rust-base64) from 0.21.0 to 0.21.1. - [Changelog](https://github.com/marshallpierce/rust-base64/blob/master/RELEASE-NOTES.md) - [Commits](https://github.com/marshallpierce/rust-base64/commits) --- updated-dependencies: - dependency-name: base64 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-05-22 09:50:06 +00:00
Andrew Lamb	6344fe8c3f	chore: Add rationale for `clippy::future_not_send` (#7822 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-05-18 16:58:56 +00:00
Dom Dwyer	82500720e4	refactor(cli): update replication help text The replication flag defines the total number of copies of each write - slightly less confusing than the additional copies it was previously, and matches with the actual code.	2023-05-18 16:01:12 +02:00
wiedld	506bd80f6f	Merge branch 'main' into chore/router-metrics-for-auth-v2	2023-05-16 10:24:42 -07:00
Dom	3c47226244	Merge branch 'main' into dom/parallel-replicate	2023-05-16 11:05:10 +01:00
wiedld	2e2aac9ac8	refactor: with updated Authorizer interface, update the metric to delineate the different scenarios	2023-05-15 11:25:01 -07:00
Carol (Nichols \|\| Goulding)	57bedb1c2d	refactor: Extract a test helper function to create a basic namespace	2023-05-15 14:20:38 -04:00
wiedld	d087160112	chore: update naming conventions, and use assert_histogram in tests	2023-05-15 09:26:15 -07:00
wiedld	199daee0f6	chore: make AuthorizerInstrumentation use a constant topic (metric name) within the registry	2023-05-15 08:52:09 -07:00
wiedld	d8661d043b	chore: use new authorizer metric decorator, in the router	2023-05-15 08:52:06 -07:00
wiedld	867fd39dbf	Merge branch 'main' into authz/refactor-interface	2023-05-15 08:03:10 -07:00
Kaya Gökalp	5fe8affb18	refactor: accept NamespaceName with Namespace create (#7774 ) Co-authored-by: Dom <dom@itsallbroken.com>	2023-05-15 10:03:55 +00:00
wiedld	4c30e7e04d	refactor: Authorizer trait should have a single interface for requested permissions() * returns an intersection of requested_perms and actual perms_on_token * returns ok if any of the requested_perms is within the actual perms_on_token	2023-05-12 15:28:58 -05:00
Dom Dwyer	dfe1a7dec8	perf(router): parallel write replication This commit changes the write replication loop to concurrently write to N distinct upstream ingesters, instead of the previous sequential logic.	2023-05-12 17:04:32 +02:00
Dom Dwyer	bf93014bb7	feat: concurrent lending iterator Changes the UpstreamSnapshot to be suitable for concurrent use. This type contains the core logic to enable a caller to uphold the responsibility of ensuring replicated writes land on distinct ingesters in the presence of concurrent replication. The clients within the snapshot are returned to at most one concurrent caller at a time, by tracking the state of each client as a FSM: ┌────────────────┐ ┌─▶│ Available │ │ └────────────────┘ │ │ drop next() │ │ │ ▼ │ ┌────────────────┐ └──│ Yielded │ └────────────────┘ │ remove │ ▼ ┌────────────────┐ │ Used │ └────────────────┘ Once a client has been yielded it will not be yielded again until it is dropped (transitioning the FSM from "yielded" to "available" again, returning it to the candidate pool of clients) or removed (transitioning to "used", permanently preventing it from being yielded to another caller).	2023-05-12 17:04:32 +02:00
Dom Dwyer	cdaf99268c	refactor: owned client in UpstreamSnapshot Changes then UpstreamSnapshot to return owned clients, instead of references to those clients. This will allow the snapshot to have a 'static lifetime, suitable for use across tasks.	2023-05-12 16:59:49 +02:00
Dom Dwyer	dc27ae5fbf	refactor: eliminate impossible error Because the number of candidate upstreams is checked to exceed the number of desired data copies before starting the write loop, and because the parallelism of the write loop matches the number of desired data copies, it's not possible for any thread to observe an empty snapshot. This commit removes the unreachable error condition for clarity.	2023-05-12 16:59:49 +02:00
Dom Dwyer	465158e08e	test(router): replication prop/invariant fuzzing Adds a property-based test of the RPC write handler's replication logic, ensuring: 1. If the number of healthy upstreams is 0, NoHealthyUpstreams is returned and no requests are attempted. 2. Given N healthy upstreams (> 0) and a replication factor of R: if N < R, "not enough replicas" is returned and no requests are attempted. 3. Upstreams that return an error are retried until the entire write succeeds or times out. 4. Writes are replicated to R distinct upstreams successfully, or an error is returned. 5. One an upstream write is ack'd as successful, it is never requested again. 6. An upstream reporting as unhealthy at the start of the write is never requested (excluding probe requests). These properties describe a mixture of invariants (don't replicate your two copies of a write to the same ingester) and expected behaviour of the replication logic (optimisations like "don't try writes when you already know they'll fail"). This passes for the single-threaded replication logic used at the time of this commit, and will be used to validate correctness of a concurrent replication implementation - a concurrent approach should uphold these properties the same way a single-threaded implementation does.	2023-05-12 16:59:48 +02:00
Dom Dwyer	cf622c1b91	test: MockWriteClient tracks ACK response count Changes the MockWriteClient to track how many success responses it has returned in response to a write request.	2023-05-12 16:59:48 +02:00
Dom Dwyer	ac656ab1f9	refactor: clearer NoHealthyUpstreams error name Renames NoUpstreams -> NoHealthyUpstreams as it's confusing because we also have "not enough replicas" which could be no upstreams? This has a slightly clearer meaning.	2023-05-12 16:59:47 +02:00

1 2 3 4 5 ...

387 Commits (9775e150b2ff78ce5817a28153d5827937f7c35d)