influxdb

Commit Graph

Author	SHA1	Message	Date
Dom Dwyer	8ebea0df37	feat: table/namespace IDs in write protocol Expose the Table and Namespace IDs encoded within the serialised DML write (added in #6036). This makes the IDs available for use in the consumers, ending the transition period. This commit DOES NOT remove the strings sent over the wire.	2022-11-08 16:57:53 +01:00
Dom Dwyer	6fa48731aa	feat: NamespaceId in DmlDelete Changes the DmlDelete to contain the NamespaceId for which it should be applied, propagating this value over the wire. Like the existing IDs within the DmlWrite, these values are marked unsafe to use due to avoid the consumers utilising them accidentally during deployment. Unlike DmlWrite, the DmlDelete is completely unused, so this is less of an issue.	2022-11-03 13:57:40 +01:00
Andrew Lamb	4fb2843d05	refactor: Rename `schema::selection::Selection` to `schema::projection::Projection` (#6037 ) * chore: Rename `schema::selection::Selection` to `schema::projection::Projection` * fix: docs Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-02 18:15:04 +00:00
Dom Dwyer	ddd6ab0ba4	refactor(write_buffer): pass IDs in wire format This commit is part of a two-part change in order to add the table & namespace IDs to the write buffer wire format. This commit forms the first half; changing the producer to send the IDs. In this commit the new ID values are never read on the consumer side, ensuring there is no consumer dependency on them. This ensures they remain operational during a rollout, where the consumer may be updated to the latest code dependent on the IDs before the producer is updated to send them. This also ensures we have a window of time where where the consumers can be rolled back after being updated, and still handle replaying messages in Kafka.	2022-11-02 13:28:56 +01:00
Dom Dwyer	72a358e52f	refactor(dml): PartitionKey required for writes Changes the DmlWrite type to require a PartitionKey be specified, instead of accepting an Option. This requirement was already in place - the write buffer upheld an invariant that all writes contained a partition key value (was not "None") or it panicked at runtime when attempting to enqueue the write. It is now possible to encode this invariant in the type system, which is what this change does.	2022-10-28 10:57:30 +02:00
Carol (Nichols \|\| Goulding)	2e83e04eab	feat: Use workspace package metadata to reduce differences and repetition	2022-10-24 13:04:09 -04:00
Dom Dwyer	cd4087e00d	style: add no todo!() or dbg!() lints Some crates had theme, some not - lets be consistent and have the compiler spot dbg!() and todo!() macro calls - they should never be in prod code!	2022-09-29 13:10:07 +02:00
Dom Dwyer	6d6fc9a08b	test: reduce timestamp precision for comparisons Reduce the precision of timestamps in tests before comparing the DML metadata objects. This allows tests to accept different timestamp precisions, such as when ops pass "through" Kafka vs. files, etc.	2022-08-22 12:58:03 +02:00
Carol (Nichols \|\| Goulding)	549a267e3c	fix: Use Self instead of unnecessary structure name repetition As now caught by clippy. https://rust-lang.github.io/rust-clippy/master/index.html#use_self	2022-08-11 15:21:02 -04:00
Andrew Lamb	8f5210ea3e	test: add test for "duration since production" in kafka `write_buffer` implementation (#5043 ) * test: add test for timestamps in kafka write buffer * refactor: move timestamp batching test to generic tests Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-07 10:27:27 +00:00
Dom Dwyer	c1f7154031	feat: propagate partition key through kafka Changes the kafka message wire format to include the partition key for serialised DML writes on the wire. After this commit, the kafka messages will contain the partition key for each op, but this information will go unused in the ingester - this enables us to roll out the producer side, before making the value's presence necessary on the consumer side. A follow-up PR will change the ingester to utilise this embedded partition key. This has the unfortunate side effect of making the partition key part of the public gRPC write API: https://github.com/influxdata/influxdb_iox/issues/4866	2022-06-20 13:42:51 +01:00
Dom Dwyer	4df2964566	refactor: store PartitionKey in DmlWrite Carry the PartitionKey in the DmlWrite, allowing the batch to be associated with a specific partition key.	2022-06-15 15:48:54 +01:00
Jake Goulding	e07bcd40c2	refactor: Remove unused dependencies These were found by iterating over all of the dependencies of each Cargo.toml, then grepping that crate for the dependency's name. If it didn't show up, I attempted to remove it. I left a few dependencies that this process flagged: * generated_types - `pbjson`,`serde`. Apparently used by the generated code. * grpc-router-test-gen - `prost`. Apparently used by the generated code. * influxdb_iox - `heappy`. Doesn't appear used, but is behind enough feature flags that I don't care to reason about and it's already optional. - `tikv_jemalloc_sys`. Appears to be setting a feature flag of an indirect dependency. * iox_gitops_adapter - `k8s_openapi`. Appears to be setting a feature flag of an indirect dependency.	2022-05-06 15:57:58 -04:00
Carol (Nichols \|\| Goulding)	068096e7e1	fix: Rename data_types2 to data_types	2022-05-06 14:45:39 -04:00
Carol (Nichols \|\| Goulding)	0541c6e40f	fix: Remove data_types crate where it's no longer used	2022-05-06 14:45:39 -04:00
Carol (Nichols \|\| Goulding)	b76c1e1ad6	fix: Remove now-unused DML sharding and related types	2022-05-06 14:45:38 -04:00
Carol (Nichols \|\| Goulding)	2ef44f2024	fix: Move timestamp types to data_types2	2022-05-06 14:45:38 -04:00
Carol (Nichols \|\| Goulding)	236edb9181	fix: Move Sequence type to data_types2	2022-05-06 14:45:38 -04:00
Carol (Nichols \|\| Goulding)	d2671355c3	fix: Move partition metadata types to data_types2	2022-05-06 14:45:37 -04:00
Carol (Nichols \|\| Goulding)	1ea4a40b1f	fix: Move NonEmptyString to data_types2	2022-05-06 14:45:37 -04:00
Carol (Nichols \|\| Goulding)	3ab0788a94	fix: Move DeletePredicate types to data_types2	2022-05-06 14:45:37 -04:00
dependabot[bot]	912d73a6f3	chore(deps): Bump ordered-float from 2.10.0 to 3.0.0 (#4502 ) Bumps [ordered-float](https://github.com/reem/rust-ordered-float) from 2.10.0 to 3.0.0. - [Release notes](https://github.com/reem/rust-ordered-float/releases) - [Commits](https://github.com/reem/rust-ordered-float/compare/v2.10.0...v3.0.0) --- updated-dependencies: - dependency-name: ordered-float dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-02 14:27:25 +00:00
二手掉包工程师	4b47d723b1	refactor: Rename time to iox_time (#4416 ) Signed-off-by: hi-rustin <rustin.liu@gmail.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-26 00:19:59 +00:00
Marco Neumann	33851be3a5	chore: upgrade Rust to 1.59 (#3875 ) Mostly a few new clippy crates around `flat_map`, `and_then`, and "underscore locks" (!!!): https://rust-lang.github.io/rust-clippy/master/index.html#let_underscore_lock Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-28 15:14:19 +00:00
Paul Dix	ce46bbaada	feat: wire up the write buffer to the ingester process (#3533 ) This adds the scaffolding for the ingester server to consume data from Kafka. This ingests data in an in memory structure while creating records in the catalog for any partitions that don't yet exist. I've removed catalog_update.rs in ingester for now. That was mostly a placeholder and will be going in a combination of handler.rs and data.rs on my next PR which will have some primitive lifecycle wired up. There's one ugly bit here where the DML write is cloned because it's getting borrowed to output spans and metrics. I'll need to follow up with a refactor to make it so that the DML write's tables can be consumed without it gumming up the metrics stuff. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-03 11:47:28 +00:00
Andrew Lamb	2062267d0f	chore: Update hashbrown (#3551 ) * chore: Update hashbrown * fix: hakari Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-01-27 15:34:10 +00:00
Paul Dix	16d584b2ff	feat: Add db_name/namespace to DmlWrite and DmlDelete (#3531 ) * feat: Add db_name/namespace to DmlWrite and DmlDelete This is required for the new ingester to be able to work with the write buffer. The protobuf that gets serialized over Kafka already includes the database name, it just wasn't getting carried through to the marshaled Dml operation. * fix: database != namespace, propagation through write buffer Co-authored-by: Marco Neumann <marco@crepererum.net> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-01-27 14:12:20 +00:00
Andrew Lamb	9c19cd6cc4	fix: clamp start/end of TimestampRange to min/max valid timestamp values (#3487 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-01-20 16:08:00 +00:00
Marco Neumann	168afb63ad	feat: add `size` methods to DML-related types This will be helpful when we want to batch DML operations in memory (e.g. when using RSKafka). This also ensures that `MBChunk` accounts for the column names that are stored within `MutableBatch`.	2022-01-18 13:52:31 +01:00
Dom	80b12d417c	feat: abstract DML handler Defines the DmlHandler trait responsible for processing a request in some abstract way, decoupling the HTTP/gRPC request handlers from the underlying routing logic.	2022-01-17 11:56:04 +00:00
Carol (Nichols \|\| Goulding)	30c4da7ca7	fix: Be consistent with regex version range specification	2021-12-06 09:37:15 -05:00
Marco Neumann	332485d2c9	fix: use correct `bytes_read` in `DmlMeta` - for file-based write buffers: Use headers + payload - for Kafka-based write buffers: Use the estimation that we also use for other metrics - as a side effect we can now just use `PartialEq` for more types Fixes #3186.	2021-12-01 15:57:21 +01:00
Marco Neumann	ad65687cba	refactor: consolidate sharding implementation Move sharding implementation from router to `dml` (where the types can be consumed w/o cloning the data) and only support the new sharding configs. The old sharding configs will be removed soon.	2021-11-26 13:00:36 +01:00
Marco Neumann	0401f453de	feat: impl `PartialEq` for `SpanContext`	2021-11-23 15:39:53 +01:00
Marco Neumann	9706e4b480	fix: workaround #3186	2021-11-23 15:39:53 +01:00
Marco Neumann	07d1b9e56b	test: extend DML test utils to support deletes	2021-11-23 15:39:53 +01:00
Carol (Nichols \|\| Goulding)	9fd4a560f5	feat: Results of running cargo hakari manage-deps	2021-11-19 09:21:57 -05:00
Raphael Taylor-Davies	8155747735	feat: add write buffer delete encoding (#2731 ) (#3127 ) * feat: add write buffer delete encoding (#2731) * chore: fix doc * chore: review feedback * chore: review feedback * chore: fmt * chore: review feedback	2021-11-17 16:12:19 +00:00
Raphael Taylor-Davies	553e412226	refactor: DMLOperation write path (#2731 ) (#3121 ) * refactor: DMLOperation write path (#2731) * chore: fmt * chore: review feedback	2021-11-16 12:42:19 +00:00
Raphael Taylor-Davies	3d091208af	refactor: move delete predicate into data_types (#2731 ) (#3094 ) * refactor: move delete predicate into dml (#2731) * refactor: move DeletePredicate to data_types * chore: fix doc Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-11-15 10:28:58 +00:00
Raphael Taylor-Davies	a6d83a3026	feat: WriteBufferReader use DmlOperation (#2731 ) (#3096 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-11-15 10:19:54 +00:00
Raphael Taylor-Davies	6f268f8260	refactor: extract DML types (#2731 ) (#3084 ) * refactor: extract DML types (#2731) * chore: fmt	2021-11-11 12:34:07 +00:00

42 Commits (5398a1b986ceb42d76a62dff43c7b005159774f1)