influxdb

Commit Graph

Author	SHA1	Message	Date
Dom Dwyer	47214ec9a0	fix: prevent panics in partitioning logic Changes the partitioning logic to be fallible. This prevents an invalid partition template from causing a panic, previously possible through two known code paths: * TagValue formatter referencing a non-tag column * Time formatter using an invalid strftime format string If either occurs, the write attempt is now aborted and an error returned to the user with a HTTP 500 status code. Additionally unexpected partitioner errors now map to a catch-all error instead of panicking.	2023-06-01 17:44:44 +02:00
Carol (Nichols \|\| Goulding)	9c0faa66f0	feat: Set a table partition template explicitly or from the namespace And use the table partition template when partitioning writes to that table.	2023-05-24 10:34:30 -04:00
Carol (Nichols \|\| Goulding)	23c0110b32	feat: Create newtypes for different partition templates So that the different kinds aren't mixed up. Also extracts the logic having to do with which template takes precedence onto the PartitionTemplate type itself.	2023-05-09 14:55:02 +02:00
Carol (Nichols \|\| Goulding)	c1a8408572	fix: Consolidate the default partition template; remove --partition-key-pattern CLI option	2023-05-09 14:54:57 +02:00
Carol (Nichols \|\| Goulding)	2aa8713d1d	fix: Remove partition TemplatePart::Table; partitioning is already per-table	2023-05-09 14:54:57 +02:00
Andrew Lamb	f93baf7693	chore: Update DataFusion and `arrow` / `arrow-flight` / `parquet` to `33.0.0` (#7045 ) * chore: Update DataFusion and arrow/arrow-flight/parquet to 33.0.0 * fix: Update test output * fix: update more test output * fix: Update querier test output * chore: Run cargo hakari tasks * test: fix formatting Fix formatting of batch pretty printing. * test: fix formatting Fix formatting of batch pretty printing. * test: fix formatting for selector tests --------- Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: Dom Dwyer <dom@itsallbroken.com> Co-authored-by: Christopher Wolff <chris.wolff@influxdata.com>	2023-02-22 21:24:20 +00:00
Carol (Nichols \|\| Goulding)	02c3083192	fix: Remove table names from Dml operations	2022-11-18 10:40:38 -05:00
Marco Neumann	746032af0f	fix: compatibility after hashbrown upgrade - Some methods need explicit types - `hashbrown::HashMap` now takes 32 bytes, not 64	2022-11-11 13:25:39 -05:00
Andrew Lamb	4fb2843d05	refactor: Rename `schema::selection::Selection` to `schema::projection::Projection` (#6037 ) * chore: Rename `schema::selection::Selection` to `schema::projection::Projection` * fix: docs Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-02 18:15:04 +00:00
Dom Dwyer	ddd6ab0ba4	refactor(write_buffer): pass IDs in wire format This commit is part of a two-part change in order to add the table & namespace IDs to the write buffer wire format. This commit forms the first half; changing the producer to send the IDs. In this commit the new ID values are never read on the consumer side, ensuring there is no consumer dependency on them. This ensures they remain operational during a rollout, where the consumer may be updated to the latest code dependent on the IDs before the producer is updated to send them. This also ensures we have a window of time where where the consumers can be rolled back after being updated, and still handle replaying messages in Kafka.	2022-11-02 13:28:56 +01:00
Dom Dwyer	61182f506b	refactor: emit PartitionKey from partitioner Changes the partitioning code to emit a PartitionKey, instead of a bare String.	2022-06-15 15:38:02 +01:00
Andrew Lamb	dde3c3922c	refactor: use consistent spelling of serialize (#4717 )	2022-05-27 14:42:59 +00:00
Dom Dwyer	43300878bc	fix(pb): encoding entirely NULL columns (#4272 ) This commit changes the protobuf record batch encoding to skip entirely NULL columns when serialising. This prevents the deserialisation from erroring due to a column type inference failure. Prior to this commit, when the system was presented a record batch such as this: \| time \| A \| B \| \| ---------- \| ---- \| ---- \| \| 1970-01-01 \| 1 \| NULL \| \| 1970-07-05 \| NULL \| 1 \| Which would be partitioned by YMD into two separate partitions: \| time \| A \| B \| \| ---------- \| ---- \| ---- \| \| 1970-01-01 \| 1 \| NULL \| and: \| time \| A \| B \| \| ---------- \| ---- \| ---- \| \| 1970-07-05 \| NULL \| 1 \| Both partitions would contain an entirely NULL column. Both of these partitioned record batches would be successfully encoded, but decoding the partition fails due to the inability to infer a column type from the serialised format which contains no values, which on the wire, looks like: Column { column_name: "B", semantic_type: Field, values: Some( Values { i64_values: [], f64_values: [], u64_values: [], string_values: [], bool_values: [], bytes_values: [], packed_string_values: None, interned_string_values: None, }, ), null_mask: [ 1, ], }, In a column that is not entirely NULL, one of the "Values" fields would be non-empty, and the decoder would use this to infer the type of the column. Because we have chosen to not differentiate between "NULL" and "empty" in our proto encoding, the decoder cannot infer which field within the "Values" struct the column belongs to - all are valid, but empty. This commit prevents this type inference failure by skipping any columns that are entirely NULL during serialisation, preventing the deserialiser from having to process columns with ambiguous types.	2022-05-18 13:33:26 +01:00
Raphael Taylor-Davies	e444fa4cb2	feat: pbdata encode (#2724 ) (#3009 )	2021-11-02 18:31:53 +00:00

14 Commits (196c589ef64f73677eb3e89e60b219f862bde19a)