influxdb

Commit Graph

Author	SHA1	Message	Date
Dom Dwyer	65034cfaa6	refactor: org & bucket parser on NamespaceName Moves the function org_and_bucket_to_namespace() to be an associated method (constructor) on the NamespaceName itself.	2023-03-31 16:12:49 +02:00
Dom Dwyer	2d46a364dc	feat: namespace soft-delete support This commit adds initial support for "soft" namespace deletion, where the actual records & data remain, but are no longer queryable / writeable. Soft deletion is eventually consistent - users can expect to continue writing to and reading from a bucket after issuing a soft delete call, until the various components either restart, or have their caches flushed. The components treat soft-deleted namespaces differently: * router: ignore soft deleted namespaces * ingester: accept soft deleted namespaces * compactor: accept soft deleted namespaces * querier: ignore soft deleted namespaces * various gRPC services: ignore soft deleted namespaces This ensures that the ingester & compactor do not see rows "vanishing" from the database, and continue to make forward progress. Writes for the deleted namespace that are buffered in the ingester will be persisted as normal, allowing us to support "un-delete" operations where the system is restored to a the state at which the delete was issued (rather than loosing the buffered data). Follow-on work is required to ensure GC drops the orphaned parquet files after the configured GC time, and optimisations such as not compacting parquet from soft-deleted namespaces seems like a trivial win.	2023-02-13 12:01:35 +01:00
Carol (Nichols \|\| Goulding)	c15de72249	fix: Remove the use of write buffer config from import schema command (#6957 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-02-11 15:02:32 +00:00
Carol (Nichols \|\| Goulding)	30fea67701	fix: Move variables within format strings. Thanks clippy! Changes made automatically using `cargo clippy --fix`.	2023-02-03 13:06:17 -05:00
Nga Tran	b856edf826	feat: function to get parttion candidates from partition table (#6519 ) * feat: function to get parttion candidates from partition table * chore: cleanup * fix: make new_file_at the same value as created_at * chore: cleanup Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-01-06 16:20:45 +00:00
Dom Dwyer	adc6fcfb04	feat(catalog): linearise sort key updates Updating the sort key is not commutative and MUST be serialised. The correctness of the current catalog interface relies on the caller serialising updates globally, something it cannot reasonably assert in a distributed system. This change of the catalog interface pushes this responsibility to the catalog itself where it can be effectively enforced, and allows a caller to detect parallel updates to the sort key.	2022-12-20 12:31:00 +01:00
Nga Tran	49a9565240	feat: gRPC that creates namespace (#6103 ) * feat: create namespace API call in router Co-authored-by: Nga Tran <nga-tran@live.com> * chore: treat retention as ns except in CLI * fix: overflow in nanosecond calc * fix: retention test after changing it from hours to ns * chore: comment clarification in cli; better response type for error in ns API * fix: correct some rebase mistakes * chore: merge namespace create & create_with_retention; renamed ns create test helper fn & const * fix: ns autocreation test was wrong after rebase * fix: mem catalog has default 1hr retention, accidently removed in rebase * chore: remove mem catalogs default 1hr retention; make it settable in sets & router Co-authored-by: Luke Bond <luke.n.bond@gmail.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-18 13:02:12 +00:00
Nga Tran	9c4266c503	refactor: first step to remove unused retention_duration (#6113 ) * refactor: first step to remove unused retention_duration * refactor: remove retenion_duration from update catalog Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-11 15:21:06 +00:00
Nga Tran	93e11d4c91	chore: Revert "feat: flag partitions for delete (#6075 )" (#6111 ) This reverts commit `77a2541172`.	2022-11-10 17:01:39 +00:00
Andrew Lamb	694443bb87	chore: Rename DatabaseName to NamespaceName (#6100 ) * chore: Rename DatabaseName to NamespaceName * fix: fmt * chore: Updates some more references * chore: more cleanup * fix: adjust test Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-10 14:13:59 +00:00
Nga Tran	77a2541172	feat: flag partitions for delete (#6075 ) * feat: flag partition for delete * fix: compare the right date and time * chore: Run cargo hakari tasks * chore: cleanup * fix: typos * chore: rust style tidy ups in catalog Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: Luke Bond <luke.n.bond@gmail.com>	2022-11-09 12:06:23 +00:00
Carol (Nichols \|\| Goulding)	74a40cc9bd	fix: Assert that there aren't two columns with the same name in the same batch This shouldn't be possible; let's make sure we know if it happens!	2022-11-07 14:10:12 -05:00
Carol (Nichols \|\| Goulding)	d454c66b4b	fix: Use a HashMap for column lookup instead of Vec ordering The checks for whether a column already exists with a different type were relying on ordering of the input matching the ordering of the columns returned from inserting the columns in Postgres. Rather than trying to match the new ordering that is required to avoid Postgres deadlocks, switch from a Vec to a HashMap and look up the column type from the name. This also reduces some allocations that weren't really needed.	2022-11-04 11:52:37 -04:00
Carol (Nichols \|\| Goulding)	efb964c390	feat: Enforce table column limits from the schema cache (#5819 ) * fix: Avoid some allocations by collecting instead of inserting into a vec * refactor: Encode that adding columns is for one table at a time * test: Add another test of column limits * test: Add below/above limit tests for create_or_get_many * fix: Explicitly DO NOT check column limits when inserting many columns * feat: Cache the max_columns_per_table on the NamespaceSchema * feat: Add a function to validate column limits in-memory * fix: Provide more useful information when over column limits * fix: Swap types to remove intermediate allocation * docs: Explain the interactions of the cache and the column limits * test: Actually set up test that showcases column limit race condition * fix: Allow writing to existing columns even if table is over column limit Co-authored-by: Dom <dom@itsallbroken.com>	2022-10-14 11:34:17 +00:00
Andrew Lamb	56a1c579a1	refactor: Change influxdb_iox client to use http rather than grpc for write (#5756 ) * refactor: Change influxdb_iox client to use http rather than grpc for write * refactor: remove custom variants * refactor: consolidate more	2022-09-29 11:12:51 +00:00
Dom Dwyer	66bf0ff272	refactor(db): NULLable persisted_sequence_number Makes the partition.persisted_sequence_number column in the catalog DB NULLable. 0 is a valid persisted sequence number.	2022-09-15 18:19:39 +02:00
Dom Dwyer	d199a83355	feat(catalog): per-partition persist mark API Adds the "persisted_sequence_number" field to the Partition model, and updates the catalog API to read & update it.	2022-09-15 16:10:35 +02:00
Andrew Lamb	1e1d964fdb	fix: Some other stragglers	2022-09-04 07:59:07 -04:00
Juul Christiaens	8b419ecd84	refactor: changed iox_shared to iox-shared changed io_shared to iox-shared in the following files: update_catalog.rs, partition.rs, lib.rs (in the service_grpc_catalog folder) and lib.rs (in the service_grpc_object_store folder).	2022-09-04 07:59:07 -04:00
Carol (Nichols \|\| Goulding)	1b49ad25f7	refactor: Rename KafkaTopicId to TopicId	2022-08-29 14:27:02 -04:00
Carol (Nichols \|\| Goulding)	58f0b63cdc	refactor: Rename KafkaTopic to Topic or TopicMetadata or topic name as appropriate	2022-08-29 14:27:02 -04:00
Carol (Nichols \|\| Goulding)	74c9529062	fix: Rename KafkaPartition to ShardIndex	2022-08-29 14:07:18 -04:00
Luke Bond	3950ca3a17	feat: upsert partition & update sort key for each day in bulk ingest (#5447 ) * feat: upsert partition & update sort key for each day in bulk ingest feat: import schema now supports earliest/latest time merging chore: tests & tidying up for bulk ingest catalog update * fix: always sort time last in PK in import schema update catalog * chore: additional test for computing sort key in bulk ingest * chore: bulk import catalog update gets sequencer from sharder service chore: import update schema tests refactor using sharder svc mock * chore: dead code fix * chore: import schema sequencer lookup test * chore: clarifying comment in import schema catalog update	2022-08-25 10:47:12 +00:00
Luke Bond	f4443f0b3a	feat: import schema override (#5420 ) * chore: struct for overrides of import schema conflicts * chore: import schema override shouldn't support tags * feat: import schema merge can take an override schema * fix: schema override in test had superfluous tag * chore: test for batch schema merge with override in import schema * feat: import schema merge now takes override schema	2022-08-17 14:59:50 +00:00
Luke Bond	10fee5535a	feat: import schema updates iox catalog (#5385 ) * feat: import schema updates iox catalog - renamed import/schema module to aggregate_tsm_schema to not conflic with schema crate - fetch schema from iox catalog, and validate/merge/create as needed chore: add catalog dsn config to import schema command chore: import schema command connects to catalog chore: import schema merge validation errors return non-zero code chore: simplified and tidies import update catalog code chore: tests and refactoring of import schema catalog update * chore: require retention on ns creation in import * chore: fixed bad test in import schema validation * chore: friendlier errors & more tests in import schema catalog update	2022-08-16 11:05:27 +00:00
Carol (Nichols \|\| Goulding)	b982bdaf2f	fix: Derive Eq when we derive PartialEq and members can derive Eq Allow this in generated code that we don't control, though. Recommended by clippy now. https://rust-lang.github.io/rust-clippy/master/index.html#derive_partial_eq_without_eq	2022-08-11 15:04:06 -04:00
Luke Bond	7e9918f067	chore: import validate merged schema (#5367 ) * feat: import schema merge now outputs validation results chore: refactor import crate chore: renamed some structs for clarity in import crate * chore: tests for import schema merge validation * chore: Run cargo hakari tasks * chore: clippy * chore: make hashmap loop easier to read in import schema validation Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-08-10 12:15:37 +00:00
Luke Bond	c5f062bba0	feat: initial commit of schema merge bulk import tool (#5344 ) * feat: initial commit of schema merge bulk import tool * chore: use observability depds instead of tracing-* * chore: removed debug printlns * chore: fix feature decls for cloud providers for import crate * chore: use println instead of info in import- no need for a simple CLI * chore: tidy whitespace * chore: remove unused dep in import * chore: Run cargo hakari tasks * chore: removed unimpld import job subcommand * chore: clarifying comment about custom serialisation code * chore: clarifying comment about schema merge code in import * chore: fix wrong comment in import command * chore: bump object store dep to get bugfix * chore: rename import schema struct for clarity * chore: run `cargo hakari generate` Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-08-10 09:07:38 +00:00

28 Commits (c7197a289e769ed8da10ef392346a4f584b5e6de)