influxdb

Commit Graph

Author	SHA1	Message	Date
Carol (Nichols \|\| Goulding)	3a2544a7eb	feat: Define a new gRPC service for ingester persist	2023-01-12 11:03:12 -05:00
Carol (Nichols \|\| Goulding)	adc5c2bf06	feat: Add a gRPC API to the catalog service to get Parquet files by namespace Tests that write line protocol (that may contain writes to multiple tables) need to be able to see when new Parquet files are saved.	2023-01-11 11:41:09 -05:00
Paul Dix	828992c9c5	feat: Ingest replica skeleton (#6529 ) * feat: Update replication.proto * Remove the PartitionId in the replicate request as a single replicate request can have the data for many partitions. * Add namespace_id and table_id to persist complete request to make data easier to lookup in buffer. * feat: Initial ingest_replica skeleton A bunch of copy pasta here from ingester2, but this takes out a ton of stuff that isn't used in replicas. Also lays the groundwork for the simpler buffer structure to keep the data and a basic cache for catalog information that will be required. * feat: update replication.proto GetPartitionBufferResponse * chore: PR cleanup * chore: PR cleanup	2023-01-09 16:53:49 +00:00
Dom Dwyer	91680854ce	feat(replication): define replication RPC API Defines the rough outline of an replication RPC API. More details/docs to follow.	2023-01-04 17:37:32 +01:00
Luke Bond	3659be59c7	feat: delete namespace api mem impl chore: tests for delete namespace; use unique ptn names in tests	2022-12-16 10:23:50 +00:00
Paul Dix	d9c72bb93f	feat: optimize wal with batching (#6399 ) * feat: optimize wal with batching Simplified the wal writer so that it batches up write operations. Currently it waits 10ms between fsync calls. We can pull this out to a config variable later if we want, but I think this is good enough for now. Also updated the reader to be a more simple blocking reader without the extra tasks and channels as that wasn't really getting us anything that I know of. * chore: cleanup wal code for PR feedback	2022-12-14 16:07:20 +00:00
kodiakhq[bot]	66c610f7b1	Merge branch 'main' into cn/ingester-persisted-file-count	2022-12-14 14:58:31 +00:00
Andrew Lamb	47cd6821e1	feat: Document IOx Flight API and add convenience methods (#6392 ) * feat: Document IOx Flight API and add convenience methods * fix: InfluxQL handling Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-12-13 17:32:37 +00:00
Carol (Nichols \|\| Goulding)	1c7f322a4e	feat: Keep track of and report number of Parquet files persisted Per partition and starting over each time the ingester restarts. Fixes #6334.	2022-12-12 11:45:00 -05:00
Carol (Nichols \|\| Goulding)	2fd2d05ef6	feat: Identify each run of an ingester with a Uuid And send that UUID in the Flight response for queries to that ingester run. Fixes #6333.	2022-12-08 17:22:52 -05:00
Carol (Nichols \|\| Goulding)	edd606aa3b	feat: Serialize using protobuf instead of json	2022-11-23 17:07:49 -05:00
Stuart Carnie	2306c383f3	feat: Introduce InfluxQL to Flight (#6166 ) * feat: Introduce InfluxQL to Flight All InfluxQL queries will fail with an error * chore: Temper protobuf lint * chore: Finalize flight.proto changes; fix tests * chore: Add tests for InfluxQL planner * chore: Update docs * chore: Update docs * chore: Rename back to original * chore: Use .into() rather than cast * chore: Use function rather than field * chore: Improved InfluxQL planner name * chore: Restore `impl Into<String>` argument * chore: Add a comment that Go clients are unable to execute InfluxQL * chore: Add a test for the `--lang` argument and InfluxQL	2022-11-23 00:33:49 +00:00
Luke Bond	7c813c170a	feat: reintroduce compactor first file in partition exception (#6176 ) * feat: compactor ignores max file count for first file chore: typo in comment in compactor * feat: restore special first file in partition compaction logic; add limit * fix: calculation in compaction max file count chore: clippy Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-18 15:58:59 +00:00
Carol (Nichols \|\| Goulding)	02c3083192	fix: Remove table names from Dml operations	2022-11-18 10:40:38 -05:00
Nga Tran	49a9565240	feat: gRPC that creates namespace (#6103 ) * feat: create namespace API call in router Co-authored-by: Nga Tran <nga-tran@live.com> * chore: treat retention as ns except in CLI * fix: overflow in nanosecond calc * fix: retention test after changing it from hours to ns * chore: comment clarification in cli; better response type for error in ns API * fix: correct some rebase mistakes * chore: merge namespace create & create_with_retention; renamed ns create test helper fn & const * fix: ns autocreation test was wrong after rebase * fix: mem catalog has default 1hr retention, accidently removed in rebase * chore: remove mem catalogs default 1hr retention; make it settable in sets & router Co-authored-by: Luke Bond <luke.n.bond@gmail.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-18 13:02:12 +00:00
Christopher M. Wolff	6d3dfa781e	chore: marshal InfluxDbError into status details (#6161 ) * chore: marshal InfluxDbError into status details * chore: address feedback and CI issues Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-17 19:51:01 +00:00
Luke Bond	9365d933f1	chore: router namespace api (#6151 ) * chore: move ns api from querier to router * chore: add explanatory comment in querier about moved namespace API * fix: add namespace service to router * fix: querier returns unimplemented error for ns retention, not panic * chore: reuse namespace -> proto in router ns api * chore: grpc namespace - consume ns to avoid clone Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-16 15:25:49 +00:00
Dom	685a0858dc	Merge branch 'main' into dom/rpc-write	2022-11-15 14:38:15 +00:00
Dom Dwyer	dcc7b10bcf	feat: define router -> ingester write protocol Specify a gRPC service and request/response message formats to push writes directly from a router to an ingester.	2022-11-15 15:00:38 +01:00
Carol (Nichols \|\| Goulding)	8dbdab8754	fix: Remove db_name field from DeletePayload Doesn't seem to be used anywhere.	2022-11-14 16:46:04 -05:00
Carol (Nichols \|\| Goulding)	4d4cdebbda	fix: Remove database_name field from DatabaseBatch It was only being used in one error message.	2022-11-14 16:46:03 -05:00
Carol (Nichols \|\| Goulding)	bdff4e8848	fix: Consistently use 'namespace' instead of 'database' in comments and other internal text	2022-11-11 15:46:04 -05:00
Dom	d9c97795fc	feat: use IDs in ingester query API (#6093 ) * refactor: NS+table ID (instead of name) in querier<>ingester * feat(ingester): use IDs for query API Changes the ingester to utilise the ID fields (instead of names) sent over the query wire message wrapped within the Flight API. BREAKING: this changes the "query-ingester" CLI command arguments which now expects the namespace & table IDs, rather than their names. * refactor(ingester): add more query logging context Updates the log messages during query execution to include more context fields. * style: remove unused import Co-authored-by: Marco Neumann <marco@crepererum.net>	2022-11-09 11:25:13 +00:00
kodiakhq[bot]	df5ec013d1	Merge branch 'main' into dom/dml-delete-namespace-id	2022-11-07 09:07:38 +00:00
Nga Tran	9356f2a1b9	feat: grpc for updating namespace retention period (#6041 ) * refactor: make namespace folder for all namesapce's commands * feat: WIP for add command to set retention period * feat: more on updating retention period * feat: grpc for update namespace retention period * test: end to end test fpr namespace retention * fix: lint proto * chore: cleanup * chore: kick CI run again * fix: command hierachy * chore: fix comments	2022-11-04 20:58:11 +00:00
Dom Dwyer	6fa48731aa	feat: NamespaceId in DmlDelete Changes the DmlDelete to contain the NamespaceId for which it should be applied, propagating this value over the wire. Like the existing IDs within the DmlWrite, these values are marked unsafe to use due to avoid the consumers utilising them accidentally during deployment. Unlike DmlWrite, the DmlDelete is completely unused, so this is less of an issue.	2022-11-03 13:57:40 +01:00
Dom Dwyer	ddd6ab0ba4	refactor(write_buffer): pass IDs in wire format This commit is part of a two-part change in order to add the table & namespace IDs to the write buffer wire format. This commit forms the first half; changing the producer to send the IDs. In this commit the new ID values are never read on the consumer side, ensuring there is no consumer dependency on them. This ensures they remain operational during a rollout, where the consumer may be updated to the latest code dependent on the IDs before the producer is updated to send them. This also ensures we have a window of time where where the consumers can be rolled back after being updated, and still handle replaying messages in Kafka.	2022-11-02 13:28:56 +01:00
Carol (Nichols \|\| Goulding)	ace497d47c	fix: Rename database to namespace in the commands I just added	2022-10-27 10:40:39 -04:00
Carol (Nichols \|\| Goulding)	de2ae6f557	feat: MVP of remote store get-table command	2022-10-26 13:50:03 -04:00
Jake Goulding	fa7fe2e9cf	feat: Add a gRPC endpoint to delete a skipped compaction Also add a CLI usage of it for convenience	2022-10-21 15:12:20 -04:00
Carol (Nichols \|\| Goulding)	b8a9fe4222	docs: Explain the meanings of skipped compaction's fields	2022-10-21 13:40:38 -04:00
Carol (Nichols \|\| Goulding)	0132a33946	fix: Rename SkippedCompactionService to CompactionService To make a good place for other compactor-related gRPC actions in the future.	2022-10-21 13:40:37 -04:00
Carol (Nichols \|\| Goulding)	ba25300b01	feat: Create compactor service to list skipped compactions	2022-10-21 13:40:31 -04:00
Dom Dwyer	c4f542bbe2	refactor(ingester): remove tombstone support This commit removes tombstone support from the ingester, and deletes associated code/helpers/tests. This commit does NOT remove tombstone support from any other service, but MAY include removing overlapping test coverage. This also removes the tombstone support from the Ingester -> Querier RPC response message. This has the nice side effect of removing a whole lot of thread spawning in the ingester tests for the Executor, speeding everything up!	2022-10-11 13:10:04 +02:00
Andrew Lamb	04ae0aee80	refactor: Remove protobuf based write service (#5750 ) * refactor: Remove grpc WriteService * fix: update end to end test * fix: Update generated_types/protos/influxdata/pbdata/v1/influxdb_pb_data_protocol.proto Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-30 10:55:03 +00:00
YIXIAO SHI	52ae60bf2e	chore: fix comment typo (#5551 ) Co-authored-by: Dom <dom@itsallbroken.com>	2022-09-07 08:49:29 +00:00
Carol (Nichols \|\| Goulding)	1b49ad25f7	refactor: Rename KafkaTopicId to TopicId	2022-08-29 14:27:02 -04:00
Carol (Nichols \|\| Goulding)	74c9529062	fix: Rename KafkaPartition to ShardIndex	2022-08-29 14:07:18 -04:00
Carol (Nichols \|\| Goulding)	240946d8f5	fix: Deprecate proto sequencer_id fields; add shard_id fields	2022-08-29 14:06:44 -04:00
Dom Dwyer	f11af90c46	refactor(proto): simplify RPC messages & types Removes the input oneof - a shard caller MUST always provide a table/namespace, and MAY provide an optional payload (which in the future will enable sharding using column valuess/etc). As there is currently no payload-based sharding, this simplifies the RPC message. Changes the returned types to better reflect the types we use internally - this should avoid type juggling for both server & client.	2022-08-24 11:39:59 +02:00
Dom Dwyer	57bbe6b216	feat: sharder API definition This commit adds a gRPC endpoint for callers to map (table, namespace) tuples to Sequencer IDs, using the logic internal to the router. Reference: https://github.com/influxdata/influxdb_iox/pull/5447#pullrequestreview-1080574538	2022-08-23 13:21:59 +02:00
Andrew Lamb	16ddc5efc6	chore: Update datafusion / arrow/parquet/arrow-flight and prost/tonic ecosystem (#5360 ) * chore: Update datafusion and arrow * chore: Update Cargo.lock * chore: update to Decimal128 * chore: Update tonic/prost/pbjson/etc * chore: Run cargo hakari tasks * fix: doctest in generated types Co-authored-by: CircleCI[bot] <circleci@influxdata.com>	2022-08-09 17:30:44 +00:00
Marko Mikulicic	5a0af921c8	chore: Roll forward: Sync ReadWindowAggregate API: TagKeyMetaNames (#5186 ) This reverts commit 5d02c755687ef041f5f45dbfc3e633a833284edb.	2022-07-22 10:44:06 +00:00
Marko Mikulicic	07cdb99192	chore: Revert "Sync ReadWindowAggregate API: TagKeyMetaNames" (#5184 ) We're noticing a possible regression (OOMs) in our testing cluster that roughly correlates with this.	2022-07-22 09:26:42 +00:00
Nga Tran	69cb3f2b19	refactor: remove min_sequence_number from Compactor and Querier, add `count_by_overlaps_with_level_0` and `count_by_overlaps_with_level_1` to catalog (#5151 ) * refactor: remove min_sequnce_number * fix: typos * fix: remove min_sequencer_number from new files from merging main * fix: add back throwing error if the compactor compacts files persisted by the ingester after the ingester sends max seq_num back to querier * test: add test_compactor_collision back but modify the input to make it work woth new changes Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-21 13:51:54 +00:00
Marko Mikulicic	21d033eafd	fix: Sync ReadWindowAggregate API: TagKeyMetaNames The storage API has been updated in https://github.com/influxdata/idpe/pull/12868 in January, but since we forked the `.proto` files we never noticed.	2022-07-21 15:07:04 +02:00
Marco Neumann	1993448abf	refactor: remove `Predicat::partition_key` (#5016 ) There is no way a user can filter for partition keys (neither via InfluxRPC nor via SQL) and the query engine doesn't use this field at all. So let's remove it. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-01 17:17:29 +00:00
Marco Neumann	be53716e4d	refactor: use IDs for `parquet_file.column_set` (#4965 ) * feat: `ColumnRepo::list_by_table_id` * refactor: use IDs for `parquet_file.column_set` Closes #4959. * refactor: introduce `TableSchema::column_id_map`	2022-06-30 15:08:41 +00:00
Dom Dwyer	75c425f375	refactor(schema-api): column data type enum Previously the column data type was exposed using an internal i32 value. This commit changes the Schema API to use a self-descriptive proto enum for the column data type.	2022-06-27 16:14:49 +01:00
Marco Neumann	c3912e34e9	refactor: store per-file column set in catalog (#4908 ) * refactor: store per-file column set in catalog Together with the table-wide schema and the partition-wide sort key, this should be everything we need to read a parquet file directly into memory without peeking any file-level metadata. The querier will use this to directly load parquet files into the read buffer. WARNING: This requires a catalog wipe! Ref #4124. * refactor: use proper `ColumnSet` type	2022-06-21 10:26:12 +00:00

1 2 3 4 5 ...

357 Commits (f7ff87758200ac3532b113f9a211eae6cbd79292)