influxdb

Commit Graph

Author	SHA1	Message	Date
Nga Tran	b20226797a	fix: make trigger midification in different file (#6526 )	2023-01-06 20:34:48 +00:00
Nga Tran	b856edf826	feat: function to get parttion candidates from partition table (#6519 ) * feat: function to get parttion candidates from partition table * chore: cleanup * fix: make new_file_at the same value as created_at * chore: cleanup Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-01-06 16:20:45 +00:00
Nga Tran	23807df7a9	feat: trigger that updates partition table when a parquet file is created (#6514 ) * feat: trigger that update partition table when a parquet file is created * chore: simplify epoch of now	2023-01-05 19:57:23 +00:00
Nga Tran	1088baea3d	chore: index for selecting partitions with parquet files created after a given time (#6496 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-01-04 18:07:07 +00:00
Luke Bond	6263ca234a	chore: delete ns postgres impl, test improvements, fix to mem impl	2022-12-16 10:23:50 +00:00
Luke Bond	7c813c170a	feat: reintroduce compactor first file in partition exception (#6176 ) * feat: compactor ignores max file count for first file chore: typo in comment in compactor * feat: restore special first file in partition compaction logic; add limit * fix: calculation in compaction max file count chore: clippy Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-18 15:58:59 +00:00
Nga Tran	a3f2fe489c	refactor: remove retention_duration field from namespace catalog table (#6124 )	2022-11-11 20:30:42 +00:00
NGA-TRAN	498851eaf5	feat: add catalog columns needed for retention policy	2022-11-01 15:35:15 -04:00
Dom Dwyer	46bbee5423	refactor: reduce default column limit Reduces the default number of columns allowed per-table, from 1,000 to 200.	2022-10-14 14:45:48 +02:00
Nga Tran	75ff805ee2	feat: instead of adding num_files and memory budget into the reason text column, let us create differnt columns for them. We will be able to filter them easily (#5742 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-26 20:14:04 +00:00
Dom Dwyer	66bf0ff272	refactor(db): NULLable persisted_sequence_number Makes the partition.persisted_sequence_number column in the catalog DB NULLable. 0 is a valid persisted sequence number.	2022-09-15 18:19:39 +02:00
Dom Dwyer	c5ac17399a	refactor(db): persist marker for partition table Adds a migration to add a column "persisted_sequence_number" that defines the inclusive upper-bound on sequencer writes materialised and uploaded to object store for the partition.	2022-09-15 16:10:35 +02:00
Luke Bond	ee3f172d45	chore: renamed DB migration for billing trigger	2022-09-13 16:29:14 +01:00
Luke Bond	c8b545134e	chore: add index to speed up billing_summary upsert	2022-09-13 16:22:44 +01:00
Luke Bond	feae712881	fix: parquet_file billing trigger respects to_delete	2022-09-13 16:22:44 +01:00
Luke Bond	cc93b2c275	chore: add catalog trigger for billing	2022-09-13 16:22:44 +01:00
Carol (Nichols \|\| Goulding)	fbe3e360d2	feat: Record skipped compactions in memory Connects to #5458.	2022-09-09 15:31:07 -04:00
Nga Tran	cbfd37540a	feat: add index on parquet_file(shard_id, compaction_level, to_delete, created_at) (#5544 )	2022-09-02 14:27:29 +00:00
Carol (Nichols \|\| Goulding)	8a0fa616cf	fix: Rename columns, tables, indexes and constraints in postgres catalog	2022-09-01 10:00:54 -04:00
Nga Tran	a2c82a6f1c	chore: remove min sequence number from the catalog table as we no longer use it (#5178 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-21 20:47:55 +00:00
Marco Neumann	be53716e4d	refactor: use IDs for `parquet_file.column_set` (#4965 ) * feat: `ColumnRepo::list_by_table_id` * refactor: use IDs for `parquet_file.column_set` Closes #4959. * refactor: introduce `TableSchema::column_id_map`	2022-06-30 15:08:41 +00:00
Marco Neumann	215f297162	refactor: parquet file metadata from catalog (#4949 ) * refactor: remove `ParquetFileWithMetadata` * refactor: remove `ParquetFileRepo::parquet_metadata` * refactor: parquet file metadata from catalog Closes #4124.	2022-06-27 15:38:39 +00:00
Nga Tran	92eeb5b232	chore: remove unused sort_key_old from catalog partition (#4944 ) * chore: remove unused sort_key_old from catalog partition * chore: add new line at the end of the SQL file	2022-06-24 15:02:38 +00:00
Marco Neumann	994bc5fefd	refactor: ensure that SQL parquet file column sets are not NULL (#4937 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-24 14:26:18 +00:00
Marco Neumann	c3912e34e9	refactor: store per-file column set in catalog (#4908 ) * refactor: store per-file column set in catalog Together with the table-wide schema and the partition-wide sort key, this should be everything we need to read a parquet file directly into memory without peeking any file-level metadata. The querier will use this to directly load parquet files into the read buffer. WARNING: This requires a catalog wipe! Ref #4124. * refactor: use proper `ColumnSet` type	2022-06-21 10:26:12 +00:00
Nga Tran	13c57d524a	feat: Change data type of catalog partition's sort_key from a string to an array of string (#4801 ) * feat: Change data type of catalog Postgres partition's sort_key from a string to an array of string * test: add column with comma * fix: use new protonuf field to avoid incompactible * fix: ensure sort_key is an empty array rather than NULL * refactor: address review comments * refactor: address more comments * chore: clearer comments * chore: Update iox_catalog/migrations/20220607102200_change_sort_key_type_to_array.sql * chore: Update iox_catalog/migrations/20220607102200_change_sort_key_type_to_array.sql * fix: Rename migration so it will be applied after Co-authored-by: Marko Mikulicic <mkm@influxdata.com>	2022-06-10 13:31:31 +00:00
Marko Mikulicic	c09f6f6bc9	chore: Incrementally migrate sort_key to array type (#4826 ) This PR is the first step where we add a new column sort_key_arr whose content we'll manually migrate from sort_key. When we're done with this, we'll merge https://github.com/influxdata/influxdb_iox/pull/4801/ (whose migration script must be adapted slightly to rename the `sort_key_arr` column back to `sort_key`). All this must be done while we shut down the ingesters and the compactors. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-10 11:35:43 +00:00
Marco Neumann	86e8f05ed1	fix: make all catalog IDs 64bit (#4418 ) Closes #4365. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-25 16:49:34 +00:00
kodiakhq[bot]	e2439c0a4f	Merge branch 'main' into cn/sort-key-catalog	2022-04-04 16:54:48 +00:00
Dom Dwyer	61bc9c83ad	refactor: add table_id index on column_name After checking the postgres workload for the catalog in prod, this missing index was noted as the cause of unexpectedly expensive plans for simple queries.	2022-04-04 13:04:25 +01:00
Carol (Nichols \|\| Goulding)	c9bc70f03a	feat: Add optional sort_key column to partition table Connects to #4195.	2022-04-01 15:45:51 -04:00
Paul Dix	6479e1fc8e	fix: add indexes to parquet_file (#4198 ) Add indexes so compactor can find candidate partitions and specific partition files quickly. Limit number of level 0 files returned for determining candidates. This should ensure that if comapction is very backed up, it will be able to work through the backlog without evaluating the entire world. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-01 09:59:39 +00:00
Marko Mikulicic	2c47d77a5b	fix: Backfill namespace_id in schema migration (#4177 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-30 16:31:26 +00:00
Carol (Nichols \|\| Goulding)	5c8a80dca6	fix: Add an index to parquet_file to_delete	2022-03-29 08:15:26 -04:00
Carol (Nichols \|\| Goulding)	f3f792fd08	feat: Add namespace_id to the parquet_files table; object store paths need it	2022-03-29 08:15:26 -04:00
Carol (Nichols \|\| Goulding)	67e13a7c34	fix: Change to_delete column on parquet_files to be a time (#4117 ) Set to_delete to the time the file was marked as deleted rather than true. Fixes #4059. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-23 18:47:27 +00:00
Paul Dix	27999ff72f	feat: add compaction_level and created_at to parquet_file (#3972 )	2022-03-10 15:56:57 +00:00
Dom Dwyer	d31576b90c	perf: get_table_persist_info indexes for joins Adds indexes to the JOINed fields to reduce execution cost, as the TableRepo::get_table_persist_info() is currently by far the most expensive catalog operation.	2022-03-08 12:12:47 +00:00
Carol (Nichols \|\| Goulding)	252ced7adf	feat: Add row count to the parquet_file record in the catalog (#3847 ) Fixes #3842.	2022-02-24 15:20:50 +00:00
Marco Neumann	d62a052394	feat: extend catalog so we can recover `ParquetChunk`s from it (#3852 ) * refactor: less parquet data copying * feat: `PartitionRepo::get_by_id` * feat: `TableRepo::get_by_id` * feat: `ParquetFile::file_size_bytes` * feat: `ParquetFile::parquet_metadata` Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-24 13:16:15 +00:00
Luke Bond	e19609ab7b	feat: routing service protection (#3807 ) * chore: db migration for namespace table & column limits * feat: impl table & column limits in catalog * chore: improved comment in catalog Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-22 17:26:37 +00:00
Dom Dwyer	4d54f8b42c	refactor: remove migration create schema	2022-02-17 14:41:32 +00:00
Dom Dwyer	3b378418f7	refactor: do not specify schema in migrations Allow the caller to set the Postgres schema a migration should be applied to, rather than restricting the migration to a specific, hard-coded schema. BREAKING CHANGE: manually adds a new migration that precedes the existing migration to ensure the iox_catalog schema exists before applying the migration. You'll probably have to drop any existing databases and migrate from scratch: sqlx database drop; sqlx database create;	2022-02-17 14:15:58 +00:00
Marco Neumann	74c251febb	feat: allow IOx catalog to setup itself (no SQLx CLI required) (#3584 ) * feat: allow IOx catalog to setup itself (no SQLx CLI required) * refactor: use SQLx macro instead of hand-rolled build script	2022-01-31 15:07:38 +00:00
Paul Dix	41038721e1	feat: Add parquet file records to iox_catalog * Adds ParquetFile and scaffolding to IOx catalog * Changed the file_location in parquet_file to object_store_id which is a uuid	2022-01-19 14:14:54 -05:00
Paul Dix	f36d66deb7	feat: Add Tombstone to Catalog * Adds TombstoneId and Tombstone to the iox_catalog with associated interfaces * Adds SequenceNumber new type for use with Tombstone * Adds Timestamp new type for use with Tombstone * Adds constraint to the Postgres schema to enforce tombstone uniqueness by table_id, sequencer_id, and sequence_number	2022-01-18 18:17:21 -05:00
Paul Dix	b796d5e2d1	fix: query pool type and sequencer create	2022-01-17 10:00:33 -05:00
Dom	aa6f118487	feat: iox_catalog sequencers (#3465 ) * refactor: ensure sequencers are unique Adds a unique constraint to ensure only one sequencer record exists for each Kafka (topic, partition). * test: use DSN from env for integration tests Removes the hard-coded DSN, instead sourcing it from the DATABASE_URL environment variable. * docs: integration testing for iox_catalog Documents the required steps in order to run the Postgres integration tests for the iox_catalog crate. * feat(iox_catalog): create & list sequencers Adds support for interacting with the "sequencer" table. * chore: update lockfile Running cargo in iox_catalog generates a lockfile diff.	2022-01-17 10:00:31 -05:00
Paul Dix	8d6d9e679f	refactor: update iox_catalog Changed to use the iox_catalog schema in Postgres rather than public. Updated talbe names to be singular. Removed the connection_string from query_pool	2022-01-17 09:56:20 -05:00
Paul Dix	4764e71c54	feat: Add initial iox_catalog skeleton	2022-01-17 09:56:20 -05:00

50 Commits (23280d0489112d50ab3c7952f4dfb68f8663c971)