influxdb

Commit Graph

Author	SHA1	Message	Date
dependabot[bot]	7b37479cd1	chore(deps): Bump siphasher from 0.3.10 to 1.0.0 (#8561 ) Bumps [siphasher](https://github.com/jedisct1/rust-siphash) from 0.3.10 to 1.0.0. - [Commits](https://github.com/jedisct1/rust-siphash/compare/0.3.10...1.0.0) --- updated-dependencies: - dependency-name: siphasher dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Dom <dom@itsallbroken.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-24 09:13:14 +00:00
Marco Neumann	92c0e0c806	feat: re-land #8491 (#8543 ) Includes the whole migration set again: 1. create helper index: This may or may not exist in prod depending on when #8540 landed. This is OK, spend a bit of work here. 2. actual migration: The bit that originally failed, but now w/ an extra step to migrate empty keys (because the `unnest`+`array_agg` logic only works for non-empty keys and is a no-op for empty ones). This is idempotent (i.e. a no-op) if we already finished the migration. 3. remove helper index: Remove the helper index from (1). 4. enfore `NOT NULL`: This is a no-op if this was already applied. Closes #8542. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-24 08:14:32 +00:00
Marco Neumann	cf0185f2a3	revert: #8491 (#8540 ) This reverts #8491 by nullifying the problematic migration and the aftermath migrations.	2023-08-22 10:35:52 +00:00
Marco Neumann	d8447779a2	feat: migrate name-based sort keys to ID based on in PG (#8491 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-22 09:11:43 +00:00
Nga Tran	3e98f7ea5c	feat: fill sort_key_ids when partition is inserted and updated (#8517 ) * feat: read null sort_key_ids * chore: clearer explanation about test strategy * chore: Apply suggestions from code review Co-authored-by: Marco Neumann <marco@crepererum.net> * test: tests that add partition with NULL sort_key_ids * feat: set sort_key_ids to empty array {} during partition insertion * feat: initial step to update sort_key_ids * chore: address review comments * chore: remove unecessary comments and tests * fix: typos * chore: remove unecessary tests * feat: continue the work of updating sort_key_ids * fix: chec duplicates for SortedColumnSet * test: tests for sort ley ids * test: fix a test * chore: remove unused comments * chore: address first half of review comments and removing tests of tests * chore: address review commnets for fetching colums in ingester --------- Co-authored-by: Marco Neumann <marco@crepererum.net> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-21 14:26:57 +00:00
Nga Tran	5d17a99dbb	feat: read null sort_key_ids (#8489 ) * feat: read null sort_key_ids * chore: clearer explanation about test strategy * chore: Apply suggestions from code review Co-authored-by: Marco Neumann <marco@crepererum.net> * test: tests that add partition with NULL sort_key_ids * chore: address review comments * chore: remove unecessary comments and tests * fix: typos * chore: remove unecessary tests * fix: chec duplicates for SortedColumnSet --------- Co-authored-by: Marco Neumann <marco@crepererum.net> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-18 14:15:27 +00:00
Carol (Nichols \|\| Goulding)	755d461cca	docs: Clarify requirements of the `list_old_style` function Namely, that it must not have a `LIMIT`. Co-authored-by: Dom <dom@itsallbroken.com>	2023-08-18 09:21:55 -04:00
Carol (Nichols \|\| Goulding)	99d12ef600	feat: Add a catalog method to get all old-style partitions To be fed to the bloom filter.	2023-08-18 09:19:25 -04:00
dependabot[bot]	d2c71bfe67	chore(deps): Bump thiserror from 1.0.46 to 1.0.47 (#8519 ) Bumps [thiserror](https://github.com/dtolnay/thiserror) from 1.0.46 to 1.0.47. - [Release notes](https://github.com/dtolnay/thiserror/releases) - [Commits](https://github.com/dtolnay/thiserror/compare/1.0.46...1.0.47) --- updated-dependencies: - dependency-name: thiserror dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-08-18 09:02:48 +00:00
dependabot[bot]	7094189004	chore(deps): Bump tokio from 1.31.0 to 1.32.0 (#8507 ) Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.31.0 to 1.32.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-1.31.0...tokio-1.32.0) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-08-17 08:06:29 +00:00
dependabot[bot]	fff313b80c	chore(deps): Bump thiserror from 1.0.44 to 1.0.46 (#8496 ) Bumps [thiserror](https://github.com/dtolnay/thiserror) from 1.0.44 to 1.0.46. - [Release notes](https://github.com/dtolnay/thiserror/releases) - [Commits](https://github.com/dtolnay/thiserror/compare/1.0.44...1.0.46) --- updated-dependencies: - dependency-name: thiserror dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-08-16 10:54:47 +00:00
dependabot[bot]	34b8585931	chore(deps): Bump tokio from 1.30.0 to 1.31.0 (#8482 ) Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.30.0 to 1.31.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-1.30.0...tokio-1.31.0) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-08-14 06:32:34 +00:00
dependabot[bot]	4c63338354	chore(deps): Bump async-trait from 0.1.72 to 0.1.73 (#8481 ) Bumps [async-trait](https://github.com/dtolnay/async-trait) from 0.1.72 to 0.1.73. - [Release notes](https://github.com/dtolnay/async-trait/releases) - [Commits](https://github.com/dtolnay/async-trait/compare/0.1.72...0.1.73) --- updated-dependencies: - dependency-name: async-trait dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-14 06:25:33 +00:00
NGA-TRAN	9bf1c8c11c	chore: revert fill sort_key_ids	2023-08-11 11:36:27 -04:00
kodiakhq[bot]	d38882c746	Merge branch 'main' into cn/cleanups	2023-08-11 13:37:44 +00:00
Nga Tran	da92a5c9e1	feat: fill catalog `sort_key_ids` for partitions with coming data (#8462 ) * feat: fill catalog sort_key_ids for partition with coming data * test: sort_key_ids has empty array for newly create partition * test: name of non-existing column * chore: add comments to ask Andrew about the code * chore: make comments clearer * chore: fix a comment to avoid failure in doc * chore: add comment for the panic if column name of sort key not found * fix: during import files the partition has to be created with empty sort key first. Then after its files are created, the partition will be uodated with sort key * chore: remove no longer needed comments after the bug in build_catalog test is fixed * chore: address review comments * refactor: Use ColumnSet type * chore: Apply suggestions from code review Co-authored-by: Carol (Nichols \|\| Goulding) <193874+carols10cents@users.noreply.github.com> * chore: fix a clippy --------- Co-authored-by: Carol (Nichols \|\| Goulding) <carol.nichols@gmail.com> Co-authored-by: Carol (Nichols \|\| Goulding) <193874+carols10cents@users.noreply.github.com>	2023-08-10 18:12:40 +00:00
dependabot[bot]	3675043585	chore(deps): Bump tokio from 1.29.1 to 1.30.0 (#8464 ) Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.29.1 to 1.30.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-1.29.1...tokio-1.30.0) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-08-10 07:50:18 +00:00
Marco Neumann	f1fbcca23c	fix: make concurrent index creations idempotent (#8450 ) * fix: make concurrent index creations idempotent `CREATE INDEX CONCURRENTLY` may fail and leave an invalid index behind. See: <https://www.postgresql.org/docs/current/sql-createindex.html#SQL-CREATEINDEX-CONCURRENTLY> Invalid indexes will NOT be used for queries, however they is also useless and technically the migration wasn't successful. Since #8407 we detect this situation. #8394 allows us to fix these migrations. #8434 makes sure that we don't mess this up (esp. that the "other checksum" pragmas are correct). For #7897. * fix: multi-line processing in `check_linear_migrations.sh` --------- Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-09 19:16:35 +00:00
Carol (Nichols \|\| Goulding)	2c66fc9652	fix: Break up long lines/strings	2023-08-09 14:03:20 -04:00
Carol (Nichols \|\| Goulding)	c6d607991c	docs: Copyedit migration docs Fix some typos, reword or add some words for clarity, add space after headings for scanability	2023-08-09 14:02:53 -04:00
Carol (Nichols \|\| Goulding)	f847463108	fix: Remove duplicated 'not' in an error message	2023-08-09 13:34:36 -04:00
Carol (Nichols \|\| Goulding)	21eb57953b	fix: Add missing word to an error message	2023-08-09 13:34:16 -04:00
Carol (Nichols \|\| Goulding)	71428b3274	docs: Correct adjective form in a comment	2023-08-09 13:33:49 -04:00
Carol (Nichols \|\| Goulding)	180e982c73	docs: Fix typo	2023-08-09 13:22:33 -04:00
Marco Neumann	530acf170c	test: more automatic tests for migrations (#8457 ) * test: test self-tests * test: more tests for migrations	2023-08-09 12:33:53 +00:00
Marco Neumann	2156bc7e05	feat: static migrations checks (#8434 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-07 15:13:28 +00:00
Marco Neumann	922a3cd292	feat: harden migration version checks (#8415 ) * refactor: clean up * feat: harden migration version checks --------- Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-07 07:59:24 +00:00
Marco Neumann	614b64d3e4	feat: retry failed migrations (#8414 )	2023-08-04 11:15:05 +00:00
Marco Neumann	9803623543	feat: post-migration sanity checks (#8407 )	2023-08-04 08:56:22 +00:00
Marco Neumann	975d9a1ae5	refactor: simplify code, harden PG locking (#8406 ) * test: simplify code * feat: `IOxMigration::single_transaction` * refactor: unlock faster * refactor: harden PG locking Check that `unlock` is actually successful and add a bunch of tests.	2023-08-04 08:39:01 +00:00
Marco Neumann	0c9570889b	feat: allow fixing migrations (#8394 ) We need to fix our non-idempotent `CREATE INDEX CONCURRENTLY` migrations so they are self-healing. This is an essential part of #7897. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-04 08:05:45 +00:00
wiedld	81b5d80a91	feat(idpe-17935): move filtering of skipped partitions to the scheduler (#8358 ) * catalog.get_in_skipped_compaction() should handle for multiple partitions * add the ability to perform transformation on sets of partitions (rather than filtering one by one). Start with the transformation to remove skipped partitions, in the scheduler. * move the env var and cli flag setting, for when to ignore skipped partitions, to the scheduler config.	2023-08-03 11:43:09 -07:00
Marco Neumann	3fb173ec56	refactor: use errors instead of panic (#8389 ) * refactor: use errors instead of panic * fix: typo	2023-08-02 13:58:55 +00:00
Marco Neumann	9e4e205ffd	refactor: migration checksum type (#8388 ) * refactor: use `Box<[...]>` instead of `Vec<...>` We are not planning to modify the vector, so storing a capacity and a length is somewhat pointless. * feat: add printout test for PG migrations * refactor: use dedicated checksum type * feat: checksum string roundtrips --------- Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-02 11:57:18 +00:00
Marco Neumann	65846e45a8	docs: explain migration transaction handling (#8387 ) Forgot to update the docs in #8373. This will be updated again after I've finished #7897. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-02 10:04:15 +00:00
Nga Tran	dac0db2196	feat: add sort_key_ids into sqlite catalog (#8384 )	2023-08-01 20:15:27 +00:00
Nga Tran	73f38077b6	feat: add sort_key_ids as array of bigints into catalog partition (#8375 ) * feat: add sort_key_ids as array of bigints into catalog partition * chore: add comments * chore: remove comments to avoid changing them in the future due to checksum requirement --------- Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-01 14:28:30 +00:00
Marco Neumann	743a59aa64	feat: use single per-migration txn when possible (#8373 ) * test: improve `test_step_sql_statement_no_transaction` * feat: also print number of steps in "applying migration step" * feat: use single per-migration txn when possible If all steps can (and want) to run in a transaction block, then wrap the migration bookkeeping and the migration script into a single transaction. This way we avoid the dirty state altogether because its now an "all or nothing" migration. Note that we still guarantee that there is only a single migration running at the same time due to the locking mechanism. Otherwise we would potentially run into nasty transaction failures during schema modifications. This is related to #7897 but only fixes / self-heals the "dirty" state for transaction that can run in transactions. For concurrent index migrations (which we need in prod) we need to be a bit smarter and this will be done in a follow-up. However I feel that not leaving half-done migrations for the cases where it's technically possible (e.g. adding columns) is already a huge step forward. * test: make `test_migrator_uses_single_transaction_when_possible` harder * test: explain test --------- Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-08-01 08:18:39 +00:00
Carol (Nichols \|\| Goulding)	4a9e76b8b7	feat: Make parquet_file.partition_id optional in the catalog (#8339 ) * feat: Make parquet_file.partition_id optional in the catalog This will acquire a short lock on the table in postgres, per: <https://stackoverflow.com/questions/52760971/will-making-column-nullable-lock-the-table-for-reads> This allows us to persist data for new partitions and associate the Parquet file catalog records with the partition records using only the partition hash ID, rather than both that are used now. * fix: Support transition partition ID in the catalog service * fix: Use transition partition ID in import/export This commit also removes support for the `--partition-id` flag of the `influxdb_iox remote store get-table` command, which Andrew approved. The `--partition-id` filter was getting the results of the catalog gRPC service's query for Parquet files of a table and then keeping only the files whose partition IDs matched. The gRPC query is no longer returning the partition ID from the Parquet file table, and really, this command should instead be using `GetParquetFilesByPartitionId` to only request what's needed rather than filtering. * feat: Support looking up Parquet files by either kind of Partition id Regardless of which is actually stored on the Parquet file record. That is, say there's a Partition in the catalog with: Partition { id: 3, hash_id: abcdefg, } and a Parquet file that has: ParquetFile { partition_hash_id: abcdefg, } calling `list_by_partition_not_to_delete(PartitionId(3))` should still return this Parquet file because it is associated with the partition that has ID 3. This is important for the compactor, which is currently only dealing in PartitionIds, and I'd like to keep it that way for now to avoid having to change Even More in this PR. * fix: Use and set new partition ID fields everywhere they want to be --------- Co-authored-by: Dom <dom@itsallbroken.com>	2023-07-31 12:40:56 +00:00
Marco Neumann	73339cfc57	fix: remove sqlx "used" metrics (#8336 ) PR #8327 introduced a bunch of metrics for the sqlx connection pool. One of the metrics was the "used" metrics that was supposed to count "currently in use" connection. In prod however this metric underflows to a very large integer. It seems that "acquire" callback is only used by sqlx for re-used connections (i.e. for the transition from "idle" to "used"). Now we could try to work around it but since there is no "close connection" callback, I doubt it it possible to do the accurately. Luckily though we don't really need that counter. sqlx already offers "active" (defined as idle + used) and "idle", so getting "used" is just the difference. I removed the "used" metric nevertheless because "active" and "idle" are read independently from each other (based on atomic integers) and are NOT guaranteed to be in-sync. Calculating the difference within IOx however would give the illusion that they are. So I leave this to the dashboard / alert / whatever, because there it is usually understood that metrics are samples and may be out of sync for a very short time. A nice side effect of this change is that it simplifies the code quite a bit. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-07-27 10:04:56 +00:00
Marco Neumann	b62e98cef1	feat: metrics for sqlx conn pools (#8327 ) To better gauge how many connections we use and especially if we hit the max connection limit, it would be helpful to actually have some metrics available for the pool usage. This change adds a few basic metrics.	2023-07-25 10:07:25 +00:00
dependabot[bot]	faa8d44492	chore(deps): Bump thiserror from 1.0.43 to 1.0.44 (#8315 ) Bumps [thiserror](https://github.com/dtolnay/thiserror) from 1.0.43 to 1.0.44. - [Release notes](https://github.com/dtolnay/thiserror/releases) - [Commits](https://github.com/dtolnay/thiserror/compare/1.0.43...1.0.44) --- updated-dependencies: - dependency-name: thiserror dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-07-24 10:18:44 +00:00
dependabot[bot]	cd31492e5b	chore(deps): Bump async-trait from 0.1.71 to 0.1.72 (#8317 ) Bumps [async-trait](https://github.com/dtolnay/async-trait) from 0.1.71 to 0.1.72. - [Release notes](https://github.com/dtolnay/async-trait/releases) - [Commits](https://github.com/dtolnay/async-trait/compare/0.1.71...0.1.72) --- updated-dependencies: - dependency-name: async-trait dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-07-24 10:07:18 +00:00
Joe-Blount	629f9d20db	fix: update new_file_at following all compactions	2023-07-20 13:27:54 -05:00
Fraser Savage	e894ea73f7	refactor(catalog): Allow kafka columns to be nullable	2023-07-20 11:18:02 +01:00
Marco Neumann	4e88571142	feat: add batch partition getters (#8268 )	2023-07-19 15:05:41 +00:00
Marco Neumann	004b401a05	chore: upgrade to sqlx 0.7.1 (#8266 ) There are a bunch of dependencies in `Cargo.lock` that are related to mysql. These are NOT compiled at all, and are also not part of `cargo tree`. The reason for the inclusion is a bug in cargo: https://github.com/rust-lang/cargo/issues/10801 Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-07-19 12:18:57 +00:00
dependabot[bot]	e33a078128	chore(deps): Bump paste from 1.0.13 to 1.0.14 (#8244 ) Bumps [paste](https://github.com/dtolnay/paste) from 1.0.13 to 1.0.14. - [Release notes](https://github.com/dtolnay/paste/releases) - [Commits](https://github.com/dtolnay/paste/compare/1.0.13...1.0.14) --- updated-dependencies: - dependency-name: paste dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-07-17 16:10:02 +00:00
Carol (Nichols \|\| Goulding)	f20e9e6368	fix: Add index on parquet_file.partition_hash_id for lookup perf	2023-07-10 13:40:03 -04:00
Carol (Nichols \|\| Goulding)	22c17fb970	feat: Abstract over which partition ID type we're using to list Parquet files	2023-07-10 13:40:01 -04:00

1 2 3 4 5 ...

478 Commits (196c589ef64f73677eb3e89e60b219f862bde19a)