influxdb

Commit Graph

Author	SHA1	Message	Date
Raphael Taylor-Davies	b39e01f7ba	feat: migrate PersistenceWindows to TimeProvider (#2722 ) (#2798 )	2021-10-11 20:40:00 +00:00
Raphael Taylor-Davies	afe34751e7	refactor: split out schema crate (#2781 ) * refactor: split out schema crate * chore: fix doc	2021-10-11 09:45:08 +00:00
Raphael Taylor-Davies	f35a49edd0	refactor: move Sequence to data_types (#2780 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-10-11 09:23:00 +00:00
Marco Neumann	d3de6bb6e4	refactor: `max_persisted_timestamp` => `flush_timestamp` There might be data left before this timestamp that wasn't persisted (e.g. incoming data while the persistence was running).	2021-10-08 12:36:23 +02:00
Marco Neumann	63a932fa37	refactor: "min unpersisted ts" => "max persisted ts" Store the "maximum persisted timestamp" instead of the "minimum unpersisted timestamp". This avoids the need to calculate the next timestamp from the current one (which was done via "max TS + 1ns"). The old calculation was prone to overflow panics. Since the timestamps in this calculation originate from user-provided data (and not the wall clock), this was an easy DoS vector that could be triggered via the following line protocol: ```text table_1 foo=1 <i64::MAX> ``` which is ```text table_1 foo=1 9223372036854775807 ``` Bonus points: the timestamp persisted in the partition checkpoints is now the very same that was used by the split query during persistence. Consistence FTW! Fixes #2225.	2021-10-08 11:52:49 +02:00
Raphael Taylor-Davies	ce5b24e65d	refactor: use DateTime<Utc> in PersistenceWindows (#2722 ) (#2743 ) * refactor: use DateTime<Utc> in PersistenceWindows (#2722) * chore: fix benchmark * chore: fmt * chore: review feedback	2021-10-06 09:39:32 +00:00
Raphael Taylor-Davies	d0929e3a34	feat: persist no chunks (#2712 ) (#2718 ) * feat: persist no chunks (#2712) * fix: persist partition * fix: chunk ordering test * chore: fix logical conflict Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-10-05 15:18:35 +00:00
Edd Robinson	b7b584b6f8	test: ignore test_summaries test on aarch64	2021-08-12 12:12:54 +01:00
Edd Robinson	54d7b8cb98	test: ignore persistence window test on aarch64	2021-08-12 12:00:41 +01:00
Dom	3de6b44e23	build: use new rustdoc lint name (#2261 ) * fix: nocache feature code rot The MBChunk::snapshot code when using the "nocache" option no longer compiles - this commit updates it to match the not(nocache) code. * build: use updated broken_intra_doc_links name The broken_intra_doc_links lint was renamed rustdoc::broken_intra_doc_links https://doc.rust-lang.org/rustdoc/lints.html	2021-08-11 19:48:51 +00:00
Marco Neumann	2eaf486eac	fix: always remember max seen sequ. numbers during replay Do not forget max seen sequence numbers for partition-sequencer combinations that can be skipped during replay. Fixes #2215.	2021-08-11 10:26:12 +02:00
Marco Neumann	9bb0bbbe8f	refactor: simplify new-min-time/flush-time handling in perstince windows a bit	2021-08-10 11:43:16 +02:00
Marco Neumann	0db3fdc05e	docs: document `new_min_time_from_persistable`	2021-08-10 11:31:32 +02:00
Marco Neumann	cd414f28ef	fix: incorrect speculation of post-persist sequence ranges This fixes an edge case where the speculated sequence ranges that can be obtained from flush handles do not account for overlapping windows. The symptom being that the resulting partition checkpoint marked sequence numbers as unpersisted that where already persisted. Fixes #2206.	2021-08-10 11:29:48 +02:00
kodiakhq[bot]	b75cb015d2	Merge branch 'main' into crepererum/fix_error_message_typo	2021-08-09 12:37:02 +00:00
Marco Neumann	c2a660b11c	fix: error message typo	2021-08-09 14:26:07 +02:00
Marco Neumann	28a50a5104	fix: typo Co-authored-by: Andrew Lamb <alamb@influxdata.com>	2021-08-09 14:24:53 +02:00
Marco Neumann	8235fb862b	test: extend db ckpt fold tests	2021-08-09 13:14:15 +02:00
Marco Neumann	a8ceb3f345	docs: expand docs on checkpoint ordering	2021-08-09 13:10:27 +02:00
Marco Neumann	8721c5fcd6	fix: improve error messages	2021-08-09 10:54:23 +02:00
Marco Neumann	950286e5b7	feat: make replay planning work w/ unordered checkpoints	2021-08-09 10:54:23 +02:00
Marco Neumann	0d49fb0d61	feat: database checkpoint folding	2021-08-09 10:54:23 +02:00
Marco Neumann	a435a15d19	feat: impl `PartialOrd` for `PartitionCheckpoint`	2021-08-09 10:54:23 +02:00
Marco Neumann	3b9ac330c2	feat: impl `PartialOrd` for `OptionalMinMaxSequence`	2021-08-09 10:54:23 +02:00
Marco Neumann	94ceef87c6	feat: ensure that min and max time in persistence windows are ordered	2021-08-09 10:33:24 +02:00
Marco Neumann	57bbae7e34	refactor: persistence windows row counts are non-zero	2021-08-09 10:33:24 +02:00
Marco Neumann	8f06652308	test: persistence window backwards clock panic	2021-08-09 10:33:24 +02:00
Raphael Taylor-Davies	c957d8154f	feat: blocking Freezable (#2224 ) * feat: blocking Freezable * chore: test	2021-08-08 19:26:11 +00:00
Marco Neumann	ef9d8aa736	fix: improve warning wording Co-authored-by: Raphael Taylor-Davies <1781103+tustvold@users.noreply.github.com>	2021-08-06 10:25:26 +02:00
Marco Neumann	882f89cecf	fix: only warn when partition ckpt and DB ckpt mins are out-of-sync There are currently a few bugs and semi-understood edge cases that can lead to this case. So instead of bailing out, just issue a warning.	2021-08-06 09:48:26 +02:00
Marco Neumann	a5e9bbc507	feat: impl `Display` for `OptionalMinMaxSequence`	2021-08-06 09:48:26 +02:00
Marco Neumann	015d858f88	test: add failing regression test for #2185 We need a partition that is partially persisted for this. This requires some rework for the time handling in `Db` to make it mockable. The remaining bits are test framework extensions.	2021-08-05 11:44:44 +02:00
Marco Neumann	748bbf6b3c	fix: check empty replay ranges as well during replay plan construction	2021-08-05 11:39:19 +02:00
Andrew Lamb	e2df0eeb00	docs: Add replay diagram and notes to checkpoints and ReplayPlanner (#2184 ) * docs: Add replay diagram and notes to persistence_windows * docs: Apply suggestions from code review Co-authored-by: Marco Neumann <marco@crepererum.net> Co-authored-by: Raphael Taylor-Davies <1781103+tustvold@users.noreply.github.com> * docs: review feedback Co-authored-by: Marco Neumann <marco@crepererum.net> Co-authored-by: Raphael Taylor-Davies <1781103+tustvold@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-08-04 18:27:50 +00:00
Raphael Taylor-Davies	ac67caf7a1	refactor: rename ReadLock to Freezable (#2145 ) * refactor: rename ReadLock to Freezable and make exclusive * chore: review feedback	2021-08-02 16:21:22 +00:00
Marco Neumann	04e797c706	refactor: pass sequencer numbers directly to DB checkpoint First of all using a partition checkpoint as some kind of intermediate representation was kinda a hack because partition checkpoints should only created for to-be-persisted partitions, not for the others. API-wise it should only be possible to construct a partition checkpoint from a flush handle. Also we were only able to construct partition checkpoints for partitions that had unpersisted data, otherwise there was no sane way to fill the `min_unpersisted_timestamp`. We must however scan all partitions no matter if there is unpersisted data so that we can determine the maximum seen sequence numbers. This was caught by a replay test resulting in a catalog state where the last database checkpoint had lower maximum seen sequence numbers than some partition checkpoint, bailing out with an error. So overall it turns out that passing the sequencer numbers directly instead of wrapping them into a partition checkpoint is the better implementation.	2021-07-28 17:28:34 +02:00
Marco Neumann	cf21c9ec40	feat: impl `Clone` for `ReplayPlan`	2021-07-23 09:23:06 +02:00
Marco Neumann	47ad397918	fix: address review comments	2021-07-22 12:37:07 +02:00
Marco Neumann	57a9d5ade0	refactor: correctly track "seen" ranges in persistence checkpoints Now we can handle all these cases: There are two partitions w/ a single write each: 1. A reads sequence number 1 2. B reads sequence number 2 3. we persist A which only knows the sequences up until 1 => the DB checkpoint needs the global max, otherwise we forget sequences during replay (2 in this case, so B would be gone) 1. B reads sequence number 1 2. A reads sequence number 2 3. we persist A which (w/o this commit) would not track the sequencer at all in this checkpoint (since there is nothing to replay) => we MUST also remember that we already read up until 2, otherwise we'll re-read 2 after replay => the partition checkpoint needs the local seen max (no matter if there's something to to persist)	2021-07-21 19:19:49 +02:00
Marco Neumann	a5fc1c7d38	fix: collect min AND max in database checkpoints This is required to correctly handle the following case: 1. There are two partitions A and B w/ a single write each (from the same sequencer). 2. We persist A: - The partition checkpoint for A will be empty because after persistence there will be nothing to replay (the single write is persisted and we're ready). - The database checkpoint that contains the global minimum of all ranges recognizes that for the sequencer there is indeed something left (the minimum sequence number from B). 3. DB restart happens, replay starts 4. We scan all persisted files, figure out that we have a DB checkpoint with a sequence minimum but (w/o the change in this commit) there is no maximum. Only partition checkpoints contain maxima, and the only partition checkpoint that was persisted was the one for partition A and that one was empty (see above). 5. So now how do we recover partition B?	2021-07-21 14:48:29 +02:00
Raphael Taylor-Davies	ffe6e62aee	feat: add instant to datetime conversion (#2078 ) * feat: add instant to datetime conversion * chore: review feedback	2021-07-21 11:43:27 +00:00
Raphael Taylor-Davies	61da0fe4df	fix: update last_instant when rotating into persistable window (#2067 )	2021-07-20 16:38:28 +00:00
Raphael Taylor-Davies	e4d2c51e8b	fix: update PersistenceWindows on rules update (#2018 ) (#2060 ) * fix: update PersistenceWindows on rules update (#2018) * chore: review feedback Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-20 12:44:47 +00:00
Raphael Taylor-Davies	8e5d5928cf	feat: compute WriteSummary from PersistenceWindows (#2030 ) (#2054 ) * feat: compute WriteSummary from PersistenceWindows (#2030) * chore: review feedback Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-20 08:46:52 +00:00
Marco Neumann	81c90868be	docs: note that `MinMaxSequence` are inclusive start/end	2021-07-16 11:45:34 +02:00
Raphael Taylor-Davies	d71f38f27c	feat: compute PartitionCheckpoint from PersistenceWindows (#2011 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-15 12:17:23 +00:00
Raphael Taylor-Davies	cbeeb97cff	feat: flush open window on persist (#1985 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-14 16:58:20 +00:00
Raphael Taylor-Davies	f1c1620c84	feat: make persistence windows interface harder to use incorrectly (#1977 ) * feat: make persistence windows interface harder to use incorrectly * chore: review feedback * chore: update comment Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-14 13:03:18 +00:00
Marco Neumann	886a87f011	chore: enable linting for `persistence_windows` crate	2021-07-14 14:11:51 +02:00
Raphael Taylor-Davies	04e77f6b3c	feat: assert PersistenceWindows pre-conditions (#1976 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-13 14:04:04 +00:00

1 2

53 Commits (c9ff8f0f9f206b54c8add4155a2eb9d65443958f)