influxdb

Commit Graph

Author	SHA1	Message	Date
davidby-influx	800970490a	fix: move aside TSM file on errBlockRead (#25839 ) The error type check for errBlockRead was incorrect, and bad TSM files were not being moved aside when that error was encountered. Use errors.Join, errors.Is, and errors.As to correctly unwrap multiple errors. Closes https://github.com/influxdata/influxdb/issues/25838	2025-01-22 10:46:31 -08:00
Dane Strandboge	0574163566	build: upgrade to go1.18 (#23250 )	2022-03-31 16:17:57 -05:00
davidby-influx	e53f75e06d	fix: discard excessive errors (#22379 ) The tsmBatchKeyIterator discards excessive errors to avoid out-of-memory crashes when compacting very corrupt files. Any error beyond DefaultMaxSavedErrors (100) will be discarded instead of appended to the error slice. closes https://github.com/influxdata/influxdb/issues/22328	2021-09-03 09:11:05 -07:00
Daniel Moran	3eb4fdaf33	fix(tsm1): fix data race when accessing tombstone stats (#20903 )	2021-03-09 15:20:40 -05:00
Ayan George	3436db4ebb	refactor: Use binary.Read() instead of io.ReadFull() (#19323 ) The original version of verifyVersion() reads into a byte slice, manually ensures its byte order, then converts it to a type comparable with Version and MagicNumber. This patch hides those details by calling binary.Read() and reading values into properly typed variables. This adds a bit of overhead but this code isn't in the hot-path and this patch greatly simplifies the code. verifyVersion() originally accepted an io.ReadSeeker. It is only called in once place and that function immediately calls seek after verifyVersion(), therefore it is probably safe to call Seek() BEFORE verifyVersion(). The benefit is that verifyVersion() is easier to test since we can pass it a bytes.Buffer. This patch adds a test for verifyVersion() as well as a benchmark. benchmark old ns/op new ns/op delta BenchmarkVerifyVersion-8 73.5 123 +67.35% Finally, this commit moves verifyVersion() from writer.go to reader.go which is where it is actually used.	2020-08-13 14:54:18 -04:00
elbehery	042128b948	fix(tsdb): Replace panic with error while de/encoding corrupt data fixes #17440 While encoding or decoding corrupt data, the current behaviour is to `panic`. This commit replaces the `panic` with `error` to be propagated up to the calling `iterator`. To avoid overwriting other `error`, iterators now wraps a `TSMErrors` which contains ALL the encountered errors. TSMErrors itself implements `Error()`, the returned string contains all the error msgs, separated by "," delimiter.	2020-04-01 20:51:11 +02:00
Jacob Marble	7de2dcd3d9	TSM: TSMReader.Close blocks until reads complete	2018-04-30 13:46:03 -07:00
Jason Wilder	05131f4453	Fix indirectIndex not removing fully deleted series If multiple tombstones exists for a series that ended up causing the full data to be deleted, the blocks were not removed from the offsets in the index. This causes the TSMReader to report that a key exist but does not have any data. During a compaction, every key should have at least one value. Since this invariant was broken, the compaction aborted early and ends up dropping all series keys that are lexigraphically greater than where the breakage occured. This would cause data to be dropped during the compaction.	2017-10-18 18:16:41 -06:00
Jason Wilder	9f102adabe	Abort BlockIterator iteration if deletes detected This fixes a potential bug where the BlockIterator would skip blocks if the underlying TSMReader had deletes on it concurrently. This could possibly occur due to changes in `91eb9de3` that now use the existing TSMReaders from the FileStore instead of creating new ones during compaction.	2017-10-18 18:16:37 -06:00
Jason Wilder	4d171f3f40	Fix data deleted outside of time range	2017-10-18 13:39:47 -06:00
Jason Wilder	94aba64b88	Re-use index entries slice when writing TSM index	2017-09-21 12:48:16 -06:00
Stuart Carnie	baa05de3f8	add benchmarks	2017-09-19 11:47:48 -07:00
Jason Wilder	7388eb9499	Use disk when writing TSM index	2017-09-11 15:29:25 -06:00
Jason Wilder	778000435a	Conver all keys from string to []byte in TSM engine This switches all the interfaces that take string series key to take a []byte. This eliminates many small allocations where we convert between to two repeatedly. Eventually, this change should propogate futher up the stack.	2017-07-28 11:00:50 -06:00
Stuart Carnie	eec80692c4	Taught tsm1 storage engine how to read and write uint64 values * introduced UnsignedValue type * leveraged existing int64 compression algorithms (RLE, Simple 8B) * tsm and WAL can read and write UnsignedValue * compaction is aware of UnsignedValue * unsigned support to model, cursors and write points NOTE: there is no support to create unsigned points, as the line protocol has not been modified.	2017-07-24 09:03:22 -07:00
Jason Wilder	5e11cdcdd7	Fix incorrect condition in OverlapsKeyRange The min key was not used in OverlapsKeyRange which caused it to return false when it should be true. This causes a bug where deletes would not write tombstones for files that actually contained the data it was supposed to delete.	2017-07-07 12:19:33 -06:00
Jason Wilder	29c2b1958e	Fix deletes triggering unnecessary compactions Tombstone files would be written to all TSM files even if the deleted keys or timerange did not exist in the TSM file. This had the side effect of causing shards to get recompacted back to the same state. If any shards or large numbers of TSM files existed, disk usage and CPU utilization would spike causing issues. This prevents tombstones being written for TSM files that could not possiby contain the series keys being deleted or if the delted time range is outside the range of the file.	2017-05-08 14:52:28 -06:00
Jason Wilder	78b7815c49	Add block type for BlockIterator	2017-03-09 09:16:59 -07:00
Gustav Westling	56d98325da	Removed ineffective assignments, and added checks for errors that previsouly was not checked	2016-12-29 20:26:15 +01:00
Jason Wilder	822f409b31	Allow queries to complete before closing TSM files If a query was running against a file being compacted, we close the file and the query would end wherever it had read up to. This could result in queries that randomly lost data, but running them again showed the full results. We now use a reference counting approach and move the in-use files out of the way in the filestore and allow the queries to complete against the old tsm files. The new files are installed and new queries will use them. Fixes #5501	2016-07-21 12:13:04 -06:00
Mark Rushakoff	39f12e376c	Defend against some boundary errors in TSM reading	2016-07-19 15:43:27 -07:00
Mark Rushakoff	28f31b4a0c	Add test cases to repro corruption panics	2016-07-19 15:36:17 -07:00
Jason Wilder	bf3aa5857d	Don't add tombstone for timerange not contained by file	2016-04-27 13:09:53 -06:00
Jason Wilder	6042e114a1	Remove tombstoned values during compaction This will skip blocks that are fully tombstoned as well as remove points that have been removed within a block.	2016-04-27 13:09:53 -06:00
Jason Wilder	c8bd41c2d8	Remove TSM reader Keys func It's very inneficient and should never be used.	2016-04-27 13:09:52 -06:00
Jason Wilder	7e06d558d5	Update ContainsValue to handle tombstones	2016-04-27 13:09:52 -06:00
Jason Wilder	97504a552c	Support time range tombstones in FileStore/KeyCursor	2016-04-27 13:09:52 -06:00
Jason Wilder	27c2bc3f15	Sepearate IndexWriter from TSMIndex Allows for future versionion of the TSMIndex as well as removing a lot of unnecessary code.	2016-04-27 13:09:52 -06:00
Jason Wilder	a789e819a3	Remove NewTSMReaderWithOptions There are two TSMIndex implementations, the directIndex and the indirectIndex. Originally, we only had the directIndex and later added the indirectIndex and NewTSMReaderWithOptions in order to allow both indexes to be used in tests and code. This has created a problem since we really only use the directIndex for writing and always use the indirectIndex for reading. This changes removes the NewTSMReaderWithOptions func so that it is no longer possible to create a TSMReader with a directIndex. This will allow a lot of the block reading code used by the directIndex to be removed and simplify maintainence. It also gives better test coverage of the code that is actually used by the TSM engine now.	2016-04-27 13:09:52 -06:00
Seif Lotfy	c6e3c87e00	Add Block checksum validation and "influx_inspect verify" tool Fixes #5502	2016-04-19 22:33:03 +02:00
Jason Wilder	8d70d65a82	Convert time.Time to int64	2016-02-25 15:15:01 -07:00
Ben Johnson	5a0d1ab7c1	rename influxdb/influxdb to influxdata/influxdb This commit changes all the import and URL references from: github.com/influxdb/influxdb to: github.com/influxdata/influxdb	2016-02-10 10:26:18 -07:00
Ben Johnson	b8918a780c	integer support	2016-02-10 09:40:25 -07:00
Jason Wilder	2b3c640695	Fix reading too far in fileAccess.readBytes Fixes #5566	2016-02-08 09:08:57 -07:00
Jason Wilder	756421ec4a	Look for fully compacted block in addition to max size during compaction Some data shapes would cause files to grow larger than the max size more quickly which resulted in them getting skipped by the full compaction planner at times. Some datasets that could make this happen are very large keys or very large numbers of keys (10M). When this happened, multiple max sized files would accumulate but the blocks would not be full. When the shard went cold for writes, these files would get recompacted down to the optimal size, but a lot of space would be wasted in the mean time.	2016-01-07 15:18:42 -07:00
Jason Wilder	7794b9c5d4	Fix panic: runtime error: slice bounds out of range The block count was an uint16 when incrementing the index location which was an int32. This caused the value the uint16 value to overflow before the index location was incremented causing the wrong location to be read on the next iteration of the loop. This triggers the slice out of range errors. Added a test that recreates the panic seen in #5257 and possibly #5202 which is older code. Fixes #5257	2016-01-04 11:20:24 -07:00
Jason Wilder	a38c95ec85	Update compactions to run concurrently This has a few changes in it (unfortuantely). The main change is to run compactions concurrently. While implementing this, a few query and performance bugs showed up that are also fixed by this commit.	2015-12-23 18:01:11 -07:00
Jason Wilder	421a127f11	Add indirectIndex.UnmarshalBinary benchmark	2015-12-17 15:38:51 -07:00
Jason Wilder	97435b9124	Return minTime/maxTime from BlockIterator.Read	2015-12-16 11:16:16 -07:00
Jason Wilder	ce6de9728e	Add test for BlockIterator with multiple blocks for a key	2015-12-16 11:16:16 -07:00
Jason Wilder	4a3037814f	Add WriteBlock to TSMWriter	2015-12-16 11:16:16 -07:00
Jason Wilder	d99c1f944e	Add BlockIterator for reading TSM blocks without decoding	2015-12-16 11:16:16 -07:00
Jason Wilder	928aef04cd	Split data_file.go into reader.go and writer.go	2015-12-16 11:16:16 -07:00

43 Commits (db/shards-persisting-rp)