influxdb

Commit Graph

Author	SHA1	Message	Date
Jason Wilder	4d37c9dc9e	Fix broken Values.Merge benchmark Merge had the side effect of modifying the original values so the results are wrong because they always hit the fast path after the first run.	2017-03-14 14:20:24 -06:00
Jason Wilder	4f28c90b54	Optimize Value.Deduplicate Deduplicate is called from various places in the engine and can cause a lot of garbage to get created. It first creates a map and then adds each value to the map in order (1st alloc). It then creates a new slice (2nd alloc) and appends everything from the map to the slice. Finally, it sorted the new slice (3rd alloc). This switches the algorithm to use stable sorting and resuing the existing slice to avoid allocations.	2016-12-08 21:10:56 -07:00
Jason Wilder	3a5a01181b	Switch all Value types from pointers	2016-11-15 16:13:55 -07:00
Jason Wilder	1b462312a9	Re-use decoder pools The decoders were held onto each iterator to avoid creating them all the time. Some of them have use quite a bit of memory so they can be expensive to create when querying across many series. Intead, more them to a re-usable pool where we create the minimum that could active be in use. This reduces garbage as well as makes the iterators less expensive to create.	2016-10-03 10:21:54 -06:00
Jason Wilder	7f96d78b79	Make encoder re-usable This allows encoders to be re-used and maintained in a pool to avoid allocating new ones on every compactions and write of an encoded block. The pool used is not a sync.Pool to ensure that the encoders will not be garbage collected.	2016-09-26 12:19:15 -06:00
Jason Wilder	0f5e994383	Fix panic in full compactions due to duplciate data in blocks Due to a bug in compactions, it's possible some blocks may have duplicate points stored. If those blocks are decoded and re-compacted, an assertion panic could trigger. We now dedup those blocks if necessary to remove the duplicate points and avoid the panic.	2016-07-14 11:32:36 -06:00
Jason Wilder	8839cabd41	Add benchmark for Merge	2016-05-10 08:39:55 -06:00
Jason Wilder	4f39cb2f97	Fix case where Merge return unsorted values	2016-05-09 15:40:34 -06:00
Jason Wilder	d99c5e26f6	Fix memory spike when compacting overwritten points If a large series contains a point that is overwritten, the compactor would load the whole series into RAM during a full compaction. If the series was large, it could cause very large RAM spikes and OOMs. The change reworks the compactor to merge blocks more incrementally similar to the fix done in #6556.	2016-05-05 22:31:30 -06:00
Jason Wilder	a0ac754802	Fix loading huge series into RAM when points are overwritten In some query scenarios, if there are a lot of points on disk spread across many blocks in TSM files and a point is overwritten near the begginning of the shard's timerange, the full series could be loaded into RAM triggering OOMs and huge allocations. The issue was that the KeyCursor code that handles overwriting points had a simple implementation that just deduped the whole series in this case. This falls over when the series is quite large. Instead, the KeyCursor has been changed to only decode blocks with updated points. It then keeps track of what section of the blocks have been read so they are not re-read when the later points are decoded. Since the points in a block are always sorted, the code was also changed to remove the Deduplicate calls since they end up reallocating the slice. Instead, we do a sorted merge and re-use the slice as much as we can.	2016-05-05 09:34:44 -06:00
Ben Johnson	286072f65a	update dep: simple8b @ b421ab40	2016-04-22 09:46:05 -06:00
Ben Johnson	525e22c92b	tsm1 query engine alloc reduction This commit makes a number of performance improvements to reduce allocations during query execution. Several objects and buffers are now reused across the components to avoid allocations. Previously a simple `count(value)` query across 1M points would require 26,000+ allocations. After the changes in this commit that number has been reduced to 88.	2016-04-11 14:50:59 -06:00
Jason Wilder	8d70d65a82	Convert time.Time to int64	2016-02-25 15:15:01 -07:00
Ben Johnson	5a0d1ab7c1	rename influxdb/influxdb to influxdata/influxdb This commit changes all the import and URL references from: github.com/influxdb/influxdb to: github.com/influxdata/influxdb	2016-02-10 10:26:18 -07:00
Ben Johnson	b8918a780c	integer support	2016-02-10 09:40:25 -07:00
Ben Johnson	00806de9b8	refactor query engine	2016-02-10 09:40:25 -07:00
Ben Johnson	98baf078d0	tsm1 query performance improvements	2016-01-27 13:42:32 -07:00
Jason Wilder	fd2a409ea3	Skip decoding blocks that are already full	2015-12-17 12:47:05 -07:00
Jason Wilder	cf341eaa6a	Remove MinTime from blocks MinTime is not in the index for each block so storing it in the block header is redundant. The encodings also store it in their header so we are actually storing it 3 times. Removing this is an incompatible change with the current tsm1 file format.	2015-12-07 11:26:58 -07:00
Jason Wilder	41b24995a7	Compcation fixes	2015-12-05 12:19:28 -07:00
Jason Wilder	e5022a898d	Support decoding into type specific slices There is a lot of allocations performed when decoding blocks. These types can be re-used to reduce allocations in many cases. This change allows a type specific slice to be passed in to decode funcs to be re-used if it is large enough. The existing decode is is left for backwards compatibility but is not very efficient right now. It may be removed.	2015-11-17 23:24:09 -07:00
Jason Wilder	d517bad6f2	Add BlockType func Allows the block type to be determined without decoding all the values.	2015-11-17 23:24:09 -07:00
Ben Johnson	e9d303531e	reuse tsm1 decode buffer This commit changes `tsm1.DecodeBlock()` to reuse the same slice of `[]tsm1.Value` instead of reallocating a new one each time.	2015-10-23 12:51:55 -06:00
Jason Wilder	758359accc	Prevent panic in DecodeSameTypeBlock If DecodeSameTypeBlock is called on on an empty Values slice, it would panic with an index out of bounds error. This func can actually be removed because DecodeBlock can determine what type of values are encoded already. This will still panic if the block cannot be decoded due to other reasons. Fixes #4365	2015-10-09 12:52:23 -06:00
Jason Wilder	b3343a6d2a	Fix similar float values encoding overflow If similar float values were encoded, the number of leading bits would overflow the 5 available bits to store them (e.g. store 33 in 5 bits). When decoding, the values after the overflowed value would spike to very large and small values. To prevent the overflow, we clamp the value to 31 which is the maximum number of leading zero bits we can encoded. Fixes #4357	2015-10-07 15:05:56 -06:00
Jason Wilder	1d754db00b	Propogate all encoding errors to engine Avoid panicing in lower level code and allow the engine to decide what it should do.	2015-10-05 20:09:56 -04:00
Paul Dix	594253cbba	Rename storage engine to tsm1, for Time Structured Merge Tree!	2015-10-05 20:09:55 -04:00

27 Commits (7bd1bd8ab394fbb6e3bd03e323cdb9c91ae6cf90)