influxdb

Commit Graph

Author	SHA1	Message	Date
Stuart Carnie	dee8977d2c	chore: move v2/v1/tsdb → v2/tsdb	2020-08-26 10:46:47 -07:00
Mark Rushakoff	f2898d1992	Wipe out workspace in preparation for v2 merge "Knock knock." "Who's there?" "InfluxDB Veet." ...	2019-01-11 10:38:50 -08:00
Edd Robinson	42c3adeffc	simplify packages under tsdb	2018-01-21 09:41:27 -08:00
Stuart Carnie	f3d45ba301	influxdata/influxdb/influxql -> influxdata/influxql	2017-10-30 14:40:26 -07:00
Stuart Carnie	3e28323a10	Simplified DecodeBlock functions array has already been sized correctly * eliminates bounds checking for each element access * reduces decoding of 30,000,000 points via storage API from 584ms to 540ms on average	2017-10-25 13:38:07 -07:00
Jason Wilder	796de3dcea	Reduce encoder pool checkout contention With higher cardinalities, the encoder pools where become a bottleneck. This changes the snapshot compactions ot checkout one encoder of each type and re-use it while writing the snapshots as opposed to repeatedly checking it out and in.	2017-09-19 15:27:26 -06:00
Jason Wilder	3d12c62121	Avoid repeatedly growning decoded values slices	2017-07-28 11:00:56 -06:00
Stuart Carnie	eec80692c4	Taught tsm1 storage engine how to read and write uint64 values * introduced UnsignedValue type * leveraged existing int64 compression algorithms (RLE, Simple 8B) * tsm and WAL can read and write UnsignedValue * compaction is aware of UnsignedValue * unsigned support to model, cursors and write points NOTE: there is no support to create unsigned points, as the line protocol has not been modified.	2017-07-24 09:03:22 -07:00
Jason Wilder	28422f2fec	Use consistent receiver var name for Value types	2017-04-28 13:20:55 -06:00
Jason Wilder	ca9c67a877	Generate encode*Values funcs	2017-03-14 11:54:53 -06:00
Jason Wilder	2f7d4995b4	Use typed values to avoid allocations This switches compactions to use type values (FloatValues) from the generic Values type. It avoids a bunch of allocations where each value much be converted from a specific type to an interface{}.	2017-03-09 16:27:07 -07:00
Mark Rushakoff	41415cf2fb	Update godoc for tsm1 package	2017-01-02 07:30:18 -08:00
Jason Wilder	dea87703cd	Reduce UnixNano pointer call	2016-12-19 14:17:01 -07:00
Jason Wilder	3a5a01181b	Switch all Value types from pointers	2016-11-15 16:13:55 -07:00
Joe LeGasse	743946fafb	models: Add FieldIterator type The FieldIterator is used to scan over the fields of a point, providing information, and delaying parsing/decoding the value until it is needed. This change uses this new type to avoid the allocation of a map for the fields which is then thrown away as soon as the points get converted into columns within the datastore.	2016-10-03 16:30:21 -06:00
Jason Wilder	20f1fb3f7f	Replace gotos with anonymous functions	2016-10-03 12:08:53 -06:00
Jason Wilder	1b462312a9	Re-use decoder pools The decoders were held onto each iterator to avoid creating them all the time. Some of them have use quite a bit of memory so they can be expensive to create when querying across many series. Intead, more them to a re-usable pool where we create the minimum that could active be in use. This reduces garbage as well as makes the iterators less expensive to create.	2016-10-03 10:21:54 -06:00
rw	c3fc87b619	Remove dangling named return value.	2016-09-27 14:18:32 -07:00
rw	fcd425c8c6	Incorporate style feedback from Joe.	2016-09-27 14:07:06 -07:00
rw	9429a2f96a	Gotos to simplify uses of the new encoder pools. For maintainability.	2016-09-27 11:47:25 -07:00
rw	f131d3cc77	Fix off-by-one error that could panic.	2016-09-26 17:03:03 -07:00
Jason Wilder	139ef8062e	Simplify encoder buffer usage	2016-09-26 12:19:16 -06:00
Jason Wilder	7f96d78b79	Make encoder re-usable This allows encoders to be re-used and maintained in a pool to avoid allocating new ones on every compactions and write of an encoded block. The pool used is not a sync.Pool to ensure that the encoders will not be garbage collected.	2016-09-26 12:19:15 -06:00
Mark Rushakoff	5b549ffdfe	Handle bounds errors in UnpackBlock	2016-07-19 15:43:27 -07:00
Jason Wilder	0b481ff627	Fix pathalogical TSM query case This fixes a pathalogical query condition cause by and problematic structuring of TSM files based on how points were written. The condition can occur when there are multiple TSM files and a large number of points are written into the past. The earlier existing TSM files must also have points in the past and close to the present causing their time range to eclipse the later files. When this condition occurs, some queries can spend an excessive amount of time merge all the overlapping blocks. The fix was to constrain the window of overlapping blocks based on the first one we ran into. There was also a simple case in the Merge where we could skip the binary search path and just append the two inputs.	2016-05-25 09:14:17 -06:00
Jason Wilder	4f39cb2f97	Fix case where Merge return unsorted values	2016-05-09 15:40:34 -06:00
Jason Wilder	d99c5e26f6	Fix memory spike when compacting overwritten points If a large series contains a point that is overwritten, the compactor would load the whole series into RAM during a full compaction. If the series was large, it could cause very large RAM spikes and OOMs. The change reworks the compactor to merge blocks more incrementally similar to the fix done in #6556.	2016-05-05 22:31:30 -06:00
Jason Wilder	a0ac754802	Fix loading huge series into RAM when points are overwritten In some query scenarios, if there are a lot of points on disk spread across many blocks in TSM files and a point is overwritten near the begginning of the shard's timerange, the full series could be loaded into RAM triggering OOMs and huge allocations. The issue was that the KeyCursor code that handles overwriting points had a simple implementation that just deduped the whole series in this case. This falls over when the series is quite large. Instead, the KeyCursor has been changed to only decode blocks with updated points. It then keeps track of what section of the blocks have been read so they are not re-read when the later points are decoded. Since the points in a block are always sorted, the code was also changed to remove the Deduplicate calls since they end up reallocating the slice. Instead, we do a sorted merge and re-use the slice as much as we can.	2016-05-05 09:34:44 -06:00
Jason Wilder	97504a552c	Support time range tombstones in FileStore/KeyCursor	2016-04-27 13:09:52 -06:00
Ben Johnson	286072f65a	update dep: simple8b @ b421ab40	2016-04-22 09:46:05 -06:00
Jason Wilder	f841a90d35	Use int64 instead of time.Time in timestamp encoder/decoder	2016-04-19 10:25:27 -06:00
Ben Johnson	525e22c92b	tsm1 query engine alloc reduction This commit makes a number of performance improvements to reduce allocations during query execution. Several objects and buffers are now reused across the components to avoid allocations. Previously a simple `count(value)` query across 1M points would require 26,000+ allocations. After the changes in this commit that number has been reduced to 88.	2016-04-11 14:50:59 -06:00
Joe LeGasse	f10c300765	Update to conversion tool to work in current versions After adding type-switches to the tsm1 packages, the custom implementation found in the conversion tool broke. This change uses tsm1.NewValue() instead of a custom implementation. This change also ensures that the tsm1.Value interface can only be implemented internally to allow for the optimized type-switch based encoding	2016-03-30 13:26:46 -04:00
Joe LeGasse	344e5abd41	Changed type-switch a few places to reduce allocations. Slices of tsm1.Value interfaces are only ever used with all the same types, and the previous code would switch on the type returned from a call to Value(), which allocated and returned an interface{} object for the underlying value. This change instead type-switches on the tsm1.Value object itself, allowing it direct access to the underlying value field, eliminating the unecessary allocations.	2016-03-11 15:57:05 -05:00
Jason Wilder	8d70d65a82	Convert time.Time to int64	2016-02-25 15:15:01 -07:00
Ben Johnson	5a0d1ab7c1	rename influxdb/influxdb to influxdata/influxdb This commit changes all the import and URL references from: github.com/influxdb/influxdb to: github.com/influxdata/influxdb	2016-02-10 10:26:18 -07:00
Ben Johnson	b8918a780c	integer support	2016-02-10 09:40:25 -07:00
Ben Johnson	00806de9b8	refactor query engine	2016-02-10 09:40:25 -07:00
INADA Naoki	80a637904d	tsm1: Use unixnano instead of time.Time	2016-02-03 10:05:40 +09:00
INADA Naoki	771253256b	FloatValue uses unixnano instead of time.Time	2016-02-03 09:57:00 +09:00
Ben Johnson	98baf078d0	tsm1 query performance improvements	2016-01-27 13:42:32 -07:00
Jason Wilder	fd2a409ea3	Skip decoding blocks that are already full	2015-12-17 12:47:05 -07:00
Jason Wilder	cf341eaa6a	Remove MinTime from blocks MinTime is not in the index for each block so storing it in the block header is redundant. The encodings also store it in their header so we are actually storing it 3 times. Removing this is an incompatible change with the current tsm1 file format.	2015-12-07 11:26:58 -07:00
Paul Dix	9637446ba9	Merge pull request #4990 from influxdb/pd-loadmetadata-wal Update TSM engine, WAL and encoding	2015-12-04 18:21:47 -05:00
Paul Dix	b0f3dcc8cc	Update TSM metadata loading and write snapshot * Update WriteSnapshot to always call synchronously * Update LoadMetadataIndex to load WAL metadata from the cache	2015-12-04 16:03:17 -05:00
Jason Wilder	c7e37766e7	Avoid repetitive index searches when iterating over cursors First pass at TSM cursor iteration ended up searching the file indexes too frequently and hurt performance. This changes that to search it once and then have the cursor hold onto the block locations to seek to. Doubles the query performance from the first iteration, but still a lot of room for improvement.	2015-12-04 10:02:59 -07:00
Paul Dix	eafb703afc	Update TSM engine, WAL and encoding * Add InfluxQLType to Values to map the TSM type to InfluxQL * Fix bug in WAL where close wouldn't nil out the currentSegment after closing it * Export writeSnapshot to be used in tests, add argument to run it async or not * Update reloadCache to load temporary metadata information in the engine * Update LoadMetadataIndex to use the temp WAL metadata information	2015-12-04 11:09:39 -05:00
Paul Dix	b0fb8a0a27	Update TSM cache, compact, wal, encoding * Update cache to have a single slice of values for a key (removed checkpoints) * Changed compact.Plan to only worry about TSM files. * Updated Plan to not return an error since there was no case in which it would. * Update WAL to not keep stats since they're no longer needed. * Update engine to flush the Cache/WAL to a new TSM file when the min threshold is hit. * Split compact logic between TSM compacts and WAL/Cache writes. * Remove unnecessary merge iterator, wal segment iterator, and other no longer necessary stuff. * Remove the asending bool from the Dedupe method. Values should always be in ascending order. It's up to the cursor to iterate through values based on the direction. Giving the cursor responsibility makes it so we don't need to sort, dedupe or reallocate anything for different query orders. * Updated engine to use its locks to ensure writes and cache flushes don't cause a race. * Update all tests with new signatures. Removed a bunch of tests around TSM rewrites and WAL segment iteration that are no longer necessary.	2015-12-03 08:11:50 -05:00
Philip O'Toole	bad0f657de	Deduplicate supports requesting sort order	2015-11-30 16:21:44 -08:00
Jason Wilder	25206c729c	Add compactor type	2015-11-24 08:50:07 -07:00

1 2

67 Commits (dee8977d2c6598cb2d17e9334ea997c99853640a)