influxdb

Commit Graph

Author	SHA1	Message	Date
Jason Wilder	d99c5e26f6	Fix memory spike when compacting overwritten points If a large series contains a point that is overwritten, the compactor would load the whole series into RAM during a full compaction. If the series was large, it could cause very large RAM spikes and OOMs. The change reworks the compactor to merge blocks more incrementally similar to the fix done in #6556.	2016-05-05 22:31:30 -06:00
Jason Wilder	a0ac754802	Fix loading huge series into RAM when points are overwritten In some query scenarios, if there are a lot of points on disk spread across many blocks in TSM files and a point is overwritten near the begginning of the shard's timerange, the full series could be loaded into RAM triggering OOMs and huge allocations. The issue was that the KeyCursor code that handles overwriting points had a simple implementation that just deduped the whole series in this case. This falls over when the series is quite large. Instead, the KeyCursor has been changed to only decode blocks with updated points. It then keeps track of what section of the blocks have been read so they are not re-read when the later points are decoded. Since the points in a block are always sorted, the code was also changed to remove the Deduplicate calls since they end up reallocating the slice. Instead, we do a sorted merge and re-use the slice as much as we can.	2016-05-05 09:34:44 -06:00
Jason Wilder	abcb559b09	Remove index meta data when series and measurements are gone This remove the dropMeta param from the tsdb.Store.DeleteSeries and lets the shard determine when to remove the meta data from the index based on what series still have data in the shard. This uncovered a nasty bug in compactions where a fully deleted series would prematurely end the compactions and not carry forward the rest of the data in the TSM file. This is now fixed as well.	2016-04-29 16:31:57 -06:00
Jason Wilder	4e353867d5	Fix first block not getting purged when deleting series	2016-04-27 17:08:00 -06:00
Jason Wilder	6042e114a1	Remove tombstoned values during compaction This will skip blocks that are fully tombstoned as well as remove points that have been removed within a block.	2016-04-27 13:09:53 -06:00
Jason Wilder	a789e819a3	Remove NewTSMReaderWithOptions There are two TSMIndex implementations, the directIndex and the indirectIndex. Originally, we only had the directIndex and later added the indirectIndex and NewTSMReaderWithOptions in order to allow both indexes to be used in tests and code. This has created a problem since we really only use the directIndex for writing and always use the indirectIndex for reading. This changes removes the NewTSMReaderWithOptions func so that it is no longer possible to create a TSMReader with a directIndex. This will allow a lot of the block reading code used by the directIndex to be removed and simplify maintainence. It also gives better test coverage of the code that is actually used by the TSM engine now.	2016-04-27 13:09:52 -06:00
Seif Lotfy	c6e3c87e00	Add Block checksum validation and "influx_inspect verify" tool Fixes #5502	2016-04-19 22:33:03 +02:00
Jason Wilder	a4e5446ddd	Return error when TSM writer close returns one The TSM writer uses a bufio.Writer that needs to be flushed before it's closed. If the flush fails for some reason, the error is not handled by the defer and the compactor continues on as if all is good. This can create files with truncated indexes or zero-length TSM files. Fixes #5889	2016-03-21 15:00:36 -06:00
Jason Wilder	8d70d65a82	Convert time.Time to int64	2016-02-25 15:15:01 -07:00
Ben Johnson	5a0d1ab7c1	rename influxdb/influxdb to influxdata/influxdb This commit changes all the import and URL references from: github.com/influxdb/influxdb to: github.com/influxdata/influxdb	2016-02-10 10:26:18 -07:00
Jason Wilder	756421ec4a	Look for fully compacted block in addition to max size during compaction Some data shapes would cause files to grow larger than the max size more quickly which resulted in them getting skipped by the full compaction planner at times. Some datasets that could make this happen are very large keys or very large numbers of keys (10M). When this happened, multiple max sized files would accumulate but the blocks would not be full. When the shard went cold for writes, these files would get recompacted down to the optimal size, but a lot of space would be wasted in the mean time.	2016-01-07 15:18:42 -07:00
Jason Wilder	b6da176a4b	Fix direct index size not calculated	2015-12-23 18:01:11 -07:00
Jason Wilder	f9ae8077da	Allow compactions to run when files have tombstones	2015-12-23 18:01:11 -07:00
Jason Wilder	a38c95ec85	Update compactions to run concurrently This has a few changes in it (unfortuantely). The main change is to run compactions concurrently. While implementing this, a few query and performance bugs showed up that are also fixed by this commit.	2015-12-23 18:01:11 -07:00
Jason Wilder	48d4156eac	Fix blocks not sorted correctly when chunking	2015-12-23 18:01:11 -07:00
Jason Wilder	bb2562b2ab	Return CompactionGroups from planning	2015-12-23 18:01:11 -07:00
Jason Wilder	611017f4ed	Add comments	2015-12-18 10:00:07 -07:00
Jason Wilder	fd2a409ea3	Skip decoding blocks that are already full	2015-12-17 12:47:05 -07:00
Jason Wilder	825296ddd8	Add comments	2015-12-16 11:30:06 -07:00
Jason Wilder	3893bc60e1	Speed up TSM compactor Just keep the current block for each iterator in the buffers.	2015-12-16 11:16:17 -07:00
Jason Wilder	00f570441b	Convert TSMKeyIterator to return blocks	2015-12-16 11:16:17 -07:00
Jason Wilder	59a57d8f73	Convert CacheKeyIterator to return encoded blocks	2015-12-16 11:16:17 -07:00
Jason Wilder	0623648140	Add chunking support back to TSMKeyIterator Was removed when MergeIterator was deleted.	2015-12-16 11:16:17 -07:00
Jason Wilder	31b97c3fe0	Add max points per block back for CacheKeyIterator Was removed when MergeIterator was removeed.	2015-12-16 11:16:16 -07:00
Jason Wilder	992aea7bd3	Merge pull request #5060 from influxdb/jw-drop-db Cancel writing TSM files when engine closes	2015-12-08 16:16:07 -07:00
Paul Dix	b192136887	Merge pull request #5058 from influxdb/pd-update-compaction-logic Update TSM compaction logic	2015-12-08 18:14:15 -05:00
Paul Dix	27cc2ea0cc	Update compact.Plan	2015-12-08 18:01:31 -05:00
Jason Wilder	d7cff651d1	Cancel writing TSM files when engine closes If the engine is closed while a compaction is going on, the close call blocks until the goroutine exits. This could be several minutes because the control does not return back up to the channel selector while there is still data to write.	2015-12-08 15:41:53 -07:00
Paul Dix	96445a53a7	Update TSM compaction logic * Update compaction to look at newest files of the smallest step first * Update compaction to look at older files in larger steps if newer files don't have enough small steps to compact * Changed the TestDefaultCompactionPlanner_CombineSequence test to reflect what's possible now. We'd only have multiple files in the same generation if the all files but one were over the max allowable size. * Clean up the logic on when full compactions are run and when planning can be skipped	2015-12-08 17:33:38 -05:00
Jason Wilder	99c313ddae	Fix leaking TSM files when compacting The files being read were not closed after the compaction ran causing them to leak. Fixes #5046	2015-12-08 12:55:30 -07:00
Jason Wilder	f245b44afa	Set full compaction duration option on planner Was set on engine and not planner so it was always 0.	2015-12-08 09:56:36 -07:00
Paul Dix	8096c6b845	Update TSM, address PR #5011 comments * Moved TSM file extension to a constant * Fixed typos * Changed group.size() back to being a uint64 since it can have multiple files up to 4GB each.	2015-12-07 14:47:17 -05:00
Paul Dix	440a8a8a1f	Change all TSM file sizes to uint32	2015-12-07 10:12:24 -05:00
Paul Dix	937233d988	Update TSM compaction planning logic * Update Plan to do a full compaction if cold for writes * Remove MaxFileSize as a config variable from Compactor. Should be a set constant * Update Plan to keep track of if the last check was fully compacted so we can skip future planning calls * Update compact min file count to 3 so that compactions run more frequently	2015-12-07 08:26:30 -05:00
Paul Dix	1bee7d1512	Update TSM, remove old version, add config * remove rolloverTSMFileSize constant that is no longer used * remove the maxGenerationFileCount since it is no longer a limitation that's necessary with the new compaction scheme. We no longer read WAL segments as part of the compaction so memory is only used as we read in each individual key * remove minFileCount and switch to a user configurable variable * remove the mutex from WALSegmentWriter. There's never more than one open in the WAL at one time and it's not exported through any function so the lock on the WAL should be used. This simplified keeping track of the last write time and removed a bunch of unnecessary locks. * update WALSegmentWriter.Write to take the compressed bytes so that encoding and compression can occur before the call to write (while we don't hold the WAL lock) * remove a bunch of unnecessary locking in WAL.writeToLog * Add check for TSM file magic number and vesion * Remove old tsm, log, and unused cursor code * Remove references to tsm1dev everywhere except in the inspector * Clean up config options for compaction and snapshotting * Remove old TSM configuration options * Update the config.sample.toml with TSM options * Update WAL compact to force if it has been cold for writes for a configurable period of time (1h by default)	2015-12-06 18:50:39 -05:00
Jason Wilder	33a33e6a23	Fix 32bit int overflow of constant value	2015-12-05 13:09:18 -07:00
Jason Wilder	41b24995a7	Compcation fixes	2015-12-05 12:19:28 -07:00
Jason Wilder	6592615958	Updated compaction strategy This changes compacting files to merge sequences of files in lower generations up to later generations	2015-12-04 23:30:39 -07:00
Jason Wilder	357b88c439	Increment sequence of max generation when compaction files	2015-12-04 13:46:28 -07:00
Jason Wilder	52bec1f7f6	Change TSM file naming to generation-sequence.tsm	2015-12-04 11:51:33 -07:00
Jason Wilder	4b7cc6720a	Merge pull request #4983 from influxdb/jw-tsm-deletes2 Implement delete series/measurement	2015-12-04 10:02:11 -07:00
Jason Wilder	c54a3da0ca	Implement delete series/measurement	2015-12-04 09:10:26 -07:00
Jason Wilder	66c9ef862e	Fix regressions Something broke with writing to the WAL now that compactions are running concurrently. There was also a performance problem with Next/Prev doing twice as many searches as necessary.	2015-12-03 14:25:03 -07:00
Paul Dix	be4891c40b	Update TSM write snapshot, Compactor * Ensure that writing snapshots in engine are goroutine safe * Add Clone method to Compactor	2015-12-03 11:49:47 -05:00
Paul Dix	b0fb8a0a27	Update TSM cache, compact, wal, encoding * Update cache to have a single slice of values for a key (removed checkpoints) * Changed compact.Plan to only worry about TSM files. * Updated Plan to not return an error since there was no case in which it would. * Update WAL to not keep stats since they're no longer needed. * Update engine to flush the Cache/WAL to a new TSM file when the min threshold is hit. * Split compact logic between TSM compacts and WAL/Cache writes. * Remove unnecessary merge iterator, wal segment iterator, and other no longer necessary stuff. * Remove the asending bool from the Dedupe method. Values should always be in ascending order. It's up to the cursor to iterate through values based on the direction. Giving the cursor responsibility makes it so we don't need to sort, dedupe or reallocate anything for different query orders. * Updated engine to use its locks to ensure writes and cache flushes don't cause a race. * Update all tests with new signatures. Removed a bunch of tests around TSM rewrites and WAL segment iteration that are no longer necessary.	2015-12-03 08:11:50 -05:00
Jason Wilder	6847a6ba0c	Fix rebase	2015-12-02 09:47:16 -07:00
Jason Wilder	751d1dd467	Don't rewrite TSM files while WAL segments exist This approach is not working and needs to be reworked.	2015-12-02 09:45:24 -07:00
Jason Wilder	5744f5ba02	Add ability to filter values by time when writing TSM files	2015-12-02 09:45:24 -07:00
Jason Wilder	708266da69	Cache related compaction fixes	2015-12-02 09:45:24 -07:00
Jason Wilder	231c052003	Don't limit WAL segments during compaction Since they are already loaded in the cache, this limit is not really needed anymore.	2015-12-02 09:45:24 -07:00

1 2

61 Commits (10db0aafeb8b21459a537d455afe48f8b19e22c4)