influxdb

Commit Graph

Author	SHA1	Message	Date
Jon Seymour	7eabae68de	tsm: cache: add a test for the write sequence {6,1,snapshot,7,2} Consider the write sequence: 6,1,snapshot,7,2. The hot cache gets deduplicated, so is 2,7. Now consider the test if 1 >= 2, this is false, so needSort is not set to true. The problem is the implicit assumption that the snapshot is always sorted by the time that merged() runs, but this may not be true. Signed-off-by: Jon Seymour <jon@wildducktheories.com>	2016-02-26 07:43:50 +11:00
Jon Seymour	eb7eec078d	tsm: cache: introduce commit lock to Cache Currently two compactors can execute Engine.WriteSnapshot at once. This isn't thread safe since both threads want to make modifications to Cache.snapshot at the same time. This commit introduces a lock which is acquired during Snapshot() and released during ClearSnapshot(), ensuring that at most one thread executes within Engine.WriteSnapshot() at once. To ensure that we always release this lock, but only release the snapshot resources on a successful commit, we modify ClearSnapshot() to accept a boolean which indicates whether the write was successful or not and guarantee to call this function if Snapshot() has been called. Signed-off-by: Jon Seymour <jon@wildducktheories.com>	2016-02-25 12:10:37 +11:00
Jason Wilder	017c24c98e	Simplify cache snapshotting The Cache had support for taking multiple snapshots to support writing multiple snapshots to TSM files concurrently if that happened to be a bottleneck. In practice, this is never a bottleneck and we only run one snappshoting goroutine continously per shard which has worked well for all workloads. The multiple snapshot support introduces some unhandled failure scenarios where wal segments could be removed without writing them to TSM files. If a snapshot compaction fails to write due to transient disk errors, subsequent snapshots will continue, but the failed one will not be retried. When the subsequent ones succeeded, all closed wal segments are removed causing data loss. This change simplifies the snapshotting capability to ensure that there is only ever one snapshot. If one fails, the next snapshot will update the existing snapshot and retry all of old and new data. Fixes #5686	2016-02-23 09:38:51 -07:00
Mark Rushakoff	191de2670c	Fix non-compiling test	2016-02-22 13:49:11 -08:00
Mark Rushakoff	fc5c8597ab	Merge pull request #5758 from influxdata/mr-disk-stats Track cache, WAL, filestore stats within tsm1 engine	2016-02-22 13:01:55 -08:00
Jason Wilder	aa2e878019	Fix cache not deduplicating points in some cases The cache had some incorrect logic for determine when a series needed to be deduplicated. The logic was checking for unsorted points and not considering duplicate points. This would manifest itself as many points (duplicate) points being returned from the cache and after a snapshot compaction run, the points would disappear because snapshot compaction always deduplicates and sorts the points. Added a test that reproduces the issue. Fixes #5719	2016-02-22 13:24:42 -07:00
Mark Rushakoff	e76967efb6	Add stats to tsm1.Cache	2016-02-19 16:37:34 -08:00
INADA Naoki	898babf616	add float bench	2016-02-03 03:12:16 +09:00
Jason Wilder	631ecc23de	Fix growing destination buffer during WAL entry encoding The test to see if the destination buffer for encoding and decoding a WAL entry was broken and would cause a panic if there were large batches that would overflow the buffer size. Fixes #5075	2015-12-10 11:46:40 -08:00
Paul Dix	1bee7d1512	Update TSM, remove old version, add config * remove rolloverTSMFileSize constant that is no longer used * remove the maxGenerationFileCount since it is no longer a limitation that's necessary with the new compaction scheme. We no longer read WAL segments as part of the compaction so memory is only used as we read in each individual key * remove minFileCount and switch to a user configurable variable * remove the mutex from WALSegmentWriter. There's never more than one open in the WAL at one time and it's not exported through any function so the lock on the WAL should be used. This simplified keeping track of the last write time and removed a bunch of unnecessary locks. * update WALSegmentWriter.Write to take the compressed bytes so that encoding and compression can occur before the call to write (while we don't hold the WAL lock) * remove a bunch of unnecessary locking in WAL.writeToLog * Add check for TSM file magic number and vesion * Remove old tsm, log, and unused cursor code * Remove references to tsm1dev everywhere except in the inspector * Clean up config options for compaction and snapshotting * Remove old TSM configuration options * Update the config.sample.toml with TSM options * Update WAL compact to force if it has been cold for writes for a configurable period of time (1h by default)	2015-12-06 18:50:39 -05:00
Philip O'Toole	fe7b3ad134	Add CacheLoader The CacheLoader loads a given cache from a slice of segment files.	2015-12-05 22:13:57 -08:00
Philip O'Toole	789ab10658	Merge hot cache values with snapshots This change starts by building the sequence of entries, which also allows the required size of destination buffer to be calculated. Then the buffer is allocated up-front in 1 call. Each snapshot and hot value-set is appended to the buffer. If ordering is violated at anytime, set the 'needSort' flag. Sorting, if necessary, is performed just before returning the data.	2015-12-04 20:58:02 -08:00
Paul Dix	b0fb8a0a27	Update TSM cache, compact, wal, encoding * Update cache to have a single slice of values for a key (removed checkpoints) * Changed compact.Plan to only worry about TSM files. * Updated Plan to not return an error since there was no case in which it would. * Update WAL to not keep stats since they're no longer needed. * Update engine to flush the Cache/WAL to a new TSM file when the min threshold is hit. * Split compact logic between TSM compacts and WAL/Cache writes. * Remove unnecessary merge iterator, wal segment iterator, and other no longer necessary stuff. * Remove the asending bool from the Dedupe method. Values should always be in ascending order. It's up to the cursor to iterate through values based on the direction. Giving the cursor responsibility makes it so we don't need to sort, dedupe or reallocate anything for different query orders. * Updated engine to use its locks to ensure writes and cache flushes don't cause a race. * Update all tests with new signatures. Removed a bunch of tests around TSM rewrites and WAL segment iteration that are no longer necessary.	2015-12-03 08:11:50 -05:00
Philip O'Toole	fc83968e2e	Cache values supports sorting order	2015-12-01 13:24:25 -08:00
Philip O'Toole	1bca38bb84	Cache supports writing multiple keys This keeps the locking to a minimum if the data is available for multiple keys at once.	2015-11-26 06:07:16 -08:00
Philip O'Toole	19f53d8a75	Add some simple benchmarks	2015-11-20 21:09:44 -08:00
Philip O'Toole	5b573b9248	Move to simpler cache This cache simply evicts as much as possible whenever a checkpoint is set.	2015-11-20 21:09:24 -08:00
Philip O'Toole	d8ea132c53	Add WAL cache	2015-11-16 19:52:49 -08:00

18 Commits (55a503671d5074f8d6e3d49631de99812d544d47)