influxdb

Commit Graph

Author	SHA1	Message	Date
Alirie Gray	576da8f9d2	fix(swagger): add log property to task runs endpoint docs	2019-05-17 14:08:10 -07:00
Nathaniel Cook	faa5fddf7b	Merge branch 'master' into flux-staging	2019-05-15 10:12:14 -06:00
Christopher Wolff	52a98aae2b	chore(tsdb): skip flaky test https://github.com/influxdata/influxdb/issues/13755	2019-05-14 12:52:37 -07:00
Jacob Marble	95f28cb571	fix(series file): Sync series segment after truncate (#13836 ) (#13859 )	2019-05-10 11:25:43 -07:00
Jacob Marble	aa5c77409d	backport: Fix open/close race in SeriesFile (#13837 )	2019-05-08 11:39:24 -07:00
Edd Robinson	3588c0505e	fix(storage): don't remap renamed TSM file There exists a possibility for an in-flight read on a TSMReader to read a stale reference to an mmapped TSM file index, which has become unmapped. This commit resolves that issue by simply renaming the file, leaving the original file handler open and the data mapped. The path is updated so that if any callers need to refer to the name of the TSM file after it's renamed, the new name will be reflected. The orphaned file handler will be closed when the TSM file is closed.	2019-05-03 22:36:35 +01:00
Jeff Wendling	ef0768db31	tsm1: predicate deletes (#13371 ) tsm1: predicate deletes	2019-05-03 14:27:25 +00:00
Stuart Carnie	bf774b66ce	fix(storage): Ensure Tag(Keys\|Values) APIs never return (nil, nil) Formalized this post condition in the documentation and added additional unit tests. Added a nil guard and unit test to WriteStringIterator.	2019-05-02 09:45:38 -07:00
Jeff Wendling	16e9eb4cb9	tsdb: respond to feedback and improve test coverage predicate.go: UnmarshalPredicate 100.0% NewProtobufPredicate 100.0% Matches 100.0% Marshal 100.0% walkPredicateNodes 100.0% buildPredicateNode 100.0% newPredicateState 100.0% Reset 100.0% Set 100.0% newPredicateCache 100.0% Cached 100.0% Store 100.0% Update 100.0% Update 92.9% Update 94.1% predicateEval 90.9% predicatePopTag 100.0% predicatePopTagEscape 100.0%	2019-05-01 13:40:40 -06:00
Jeff Wendling	4b4a814d7d	storage: fix predicate matching on field tags	2019-05-01 13:40:40 -06:00
Jeff Wendling	740d669514	tsm1: teach the cache about predicates	2019-05-01 13:40:40 -06:00
Jeff Wendling	4fb7bf1730	tsm1: implement predicate matcher from protobufs	2019-05-01 13:40:40 -06:00
Jeff Wendling	4096f93891	tsm1: implement reading and writing predicates in tombstone files	2019-05-01 13:40:40 -06:00
Jeff Wendling	dcf797f111	tsm1: basic predicate implementation at index layer Only wires it up. No tests, no tombstone tracking, nothing.	2019-05-01 13:40:40 -06:00
Jeff Wendling	7403fd8aa9	tsm1: rename engine method to DeletePrefixRange The storage/engine knows about buckets, but the tsm1/engine doesn't, so name the tsm1/engine method Prefix and keep the storage/engine named Bucket.	2019-05-01 13:40:40 -06:00
Jacob Marble	8c269e0153	chore(log): Put trace_id back in logs (#13712 ) * chore(log): Put trace_id back in logs * fix tests	2019-04-30 18:51:22 -07:00
Stuart Carnie	65e4e3c5de	Merge pull request #13701 from influxdata/sgc/bp/2.x/13687 Don't panic when encoding string blocks and length of source slice is too large	2019-04-30 10:02:40 -07:00
Stuart Carnie	369a4610e6	fix(storage): Don't panic when length of source slice is too large StringArrayEncodeAll will panic if the total length of strings contained in the src slice is > 0xffffffff. This change adds a unit test to replicate the issue and an associated fix to return an error. This also raises an issue that compactions will be unable to make progress under the following condition: * multiple string blocks are to be merged to a single block and * the total length of all strings exceeds the maximum block size that snappy will encode (0xffffffff) The observable effect of this is errors in the logs indicating a compaction failure. Fixes #13687	2019-04-29 13:29:41 -07:00
Jeff Wendling	9cd7c0f7e3	tsi1: don't do verbose debug logging unless test fails	2019-04-29 14:01:45 -06:00
Stuart Carnie	7b97a41dcb	feat(storage): Teach TagKeys, TagValues how to accumulate statistics This commit teaches the storage schema APIs how to track statistics and make them available via the returned `cursors.StringIterator`. Statistics are only tracked when decoding TSM blocks or when scanning the in-memory cache. Closes #13541	2019-04-24 11:14:22 -07:00
Stuart Carnie	ed344d25f8	feat(storage): Teach storage how to find a distinct set of tag keys The TagValues API will perform a linear scan if there is no predicate; otherwise, it will use the index to find a list of candidate series keys. TagKeys expects the predicate to be transformed such that `_measurement` and `_field` are remapped to `\x00` and `\xff` respectively. There is one TODO marked to analyze the predicate for a `\x00 = '<measurement>'` pattern. If found, the predicate can be eliminated and fall back to a linear prefix scan by combining the org, bucket and measurement. This is tracked by issue #13497.	2019-04-24 11:14:22 -07:00
Ben Johnson	272f340c30	Merge point parse & explode.	2019-04-24 10:12:15 -06:00
Jeff Wendling	59279837e5	tsi1: partition close deadlock When a tsi1 partition closes, it waits on the wait group for compactions and then acquires the lock. Unfortunately, a compaction may start in the mean time, holding on to some resources. Then, close will attempt to close those resources while holding the lock. That will block until the compaction has finished, but it also needs to acquire the lock in order to finish, leading to deadlock. One cannot just move the wait group wait into the lock because, once again, the compaction must acquire the lock before finishing. Compaction can't finish before acquiring the lock because then it might be operating on an invalid resource. This change splits the locks into two: one to protect just against concurrent Open and Close calls, and one to protect all of the other state. We then just close the partition, acquire the lock, then free the resources. Starting a compaction requires acquiring a resource to the partition itself, so that it can't start one after it has started closing. This change also introduces a cancellation channel into a reference to a resource that is closed when the resource is being closed, allowing processes that have acquired a reference to clean up quicker if someone is trying to close the resource.	2019-04-22 09:06:32 -06:00
Tanya Gordeeva	97572ee878	feat(storage): add tsm level metrics Adds prometheus metrics recording compaction levels for TSM files.	2019-04-19 13:33:52 -07:00
Stuart Carnie	d5341a1a4a	feedback: Fix comments in template	2019-04-18 16:19:19 -07:00
Stuart Carnie	972cda1775	feedback: Changes in response to PR feedback	2019-04-18 16:19:18 -07:00
Stuart Carnie	904c91aecc	chore: Fix staticcheck complaints	2019-04-18 16:19:18 -07:00
Stuart Carnie	d3790aa072	feat: Teach storage engine how to find tag values for a given key The TagValues API will perform a linear scan if there is no predicate; otherwise, it will use the index to find a list of candidate series keys. TagValues expects the predicate to be transformed such that `_measurement` and `_field` are remapped to `\x00` and `\xff` respectively. There is one TODO marked to analyze the predicate for a `\x00 = '<measurement>'` pattern. If found, the predicate can be eliminated and fall back to a linear prefix scan by combining the org, bucket and measurement.	2019-04-18 16:19:18 -07:00
Stuart Carnie	35e0094a28	feat: TimeRangeIterator for checking if keys have data in a TSM file The TimeRangeIterator permits linear or random index scans and can answer whether the current key has data for the specified time interval, considering any tombstones. When there are no tombstones there are some opportunities for optimization to skip decoding blocks. Specifically, if the queried time interval overlaps any boundaries of the TSM index entries.	2019-04-18 16:19:18 -07:00
Stuart Carnie	7544ea0a5b	feat: Teach Values how to determine it contains data for a time interval Add a Contains API which is a peer to the TimestampArray.Contains function. This is used by the schema APIs to determine if data exists in the cache for a given key and time interval.	2019-04-18 16:19:18 -07:00
Stuart Carnie	1ddd0445d8	feat(tsm1): Add Seek API to TSMIndexIterator Permits random access of the iterator, correctly maintaining state, so that Next may be called to iterator from a given key. This API will be used by the schema APIs when a predicate is specified, typically requiring random access.	2019-04-18 16:19:18 -07:00
Stuart Carnie	36a33bcb9f	feat(tsdb): Teach storage how to only decode timestamps from a block TimestampArray.Contains(min,max) API performs a binary search to determine if timestamps exist for the given time interval. It also implements Exclude to drop timestamps that have been tombstoned. DecodeTimestampArrayBlock decodes only the timestamps of the provided block.	2019-04-18 16:19:18 -07:00
Stuart Carnie	7fc9661b7b	chore: Move StringIterator to cursors package for wider reuse	2019-04-18 16:19:17 -07:00
Stuart Carnie	e74f2f8e08	chore(cursors): Remove unused field	2019-04-18 16:19:17 -07:00
Stuart Carnie	d67b1ef245	fix(cursors): Add go:generate directive	2019-04-18 16:19:17 -07:00
Todd Persen	138c17f22c	Fix typos in tsdb package	2019-04-17 12:55:38 -07:00
Ben Johnson	2b3ce82852	fix(tsdb): Remove TSI stats file cache Removes the `STATS` file generated during TSI compaction as it had potential for becoming inconsistent with the index data. Instead, stats are recalculated on start up and on each compaction on a per-partition basis. Computing stats for 10M series across 10K measurements takes approximately 0.171s.	2019-04-17 09:34:32 -06:00
Jacob Marble	f56c42794b	chore(tracing): Cleanup (#13296 ) * chore(tracing): Cleanup * broken test * fix unused var * fix test	2019-04-10 19:28:21 -07:00
Ben Johnson	307bb6af9c	Improve bulk series file writes.	2019-04-05 14:38:58 -06:00
Jeff Wendling	96a01eecf2	change an inaccurate comment	2019-03-30 10:24:15 -06:00
Jeff Wendling	cbefaeb7f5	tsm1: make cache limit error a type This makes it easier and more robust to check if an error is due to the cache memory limit being exceeded.	2019-03-30 10:24:15 -06:00
Jeff Wendling	647deb475c	tsm1: move cache entry to its own file	2019-03-30 10:24:15 -06:00
Jeff Wendling	fad1e07d1d	tsm1: clean up some dead/useless code in the cache The storer interface isn't necessary if the init/Free logic is removed, which is unnecessary in a world with only one shard. Additionally, there were some cases where an init/Free call could race and cause data loss in the cache. Not doing it at all fixes all of those races.	2019-03-30 10:24:15 -06:00
Jeff Wendling	591e94dad9	tsm1: rings are fixed at 16 partitions The code actually didn't work if 16 wasn't passed. Indeed, the benchmarks weren't even working. Fix up all that, and reduce the complexity some.	2019-03-30 10:24:15 -06:00
Jonas Hahnfeld	89ced057cb	Fix compaction logic on infrequent cache snapshots This change fixes #10511 that manifests when a shard is considered cold faster than its cache is snapshotted. Previously the code only looked at the last modification of compacted tsm1 files. Instead the (restored) Engine.lastModified() also takes the cache into account. Ports #10522 to master where engine.go has moved and Engine.LastModified() was deleted because it was unused.	2019-03-28 22:21:59 +01:00
Edd Robinson	9a42202b53	PR feedback	2019-03-26 09:57:01 +00:00
Edd Robinson	aa4e652e43	Add reason to total compaction metric This commit adds a reason label to the total compaction metric. For snapshots, the reason will indicate why the cache was snapshotted. For other compactions, the reason label will be blank.	2019-03-25 15:25:03 +00:00
Edd Robinson	dbca30dac5	Add integration tests for cache snapshotting	2019-03-25 11:44:01 +00:00
Edd Robinson	55e9ed689f	Allow the tsm1.Cache to be snapshotted due to age This commit adds a new Cache option, via the `tsm1.CacheConfig.SnapshotAgeDuration` field, which controls the maximum age the cache can reach before it is snapshotted to a TSM file. The default value for this option is `0`, which means that the cache will never be snapshotted based only on age. Setting this value to, for example, 10 seconds, would result in the cache snapshotting every 10 seconds. Snapshotting the cache more frequently can provide better durability guarantees in some circumstances, though more, smaller TSM files will lead to more work needed to compact them down to larger, more dense files. When using InfluxDB with a WAL there isn't really a strong reason to alter `tsm1.CacheConfig.SnapshotAgeDuration` from `0`.	2019-03-25 11:44:01 +00:00
Edd Robinson	af3f7bc9cb	Add new cache configuration value	2019-03-25 11:44:01 +00:00

1 2 3 4 5 ...

289 Commits (479dc3ffd3af15fc28ce93a1f249a50190b6ee05)