influxdb

Commit Graph

Author	SHA1	Message	Date
Tanya Gordeeva	fe4333e8e0	fix(storage): fix tracking disk bytes in memory	2019-06-27 16:36:00 -07:00
Tanya Gordeeva	3ff15a8b41	fix(storage): fix counts for level 4+ files The counts wreen't adding all the level 4+ files, so the last one to be counted would override the rest.	2019-06-27 16:36:00 -07:00
Ben Johnson	b3d7986d4b	chore(tsdb): Fix read metrics declaration.	2019-06-27 09:25:27 -06:00
Ben Johnson	12549c859e	feat(tsdb): Add basic tsdb read metrics Adds a total cursor counter and seek location counter to a new `readMetrics` that is added to each `Engine`. Default labels group by `engine_id` and `node_id`.	2019-06-26 16:16:24 -06:00
tmgordeeva	fb69c5d06c	Merge pull request #13698 from influxdata/tg-fix-metrics fix(storage): reduce tsm level metrics cardinality	2019-06-20 17:57:37 -07:00
Tanya Gordeeva	6428cdbce6	fix(storage): initialize tsm file metrics, update after compaction These metrics weren't being properly intialized on opening the file store, and weren't being properly updated on compaction.	2019-06-20 14:37:53 -07:00
Tanya Gordeeva	85dc52a93b	fix(storage): reduce tsm level metrics cardinality This should have cut off TSM file levels at 4+.	2019-06-20 14:37:33 -07:00
Ben Johnson	14980d55b8	fix(storage): Add WithCurrentGenerationFunc() for generation injection. Adds the ability to set the current generation to use when compacting the cache only. Previously, we used the current generation for all files but this causes issues and we should only use the current generation for level 1 compaction.	2019-06-20 08:54:38 -06:00
Edd Robinson	3588c0505e	fix(storage): don't remap renamed TSM file There exists a possibility for an in-flight read on a TSMReader to read a stale reference to an mmapped TSM file index, which has become unmapped. This commit resolves that issue by simply renaming the file, leaving the original file handler open and the data mapped. The path is updated so that if any callers need to refer to the name of the TSM file after it's renamed, the new name will be reflected. The orphaned file handler will be closed when the TSM file is closed.	2019-05-03 22:36:35 +01:00
Jeff Wendling	ef0768db31	tsm1: predicate deletes (#13371 ) tsm1: predicate deletes	2019-05-03 14:27:25 +00:00
Stuart Carnie	bf774b66ce	fix(storage): Ensure Tag(Keys\|Values) APIs never return (nil, nil) Formalized this post condition in the documentation and added additional unit tests. Added a nil guard and unit test to WriteStringIterator.	2019-05-02 09:45:38 -07:00
Jeff Wendling	16e9eb4cb9	tsdb: respond to feedback and improve test coverage predicate.go: UnmarshalPredicate 100.0% NewProtobufPredicate 100.0% Matches 100.0% Marshal 100.0% walkPredicateNodes 100.0% buildPredicateNode 100.0% newPredicateState 100.0% Reset 100.0% Set 100.0% newPredicateCache 100.0% Cached 100.0% Store 100.0% Update 100.0% Update 92.9% Update 94.1% predicateEval 90.9% predicatePopTag 100.0% predicatePopTagEscape 100.0%	2019-05-01 13:40:40 -06:00
Jeff Wendling	4b4a814d7d	storage: fix predicate matching on field tags	2019-05-01 13:40:40 -06:00
Jeff Wendling	740d669514	tsm1: teach the cache about predicates	2019-05-01 13:40:40 -06:00
Jeff Wendling	4fb7bf1730	tsm1: implement predicate matcher from protobufs	2019-05-01 13:40:40 -06:00
Jeff Wendling	4096f93891	tsm1: implement reading and writing predicates in tombstone files	2019-05-01 13:40:40 -06:00
Jeff Wendling	dcf797f111	tsm1: basic predicate implementation at index layer Only wires it up. No tests, no tombstone tracking, nothing.	2019-05-01 13:40:40 -06:00
Jeff Wendling	7403fd8aa9	tsm1: rename engine method to DeletePrefixRange The storage/engine knows about buckets, but the tsm1/engine doesn't, so name the tsm1/engine method Prefix and keep the storage/engine named Bucket.	2019-05-01 13:40:40 -06:00
Jacob Marble	8c269e0153	chore(log): Put trace_id back in logs (#13712 ) * chore(log): Put trace_id back in logs * fix tests	2019-04-30 18:51:22 -07:00
Stuart Carnie	369a4610e6	fix(storage): Don't panic when length of source slice is too large StringArrayEncodeAll will panic if the total length of strings contained in the src slice is > 0xffffffff. This change adds a unit test to replicate the issue and an associated fix to return an error. This also raises an issue that compactions will be unable to make progress under the following condition: * multiple string blocks are to be merged to a single block and * the total length of all strings exceeds the maximum block size that snappy will encode (0xffffffff) The observable effect of this is errors in the logs indicating a compaction failure. Fixes #13687	2019-04-29 13:29:41 -07:00
Stuart Carnie	7b97a41dcb	feat(storage): Teach TagKeys, TagValues how to accumulate statistics This commit teaches the storage schema APIs how to track statistics and make them available via the returned `cursors.StringIterator`. Statistics are only tracked when decoding TSM blocks or when scanning the in-memory cache. Closes #13541	2019-04-24 11:14:22 -07:00
Stuart Carnie	ed344d25f8	feat(storage): Teach storage how to find a distinct set of tag keys The TagValues API will perform a linear scan if there is no predicate; otherwise, it will use the index to find a list of candidate series keys. TagKeys expects the predicate to be transformed such that `_measurement` and `_field` are remapped to `\x00` and `\xff` respectively. There is one TODO marked to analyze the predicate for a `\x00 = '<measurement>'` pattern. If found, the predicate can be eliminated and fall back to a linear prefix scan by combining the org, bucket and measurement. This is tracked by issue #13497.	2019-04-24 11:14:22 -07:00
Ben Johnson	272f340c30	Merge point parse & explode.	2019-04-24 10:12:15 -06:00
Tanya Gordeeva	97572ee878	feat(storage): add tsm level metrics Adds prometheus metrics recording compaction levels for TSM files.	2019-04-19 13:33:52 -07:00
Stuart Carnie	972cda1775	feedback: Changes in response to PR feedback	2019-04-18 16:19:18 -07:00
Stuart Carnie	904c91aecc	chore: Fix staticcheck complaints	2019-04-18 16:19:18 -07:00
Stuart Carnie	d3790aa072	feat: Teach storage engine how to find tag values for a given key The TagValues API will perform a linear scan if there is no predicate; otherwise, it will use the index to find a list of candidate series keys. TagValues expects the predicate to be transformed such that `_measurement` and `_field` are remapped to `\x00` and `\xff` respectively. There is one TODO marked to analyze the predicate for a `\x00 = '<measurement>'` pattern. If found, the predicate can be eliminated and fall back to a linear prefix scan by combining the org, bucket and measurement.	2019-04-18 16:19:18 -07:00
Stuart Carnie	35e0094a28	feat: TimeRangeIterator for checking if keys have data in a TSM file The TimeRangeIterator permits linear or random index scans and can answer whether the current key has data for the specified time interval, considering any tombstones. When there are no tombstones there are some opportunities for optimization to skip decoding blocks. Specifically, if the queried time interval overlaps any boundaries of the TSM index entries.	2019-04-18 16:19:18 -07:00
Stuart Carnie	7544ea0a5b	feat: Teach Values how to determine it contains data for a time interval Add a Contains API which is a peer to the TimestampArray.Contains function. This is used by the schema APIs to determine if data exists in the cache for a given key and time interval.	2019-04-18 16:19:18 -07:00
Stuart Carnie	1ddd0445d8	feat(tsm1): Add Seek API to TSMIndexIterator Permits random access of the iterator, correctly maintaining state, so that Next may be called to iterator from a given key. This API will be used by the schema APIs when a predicate is specified, typically requiring random access.	2019-04-18 16:19:18 -07:00
Stuart Carnie	36a33bcb9f	feat(tsdb): Teach storage how to only decode timestamps from a block TimestampArray.Contains(min,max) API performs a binary search to determine if timestamps exist for the given time interval. It also implements Exclude to drop timestamps that have been tombstoned. DecodeTimestampArrayBlock decodes only the timestamps of the provided block.	2019-04-18 16:19:18 -07:00
Todd Persen	138c17f22c	Fix typos in tsdb package	2019-04-17 12:55:38 -07:00
Jacob Marble	f56c42794b	chore(tracing): Cleanup (#13296 ) * chore(tracing): Cleanup * broken test * fix unused var * fix test	2019-04-10 19:28:21 -07:00
Ben Johnson	307bb6af9c	Improve bulk series file writes.	2019-04-05 14:38:58 -06:00
Jeff Wendling	96a01eecf2	change an inaccurate comment	2019-03-30 10:24:15 -06:00
Jeff Wendling	cbefaeb7f5	tsm1: make cache limit error a type This makes it easier and more robust to check if an error is due to the cache memory limit being exceeded.	2019-03-30 10:24:15 -06:00
Jeff Wendling	647deb475c	tsm1: move cache entry to its own file	2019-03-30 10:24:15 -06:00
Jeff Wendling	fad1e07d1d	tsm1: clean up some dead/useless code in the cache The storer interface isn't necessary if the init/Free logic is removed, which is unnecessary in a world with only one shard. Additionally, there were some cases where an init/Free call could race and cause data loss in the cache. Not doing it at all fixes all of those races.	2019-03-30 10:24:15 -06:00
Jeff Wendling	591e94dad9	tsm1: rings are fixed at 16 partitions The code actually didn't work if 16 wasn't passed. Indeed, the benchmarks weren't even working. Fix up all that, and reduce the complexity some.	2019-03-30 10:24:15 -06:00
Jonas Hahnfeld	89ced057cb	Fix compaction logic on infrequent cache snapshots This change fixes #10511 that manifests when a shard is considered cold faster than its cache is snapshotted. Previously the code only looked at the last modification of compacted tsm1 files. Instead the (restored) Engine.lastModified() also takes the cache into account. Ports #10522 to master where engine.go has moved and Engine.LastModified() was deleted because it was unused.	2019-03-28 22:21:59 +01:00
Edd Robinson	9a42202b53	PR feedback	2019-03-26 09:57:01 +00:00
Edd Robinson	aa4e652e43	Add reason to total compaction metric This commit adds a reason label to the total compaction metric. For snapshots, the reason will indicate why the cache was snapshotted. For other compactions, the reason label will be blank.	2019-03-25 15:25:03 +00:00
Edd Robinson	dbca30dac5	Add integration tests for cache snapshotting	2019-03-25 11:44:01 +00:00
Edd Robinson	55e9ed689f	Allow the tsm1.Cache to be snapshotted due to age This commit adds a new Cache option, via the `tsm1.CacheConfig.SnapshotAgeDuration` field, which controls the maximum age the cache can reach before it is snapshotted to a TSM file. The default value for this option is `0`, which means that the cache will never be snapshotted based only on age. Setting this value to, for example, 10 seconds, would result in the cache snapshotting every 10 seconds. Snapshotting the cache more frequently can provide better durability guarantees in some circumstances, though more, smaller TSM files will lead to more work needed to compact them down to larger, more dense files. When using InfluxDB with a WAL there isn't really a strong reason to alter `tsm1.CacheConfig.SnapshotAgeDuration` from `0`.	2019-03-25 11:44:01 +00:00
Edd Robinson	af3f7bc9cb	Add new cache configuration value	2019-03-25 11:44:01 +00:00
Edd Robinson	4022db03c2	Provide explicit cache snapshot reasons	2019-03-25 11:44:01 +00:00
Edd Robinson	c4cc3ca7bc	Fix	2019-03-19 15:12:35 +00:00
Edd Robinson	f383ec9225	Add ability to use report-tsm programmatically	2019-03-19 14:29:25 +00:00
Edd Robinson	3b39832ba5	Reduce garbage	2019-03-19 14:28:51 +00:00
Edd Robinson	a6447b6ca5	Refactor tsm report for 2.0	2019-03-19 14:25:53 +00:00

1 2 3 4 5

202 Commits (1e461adfa40b9427f98da9d9c0ab78682b7f74dd)