influxdb

Commit Graph

Author	SHA1	Message	Date
Edd Robinson	bff69f7a82	Refactor inmem implementation	2017-11-17 11:06:43 +00:00
Edd Robinson	25f0fedd6f	Fix MeasurementNamesByExpr in tsi1	2017-11-17 11:06:43 +00:00
Edd Robinson	3967e78885	Consolidate tests to tsdb package	2017-11-17 11:06:43 +00:00
Edd Robinson	b3407c5d46	Correct authorisation on inmem SHOW MEASUREMENTS	2017-11-17 11:06:43 +00:00
Edd Robinson	d4cecd7cc7	Add index authorisation test coverage	2017-11-17 11:06:43 +00:00
Edd Robinson	6851db3fc9	Add FGA support to SHOW MEASUREMENTS	2017-11-17 11:06:43 +00:00
Edd Robinson	5298339f21	Add test coverage for FGA on Tag Keys/Values	2017-11-17 11:06:43 +00:00
Edd Robinson	aa17ef55f9	Implement FGA on SHOW SERIES	2017-11-17 11:06:43 +00:00
Edd Robinson	8acab9b5ac	Fix existing bug where database was empty	2017-11-17 11:06:43 +00:00
Jason Wilder	aa99a56bf1	Merge pull request #9129 from influxdata/jw-cursor-deletes Fix KeyCursor not returning remaing blocks	2017-11-16 16:58:30 -07:00
Jason Wilder	02dbe6dbd3	Fix KeyCursor not return remaing blocks If the first block that needs to be read was partially deleted such that the trailing end has no values, it was possible for the query cursor end early. This was caused by the KeyCursor.ReadFloatBlock returning no values instead of checking the remaing blocks.	2017-11-16 15:23:34 -07:00
Stuart Carnie	2c2244b79c	remove empty file	2017-11-16 09:02:31 -08:00
Ben Johnson	ede3fcf98e	intermediate	2017-11-15 16:09:25 -07:00
Jason Wilder	e2cb1d0ff4	Merge pull request #9114 from influxdata/jw-force-full-plan Add capability to force a full compaction	2017-11-15 10:45:00 -07:00
Jason Wilder	ef06773d5b	Fix panic: runtime error: slice bounds out of range A panic could occur if an invalid time range was passed to Exclude/Include, etc.	2017-11-15 08:18:53 -07:00
Jason Wilder	97e0d496a6	Add capability to force a full compaction This adds the capability to the engine to force a full compaction to be scheduled. When called, it snapshots any data in the cache, aborts running compactions and prevents level plans from returning level plans.	2017-11-15 07:14:27 -07:00
Ben Johnson	ba4c9e0317	Merge remote-tracking branch 'upstream/master' into er-tsi-index-part	2017-11-14 16:14:13 -07:00
Stuart Carnie	2e04e871c9	fix descending queries * did not handle cached values correctly * sort shards by time in either ascending or descending order depending on the RPC request ordering to ensure they are traversed in the correct order.	2017-11-13 17:14:36 -08:00
Jason Wilder	8b18cc4456	Optimize deletes in tsi The DropSeries code path ended up creating a MeasurementSeriesIterator for each dropped series, this was too expensive just to see if a series exists. This adds a HasSeries func and fixes and issue where TSI files were compacted while an iterator was still in use causing a panic.	2017-11-13 12:35:38 -07:00
Jason Wilder	c0631c2b95	Fix temp tombstone files leaking	2017-11-13 09:02:10 -07:00
Jason Wilder	13692639cb	Fix create/delete series race This fixes a race where writes and deletes to the same series and measurements could sometimes leave the index in an inconsistent state.	2017-11-13 09:02:10 -07:00
Jason Wilder	80cd5e63af	Optimize DeleteSeriesRange This removes more allocations and speeds up some critical sections.	2017-11-13 09:02:10 -07:00
Jason Wilder	aee395d3bd	Make DeleteSeriesRange take SeriesIterator	2017-11-13 09:02:10 -07:00
Jason Wilder	f893beb6d8	Use MeasurementSeriesKeysByExprIterator for deletes	2017-11-13 09:02:10 -07:00
Jason Wilder	16d1f4309b	Extract MeasurementSeriesKeysByExprIterator	2017-11-13 09:02:10 -07:00
Jason Wilder	000768371f	Optimized deletes in TSM index This optimizes how deletes are processed to reduce memory usage and improve efficiency.	2017-11-13 09:02:08 -07:00
Ben Johnson	9756a29678	import fix	2017-11-13 08:54:32 -07:00
Jason Wilder	eebd88f825	Don't write tombstones for keys that do not exist This filters out keys that do not exist in a TSM file to avoid writing entries that would end up being ignored when applied.	2017-11-13 08:50:07 -07:00
Jason Wilder	88c48ec78b	Rework Engine.DeleteSeriesRange to avoid allocations This removes the containsSeries func which ends up creating a map sized to the slice of keys passed in. This doesn't scale well to high cardinalities and creates a lot of garbage.	2017-11-13 08:50:07 -07:00
Jason Wilder	cb658774bb	Reduce allocations when reading tombstone v4	2017-11-13 08:50:07 -07:00
Jason Wilder	1c65bb3bb1	Fix leaked goroutine in FileStore.WalkKeys If fn returned and error, the goroutines sending keys from TSM files would get blocked indefinitely and leak.	2017-11-13 08:50:07 -07:00
Jason Wilder	b0c7a44eaa	Adjust min/max time to work in the engine The query language min and max times are slighly different than the values used in the engine. This allows faster codes to be used when the whole time range is deleted.	2017-11-13 08:50:07 -07:00
Jason Wilder	2959b8d2eb	Make BatchDeleters concurrent	2017-11-13 08:50:07 -07:00
Jason Wilder	5a775c50d9	Add DeleteRangeWith This is a version of DeleteRange that take a func predicate to determine whether a series key should be deleted or not. This avoids the large slice allocations with higher cardinalities.	2017-11-13 08:50:07 -07:00
Jason Wilder	6b19d2b673	Add BatchDeleters type	2017-11-13 08:48:03 -07:00
Jason Wilder	9ac83601cf	Use BatchDeleter in FileStore	2017-11-13 08:48:03 -07:00
Jason Wilder	4ed19348fd	Add a BatchDelete capability to TSMReader	2017-11-13 08:48:03 -07:00
Jason Wilder	44e782f173	Store temporary tombstones on disk This removes the in-memory tombstone buffer when writing tombstones which eliminates one source of large memory spikes during deletes.	2017-11-13 08:48:03 -07:00
Jason Wilder	bd15d37c70	Extract commit func	2017-11-13 08:48:03 -07:00
Jason Wilder	1e56894097	Extract writeTombstone func	2017-11-13 08:48:03 -07:00
Jason Wilder	b958c68ce5	Avoid re-reading tombstones when writing new ones This adds a new v4 tombstone format that extends the v3 format by allowing multiple batches of tombstones to be written without having to re-read all the existing tombstones. This uses gzip multi stream to append multiple v3 files together to create a v4 format.	2017-11-13 08:48:03 -07:00
Jason Wilder	17bae05370	Allow buffering tombstones before writing to disk	2017-11-13 08:48:03 -07:00
Jonathan A. Sternberg	0b7c56bcd8	Update the zap logger dependency The previous sha was taken from a revision on a devel branch that I thought would continue staying in the tree after it was merged. That revision was rebased away and the API was changed for the logger. This updates the usage of the logger and adds a simple package for constructing the base logger. The 1.0 version of zap changed the format of the default console logger so this change moves over to this new logger instead of attempting to retain backwards compatibility with the old format.	2017-11-10 16:27:16 -06:00
Ben Johnson	e278af2b18	intermediate	2017-11-09 09:30:19 -07:00
Ben Johnson	d3cd750509	Refactor series file tombstoning.	2017-11-09 09:30:19 -07:00
Ben Johnson	3034d3fb54	intermediate	2017-11-09 09:30:19 -07:00
Ben Johnson	919f99f34d	Fixing tests.	2017-11-09 09:30:19 -07:00
Ben Johnson	07a743cca7	Rebase fixes	2017-11-09 09:29:19 -07:00
Edd Robinson	4471341d7e	Ensure error channel has capacity for all partitions	2017-11-09 09:28:37 -07:00
Ben Johnson	b24b08a23c	Fix partition loading.	2017-11-09 09:28:37 -07:00
Ben Johnson	1f6d4ed1d1	Add series map.	2017-11-09 09:28:37 -07:00
Edd Robinson	49218fd3bd	Fix issue with series being added to log file	2017-11-09 09:28:37 -07:00
Edd Robinson	87778f3c45	Open partitions in parallel	2017-11-09 09:28:37 -07:00
Edd Robinson	3ae799b3a5	WIP Fix build	2017-11-09 09:28:37 -07:00
Edd Robinson	ebb23df1cf	Implement most merge based methods	2017-11-09 09:28:37 -07:00
Edd Robinson	6d87ff7fa2	WIP - series point iterator	2017-11-09 09:28:37 -07:00
Edd Robinson	aec607bddf	Implement Measurement sketches	2017-11-09 09:28:37 -07:00
Edd Robinson	b39aa858cf	Implement series creation	2017-11-09 09:28:37 -07:00
Edd Robinson	65c6fa747e	Implement methods that don't require merge	2017-11-09 09:28:37 -07:00
Edd Robinson	bf132004a3	Implement basic partition layout	2017-11-09 09:28:37 -07:00
Edd Robinson	7aa9de508d	Initial refactor of tsi1.Index This commit carries out the initial refactor of the tsi1.Index into tsi1.Partition. We then create a new tsi1.Index that will be an abstraction over a collection of Partitions.	2017-11-09 09:27:56 -07:00
Edd Robinson	fb646549f4	Index files -> partition files	2017-11-09 09:26:06 -07:00
Ben Johnson	328bffd658	Convert series ids to 64-bits.	2017-11-09 09:26:06 -07:00
Ben Johnson	0ffd94a37a	Fix rebase	2017-11-09 09:25:10 -07:00
Ben Johnson	08e459357a	Fix tsi race conditions.	2017-11-09 09:18:33 -07:00
Ben Johnson	c75f1127aa	intermediate	2017-11-09 09:18:33 -07:00
Ben Johnson	f223153078	Initial working version of series file.	2017-11-09 09:18:33 -07:00
Ben Johnson	e05d4fdeeb	intermediate	2017-11-09 09:18:33 -07:00
Ben Johnson	9ad2b53881	intermediate	2017-11-09 09:18:33 -07:00
Ben Johnson	7259589241	intermediate	2017-11-09 09:18:33 -07:00
Ben Johnson	48b48a8927	intermediate	2017-11-09 09:13:46 -07:00
Edd Robinson	59c4e4b1bc	Skip shards we don't have	2017-11-08 13:33:52 +00:00
Ben Johnson	156f25ac23	Improve SHOW TAG KEYS performance.	2017-11-07 10:59:19 -07:00
Edd Robinson	e762da9aca	Fix race on store close There was a very small window where it was possible to deadlock during the close of the Store. When closing, the Store waited on its Waitgroup under a `Lock`. Naturally, all other goroutines must have been in a position to call `Done` on the `Waitgroup` before the `Wait` call in `Close` would return. For the goroutine running the `monitorShards` method it was possible that it would be unable to do this. Specifically, if the `monitorShards` goroutine was jumping into the `t.C` case as the `Close()` goroutine was acquiring the `Lock` then then `monitorShards` goroutine would be unable to acquire the `RLock`. Since it would also be unable to progress around its loop to jump into the `s.closing` case, it would be unable to call `Done` on the `WaitGroup` and we would have a deadlock. This was identified during an AppVeyor CI run, though I was unable to reproduce this locally.	2017-11-07 15:26:46 +00:00
Edd Robinson	07c4fdc1ed	Fix data race on SeriesPointIterator	2017-11-07 10:48:23 +00:00
Edd Robinson	88e2ea822d	Add inmem shard optimisation to SHOW MEASUREMENTS	2017-11-06 19:15:01 +00:00
Edd Robinson	f8353bf300	Check shard index type correctly Previously we used the EngineOptions to determine which shard index type we were using. However, these options are set once at runtime initialisation. Therefore if you're running with TSI enabled but then accessing a legacy database with the inmem index, TagValues would not have taken advantage of the inmem index. This change ensures we always check the actual index of the shard(s).	2017-11-06 19:15:01 +00:00
Edd Robinson	fbcb299b8a	Support WHERE time clause in SHOW TAG VALUES This commit adds time support to SHOW TAG VALUES. Time can be used as both a lower and upper boundary. However, there are some caveats. For the `inmem` index, filtering by time will still return all results because the index data is shared across shards. For the `tsi1` index, filtering by time will only work down to the shard lever. Specifically, when querying by time all shards within that time range will be used to generate the results.	2017-11-06 19:15:01 +00:00
Edd Robinson	98d584b63f	Use index for SHOW X meta queries When a meta query does not include a time component then it can be answered exclusively by the index. This should result in a much faster query execution that if the TSM engine was engaged. This commit rewrites the following queries such that they make use of the index where no time component is present: - SHOW MEASUREMENTS - SHOW SERIES - SHOW TAG KEYS - SHOW FIELD KEYS	2017-11-06 19:15:00 +00:00
Stuart Carnie	7cb25ecbff	optimized slice when outside timerange find position then update both slices once	2017-11-03 16:31:01 -07:00
Stuart Carnie	295acd6920	also slice values	2017-11-03 15:50:16 -07:00
Stuart Carnie	c1da95442c	Merge pull request #9054 from influxdata/js-update-influxql-path-in-templates Update the influxql path inside of the template files	2017-11-03 09:44:02 -07:00
Jonathan A. Sternberg	748fc4ae79	Update the influxql path inside of the template files	2017-11-03 10:57:17 -05:00
Jonathan A. Sternberg	87ed89ee74	Implement pull request feedback for human readable sizes	2017-11-01 13:08:51 -05:00
Andrew Hare	ecb3952fa9	Allow human-readable byte sizes in config Update support in the `toml` package for parsing human-readble byte sizes. Supported size suffixes are "k" or "K" for kibibytes, "m" or "M" for mebibytes, and "g" or "G" for gibibytes. If a size suffix isn't specified then bytes are assumed. In the config, `cache-max-memory-size` and `cache-snapshot-memory-size` are now typed as `toml.Size` and support the new syntax.	2017-11-01 11:09:09 -05:00
Stuart Carnie	9a43c14653	Merge pull request #9041 from influxdata/sgc-influxql influxdata/influxdb/influxql -> influxdata/influxql	2017-10-31 07:31:31 -07:00
Stuart Carnie	f3d45ba301	influxdata/influxdb/influxql -> influxdata/influxql	2017-10-30 14:40:26 -07:00
Jason Wilder	48ebc53154	Revert "Fix race in disableLevelCompactions" This reverts commit `4f8580fbaa`.	2017-10-30 14:14:50 -06:00
Ben Johnson	49c1fca036	Handle nil MeasurementIterator.	2017-10-26 11:25:46 -06:00
Stuart Carnie	dc04eaa8f3	Amendments based on feedback * Fprint* functions * No nakedness * clarify panic messages * spacing between case statements * remove break in favor of return * remove goto in favor of for { continue }	2017-10-25 13:38:07 -07:00
Stuart Carnie	c39f1ad748	Add batch cursor support to tsdb and tsm1 * batch cursors return slices of timestamps and values to reduce call overhead. Significantly improved iteration. * added CreateCursor API to Shard, Engine * moved build*Cursor to code gen	2017-10-25 13:38:07 -07:00
Stuart Carnie	3e28323a10	Simplified DecodeBlock functions array has already been sized correctly * eliminates bounds checking for each element access * reduces decoding of 30,000,000 points via storage API from 584ms to 540ms on average	2017-10-25 13:38:07 -07:00
Edd Robinson	47bd069315	Fix race in Measurement index Fixes #8989 and #8633. Previously when issuing commands involving a regex check, walking through the tags keys/values on a measurement, using the measurement's index, would be racy. This commit adds a new `TagKeyValue` type that abstracts away the multi-layer map we were using as an inverted index from tag keys and values to series ids. With this abstraction we can also make concurrent access to this inverted index goroutine safe. Finally, this commit fixes a very old bug in the index which will affect any query using a regex. Previously we would always check _every_ tag against a regex for a measurement, even when we had found a match.	2017-10-25 13:34:21 +01:00
Stuart Carnie	b7579340fe	return query.ErrQueryInterrupted for read on InterruptCh	2017-10-24 14:10:28 -07:00
Jason Wilder	955829e7c3	Merge pull request #9003 from influxdata/jw-delete-regression Delete series in batches	2017-10-24 13:54:33 -06:00
Jason Wilder	cbbbe8bedb	Delete series in batches This fixes a regression where deleting series keys would happen one at a time instead of in bulk.	2017-10-24 11:06:21 -06:00
Stuart Carnie	02a05e86ee	Add missing template changes for EXPLAIN ANALYZE	2017-10-23 14:46:36 -07:00
Ben Johnson	5a77238f30	Sort & validate TSI key value insertion.	2017-10-23 10:46:01 -06:00
Stuart Carnie	e9313876ab	EXPLAIN ANALYZE * Introduces EXPLAIN ANALYZE command, which produces a detailed tree of operations used to execute the query. introduce context.Context to APIs metrics package * create groups of named measurements * safe for concurrent access tracing package EXPLAIN ANALYZE implementation for OSS Serialize EXPLAIN ANALYZE traces from remote nodes use context.Background for tests group with other stdlib packages additional documentation and remove unused API use influxdb/pkg/testing/assert remove testify reference	2017-10-20 08:01:37 -07:00
Jason Wilder	05131f4453	Fix indirectIndex not removing fully deleted series If multiple tombstones exists for a series that ended up causing the full data to be deleted, the blocks were not removed from the offsets in the index. This causes the TSMReader to report that a key exist but does not have any data. During a compaction, every key should have at least one value. Since this invariant was broken, the compaction aborted early and ends up dropping all series keys that are lexigraphically greater than where the breakage occured. This would cause data to be dropped during the compaction.	2017-10-18 18:16:41 -06:00
Jason Wilder	9f102adabe	Abort BlockIterator iteration if deletes detected This fixes a potential bug where the BlockIterator would skip blocks if the underlying TSMReader had deletes on it concurrently. This could possibly occur due to changes in `91eb9de3` that now use the existing TSMReaders from the FileStore instead of creating new ones during compaction.	2017-10-18 18:16:37 -06:00
Jason Wilder	4d171f3f40	Fix data deleted outside of time range	2017-10-18 13:39:47 -06:00
Ben Johnson	62093d2641	Merge pull request #8975 from benbjohnson/tsi-copy-returned-bytes Copy returned bytes from TSI meta functions.	2017-10-18 09:26:02 -06:00
Ben Johnson	8ad2048a6b	TSI byte copy usage comments.	2017-10-18 07:21:54 -06:00
Ben Johnson	d17d0f18e0	Move copyBytes() and copyByteSlices() to bytesutil.	2017-10-18 07:19:46 -06:00
Jason Wilder	a6f4069ca7	Fix max select series limit for tsi TSI did not check that the max select series limit during planning the same way that inmem did. This means that the limit could be set but the planning of a high cardinality query would still OOM the server. This fixes that limit as well as makes the query interruptible during planning.	2017-10-17 15:24:41 -06:00
Ben Johnson	dceb88eb30	Copy returned bytes from TSI meta functions.	2017-10-17 14:05:35 -06:00
Jason Wilder	4f8580fbaa	Fix race in disableLevelCompactions There was a race on the WaitGroup where we could end up calling Add while another goroutine was still waiting. The functions were confusing so they have been simplified a bit since the compactions goroutines have been reworked a lot already.	2017-10-16 10:50:16 -06:00
Jason Wilder	5033783a33	Handle deleted series when rebuilding measurment index	2017-10-16 10:50:16 -06:00
Jason Wilder	e683502dd6	Merge pull request #8961 from lrita/master remove duplicated code in cacheKeyIterator.encode()	2017-10-16 10:17:32 -06:00
Jason Wilder	bc360ccfd5	Merge pull request #8970 from influxdata/jw-wal-panic Fix corrupted wal segment panic on 32 bit systems	2017-10-16 10:00:02 -06:00
Jason Wilder	fb7135ddc8	Fix corrupted wal segment panic on 32 bit systems	2017-10-16 09:41:20 -06:00
lrita	2f0aa4a420	remove duplicated code in cacheKeyIterator.encode()	2017-10-13 20:39:15 +08:00
Stuart Carnie	a0848eac8c	remove unnecessary err value readKey never sets error, so it is always nil	2017-10-12 08:28:53 -07:00
Jason Wilder	1401950b10	Only schedule one compaction per shard at a time The scheduling logic ended up favoring more backlogged shards too much and would starved active, less backed up shards. This occurred because the scheduling kicks in once a second. When it runs, it schedules as many compactions as it can. A backed up shard would end up having more compactions to run during the loop an would generally get to schedule them more frequently. This now allows each shard to try and schedule one compaction at a time which provides a more balanced approach. At some point, we'll probably want to more directly balanc the each shards backlog vs letting it happen somewhat randomly.	2017-10-09 11:40:32 -06:00
Jason Wilder	00a403f60e	Reduce allocation in tsmKeyIterator.Next This reuses some intermediate buffers and structs while compacting files.	2017-10-04 17:35:56 -06:00
Jason Wilder	6b6ccf1a40	Wait for compaction gorotuines to finish	2017-10-04 10:01:44 -06:00
Jason Wilder	06226d6fd3	Handle orphan lower level TSM files during full planning Some files seem to get orphan behind higher levels. This causes the compactions to get blocked as the lowere level files will not get picked up by their lower level planners. This allows the full plan to identify them and pull them into their plans.	2017-10-04 08:13:14 -06:00
Jason Wilder	a1d0b52897	Allow lower priority compactions to use excess capacity If there is a backlog of level 3 and 4 compacitons, but few level 1 and 2 compactions, allow them to use some excess capacity.	2017-10-04 08:11:44 -06:00
Jason Wilder	f2a681c4cf	Unconditionally remove file when calling Remove	2017-10-03 10:49:17 -06:00
Jason Wilder	0c0505881f	Remove multiple file skipping for full compaction planning This check doesn't make sense for high cardinality data as the files typically get big and sparse very quickly. This causes a lot of extra disk space to be used which is taken up by large indexes and sparse data.	2017-10-03 10:48:14 -06:00
Jason Wilder	90df803802	Prevent infinite scheduling loop One shard might be able to run a compaction, but could fail to limits being hit. This loop would continue indefinitely as the same task would continue to be rescheduled.	2017-10-03 10:48:14 -06:00
Jason Wilder	4ff4ba0841	Use first file in generation for level With higher cardinality or larger series keys, the files can roll over early which causes them to take longer to be compacted by higher levels. This causes larger disk usage and higher numbers of tsm files at times.	2017-10-03 10:48:14 -06:00
Jason Wilder	71071ed67a	Add compaction backlog stat This gives an indication as to whether compactions are backed up or not.	2017-10-03 10:48:14 -06:00
Jason Wilder	16ece490ef	Reduce allocation in tsmKeyIterator.Next The chunked slice is unnecessary and we can re-use k.blocks throughout the compaction.	2017-10-03 10:48:14 -06:00
Jason Wilder	2c5006fccc	Rework snapshotting concurrency This switches the thresholds that are used for writing snapshots concurrently. This scales better than the prior model.	2017-10-03 10:48:14 -06:00
Jason Wilder	3af9c7df37	Remove a defer allocation Shows up under high cardinality compactions.	2017-10-03 10:48:14 -06:00
Jason Wilder	70817350b7	Ensure temp index files are cleaned up on error	2017-10-03 10:48:14 -06:00
Jason Wilder	a5afaf7499	Fix cache mem size not including key size	2017-10-03 10:48:14 -06:00
Jason Wilder	ae821f4e2d	Rework compaction scheduling This changes the compaction scheduling to better utilize the available cores that are free. Previously, a level was planned in its own goroutine and would kick off a number of compactions groups. The problem with this model was that if there were 4 groups, and 3 completed quickly, the planning would be blocked for that level until the last group finished. If the compactions at the prior level are running more quickly, a large backlog could accumlate. This now moves the planning to a single goroutine that plans each level in succession and starts as many groups as it can. When one group finishes, the planning will start the next group for the level.	2017-10-03 10:48:13 -06:00
Jason Wilder	f668b0cc3f	Only use O_SYNC for tsm file writing Doing this for the WAL reduces throughput quite a bit.	2017-10-03 10:48:13 -06:00
Jason Wilder	1610ae5727	Don't return tsm files part of a compaction plan	2017-10-03 10:48:13 -06:00
Joe LeGasse	1525069213	Merge pull request #8892 from influxdata/jl-tag-values auth: add series auth to 'show tag values'	2017-10-03 08:47:39 -04:00
Lyon Hill	7e5fd14e8a	add in some optimization	2017-10-02 12:02:38 -06:00
Lyon Hill	a6cbce0d3e	fix issues brought up by joe	2017-10-02 11:41:03 -06:00
Lyon Hill	38dc837910	Fix a minor memory leak when batching points for some services. fixes #8895	2017-10-02 11:26:25 -06:00
Joe LeGasse	1443b22379	auth: add series auth to 'show tag values'	2017-09-27 20:01:18 -04:00
Edd Robinson	e0cba4477c	Merge pull request #8885 from influxdata/er-entry-race Fix race on Cache entry	2017-09-27 18:42:45 +01:00
Edd Robinson	d0b81c1e6c	Fix race on Cache entry	2017-09-27 18:10:23 +01:00
Edd Robinson	a1b67160f6	Use math/bits in encoder	2017-09-26 12:51:08 +01:00
Jason Wilder	7fed382dbf	Merge pull request #8872 from influxdata/jw-mmap Fix long process stalls	2017-09-25 14:49:35 -06:00
Jason Wilder	122a74c692	Use synchronous IO for wal and tsm writing The fysncs due to large writes when writing to TSM files and the WAL can eventually cause large pauses. Since we already buffer writes, using synchronous IO reduces fsync latency by ensuring the individiual writes hit disk. This spreads out the latecncy across multiple writes better.	2017-09-25 12:44:57 -06:00
Edd Robinson	2def219f09	Refactor Shard to further protect Engine	2017-09-25 17:43:30 +01:00
Edd Robinson	4a67f92acc	Prevent store from directly accessing Shard's engine	2017-09-25 17:43:01 +01:00
Edd Robinson	8e9cabbb9c	Fix race in TagValues when reaching into engine	2017-09-25 17:43:01 +01:00
Edd Robinson	7739ff749a	Ensure engine protected by shard mutex	2017-09-25 17:42:30 +01:00
Jason Wilder	5774b44a4c	Remove MADV_RANDOM This was inadvertently added when merging the solaris and unix mmap files. This causes large delays due to major page faults.	2017-09-25 10:25:06 -06:00
Edd Robinson	ea104596f0	Implement TSI index versioning This commit adds a basic TSI versioning scheme, by adding a Version field to an index's MANIFEST file. Existing TSI indexes will not have this field present in their MANIFEST files, and thus will be deemed incomatible with the current version. Users with existing TSI indexes will be able to remove them, and convert the resulting inmem indexes to the current version of a TSI index using the influx_inspect tooling.	2017-09-22 17:59:39 +01:00
Jason Wilder	1e345aa7a1	Merge pull request #8856 from influxdata/jw-cache Snapshot compaction improvements	2017-09-22 10:45:54 -06:00
Edd Robinson	44691847e9	Merge branch 'master' into er-8678-tsi1-where	2017-09-22 16:54:49 +01:00
Jason Wilder	94aba64b88	Re-use index entries slice when writing TSM index	2017-09-21 12:48:16 -06:00
Jason Wilder	db204f3eb7	Default concurrent compactions to 50% of available cores	2017-09-21 12:48:11 -06:00
Jason Wilder	deef0c5649	Fix 32bit alignment	2017-09-20 10:00:20 -06:00
Jason Wilder	61ca1243c7	Increase index disk writer buffer	2017-09-20 09:05:30 -06:00
Jason Wilder	796de3dcea	Reduce encoder pool checkout contention With higher cardinalities, the encoder pools where become a bottleneck. This changes the snapshot compactions ot checkout one encoder of each type and re-use it while writing the snapshots as opposed to repeatedly checking it out and in.	2017-09-19 15:27:26 -06:00
Jason Wilder	391a6288c6	Write parallel snapshot for higher cardinalities	2017-09-19 15:27:26 -06:00
Jason Wilder	0d52b060df	Skip onFileStoreReplace with tsi	2017-09-19 15:27:25 -06:00
Jason Wilder	4fe81aeee6	Remove manual Gosched from compactions At higher cardinalities, this dramatically slows down compaction throughput.	2017-09-19 15:27:25 -06:00
Jason Wilder	31e785d676	Don't deduplicate a single value	2017-09-19 15:27:25 -06:00
Jason Wilder	2ca9ccee1f	Reset snapshot cache outside of write lock	2017-09-19 15:27:25 -06:00
Jason Wilder	ddeba2c86b	Split large snapshots and write concurrently	2017-09-19 15:27:25 -06:00
Jason Wilder	9ee305f6f5	Periodically re-allocate cache store This perioically re-allocates the cache store to avoid memory fragmentation and gradual slow down of the store after repeated deletes and inserts into the map.	2017-09-19 15:27:25 -06:00
Jason Wilder	2885b9b310	Remove entrySizeHints map There is a lot of overhead for calculating the hints for larger cardinalities. This slows down resetting the partitions in the ring.	2017-09-19 15:27:25 -06:00
Jason Wilder	4124a8ed97	Simplify cache ring The continuum slice is not needed since the number of partitions doesn't change. This removes the slice to make the mapping simpler.	2017-09-19 15:27:25 -06:00
Stuart Carnie	ed7bc9d825	fix FindValues panic for empty array	2017-09-19 14:23:32 -07:00
Stuart Carnie	92756ec0ad	Reduce allocations, improve readEntries performance by simplifying loop * callers of ReadEntries and Key API can cache allocated slice	2017-09-19 11:57:10 -07:00
Stuart Carnie	baa05de3f8	add benchmarks	2017-09-19 11:47:48 -07:00
Stuart Carnie	cfc6a1cd9f	implement optimization for Include function ``` benchmark old ns/op new ns/op delta BenchmarkIntegerValues_IncludeNone_1000-8 651 6.69 -98.97% BenchmarkIntegerValues_IncludeMiddleHalf_1000-8 1131 114 -89.92% BenchmarkIntegerValues_IncludeFirst_1000-8 638 33.9 -94.69% BenchmarkIntegerValues_IncludeLast_1000-8 1269 32.2 -97.46% BenchmarkIntegerValues_IncludeNone_10000-8 7751 6.76 -99.91% BenchmarkIntegerValues_IncludeMiddleHalf_10000-8 11582 1378 -88.10% BenchmarkIntegerValues_IncludeFirst_10000-8 7911 43.8 -99.45% BenchmarkIntegerValues_IncludeLast_10000-8 12442 38.4 -99.69% ``` (cherry picked from commit fb93ad5)	2017-09-19 09:53:28 -07:00
Stuart Carnie	ca40c1ad3c	<type>Values.Exclude function uses binary search and copy builtin ``` ± benchcmp old.txt new.txt benchmark old ns/op new ns/op delta BenchmarkIntegerValues_ExcludeNone_1000-8 1285 7.34 -99.43% BenchmarkIntegerValues_ExcludeMiddleHalf_1000-8 1258 148 -88.24% BenchmarkIntegerValues_ExcludeFirst_1000-8 1268 7.51 -99.41% BenchmarkIntegerValues_ExcludeLast_1000-8 1125 27.7 -97.54% BenchmarkIntegerValues_ExcludeNone_10000-8 12665 7.31 -99.94% BenchmarkIntegerValues_ExcludeMiddleHalf_10000-8 12039 976 -91.89% BenchmarkIntegerValues_ExcludeFirst_10000-8 12663 7.29 -99.94% BenchmarkIntegerValues_ExcludeLast_10000-8 10990 34.9 -99.68% ``` (cherry picked from commit d7a3c23)	2017-09-19 09:53:26 -07:00
Jason Wilder	940da04a34	Merge pull request #8829 from influxdata/jw-mmap Release mmap pages when shard is cold	2017-09-18 12:08:37 -06:00
Jason Wilder	31646aae3a	Release mmap pages when shard is cold This instructs the kernel that it can release memory used by mmap'd TSM files when they are not actively being used. It the mappings are use, the kernel will fault the pages back in. On linux, this causes RES memory to drop immediately when run.	2017-09-18 11:51:51 -06:00
Edd Robinson	e39de3e427	Merge pull request #8782 from oiooj/pr-shard-fix Correctly check if the Shard is ready for queries or writes	2017-09-18 18:17:19 +01:00
Jonathan A. Sternberg	2228b91b0d	Unsigned data type parsing and prioritization	2017-09-14 12:28:13 -05:00
Jason Wilder	7d467c2047	Fix windows unmapping of anonymous index slice	2017-09-12 10:30:10 -06:00
Jason Wilder	b4b3c159cc	Fixup rebase	2017-09-11 17:04:10 -06:00
Jason Wilder	d5d9f9acfe	Remove debug line	2017-09-11 15:31:28 -06:00
Jason Wilder	26f92ce6ac	Remove commented out code	2017-09-11 15:30:05 -06:00
Jason Wilder	820856347c	Don't use disk temp file for snapshots	2017-09-11 15:29:26 -06:00
Jason Wilder	4ed9c75896	Fix unmapping anonymous memory slice	2017-09-11 15:29:26 -06:00
Jason Wilder	97f7857715	Remove mutex on TSMWriter This isn't used by more than one goroutine so locks are unnecessary.	2017-09-11 15:29:26 -06:00
Jason Wilder	a93a5e9bdf	Include the size of the key in the cache size	2017-09-11 15:29:26 -06:00
Jason Wilder	38460ec37e	Re-enable compactions during writes A cold shard that suddenly receives a lot of writes could get a very big cache that takes a long time to snapshot or causes the cache max memory limit to be hit more quickly. This re-enables the compactions if necessary during writes so we don't have to wait for the shard monitor goroutine to re-enable them.	2017-09-11 15:29:26 -06:00
Ben Johnson	ee4d3c7b3d	Invalidate all bloom filters.	2017-09-11 15:29:26 -06:00
Ben Johnson	3c2487b97a	Clean up tsi bloom filter invalidation.	2017-09-11 15:29:26 -06:00
Ben Johnson	6af936ee61	Fix bloom filter invalidation.	2017-09-11 15:29:26 -06:00
Ben Johnson	a40b2bb210	Simplify bloom filter invalidation.	2017-09-11 15:29:26 -06:00
Edd Robinson	408a78d904	Increase size of SeriesBlock partition	2017-09-11 15:29:26 -06:00
Jason Wilder	7388eb9499	Use disk when writing TSM index	2017-09-11 15:29:25 -06:00
Ben Johnson	0ec2736f23	Incrementally rebuild tsi bloom filters.	2017-09-11 15:29:25 -06:00
Jason Wilder	a5a2957567	Reduce allocation in log_file	2017-09-11 15:29:25 -06:00
Jason Wilder	d3e832b462	Use offheap memory for indirect index offsets slice	2017-09-11 15:29:25 -06:00
Jason Wilder	91eb9de341	Use existing TSMReader from file store during compactions Compactions would create their own TSMReaders for simplicity. With very high cardinality compactions, creating the reader and indirectIndex can start to use a significant amount of memory. This changes the compactions to use a reader that is already allocated and managed by the FileStore.	2017-09-11 15:29:25 -06:00
Jason Wilder	739ecd2ebd	Fix a compaction planning bug There was a race where the plan returned was for files that were just compacted so the compaction would immediately abort.	2017-09-11 15:26:25 -06:00
Jason Wilder	bc4fb0ea10	Sort index entries if necessary These are already sorted during compaction, so switch to sorting lazily to avoid the CPU and allocations. This would only occur when using if using the writer directly.	2017-09-11 15:26:25 -06:00
Jason Wilder	a9e89ede75	Reduce lock contenton on Index Stat and Size are read-only and can take an RLock.	2017-09-11 15:26:25 -06:00
Jason Wilder	f18dec6a4a	Use sorted slice for writing TSM index The directIndex used by the TSMWriter maintained a map of series keys to index entries. When the index is written to the TSM file, the keys are sorted and then written out in order. The reason for this is because directIndex used to be the only index and it was optimized more for reading. The reading has been replaced by the indirectIndex so the map of keys ends up wasting space. During compactions, the series keys (and index entries) are already sorted so this change uses the sorting to avoid the map and sort when writing the index. This reduces allocations and CPU usage quite a bit for larger cardinality TSM files.	2017-09-11 15:26:24 -06:00
Jason Wilder	2a0d7935d7	Switch level 3 compactions to use fast compaction strategy This leaves the slower compactions that create full blocks to only the full compaction. This helps reduce CPU usage and memory while shards are hot, but increases disk usage (reduced compression) slightly.	2017-09-11 15:26:24 -06:00
Jason Wilder	94e229ff59	Merge branch 'master' into jw-drop-series	2017-09-08 15:34:32 -06:00
Jason Wilder	44e1d3f185	Merge pull request #8804 from influxdata/jw-wal-oom Fix increased memory usage in cache and wal	2017-09-08 15:10:53 -06:00
Jason Wilder	78922f9821	Set rc to nil when closing WALSegmentReader	2017-09-08 14:55:02 -06:00

... 2 3 4 5 6 ...

2155 Commits (9418d373d608805c4575345ad0dcd1b2f3f981fb)