influxdb

Commit Graph

Author	SHA1	Message	Date
Edd Robinson	a2ece0a49a	Pass series id in via Index API	2018-01-15 12:00:31 +00:00
Ben Johnson	d295f30686	Remove series id check during deletion.	2018-01-15 12:00:31 +00:00
Edd Robinson	bb6bfad5ea	Ensure inmem index updated properly	2018-01-15 12:00:30 +00:00
Edd Robinson	b9d0a39131	Skip empty series keys	2018-01-15 12:00:30 +00:00
Edd Robinson	a4bef3a4bc	Refactoring delete tests	2018-01-15 12:00:30 +00:00
Jason Wilder	ba9a5af7eb	Mark series deleted in series file This commit adds the ability to correctly mark a series as deleted in the global series file. Whenever a shard engine determines that a series should be deleted, it checks with each shard's bitset for series that are to be deleted and are no longer contained in any shard-local bitsets. These series are then removed from the series file.	2018-01-15 12:00:30 +00:00
Edd Robinson	286c8f4c09	Return to original DELETE/DROP SERIES semantics This reverts commit `59afd8cc90`.	2018-01-15 12:00:30 +00:00
Jason Wilder	c2cbd14e09	Fix TestEngine_DisableEnableCompactions_Concurrent hang This test could hang due to an existing race that is still not fixed. The snapshot and level compaction goroutines woule end up waiting on the wrong channel to be closed so whey would never exit.	2018-01-11 11:58:20 -07:00
Edd Robinson	ed8b9925c8	Comment update	2018-01-11 01:01:54 +00:00
Edd Robinson	e610e7c21d	Track undeleted series IDs per-shard with inmem This commit adds a bitset into each shard's in-memory index, to be used to track undeleted series ids. Currently tsi1 support is not implemented. When new series are added to the shard, the series id is added to the bitset. When series are deleted from the shard, the series ids are removed from the bitset. Becasue each shard shares the same inmem index reference, the bitset is stored in the `ShardIndex`, which is local to each shard, and then different references are passed into the shared `Index` object, depending on which shard is writing the series.	2018-01-11 01:01:54 +00:00
Adam	938db68198	Update restore functionality to run in online mode, consume Enterprise backup files. (#9207 ) * Live Restore + Enterprise data format compatability * Extended ImportData to import all DB's if no db name given * Added a new enterprise data test, and backup command now prints the backup file paths at conclusion * Added whole-system backup test * Update to use protobuf in all enterprise data cases * Update to test to do cross-testing with enterprise version * incremental enterprise backup format support	2018-01-10 13:59:18 -05:00
David Norton	1c452d83cb	fix #9286 : return digest size	2018-01-08 13:15:14 -05:00
Stuart Carnie	ed207b54c3	updates after TSI / series file merge	2017-12-29 10:58:25 -07:00
Stuart Carnie	5dfe3b2645	inmem startup improvments * only call ParseTags when necessary * remove dependency on inmem.Series in tsdb test package * Measurement and Series are no longer exported. Their use is restricted to the inmem package * improve Measurement and Series types by exporting immutable fields and removing unnecessary APIs and locks Reduced startup time from 28s to 17s. Overall improvement including #9162 reduces startup from 46s to 17s for 1MM series across 14 shards.	2017-12-29 07:58:52 -07:00
Ben Johnson	d8b1d208c0	rebase	2017-12-20 15:13:34 -07:00
Edd Robinson	c476a0b4a1	Merge branch 'master' into er-tsi-index-part	2017-12-15 18:31:24 +00:00
Jason Wilder	2d85ff1d09	Adjust compaction planning Increase level 1 min criteria, fix only fast compactions getting run, and fix very large generations getting included in optimize plans.	2017-12-14 22:41:34 -07:00
Jason Wilder	749c9d2483	Rate limit disk IO when writing TSM files This limits the disk IO for writing TSM files during compactions and snapshots. This helps reduce the spiky IO patterns on SSDs and when compactions run very quickly.	2017-12-14 22:02:32 -07:00
Edd Robinson	59afd8cc90	Return to original DELETE/DROP SERIES semantics Since possibly v0.9 DELETE SERIES has had the unwanted side effect of removing series from the index when the last traces of series data are removed from TSM. This occurred because the inmem index was rebuilt on startup, and if there was no TSM data for a series then there could be not series to add to the index. This commit returns to the original (documented) DROP/DETETE SERIES behaviour. As such, when issuing DROP SERIES all instances of matching series will be removed from both the TSM engine and the index. When issuing DELETE SERIES only TSM data will be removed. It is up to the operator to remove series from the index. NB, this commit does not address how to remove series data from the series file when a shard rolls over.	2017-12-15 00:02:06 +00:00
Edd Robinson	9e3b17fd09	Ensure deleted series are not returned via iterators	2017-12-14 21:29:35 +00:00
Jason Wilder	7dc5327a0a	Adjust snapshot concurrency by latency This changes the approach to adjusting the amount of concurrency used for snapshotting to be based on the snapshot latency vs cardinality. The cardinality approach could use too much concurrency and increase the number of level 1 TSM files too quickly which incurs more disk IO. The latency model seems to adjust better to different workloads.	2017-12-13 13:17:56 -07:00
David Norton	253ea7cc5e	feat #9212 : fix file in use bug on Windows	2017-12-13 09:29:07 -05:00
David Norton	4e13248d85	feat #9212 : add ability to generate shard digests	2017-12-13 09:28:34 -05:00
Edd Robinson	f1bcc97e89	Fix auth tests	2017-12-12 21:25:35 +00:00
Edd Robinson	7d13bf3262	merge master	2017-12-08 17:21:58 +00:00
Edd Robinson	f6835632e7	Merge master into branch	2017-12-08 17:11:07 +00:00
Edd Robinson	3318c94a2f	Clean up 🛁:	2017-12-08 11:38:53 +00:00
Adam	a0b2195d6b	Pulled in backup-relevant code for review (#9193 ) for issue #8879	2017-12-07 11:35:20 -05:00
Jason Wilder	0a85ce2b73	Schedule compactions less aggressively This runs the scheduler every 5s instead of every 1s as well as reduces the scope of a level 1 plan.	2017-12-06 13:45:43 -07:00
Ben Johnson	493c1ed0d1	inmem tests passing.	2017-12-05 10:49:58 -07:00
Jason Wilder	909a2fb6cc	Fix deletes removing index for invalid time ranges If a delete for a time that does not exist was run, we would not remove the series key from the slice of series to remove from the index. This could be triggered by running somethin like "delete from cpu where time = 0" and if there was no data at time 0, the series would still be removed from the index.	2017-11-30 15:01:01 -07:00
Andrew Hare	761a8f8bec	Schedule a full compaction after a successful import	2017-11-29 13:50:38 -07:00
Ben Johnson	ca09f18e65	intermediate: tsdb compile	2017-11-29 11:20:18 -07:00
Jason Wilder	8633e38549	Fix removing series from index The loop to check if a series still exists in a TSM file was wrong in that it 1) exited early after one iteration and 2) had an off by one error that causes the wrong series to be marked as existing. This fixes both of these cases which can cause the index to become inconsistent with the data store on disk.	2017-11-29 10:45:04 -07:00
Edd Robinson	81976bca59	Refactor based on new design	2017-11-28 17:54:29 +00:00
Edd Robinson	b10249a9b3	Fix rebase	2017-11-28 15:58:35 +00:00
Edd Robinson	041a3837be	Ensure index can track fields	2017-11-28 15:57:03 +00:00
Edd Robinson	38e0dd695f	Allow concurrent access to Engine Index	2017-11-28 15:57:03 +00:00
Edd Robinson	12a2ff7fac	Add support for TSI shard streaming and shard size This commit firstly ensures that a shard's size on disk is accurately reported when using the tsi1 index, by including the on-disk size of the tsi1 index in the calculation. Secondly, this commit add support for shard streaming/copying when using the tsi1 index. Prior to this, a tsi1 index would not be correctly restored when streaming shards.	2017-11-28 15:57:02 +00:00
Jason Wilder	b59858e529	Ensure series keys are sorted before searching The Cache.ApplyEntryFn iterates keys according to the partitions and hashed values. This can cause the deleteKeys slice to contain unsorted keys when deleting series. The code uses a binary search on this slice later on and this can fail to detect that the series should still exists. The series is then removed from the index even though it has data still. Fixes #9116	2017-11-27 17:06:03 -07:00
Edd Robinson	e6b7140d65	Merge pull request #9143 from influxdata/er-show-tag-key-perf SHOW TAG KEYS with high cardinality and many shards	2017-11-27 15:04:15 +00:00
Stuart Carnie	7cdfd95966	initial opentrace implementation for ifql interface NOTE: does not include a default tracer until configuration across projects is standardized	2017-11-22 14:42:26 -07:00
Jason Wilder	cacb55fac4	Fix typos	2017-11-22 11:17:34 -07:00
Jason Wilder	dd1c030815	Remove limit count param on fields It's not used anymore.	2017-11-22 11:17:34 -07:00
Jason Wilder	c14b0e81b7	Save field types to speed up startup This persists the field types in a shard to avoid having to scan all the TSM files at startup.	2017-11-22 11:17:34 -07:00
Edd Robinson	68dd5e27c8	Improve performance of TagKeys	2017-11-21 17:16:47 +00:00
Jason Wilder	50b6ace75f	Fix wait reused while disabling compactions	2017-11-20 14:55:47 -07:00
Edd Robinson	6851db3fc9	Add FGA support to SHOW MEASUREMENTS	2017-11-17 11:06:43 +00:00
Ben Johnson	ede3fcf98e	intermediate	2017-11-15 16:09:25 -07:00
Jason Wilder	97e0d496a6	Add capability to force a full compaction This adds the capability to the engine to force a full compaction to be scheduled. When called, it snapshots any data in the cache, aborts running compactions and prevents level plans from returning level plans.	2017-11-15 07:14:27 -07:00
Ben Johnson	ba4c9e0317	Merge remote-tracking branch 'upstream/master' into er-tsi-index-part	2017-11-14 16:14:13 -07:00
Jason Wilder	8b18cc4456	Optimize deletes in tsi The DropSeries code path ended up creating a MeasurementSeriesIterator for each dropped series, this was too expensive just to see if a series exists. This adds a HasSeries func and fixes and issue where TSI files were compacted while an iterator was still in use causing a panic.	2017-11-13 12:35:38 -07:00
Jason Wilder	13692639cb	Fix create/delete series race This fixes a race where writes and deletes to the same series and measurements could sometimes leave the index in an inconsistent state.	2017-11-13 09:02:10 -07:00
Jason Wilder	80cd5e63af	Optimize DeleteSeriesRange This removes more allocations and speeds up some critical sections.	2017-11-13 09:02:10 -07:00
Jason Wilder	aee395d3bd	Make DeleteSeriesRange take SeriesIterator	2017-11-13 09:02:10 -07:00
Jason Wilder	f893beb6d8	Use MeasurementSeriesKeysByExprIterator for deletes	2017-11-13 09:02:10 -07:00
Jason Wilder	88c48ec78b	Rework Engine.DeleteSeriesRange to avoid allocations This removes the containsSeries func which ends up creating a map sized to the slice of keys passed in. This doesn't scale well to high cardinalities and creates a lot of garbage.	2017-11-13 08:50:07 -07:00
Jason Wilder	b0c7a44eaa	Adjust min/max time to work in the engine The query language min and max times are slighly different than the values used in the engine. This allows faster codes to be used when the whole time range is deleted.	2017-11-13 08:50:07 -07:00
Jonathan A. Sternberg	0b7c56bcd8	Update the zap logger dependency The previous sha was taken from a revision on a devel branch that I thought would continue staying in the tree after it was merged. That revision was rebased away and the API was changed for the logger. This updates the usage of the logger and adds a simple package for constructing the base logger. The 1.0 version of zap changed the format of the default console logger so this change moves over to this new logger instead of attempting to retain backwards compatibility with the old format.	2017-11-10 16:27:16 -06:00
Ben Johnson	9ad2b53881	intermediate	2017-11-09 09:18:33 -07:00
Andrew Hare	ecb3952fa9	Allow human-readable byte sizes in config Update support in the `toml` package for parsing human-readble byte sizes. Supported size suffixes are "k" or "K" for kibibytes, "m" or "M" for mebibytes, and "g" or "G" for gibibytes. If a size suffix isn't specified then bytes are assumed. In the config, `cache-max-memory-size` and `cache-snapshot-memory-size` are now typed as `toml.Size` and support the new syntax.	2017-11-01 11:09:09 -05:00
Stuart Carnie	9a43c14653	Merge pull request #9041 from influxdata/sgc-influxql influxdata/influxdb/influxql -> influxdata/influxql	2017-10-31 07:31:31 -07:00
Stuart Carnie	f3d45ba301	influxdata/influxdb/influxql -> influxdata/influxql	2017-10-30 14:40:26 -07:00
Jason Wilder	48ebc53154	Revert "Fix race in disableLevelCompactions" This reverts commit `4f8580fbaa`.	2017-10-30 14:14:50 -06:00
Stuart Carnie	c39f1ad748	Add batch cursor support to tsdb and tsm1 * batch cursors return slices of timestamps and values to reduce call overhead. Significantly improved iteration. * added CreateCursor API to Shard, Engine * moved build*Cursor to code gen	2017-10-25 13:38:07 -07:00
Stuart Carnie	b7579340fe	return query.ErrQueryInterrupted for read on InterruptCh	2017-10-24 14:10:28 -07:00
Stuart Carnie	e9313876ab	EXPLAIN ANALYZE * Introduces EXPLAIN ANALYZE command, which produces a detailed tree of operations used to execute the query. introduce context.Context to APIs metrics package * create groups of named measurements * safe for concurrent access tracing package EXPLAIN ANALYZE implementation for OSS Serialize EXPLAIN ANALYZE traces from remote nodes use context.Background for tests group with other stdlib packages additional documentation and remove unused API use influxdb/pkg/testing/assert remove testify reference	2017-10-20 08:01:37 -07:00
Jason Wilder	4f8580fbaa	Fix race in disableLevelCompactions There was a race on the WaitGroup where we could end up calling Add while another goroutine was still waiting. The functions were confusing so they have been simplified a bit since the compactions goroutines have been reworked a lot already.	2017-10-16 10:50:16 -06:00
Jason Wilder	1401950b10	Only schedule one compaction per shard at a time The scheduling logic ended up favoring more backlogged shards too much and would starved active, less backed up shards. This occurred because the scheduling kicks in once a second. When it runs, it schedules as many compactions as it can. A backed up shard would end up having more compactions to run during the loop an would generally get to schedule them more frequently. This now allows each shard to try and schedule one compaction at a time which provides a more balanced approach. At some point, we'll probably want to more directly balanc the each shards backlog vs letting it happen somewhat randomly.	2017-10-09 11:40:32 -06:00
Jason Wilder	6b6ccf1a40	Wait for compaction gorotuines to finish	2017-10-04 10:01:44 -06:00
Jason Wilder	90df803802	Prevent infinite scheduling loop One shard might be able to run a compaction, but could fail to limits being hit. This loop would continue indefinitely as the same task would continue to be rescheduled.	2017-10-03 10:48:14 -06:00
Jason Wilder	71071ed67a	Add compaction backlog stat This gives an indication as to whether compactions are backed up or not.	2017-10-03 10:48:14 -06:00
Jason Wilder	ae821f4e2d	Rework compaction scheduling This changes the compaction scheduling to better utilize the available cores that are free. Previously, a level was planned in its own goroutine and would kick off a number of compactions groups. The problem with this model was that if there were 4 groups, and 3 completed quickly, the planning would be blocked for that level until the last group finished. If the compactions at the prior level are running more quickly, a large backlog could accumlate. This now moves the planning to a single goroutine that plans each level in succession and starts as many groups as it can. When one group finishes, the planning will start the next group for the level.	2017-10-03 10:48:13 -06:00
Joe LeGasse	1443b22379	auth: add series auth to 'show tag values'	2017-09-27 20:01:18 -04:00
Edd Robinson	d0b81c1e6c	Fix race on Cache entry	2017-09-27 18:10:23 +01:00
Jason Wilder	db204f3eb7	Default concurrent compactions to 50% of available cores	2017-09-21 12:48:11 -06:00
Jason Wilder	0d52b060df	Skip onFileStoreReplace with tsi	2017-09-19 15:27:25 -06:00
Jason Wilder	31646aae3a	Release mmap pages when shard is cold This instructs the kernel that it can release memory used by mmap'd TSM files when they are not actively being used. It the mappings are use, the kernel will fault the pages back in. On linux, this causes RES memory to drop immediately when run.	2017-09-18 11:51:51 -06:00
Jason Wilder	2a0d7935d7	Switch level 3 compactions to use fast compaction strategy This leaves the slower compactions that create full blocks to only the full compaction. This helps reduce CPU usage and memory while shards are hot, but increases disk usage (reduced compression) slightly.	2017-09-11 15:26:24 -06:00
Jason Wilder	94e229ff59	Merge branch 'master' into jw-drop-series	2017-09-08 15:34:32 -06:00
Jason Wilder	b9b648e2a0	Dynamically allocate cache store The cache store can be memory intensive with many shards. This lazyily allocates it when needed and frees it when the cache is empty and cold.	2017-09-07 16:35:08 -06:00
Jason Wilder	a8d9eeef36	Reduce lock contention when deleting high cardinality series Deleting high cardinality series could take a very long time, cause write timeouts as well as dead lock the process. This fixes these issue to by changing the approach for cleaning up the indexes and reducing lock contention. The prior approach delete each series and updated every index (inmem) during the delete. This was very slow and cause the index to be locked while it items in a slice were removed one by one. This has been changed to mark series as deleted and then rebuild the index asynchronously which speeds up the process. There was also a dead lock that could occur when deleing the field set. Deleting the field set held a write lock and the function it invoked under the lock could try to take a read lock on the field set. This would then deadlock. This approach was also very slow and caused time out for writes. It now uses faster approach that checks for the existing of the measurment in the cache and filestore which does not take write locks.	2017-09-07 11:36:02 -06:00
Jonathan A. Sternberg	590be193e5	Include the number of scanned cached values in the iterator cost	2017-09-06 15:41:07 -05:00
Jonathan A. Sternberg	50d404e690	Initial implementation of explain plan It prints the statistics of each iterator that will access the storage engine. For each access of the storage engine, it will print the number of shards that will potentially be accessed, the number of files that may be accessed, the number of series that will be created, the number of blocks, and the size of those blocks.	2017-09-01 09:01:10 -05:00
Jonathan A. Sternberg	466fc9026e	Reduce how long it takes to walk the varrefs in an expression This is used quite a bit to determine which fields are needed in a condition. When the condition gets large, the memory usage begins to slow it down considerably and it doesn't take care of duplicates.	2017-08-31 09:33:45 -05:00
Ben Johnson	1dbe0662d8	Use system cursors for measurement, series, and tag key meta queries.	2017-08-30 08:35:20 -06:00
Stuart Carnie	d189621d07	log message when iterator closed by finalizer	2017-08-21 16:46:24 -07:00
Stuart Carnie	25edd7bfdf	naming	2017-08-17 15:47:47 -07:00
Stuart Carnie	c86dc0d103	redundant allocation is overwritten by line 1769	2017-08-17 11:12:41 -07:00
Stuart Carnie	823f903cc6	inputs are closed if Merge returns error and use <type>FinalizerIterator * <type>FinalizerIterator sets a runtime finalizer and calls Close when garbage collected. This will ensure any associated cursors are closed and the associated TSM files released * `query.Iterators#Merge` call could return an error and the inputs would not be closed, causing a cursor leak	2017-08-17 11:12:18 -07:00
Jason Wilder	85842503be	Fix deadlock in engine/measurement fields The OnReplace func ends up trying to acquire locks on MeasurementFields. When its called via snapshotting, this can deadlock because the snapshotting goroutine also holds an RLock on the engine. If a delete measurement calls is run at the right time, it will lock the MeasurementFields and try to acquire a lock on the engine to disable compactions. This creates a deadlock. To fix this, the OnReplace callback is moved to a function param to allow only Replace calls as part of a compaction to invoke it as opposed to both snapshotting and compactions. Fixes #8713	2017-08-16 16:43:40 -06:00
Jonathan A. Sternberg	697759613c	Remove time comparisons from the inner sections of the storage engine	2017-08-16 16:51:13 -05:00
Jonathan A. Sternberg	9a2357c2c0	Separate the query engine into a separate package This change provides a clear separation between the query engine mechanics and the query language so that the language can be parsed and dealt with separate from the query engine itself.	2017-08-16 13:38:43 -05:00
Stuart Carnie	3caeee8a24	fix: cursor leak when cur == nil and aux or conds is not empty	2017-08-16 09:17:20 -07:00
Ben Johnson	e0d8cb0ef3	Cardinality AST, parser, & rewriter fixes.	2017-08-16 09:27:29 -06:00
Ben Johnson	60ab1282ea	Refactor system iterators. Previously pseudo iterators could be created for meta data such as series, measurement, and tag data. These iterators were created at a higher level and lacked a lot of the power of the query engine. This commit moves system iterators down to the series level and supports the following: - _name - _seriesKey - _tagKey - _tagValue - _fieldKey These can be used as normal fields such as: SELECT _seriesKey FROM cpu This will return all the series keys for `cpu`.	2017-08-16 09:27:29 -06:00
David Norton	1d8d739418	fix #8677 : check for snapshot size == 0	2017-08-16 09:43:56 -04:00
Jason Wilder	90e2cadeb6	Fix drop measurement not dropping all data If there were multiple shards, drop measurement could update the index and remove the measurement before the other shards ran their deletes. This causes the later shards to not see any series to delete. The fix is to all deleteSeries to handle the index delete which already accounts for removing the measurement when it is fully removed from the index.	2017-08-15 11:19:45 -06:00
Edd Robinson	aa7095be5a	Use a merge-based approach for TagValues	2017-08-02 14:10:52 +01:00
Jason Wilder	94a48774b7	Pull in new index filter	2017-08-02 14:10:52 +01:00
Stuart Carnie	ff65f0f24d	Reduce allocations using nil cursors and literal value cursors ``` benchmark old ns/op new ns/op delta BenchmarkIntegerIterator_Next-8 82.8 22.7 -72.58% benchmark old allocs new allocs delta BenchmarkIntegerIterator_Next-8 3 0 -100.00% benchmark old bytes new bytes delta BenchmarkIntegerIterator_Next-8 32 0 -100.00% ```	2017-07-30 09:15:34 -07:00
Jason Wilder	778000435a	Conver all keys from string to []byte in TSM engine This switches all the interfaces that take string series key to take a []byte. This eliminates many small allocations where we convert between to two repeatedly. Eventually, this change should propogate futher up the stack.	2017-07-28 11:00:50 -06:00
Jason Wilder	18a02d50d7	Interrupt in progress TSM compactions When snapshots and compactions are disabled, the check to see if the compaction should be aborted occurs in between writing to the next TSM file. If a large compaction is running, it might take a while for the file to be finished writing causing long delays. This now interrupts compactions while iterating over the blocks to write which allows them to abort immediately.	2017-07-27 15:58:56 -06:00
Stuart Carnie	eec80692c4	Taught tsm1 storage engine how to read and write uint64 values * introduced UnsignedValue type * leveraged existing int64 compression algorithms (RLE, Simple 8B) * tsm and WAL can read and write UnsignedValue * compaction is aware of UnsignedValue * unsigned support to model, cursors and write points NOTE: there is no support to create unsigned points, as the line protocol has not been modified.	2017-07-24 09:03:22 -07:00
Jason Wilder	839cddf6d5	Refresh index after compactions The in-memory index can get out of sync when deletes and writes to the same measurement are running concurrently. The index is updated independently from data on disk and it's possible for the index to unassign a shard when data still exists on disk. What happens is that there are TSM files on disk, but the index does not know that the series that exist in those files still are in the shard. Restarting the server reloads the index and the data is visible again. From and end user perspective, this can look like more data is deleted than should have been or that deleted data re-appears after a restart or writes to the shard occur again. There isn't an easy way to resolve this since the index and storage are not transactional resources and we cannot atomically commit or rollback changes to both at once. As a workaround, after new TSM files are installed, we refresh the index with series keys that exist in the new tsm files as well as any lingering data still in the cache. There is a small window of time when the index may be missing series, but it will re-appear after the refresh completes.	2017-07-07 12:19:30 -06:00
Jason Wilder	9ac042b5cd	Reduce lock contention when disabling compactions The monitor goroutine calls enable compactions every 10s to spin down (or start up) goroutines for cold shards. This frequent Lock may be causing lock contention for writes and queries which get blocked trying to acquire an RLock. The go RWMutex says that new RLock calls will block if there is a pending Lock call that is blocked. Switching the common path to use an RLock should avoid the Lock and reduce lock contention for writes and queries.	2017-07-05 15:42:21 -06:00
Stuart Carnie	46796d932f	add database to index, engine and shard; call AuthorizeSeriesRead	2017-05-26 13:21:50 -07:00
Joe LeGasse	815f740f4c	initial fga work wip wip fix tests / build	2017-05-26 13:16:27 -07:00
Jason Wilder	29e4287fd2	Preven masking root errors when compactions are in progress The root error when creating a tmp file when writing a snapshot was hidden making it difficult to determine why snapshots were failing.	2017-05-23 12:09:36 -06:00
Jason Wilder	bd6d0681e9	Ensure planned files are released The defer was never executed because the planning happens in a long running goroutine that loops. The plans need to be released immediately after applying them.	2017-05-23 12:08:25 -06:00
Jason Wilder	1833475c09	Fix TSM tmp files leaking TMP files could leak when compactions failed for various reasons. They were also being deleted inadvertently when compactions were disabled causing other errors to be reported in the logs.	2017-05-22 14:51:18 -06:00
Jason Wilder	4d002bb370	Limit concurrent compactions within a shard This changes full compactions within a shard to run sequentially instead of running all the compaction groups in parallel. Normally, there is only 1 full compaction group to run. At times, there could be several which causes instability if they are all running concurrently as they tie up a cpu for long periods of time. Level compactions are also capped to a max of 4 concurrently running for each level in a shard. This prevents sudden spikes in CPU and disk usage due to a large backlog of tsm files at a given level.	2017-05-12 14:05:24 -06:00
Jason Wilder	2cac46ebbc	Convert usage of strings to []byte Measurement name and field were converted between []byte and string repetively causing lots of garbage. This switches the code to use []byte in the write path.	2017-05-12 14:05:19 -06:00
Jason Wilder	c0c6ad6880	Don't disable snapshots when snapshot compactions are disabled Snapshot compactions can be disabled independently of snapshotting capability. This prevents taking backups of shards that have compactions disabled.	2017-05-05 14:15:45 -06:00
Jason Wilder	bc639c5982	Make disableLevelCompactions lighter weight Since this is called more frequently now, the cleanup func was invoked quite a bit which makes several syscalls per shard. This should only be called the first time compactions are disabled.	2017-05-04 09:56:15 -06:00
Jason Wilder	88848a9426	Remove per shard monitor goroutine The monitor goroutine ran for each shard and updated disk stats as well as logged cardinality warnings. This goroutine has been removed by making the disks stats more lightweight and callable direclty from Statisics and move the logging to the tsdb.Store. The latter allows one goroutine to handle all shards.	2017-05-03 16:31:57 -06:00
Jason Wilder	f87fd7c7ed	Stop background compaction goroutines when shard is cold Each shard has a number of goroutines for compacting different levels of TSM files. When a shard goes cold and is fully compacted, these goroutines are still running. This change will stop background shard goroutines when the shard goes cold and start them back up if new writes arrive.	2017-05-03 16:31:57 -06:00
Jason Wilder	3d1c0cd981	Don't return compaction plans for files already part of a plan The compactor prevents the same file from being compacted by different compaction runs, but it can result in warning errors in the logs that are confusing. This adds compaction plan tracking to the planner so that files are only part of one plan at a given time.	2017-05-03 16:31:57 -06:00
Jason Wilder	8fc9853ed8	Add max-concurrent-compactions limit This limit allows the number of concurrent level and full compactions to be throttled. Snapshot compactions are not affected by this limit as then need to run continously. This limit can be used to control how much CPU is consumed by compactions. The default is to limit to the number of CPU available.	2017-05-03 16:31:57 -06:00
Jason Wilder	141f0d71cd	Update index when import files	2017-04-28 14:00:45 -06:00
Jason Wilder	a76146e34a	Add Store.Import capability This allows the contents of a backup to be imported into a shard without requiring the whole shard to be replaced.	2017-04-28 13:30:46 -06:00
Jason Wilder	137d0c0d09	Rename WAL.WritePoints to WAL.WriteMulti To match Cache.WriteMulti	2017-04-28 13:20:55 -06:00
Jason Wilder	5c51ae7319	Merge branch '1.2' into jw-merge-123	2017-04-14 14:36:54 -06:00
Jason Wilder	ff1270dfeb	Fix dropping fields created data corruption The Point is intended to be immutable after being parsed since it is shared by several goroutines. When dropping a field (e.g. time), corrupted data can result if one goroutine is delete the field while another is marshaling the underlying byte slices. To avoid this, the shard will just skip invalid fields and series instead of trying to mutate them by deleting them.	2017-04-07 12:58:42 -06:00
Ben Johnson	9c97cd8601	Merge remote-tracking branch 'upstream/master' into tsi	2017-04-04 12:46:09 -06:00
Jason Wilder	5fa8073fc2	Merge branch '1.2' into jw-merge-123	2017-04-04 11:12:06 -06:00
Jason Wilder	8da84e6144	Merge branch 'master' into tsi	2017-04-03 11:21:02 -06:00
Jason Wilder	32c4d43952	Speed up drop measurement This reworks drop measurement to use a sorted list of series keys instead of creating an intermediate map. It remove allocations and some extra garbage that is created during drop measurement.	2017-04-03 08:57:53 -06:00
Edd Robinson	fddaff2cc8	Merge master in	2017-03-29 18:00:28 +01:00
Ben Johnson	2edfb1c92d	Ignore series limit on database load.	2017-03-24 16:27:16 -06:00
Ben Johnson	9fb8f1ec1d	Fix database and tag limits.	2017-03-24 09:48:10 -06:00
Jason Wilder	7119ef8f29	Merge pull request #8193 from influxdata/jw-123-backports 1.2.3 backports	2017-03-23 13:31:35 -06:00
Jason Wilder	ced953ae89	Use typed values to avoid allocations This switches compactions to use type values (FloatValues) from the generic Values type. It avoids a bunch of allocations where each value much be converted from a specific type to an interface{}.	2017-03-23 12:53:17 -06:00
Jason Wilder	2972a3f223	Remove MMAP derefencing code This code was added to address some slow startup issues. It is believed to be the cause of some segfault panic's that occur at query time when the underlying MMAP array has been unmapped. The current structure of code makes this change unnecessary now.	2017-03-23 12:46:23 -06:00
Jason Wilder	8f7b251afd	Merge branch 'master' into jw-tsi	2017-03-20 17:17:26 -06:00
Jason Wilder	8177df2dab	Simplify Measurement.TagSets signature	2017-03-17 16:19:10 -06:00
Jason Wilder	2d5d899ac2	Allow queries to be interrupted during planning If a bad query is run, kill query and limits would not kick in until after it started executing. Some bad queries that involve high cardinality can cause the server to OOM just from planning which defeats the purpose of the max-select-series limit. This change primarily fixes max-select-series limit so that the query is killed earlier and has the side effect that kill query now can kill a query while it's being planned.	2017-03-17 16:00:54 -06:00
Jason Wilder	bc4aeefbed	Check max-series-limit in shard iterator creation The limit waited until all the iterators had been created which still allows problem queries to be planned. This allows the queries to be aborted much earlier in some cases.	2017-03-17 16:00:25 -06:00
Jason Wilder	e9eb925170	Coalesce multiple WAL fsyncs Fsyncs to the WAL can cause higher IO with lots of small writes or slower disks. This reworks the previous wal fsyncing to remove the extra goroutine and remove the hard-coded 100ms delay. Writes to the wal still maintain the invariant that they do not return to the caller until the write is fsync'd. This also adds a new config options wal-fsync-delay (default 0s) which can be increased if a delay is desired. This is somewhat useful for system with slower disks, but the current default works well as is.	2017-03-15 16:31:03 -06:00
Ben Johnson	1807772388	Fix tsi tests.	2017-03-15 11:23:58 -06:00
Ben Johnson	358b1e0b05	Merge remote-tracking branch 'upstream/master' into tsi	2017-03-15 10:13:32 -06:00
Jason Wilder	2f7d4995b4	Use typed values to avoid allocations This switches compactions to use type values (FloatValues) from the generic Values type. It avoids a bunch of allocations where each value much be converted from a specific type to an interface{}.	2017-03-09 16:27:07 -07:00
Jason Wilder	b9e5375043	Merge branch '1.2' into jw-merge-12	2017-03-08 13:16:50 -07:00
Jason Wilder	37187cbe6d	Delete series under fields lock Still seeing the panic that switching this logic around was supposed to fix. We now delete the bulk of data outside of the fields lock and then again, under the write lock, to ensure that the field mapping is accurate. We don't do the full delete under the lock because it can block writes and queries that require a read lock.	2017-03-06 14:19:55 -07:00
Jason Wilder	675d7c9d65	Merge branch '1.2' into jw-merge12	2017-03-06 11:09:05 -07:00
Jason Wilder	3c70abf061	Delete series before remove from field index There is a race where the field type can be deleted while a new type is written and during a query. When this happens, an iterator for the new type is created but old data make still exist in the cache for TSM files causing a panic.	2017-03-06 09:38:27 -07:00
Jason Wilder	a024003f2c	Merge branch '1.2' into jw-merge-12	2017-02-22 12:13:29 -07:00
Ben Johnson	78a9bb2527	Remove Tags.shouldCopy, replace with forceCopy on series creation. Previously, tags had a `shouldCopy` flag to indicate if those tags referenced an underlying buffer and should be copied to allow GC. Unfortunately, this prevented tags from being copied that were created and referenced the mmap which caused segfaults. This change removes the `shouldCopy` flag and replaces it with a `forceCopy` argument in `CreateSeriesIfNotExists()`. This allows the write path to indicate that tags must be cloned on insert.	2017-02-21 11:13:35 -07:00
Mark Rushakoff	601cbcd084	Merge branch '1.2' into mr-merge-12	2017-02-17 16:14:22 -08:00
Jonathan A. Sternberg	2fe48d6781	Rename zap import back to github.com/uber-go/zap They rebased a revision we were previously relying upon that allowed us to use the vanity name so we are reverting back to an older version with the old import path.	2017-02-17 17:17:22 -06:00

1 2 3 4 5 ...

463 Commits (master)