influxdb

Commit Graph

Author	SHA1	Message	Date
Edd Robinson	bd8dd9a291	Sketches working	2017-01-05 09:54:04 -07:00
Edd Robinson	d19fbf5ab4	Wire in HLL estimator	2017-01-05 09:54:03 -07:00
Edd Robinson	2b8efefef4	Initial index interface	2017-01-05 09:51:43 -07:00
Edd Robinson	05bc4dec00	Refactor	2017-01-05 09:50:23 -07:00
Edd Robinson	c535e3899a	Remove in-memory index from Shard and Store	2017-01-05 09:47:09 -07:00
Mark Rushakoff	6a94d200c8	Merge remote-tracking branch 'influx/master' into mr-godoc	2017-01-04 13:27:36 -08:00
Cory LaNou	3c518f8927	panicing is bad -> error returns are good	2017-01-03 14:28:29 -06:00
Mark Rushakoff	41415cf2fb	Update godoc for tsm1 package	2017-01-02 07:30:18 -08:00
Gustav Westling	26b33307ae	Resolved PR comments on test files	2016-12-30 11:42:38 +01:00
Gustav Westling	56d98325da	Removed ineffective assignments, and added checks for errors that previsouly was not checked	2016-12-29 20:26:15 +01:00
Jason Wilder	2468347ffb	Fix comment	2016-12-19 14:17:49 -07:00
Jason Wilder	326557e539	Fix race in partition.reset	2016-12-19 14:17:01 -07:00
Jason Wilder	e91e45d71c	Fix panic in cache benchmark	2016-12-19 14:17:01 -07:00
Jason Wilder	0b6b9ea1cb	Use atomics for cache.snapshotSize stat	2016-12-19 14:17:01 -07:00
Jason Wilder	637a67ea35	Reduce lock contention on measurementFields	2016-12-19 14:17:01 -07:00
Jason Wilder	b7c1e625b0	Move needSort tracking to Deduplicate This eliminates some *UnixNano() calls and also simplifies the cache logic so that it does not need to worry about whether entries are sorted.	2016-12-19 14:17:01 -07:00
Jason Wilder	dea87703cd	Reduce UnixNano pointer call	2016-12-19 14:17:01 -07:00
Mark Rushakoff	722b6345fe	Fix unchecked error in templated Read${TYPE}Block	2016-12-19 09:31:26 -08:00
Jonathan A. Sternberg	ec57108520	Use proper uber-go/zap import path It looks like the real import path to the project is go.uber.org/zap instead of github.com/uber-go/zap since the example in the project references that path.	2016-12-15 08:54:14 -06:00
Edd Robinson	ec27c57127	Further optimisations and a race fix	2016-12-14 18:23:36 +00:00
Edd Robinson	05ec6ad9ad	Add to index safely	2016-12-14 18:23:36 +00:00
Edd Robinson	d78ca1a0f3	Fix some races	2016-12-14 18:23:36 +00:00
Edd Robinson	d2923c7bf9	Add hints as to how to pre-allocate entry values Currently, whenever a snapshot occurs the Cache is reset and so many allocations are repeated, as the same type of data is re-added to the Cache. This commit allows the stores to keep track of the number of values within an entry, and use that size as a hint when the same entry needs to be recreated after a snapshot. To avoid hints persisting over a long period of time they are deleting after every snapshot, and rebuilt using the most recent entries only.	2016-12-14 18:23:36 +00:00
Edd Robinson	f2b5c7f5be	Reduce contention when adding entries	2016-12-14 18:23:36 +00:00
Edd Robinson	98f0392ca6	Update size using atomic	2016-12-14 18:23:36 +00:00
Edd Robinson	66edb32182	Sharded Cache using a hash ring	2016-12-14 18:23:36 +00:00
Edd Robinson	d3e6d4e7ca	Add benchmarks	2016-12-14 18:21:50 +00:00
Jonathan A. Sternberg	21502a39e8	Switch logging to use structured logging everywhere The logging library has been switched to use uber-go/zap. While the logging has been changed to use structured logging, this commit does not change any of the logging statements to take advantage of the new structured log or new log levels. Those changes will come in future commits.	2016-12-14 10:45:15 -06:00
Jason Wilder	4f28c90b54	Optimize Value.Deduplicate Deduplicate is called from various places in the engine and can cause a lot of garbage to get created. It first creates a map and then adds each value to the map in order (1st alloc). It then creates a new slice (2nd alloc) and appends everything from the map to the slice. Finally, it sorted the new slice (3rd alloc). This switches the algorithm to use stable sorting and resuing the existing slice to avoid allocations.	2016-12-08 21:10:56 -07:00
Hrvoje Marjanovic	9483b8b409	gofmt	2016-12-03 22:06:38 +01:00
Hrvoje Marjanovic	6ed708e3fd	Reduce pool size, change WAL writers default Big pool can lead to huge memory usage in certain loads. See #7640 for detailed discussion.	2016-12-02 18:45:43 +01:00
Allen Petersen	31129ab0e9	Use slash separator for filenames in tar archives NO-OP on platforms with unix path separator. On Windows paths get converted to slashes before adding to archive and back to backslashes during restore.	2016-11-29 09:44:08 -08:00
Jason Wilder	e8a28cfbab	Expose Shard.LastModified This returns the LastModified time of the shard. The LastModified time is the wall time when a change to the shards state occurred. It uses the WAL or FileStore to determine the max mod time.	2016-11-23 10:04:07 -07:00
Jason Wilder	3a5a01181b	Switch all Value types from pointers	2016-11-15 16:13:55 -07:00
Jason Wilder	0ee58c208a	Switch time.Sleep to time.Ticker Avoids an allocation when calling time.Sleep	2016-11-15 16:13:55 -07:00
Jason Wilder	73b8f52ca0	Cache results onf findGenerations This allocates quite a bit and it's called multiple times per second per shard. The generations don't change until a compaction has occurred so most of the time is re-calculating the same thing and creating garbage.	2016-11-15 16:13:55 -07:00
Jason Wilder	0b6f5441b9	Add config option to messages when limits exceeded When a limit is exceeded, we return errors and sometimes log (if appropriate) that a limit was exceeded. The messages don't always provide an indication as to where or how they are configured. Instead, return the config option (easily searchable for) as well as the limit currently set and the value that exceeded it when possible.	2016-10-28 14:54:45 -06:00
Jason Wilder	b1ceb5e66d	Add cache write OK, Dropped, Error stats Adds a new dropped stat as well as fixes OK and error stats not actually get collected and stored.	2016-10-28 12:15:50 -06:00
Jason Wilder	873189e0c2	Fix panic: interface conversion: tsm1.Value is tsm1.FloatValue, not tsm1.StringValue If concurrent writes to the same shard occur, it's possible for different types to be added to the cache for the same series. The way the measurementFields map on the shard is updated is racy in this scenario which would normally prevent this from occurring. When this occurs, the snapshot compaction panics because it can't encode different types in the same series. To prevent this, we have the cache return an error a different type is added to existing values in the cache. Fixes #7498	2016-10-28 12:15:50 -06:00
Jason Wilder	e388912b6c	Fix race in findGenerations The file store stats slice is re-used which causes the race below: WARNING: DATA RACE Write at 0x00c42007e140 by goroutine 43: github.com/influxdata/influxdb/tsdb/engine/tsm1.(FileStore).Stats() /Users/jason/go/src/github.com/influxdata/influxdb/tsdb/engine/tsm1/file_store.go:511 +0x22e github.com/influxdata/influxdb/tsdb/engine/tsm1.(DefaultPlanner).findGenerations() /Users/jason/go/src/github.com/influxdata/influxdb/tsdb/engine/tsm1/compact.go:461 +0x6f github.com/influxdata/influxdb/tsdb/engine/tsm1.(DefaultPlanner).PlanLevel() Previous read at 0x00c42007e140 by goroutine 40: github.com/influxdata/influxdb/tsdb/engine/tsm1.(DefaultPlanner).findGenerations() /Users/jason/go/src/github.com/influxdata/influxdb/tsdb/engine/tsm1/compact.go:463 +0x13d github.com/influxdata/influxdb/tsdb/engine/tsm1.(*DefaultPlanner).PlanOptimize()	2016-10-28 12:15:49 -06:00
Steven Hartland	3f16197243	Improve tsm1 cache performance Reduce the cache lock contention by widening the cache lock scope in WriteMulti, while this sounds counter intuitive it was: * 1 x Read Lock to read the size * 1 x Read Lock per values * 1 x Write Lock per values on race * 1 x Write Lock to update the size We now have: * 1 x Write Lock This also reduces contention on the entries Values lock too as we have the global cache lock. Move the calculation of the added size before taking the lock as it takes time and doesn't need the lock. This also fixes a race in WriteMulti due to the lock not being held across the entire operation, which could cause the cache size to have an invalid value if Snapshot has been run in the between the addition of the values and the size update. Fix the cache benchmark which where benchmarking the creation of the cache not its operation and add a parallel test for more real world scenario, however this could still be improved. Add a fast path newEntryValues values for the new case which avoids taking the values lock and all the other calculations. Drop the lock before performing the sort in Cache.Keys().	2016-10-25 15:24:51 -06:00
Jonathan A. Sternberg	a515aeda39	Optimize first/last when no group by interval is present The `first()` and `last()` functions response rate would increase linear to the number of points even though it seems like it shouldn't. This optimization greatly reduces the amount of time to return a response when no `GROUP BY time(...)` clause is present in a query.	2016-10-25 09:57:31 -05:00
Jason Wilder	686d1a7ba4	Remove unused config options	2016-10-24 15:32:38 -06:00
Edd Robinson	0ee093f1fb	Memoize output of FileStore.Stats	2016-10-24 10:23:20 -06:00
Jonathan A. Sternberg	3681bc8a43	Filter out series within shards that do not have data for that series Previously, we would return a full tag set for every shard and the tag set would include all series that existed in the database index including series that didn't physically exist within that shard. This led to the tag sets returned being incredibly huge when we had high cardinality but sparse data. Since the data was sparse, it was unexpected that it would cause such a large strain on the system by most people. Now we filter out the series ids that are not assigned to the current shard when computing a tag set for that shard. This lowers the memory usage for high cardinality sparse data drastically and allows queries on those to complete successfully. This does not resolve issues for high cardinality data in every shard that is also spread out over a long series of time. That situation isn't nearly as common as the above situation though.	2016-10-20 14:15:34 -05:00
Jason Wilder	b50d9558cf	Merge pull request #7479 from influxdata/jw-clean-err Skip cleanup if dir does not exist	2016-10-18 15:49:09 -06:00
Jason Wilder	f30b00c24f	Skip cleanup if dir does not exist	2016-10-18 15:33:39 -06:00
Mark Rushakoff	377c40f122	Add stats for active compactions Unify logic around compaction execution to a single place. Also report on the error stats that we track. Previously they were not emitted in the stats output.	2016-10-18 14:12:21 -07:00
Joe LeGasse	de9c743004	TSM: update comments for disabling level compactions	2016-10-18 14:14:59 -06:00
Joe LeGasse	eda8f70372	TSM: Handle concurrent deletes for compaction	2016-10-18 14:14:59 -06:00

1 2 3 4 5 ...

597 Commits (bd8dd9a29107e1cb5e7b5674779f332efa6eb3d6)