influxdb

Commit Graph

Author	SHA1	Message	Date
Jason Wilder	6b6ccf1a40	Wait for compaction gorotuines to finish	2017-10-04 10:01:44 -06:00
Jason Wilder	06226d6fd3	Handle orphan lower level TSM files during full planning Some files seem to get orphan behind higher levels. This causes the compactions to get blocked as the lowere level files will not get picked up by their lower level planners. This allows the full plan to identify them and pull them into their plans.	2017-10-04 08:13:14 -06:00
Jason Wilder	a1d0b52897	Allow lower priority compactions to use excess capacity If there is a backlog of level 3 and 4 compacitons, but few level 1 and 2 compactions, allow them to use some excess capacity.	2017-10-04 08:11:44 -06:00
Jason Wilder	f2a681c4cf	Unconditionally remove file when calling Remove	2017-10-03 10:49:17 -06:00
Jason Wilder	0c0505881f	Remove multiple file skipping for full compaction planning This check doesn't make sense for high cardinality data as the files typically get big and sparse very quickly. This causes a lot of extra disk space to be used which is taken up by large indexes and sparse data.	2017-10-03 10:48:14 -06:00
Jason Wilder	90df803802	Prevent infinite scheduling loop One shard might be able to run a compaction, but could fail to limits being hit. This loop would continue indefinitely as the same task would continue to be rescheduled.	2017-10-03 10:48:14 -06:00
Jason Wilder	4ff4ba0841	Use first file in generation for level With higher cardinality or larger series keys, the files can roll over early which causes them to take longer to be compacted by higher levels. This causes larger disk usage and higher numbers of tsm files at times.	2017-10-03 10:48:14 -06:00
Jason Wilder	71071ed67a	Add compaction backlog stat This gives an indication as to whether compactions are backed up or not.	2017-10-03 10:48:14 -06:00
Jason Wilder	16ece490ef	Reduce allocation in tsmKeyIterator.Next The chunked slice is unnecessary and we can re-use k.blocks throughout the compaction.	2017-10-03 10:48:14 -06:00
Jason Wilder	2c5006fccc	Rework snapshotting concurrency This switches the thresholds that are used for writing snapshots concurrently. This scales better than the prior model.	2017-10-03 10:48:14 -06:00
Jason Wilder	3af9c7df37	Remove a defer allocation Shows up under high cardinality compactions.	2017-10-03 10:48:14 -06:00
Jason Wilder	70817350b7	Ensure temp index files are cleaned up on error	2017-10-03 10:48:14 -06:00
Jason Wilder	a5afaf7499	Fix cache mem size not including key size	2017-10-03 10:48:14 -06:00
Jason Wilder	ae821f4e2d	Rework compaction scheduling This changes the compaction scheduling to better utilize the available cores that are free. Previously, a level was planned in its own goroutine and would kick off a number of compactions groups. The problem with this model was that if there were 4 groups, and 3 completed quickly, the planning would be blocked for that level until the last group finished. If the compactions at the prior level are running more quickly, a large backlog could accumlate. This now moves the planning to a single goroutine that plans each level in succession and starts as many groups as it can. When one group finishes, the planning will start the next group for the level.	2017-10-03 10:48:13 -06:00
Jason Wilder	f668b0cc3f	Only use O_SYNC for tsm file writing Doing this for the WAL reduces throughput quite a bit.	2017-10-03 10:48:13 -06:00
Jason Wilder	1610ae5727	Don't return tsm files part of a compaction plan	2017-10-03 10:48:13 -06:00
Joe LeGasse	1525069213	Merge pull request #8892 from influxdata/jl-tag-values auth: add series auth to 'show tag values'	2017-10-03 08:47:39 -04:00
Lyon Hill	7e5fd14e8a	add in some optimization	2017-10-02 12:02:38 -06:00
Lyon Hill	a6cbce0d3e	fix issues brought up by joe	2017-10-02 11:41:03 -06:00
Lyon Hill	38dc837910	Fix a minor memory leak when batching points for some services. fixes #8895	2017-10-02 11:26:25 -06:00
Joe LeGasse	1443b22379	auth: add series auth to 'show tag values'	2017-09-27 20:01:18 -04:00
Edd Robinson	e0cba4477c	Merge pull request #8885 from influxdata/er-entry-race Fix race on Cache entry	2017-09-27 18:42:45 +01:00
Edd Robinson	d0b81c1e6c	Fix race on Cache entry	2017-09-27 18:10:23 +01:00
Edd Robinson	a1b67160f6	Use math/bits in encoder	2017-09-26 12:51:08 +01:00
Jason Wilder	7fed382dbf	Merge pull request #8872 from influxdata/jw-mmap Fix long process stalls	2017-09-25 14:49:35 -06:00
Jason Wilder	122a74c692	Use synchronous IO for wal and tsm writing The fysncs due to large writes when writing to TSM files and the WAL can eventually cause large pauses. Since we already buffer writes, using synchronous IO reduces fsync latency by ensuring the individiual writes hit disk. This spreads out the latecncy across multiple writes better.	2017-09-25 12:44:57 -06:00
Edd Robinson	2def219f09	Refactor Shard to further protect Engine	2017-09-25 17:43:30 +01:00
Edd Robinson	4a67f92acc	Prevent store from directly accessing Shard's engine	2017-09-25 17:43:01 +01:00
Edd Robinson	8e9cabbb9c	Fix race in TagValues when reaching into engine	2017-09-25 17:43:01 +01:00
Edd Robinson	7739ff749a	Ensure engine protected by shard mutex	2017-09-25 17:42:30 +01:00
Jason Wilder	5774b44a4c	Remove MADV_RANDOM This was inadvertently added when merging the solaris and unix mmap files. This causes large delays due to major page faults.	2017-09-25 10:25:06 -06:00
Edd Robinson	ea104596f0	Implement TSI index versioning This commit adds a basic TSI versioning scheme, by adding a Version field to an index's MANIFEST file. Existing TSI indexes will not have this field present in their MANIFEST files, and thus will be deemed incomatible with the current version. Users with existing TSI indexes will be able to remove them, and convert the resulting inmem indexes to the current version of a TSI index using the influx_inspect tooling.	2017-09-22 17:59:39 +01:00
Jason Wilder	1e345aa7a1	Merge pull request #8856 from influxdata/jw-cache Snapshot compaction improvements	2017-09-22 10:45:54 -06:00
Edd Robinson	44691847e9	Merge branch 'master' into er-8678-tsi1-where	2017-09-22 16:54:49 +01:00
Jason Wilder	94aba64b88	Re-use index entries slice when writing TSM index	2017-09-21 12:48:16 -06:00
Jason Wilder	db204f3eb7	Default concurrent compactions to 50% of available cores	2017-09-21 12:48:11 -06:00
Jason Wilder	deef0c5649	Fix 32bit alignment	2017-09-20 10:00:20 -06:00
Jason Wilder	61ca1243c7	Increase index disk writer buffer	2017-09-20 09:05:30 -06:00
Jason Wilder	796de3dcea	Reduce encoder pool checkout contention With higher cardinalities, the encoder pools where become a bottleneck. This changes the snapshot compactions ot checkout one encoder of each type and re-use it while writing the snapshots as opposed to repeatedly checking it out and in.	2017-09-19 15:27:26 -06:00
Jason Wilder	391a6288c6	Write parallel snapshot for higher cardinalities	2017-09-19 15:27:26 -06:00
Jason Wilder	0d52b060df	Skip onFileStoreReplace with tsi	2017-09-19 15:27:25 -06:00
Jason Wilder	4fe81aeee6	Remove manual Gosched from compactions At higher cardinalities, this dramatically slows down compaction throughput.	2017-09-19 15:27:25 -06:00
Jason Wilder	31e785d676	Don't deduplicate a single value	2017-09-19 15:27:25 -06:00
Jason Wilder	2ca9ccee1f	Reset snapshot cache outside of write lock	2017-09-19 15:27:25 -06:00
Jason Wilder	ddeba2c86b	Split large snapshots and write concurrently	2017-09-19 15:27:25 -06:00
Jason Wilder	9ee305f6f5	Periodically re-allocate cache store This perioically re-allocates the cache store to avoid memory fragmentation and gradual slow down of the store after repeated deletes and inserts into the map.	2017-09-19 15:27:25 -06:00
Jason Wilder	2885b9b310	Remove entrySizeHints map There is a lot of overhead for calculating the hints for larger cardinalities. This slows down resetting the partitions in the ring.	2017-09-19 15:27:25 -06:00
Jason Wilder	4124a8ed97	Simplify cache ring The continuum slice is not needed since the number of partitions doesn't change. This removes the slice to make the mapping simpler.	2017-09-19 15:27:25 -06:00
Stuart Carnie	ed7bc9d825	fix FindValues panic for empty array	2017-09-19 14:23:32 -07:00
Stuart Carnie	92756ec0ad	Reduce allocations, improve readEntries performance by simplifying loop * callers of ReadEntries and Key API can cache allocated slice	2017-09-19 11:57:10 -07:00

1 2 3 4 5 ...

1889 Commits (f20cab6e994b0df80a8101c9085c3d13d40800b4)