influxdb

Commit Graph

Author	SHA1	Message	Date
Jason Wilder	4436e65fb9	Apply deletes to TSM files concurrently	2016-07-28 20:25:36 -06:00
Jason Wilder	fb5a143b08	Fix typos	2016-07-21 12:13:04 -06:00
Jason Wilder	822f409b31	Allow queries to complete before closing TSM files If a query was running against a file being compacted, we close the file and the query would end wherever it had read up to. This could result in queries that randomly lost data, but running them again showed the full results. We now use a reference counting approach and move the in-use files out of the way in the filestore and allow the queries to complete against the old tsm files. The new files are installed and new queries will use them. Fixes #5501	2016-07-21 12:13:04 -06:00
Edd Robinson	f37e726869	Add trace logging statements to tsdb	2016-07-21 11:14:29 +01:00
Edd Robinson	44231abcbd	Add trace logger controlled via DataLoggingEnabled	2016-07-21 11:14:29 +01:00
Edd Robinson	83cc580ff8	Tidy up logging	2016-07-21 11:14:29 +01:00
Jonathan A. Sternberg	12a33fe0d3	Add stats and diagnostics to the TSM engine Track the number of TSM files in the file store and keep engine statistics related to the number of TSM compactions.	2016-07-07 19:35:55 -05:00
Jonathan A. Sternberg	837a9804cf	Refactoring the monitor service to avoid expvar Truncate the time interval output of the monitor service to be on even time intervals rather than on every minute based on the start time. This normalizes the output from the monitor service.	2016-07-07 11:13:58 -05:00
Jason Wilder	ca6bfac01a	Fix out of order blocks returned during query If there were blocks in later TSM files that were for overwritten points or writes into the past, they could be returned more than once or out of order causing the cursor values to be unsorted. One effect of this is that graphs in graphana would render with the line going all over the place in spots. This might also cause duplicate data to be returned. Fixes #6738	2016-06-22 17:34:44 -06:00
Jason Wilder	a74ea4cbf4	Allow creating shards in a disable state For restoring a shard, we need to be able to have the shard open, but disabled. It was racy to open it and then disable it separately since writes/queries could occur in between that time.	2016-06-01 16:17:18 -06:00
Jason Wilder	0b481ff627	Fix pathalogical TSM query case This fixes a pathalogical query condition cause by and problematic structuring of TSM files based on how points were written. The condition can occur when there are multiple TSM files and a large number of points are written into the past. The earlier existing TSM files must also have points in the past and close to the present causing their time range to eclipse the later files. When this condition occurs, some queries can spend an excessive amount of time merge all the overlapping blocks. The fix was to constrain the window of overlapping blocks based on the first one we ran into. There was also a simple case in the Merge where we could skip the binary search path and just append the two inputs.	2016-05-25 09:14:17 -06:00
Jason Wilder	7fb7faaaca	Fix points already read from being returned more than once If there were duplicate points in multiple blocks, we would correctly dedup the points and mark the regions of the blocks we've read. Unfortunately, we were not excluding the already points as the cursor moved to points in the later blocks which could cause points to be return twice incorrectly. Fixes #6611	2016-05-18 17:21:10 -06:00
Cory LaNou	f415cf89ad	wip	2016-05-10 11:01:03 -05:00
Jason Wilder	d99c5e26f6	Fix memory spike when compacting overwritten points If a large series contains a point that is overwritten, the compactor would load the whole series into RAM during a full compaction. If the series was large, it could cause very large RAM spikes and OOMs. The change reworks the compactor to merge blocks more incrementally similar to the fix done in #6556.	2016-05-05 22:31:30 -06:00
Jason Wilder	a0ac754802	Fix loading huge series into RAM when points are overwritten In some query scenarios, if there are a lot of points on disk spread across many blocks in TSM files and a point is overwritten near the begginning of the shard's timerange, the full series could be loaded into RAM triggering OOMs and huge allocations. The issue was that the KeyCursor code that handles overwriting points had a simple implementation that just deduped the whole series in this case. This falls over when the series is quite large. Instead, the KeyCursor has been changed to only decode blocks with updated points. It then keeps track of what section of the blocks have been read so they are not re-read when the later points are decoded. Since the points in a block are always sorted, the code was also changed to remove the Deduplicate calls since they end up reallocating the slice. Instead, we do a sorted merge and re-use the slice as much as we can.	2016-05-05 09:34:44 -06:00
Jason Wilder	c8bd41c2d8	Remove TSM reader Keys func It's very inneficient and should never be used.	2016-04-27 13:09:52 -06:00
Jason Wilder	97504a552c	Support time range tombstones in FileStore/KeyCursor	2016-04-27 13:09:52 -06:00
Jason Wilder	a789e819a3	Remove NewTSMReaderWithOptions There are two TSMIndex implementations, the directIndex and the indirectIndex. Originally, we only had the directIndex and later added the indirectIndex and NewTSMReaderWithOptions in order to allow both indexes to be used in tests and code. This has created a problem since we really only use the directIndex for writing and always use the indirectIndex for reading. This changes removes the NewTSMReaderWithOptions func so that it is no longer possible to create a TSMReader with a directIndex. This will allow a lot of the block reading code used by the directIndex to be removed and simplify maintainence. It also gives better test coverage of the code that is actually used by the TSM engine now.	2016-04-27 13:09:52 -06:00
Ben Johnson	286072f65a	update dep: simple8b @ b421ab40	2016-04-22 09:46:05 -06:00
Stephen Gutekanst	9dc09c5257	Make logging output location more programmatically configurable (#6213 ) This has various benefits: - Users embedding InfluxDB within other Go programs can specify a different logger / prefix easily. - More consistent with code used elsewhere in InfluxDB (e.g. services, other `run.Server.` fields, etc). - This is also more efficient, because it means `executeQuery` no longer allocates a single `log.Logger` each time it is called.	2016-04-20 21:07:08 +01:00
Seif Lotfy	c6e3c87e00	Add Block checksum validation and "influx_inspect verify" tool Fixes #5502	2016-04-19 22:33:03 +02:00
Pierre Fersing	29b19a2293	Fix deadlock in tsm1/file_store	2016-04-12 09:39:21 +02:00
Ben Johnson	525e22c92b	tsm1 query engine alloc reduction This commit makes a number of performance improvements to reduce allocations during query execution. Several objects and buffers are now reused across the components to avoid allocations. Previously a simple `count(value)` query across 1M points would require 26,000+ allocations. After the changes in this commit that number has been reduced to 88.	2016-04-11 14:50:59 -06:00
Jason Wilder	1b08e2dd55	Use walk func to load all tsm keys to index Avoids allocating a big map or all keys.	2016-03-29 12:59:26 -06:00
Jason Wilder	03ced4cc90	Load shards concurrently	2016-03-29 12:58:52 -06:00
Ben Johnson	6e1c1da25b	reduce allocations in query execution This commit removes some heap objects by converting them from pointer references to non-pointers or by reusing buffers.	2016-03-22 09:51:39 -06:00
Jason Wilder	7567453c9a	Ensure TSM files are fsync'd Make sure TSM files are fsync'd when closed and also that the parent dir is fsync'd when they are renamed.	2016-03-21 15:03:52 -06:00
Jason Wilder	9984cd5d6d	Fix skipping blocks at query time when overlaps exist Depending on how data is written across TSM files, it was possible to skip over some blocks at query time making it looks like data was missing.	2016-03-14 13:11:11 -06:00
Mark Rushakoff	cdcb079769	Tag TSM stats with database, retention policy ... by extracting the db/rp from the given path. Now that the code has "standardized" on extracting db/rp this way, the ShardLocation struct is no longer necessary and thus has been removed. We're back on the previous style of passing the path and walPath to NewShard.	2016-02-29 09:17:34 -08:00
Jason Wilder	8d70d65a82	Convert time.Time to int64	2016-02-25 15:15:01 -07:00
Mark Rushakoff	602043e11b	Add disk stats for FileStore	2016-02-19 16:37:34 -08:00
Ben Johnson	b8918a780c	integer support	2016-02-10 09:40:25 -07:00
Ben Johnson	00806de9b8	refactor query engine	2016-02-10 09:40:25 -07:00
Jason Wilder	756421ec4a	Look for fully compacted block in addition to max size during compaction Some data shapes would cause files to grow larger than the max size more quickly which resulted in them getting skipped by the full compaction planner at times. Some datasets that could make this happen are very large keys or very large numbers of keys (10M). When this happened, multiple max sized files would accumulate but the blocks would not be full. When the shard went cold for writes, these files would get recompacted down to the optimal size, but a lot of space would be wasted in the mean time.	2016-01-07 15:18:42 -07:00
Jason Wilder	faf8ee17fa	Fix typo	2016-01-06 12:53:04 -07:00
Jason Wilder	2f7a0090c1	Don't allocate a pre-sized buffer for each cursor This is contributing to some of the high memory usage on queries and possibly some OOMs. This is slightly slower, but removing it allows some fairly large count queries over 5M series to complete instead of crashing the process using tsm1 engine.	2016-01-06 10:50:38 -07:00
Paul Dix	59fbd371fc	Implement backup/restore for TSM. This changes backup and restore to work for TSM. It breaks it for b1 and bz1, but since those are getting removed it's ok. The backup runs against any host that is specified and can backup either the metasstore, a database, specific retention policy, or a specific shard. It can also take incremental backups with the `since` flag, which will only backup TSM files that have been created since that timestamp. The backup is safe to run online. However, for shards that are still hot for writes, they won't be able to create new TSM files while the backup for that single shard runs. If the backup isn't too large and the write throughput isn't too high this shouldn't be a problem since the writes will just go into the WAL cache.	2015-12-30 18:06:50 -05:00
Jason Wilder	a38c95ec85	Update compactions to run concurrently This has a few changes in it (unfortuantely). The main change is to run compactions concurrently. While implementing this, a few query and performance bugs showed up that are also fixed by this commit.	2015-12-23 18:01:11 -07:00
Jason Wilder	8c7e11f4cf	Aggressively clean up KeyCursor resources	2015-12-17 12:51:51 -07:00
Jason Wilder	825296ddd8	Add comments	2015-12-16 11:30:06 -07:00
Jason Wilder	70d1f45058	Load TSM files concurrently	2015-12-16 11:28:12 -07:00
Philip O'Toole	01ac0b3f23	Tweak compaction log messages	2015-12-15 10:33:13 -08:00
Philip O'Toole	a6cdb5229d	Log tsm initialization	2015-12-14 15:50:56 -08:00
Jason Wilder	9d82e24ca0	Fix performance of dropping large number of keys	2015-12-08 10:47:06 -07:00
Jason Wilder	87892d79da	Dedupe points at query time if there are overlapping blocks	2015-12-07 21:10:10 -07:00
Jason Wilder	a2583d2be1	Reduce lock contention when planning TSM queries	2015-12-07 15:42:36 -07:00
Jason Wilder	4da20c49e9	Optimize TSM file scanning for time queries Move the index locations planning to be lazily created after the first seek when we know what time and direction we're searching for. This allows files and blocks to be skip before having to scan the files index. This improves queries times with time filters wherne there are many TSM files on disk.	2015-12-07 15:42:36 -07:00
Paul Dix	8096c6b845	Update TSM, address PR #5011 comments * Moved TSM file extension to a constant * Fixed typos * Changed group.size() back to being a uint64 since it can have multiple files up to 4GB each.	2015-12-07 14:47:17 -05:00
Paul Dix	440a8a8a1f	Change all TSM file sizes to uint32	2015-12-07 10:12:24 -05:00
Paul Dix	937233d988	Update TSM compaction planning logic * Update Plan to do a full compaction if cold for writes * Remove MaxFileSize as a config variable from Compactor. Should be a set constant * Update Plan to keep track of if the last check was fully compacted so we can skip future planning calls * Update compact min file count to 3 so that compactions run more frequently	2015-12-07 08:26:30 -05:00
Paul Dix	1bee7d1512	Update TSM, remove old version, add config * remove rolloverTSMFileSize constant that is no longer used * remove the maxGenerationFileCount since it is no longer a limitation that's necessary with the new compaction scheme. We no longer read WAL segments as part of the compaction so memory is only used as we read in each individual key * remove minFileCount and switch to a user configurable variable * remove the mutex from WALSegmentWriter. There's never more than one open in the WAL at one time and it's not exported through any function so the lock on the WAL should be used. This simplified keeping track of the last write time and removed a bunch of unnecessary locks. * update WALSegmentWriter.Write to take the compressed bytes so that encoding and compression can occur before the call to write (while we don't hold the WAL lock) * remove a bunch of unnecessary locking in WAL.writeToLog * Add check for TSM file magic number and vesion * Remove old tsm, log, and unused cursor code * Remove references to tsm1dev everywhere except in the inspector * Clean up config options for compaction and snapshotting * Remove old TSM configuration options * Update the config.sample.toml with TSM options * Update WAL compact to force if it has been cold for writes for a configurable period of time (1h by default)	2015-12-06 18:50:39 -05:00
Jason Wilder	41b24995a7	Compcation fixes	2015-12-05 12:19:28 -07:00
Jason Wilder	6592615958	Updated compaction strategy This changes compacting files to merge sequences of files in lower generations up to later generations	2015-12-04 23:30:39 -07:00
Jason Wilder	357b88c439	Increment sequence of max generation when compaction files	2015-12-04 13:46:28 -07:00
Jason Wilder	52bec1f7f6	Change TSM file naming to generation-sequence.tsm	2015-12-04 11:51:33 -07:00
Jason Wilder	479469994a	Optimize FileStats calls FileStats called frequently during compaction planning was too expensive because they were cleared out every time a file replaced causing them all to be reloaded. Insted, we grab the stats that are already maintained by the files themselves from the files when needed.	2015-12-04 11:16:39 -07:00
Jason Wilder	70710df910	Fix typo	2015-12-04 10:02:59 -07:00
Jason Wilder	c7e37766e7	Avoid repetitive index searches when iterating over cursors First pass at TSM cursor iteration ended up searching the file indexes too frequently and hurt performance. This changes that to search it once and then have the cursor hold onto the block locations to seek to. Doubles the query performance from the first iteration, but still a lot of room for improvement.	2015-12-04 10:02:59 -07:00
Jason Wilder	4b7cc6720a	Merge pull request #4983 from influxdb/jw-tsm-deletes2 Implement delete series/measurement	2015-12-04 10:02:11 -07:00
Jason Wilder	c54a3da0ca	Implement delete series/measurement	2015-12-04 09:10:26 -07:00
Jason Wilder	66c9ef862e	Fix regressions Something broke with writing to the WAL now that compactions are running concurrently. There was also a performance problem with Next/Prev doing twice as many searches as necessary.	2015-12-03 14:25:03 -07:00
Jason Wilder	adf5c5b223	Replace Next/Prev with Scan	2015-12-03 12:39:13 -07:00
Jason Wilder	be59ba3455	Add Prev support to FileStore Allows read the previous block of values given a timestamp and key.	2015-12-03 12:39:12 -07:00
Jason Wilder	6fba01df89	Implement single field TSM queries	2015-12-03 12:35:36 -07:00
Jason Wilder	3a8a19a99d	Implement LoadMetaDataIndex for tsm1dev engine	2015-12-02 13:38:06 -07:00
Jason Wilder	4a03469662	Integrate TSM compaction into dev engine	2015-12-02 09:45:23 -07:00
Jason Wilder	d4b1c25f8e	Add CompactionPlanner type CompactionPlanner is used to determine which files (WAL Segments, TSM Files) to include in a given compaction run.	2015-12-02 09:45:23 -07:00
Jason Wilder	7c7a68d783	Small cleanups	2015-11-17 11:30:29 -07:00
Jason Wilder	9c2be12b65	Add FileStore.Remove func Allows a TSMFile to be removed from the active set of files managed by the FileStore.	2015-11-16 09:16:10 -07:00
Jason Wilder	ef18f8afb2	Handle TSM key deletions This writes a tombstone file containing a line per deleted key. This file is read when a TSMReader is created and any keys listed in the file are removed from the index.	2015-11-16 08:44:52 -07:00
Jason Wilder	0ab423c7ff	Initial FileStore implementation Provides functionality to load a directory of TSM files (or add them manually) as well as reading blocks of values for individual key and times.	2015-11-16 08:44:52 -07:00

1 2 3 4

171 Commits (db/wait-timeout-utility)