influxdb

Commit Graph

Author	SHA1	Message	Date
Jonathan A. Sternberg	93745d9693	Merge pull request #6391 from influxdata/js-5553-limit-queries-slow-with-group-by Propagate the limit option to the low level iterators	2016-04-16 09:39:25 -04:00
Jonathan A. Sternberg	bd5fdd797d	Propagate the limit option to the low level iterators When a GROUP BY or multiple sources are used, the top level limit iterator requires reading the entire iterator stream so it can find all of the tag groups it needs to return. For large data series, this ends up with the limit iterator discarding a lot of output. This change adds a new lower level limit iterator on each series itself so that there are fewer data points that have to be thrown away by the top level iterator. Fixes #5553.	2016-04-15 18:23:54 -04:00
Jonathan A. Sternberg	835d08591e	Do not filter out empty tags from series keys	2016-04-13 09:15:57 -04:00
Jonathan A. Sternberg	ea6262b712	Enhance comparing tags and fields in the where clause Now it is possible to compare tags and fields and it is also now possible to compare tags and tags. Previously, it was only possible to compare fields with fields and tags with a string or a regex. Fixes #3371.	2016-04-11 18:10:08 -04:00
Jonathan A. Sternberg	028fdaff81	Merge pull request #6222 from influxdata/js-6206-descending-tsm1-iterators Handle nil values from the tsm1 cursor correctly	2016-04-06 10:05:20 -04:00
Jonathan A. Sternberg	94ec92d669	Handle nil values from the tsm1 cursor correctly Send nil values from the tsm1 cursor at the end of the cursor. After the cursor reached tsm1, the `nextAt()` call would always return the default value rather than a nil value. Descending also didn't work correctly because the seeking functionality for tsm1 iterators would always act like they were ascending instead of descending when choosing which value to select. This resulted in very strange output from the emitter since it couldn't figure out if it was ascending or descending. Fixes #6206.	2016-04-06 09:27:02 -04:00
Jason Wilder	3f4c5a5585	Fix race on measurementFields Both Shard and Engine had the same reference to the measurementField map, but they each protected it with their own locks. This causes a race when write and queries are occurring because writes can add new fields to the map while queries are reading from it. The fix moves the ownership to the Engine and provides protected accessors to that Shard now users. For the most parts, the access on shard were old dead code. Fixing the measurementFields map race created a new race on the internal fields map. This is now unexported and protected via MeasurementFields exported funcs. Fixes #6188	2016-04-01 18:57:01 -06:00
Jason Wilder	1b08e2dd55	Use walk func to load all tsm keys to index Avoids allocating a big map or all keys.	2016-03-29 12:59:26 -06:00
Jason Wilder	03ced4cc90	Load shards concurrently	2016-03-29 12:58:52 -06:00
Jonathan A. Sternberg	a35d9602cd	Fix where filters when a OR is used and when a tag does not exist If an OR was used, merging filters between different expressions would not work correctly. If one of the sides had a set of series ids with a condition and the other side had no series ids associated with the expression, all of the series from the side with a condition would have the condition ignored. Instead of defaulting a non-existant series filter to true, it should just be false and the evaluation of the one side that does exist should take care of determining if the series id should be included or not. The AND condition used false correctly so did not have to be changed. If a tag did not exist and `!=` or `!~` were used, it would return false even though the neither a field or a tag equaled those values. This has now been modified to correctly return the correct series ids and the correct condition. Also fixed a panic that would occur when a tag caused a field access to become unnecessary. The filter using the field access still got created and used even though it was unnecessary, resulting in an attempted access to a non-initialized map. Fixes #5152 and a bunch of other miscellaneous issues.	2016-03-22 12:19:06 -04:00
Jonathan A. Sternberg	6655ca7769	Create a new interrupt iterator that will stop emitting points after an interrupt Use of the iterator is spread out into both `IteratorCreators` and inside of the iterators themselves. Part of the interrupt must be handled inside of the engine so it stops trying to emit points when an interrupt is found and another part of the interrupt has to happen when combining the iterators so it doesn't just start reading the next shard.	2016-03-21 12:07:07 -04:00
Jason Wilder	000459e350	Fix deadlock when running backup A deadlock occurs under write load if a backup is run in between the time when a snapshot compactions has snapshotted the cache and successfully written it to disk. The issus is that the second snapshot call will block on the commit lock while it is holding the engine write lock. This causes all writes to block as well as prevents the currently runnign snapshot compaction from completing because it needs to acquire a read-lock. This PR removes the commit lock and just returns an error if a snapshot is in progress to all any locks being held to be released. The caller can determine whether to retry or giveup.	2016-03-14 12:36:48 -06:00
Jason Wilder	992c78ee22	Remove period shard maintenance goroutine This is no longer used in tsm and just peridocially locks everything for no reason now.	2016-03-09 17:31:02 -07:00
Edd Robinson	58c03448aa	Merge pull request #5514 from influxdata/er-engine-panic Ensure shards and engine are safely closed	2016-03-09 18:56:36 +00:00
Jason Wilder	8d70d65a82	Convert time.Time to int64	2016-02-25 15:15:01 -07:00
Jon Seymour	eb7eec078d	tsm: cache: introduce commit lock to Cache Currently two compactors can execute Engine.WriteSnapshot at once. This isn't thread safe since both threads want to make modifications to Cache.snapshot at the same time. This commit introduces a lock which is acquired during Snapshot() and released during ClearSnapshot(), ensuring that at most one thread executes within Engine.WriteSnapshot() at once. To ensure that we always release this lock, but only release the snapshot resources on a successful commit, we modify ClearSnapshot() to accept a boolean which indicates whether the write was successful or not and guarantee to call this function if Snapshot() has been called. Signed-off-by: Jon Seymour <jon@wildducktheories.com>	2016-02-25 12:10:37 +11:00
Jason Wilder	017c24c98e	Simplify cache snapshotting The Cache had support for taking multiple snapshots to support writing multiple snapshots to TSM files concurrently if that happened to be a bottleneck. In practice, this is never a bottleneck and we only run one snappshoting goroutine continously per shard which has worked well for all workloads. The multiple snapshot support introduces some unhandled failure scenarios where wal segments could be removed without writing them to TSM files. If a snapshot compaction fails to write due to transient disk errors, subsequent snapshots will continue, but the failed one will not be retried. When the subsequent ones succeeded, all closed wal segments are removed causing data loss. This change simplifies the snapshotting capability to ensure that there is only ever one snapshot. If one fails, the next snapshot will update the existing snapshot and retry all of old and new data. Fixes #5686	2016-02-23 09:38:51 -07:00
Jonathan A. Sternberg	50753de032	Merge pull request #5782 from influxdata/js-5777-audit-panics-in-influxql Remove the non-unreachable panics in the new query engine	2016-02-22 17:18:57 -05:00
Jonathan A. Sternberg	7a03df2af1	Remove the non-unreachable panics in the new query engine The only panics left are ones that should be unreachable unless there is a bug. Fixes #5777.	2016-02-22 12:52:43 -05:00
Jon Seymour	6697c721fb	tsm: cache: add cache throughput related statistics. Complementing and extending the changes in #5758. Add 2 level statistics: * snapshotCount * cacheAgeMs Add 2 counter statistics * cachedBytes * WALCompactionTimeMs snapshotCount can be used to measure transient write errors that are causing snapshots to accumulate cacheAgeMs can be used to guage the level of write activity into the cache The differences between cachedBytes stats sampled at different times can be used to calculate cache throughput rates The ratio (cachedBytes-diskBytes)/WALCompactionTimeMs can be used calculate WAL compaction throughput. The ratio of difference between first and last WAL compaction time over the interval length is an estimate of percentage of cache throughput consumed. Signed-off-by: Jon Seymour <jon@wildducktheories.com>	2016-02-20 22:18:57 +11:00
Mark Rushakoff	e76967efb6	Add stats to tsm1.Cache	2016-02-19 16:37:34 -08:00
Ben Johnson	d9a6a7340f	add canonical paths	2016-02-10 11:30:52 -07:00
Ben Johnson	5a0d1ab7c1	rename influxdb/influxdb to influxdata/influxdb This commit changes all the import and URL references from: github.com/influxdb/influxdb to: github.com/influxdata/influxdb	2016-02-10 10:26:18 -07:00
Jonathan A. Sternberg	d1f7c445e7	Modify iterators to work across shards Aux iterators now ask the iterator creator what series will be returned and determine which aux fields to create based on the results. The `tsdb.Shards` struct also creates a call iterator around the iterators returned from each shard.	2016-02-10 09:40:29 -07:00
Jonathan A. Sternberg	c2d1206177	Implement the fill iterator Fill requires an additional function for IteratorCreator to retrieve the series that will be returned from the iterator. When fill is required for an aggregate, the IteratorCreator will be asked what series will be returned by the created iterator.	2016-02-10 09:40:29 -07:00
Ben Johnson	6204350d65	fix math operations	2016-02-10 09:40:27 -07:00
Ben Johnson	b8918a780c	integer support	2016-02-10 09:40:25 -07:00
Jonathan A. Sternberg	34f14424dd	Filter tags from the condition when building cursors on tsm1	2016-02-10 09:40:25 -07:00
Ben Johnson	00806de9b8	refactor query engine	2016-02-10 09:40:25 -07:00
Edd Robinson	1bcb1d033f	Allow Close to be called multiple times safely	2016-02-03 10:20:22 +00:00
Jason Wilder	5bee8880db	Reduce lock content in engine.WritePoints Writing the snapshot would deduplicate the snapshot points while still holding the engine write-lock. This can be expensive under high load and cause writes to back up and OOM the server. Instead, grab the snapshot under the lock and dedup it after releasing the lock. Possible fix for #5442	2016-01-25 15:37:34 -07:00
Jason Wilder	24f1bcfd20	Remove Dev prefix from tsm engine/tx	2016-01-10 16:43:36 -07:00
Jason Wilder	5b179113fc	Don't close tsm cursor prematurely We were closing the cursor when we read the last block which caused the internal state to be cleared. In a group by query, we seeked multiple times so depending on the group by interval and how the data was laid out in the blocks, we woudl close the cursor and the last block would get skipped. Fixes #5193	2016-01-10 15:26:01 -07:00
Jason Wilder	d2b7c03175	Re-use the series key Avoid allocating the string twice.	2016-01-06 12:52:13 -07:00
Paul Dix	26e1c6464a	Update backup to address PR comments	2015-12-30 18:06:51 -05:00
Paul Dix	59fbd371fc	Implement backup/restore for TSM. This changes backup and restore to work for TSM. It breaks it for b1 and bz1, but since those are getting removed it's ok. The backup runs against any host that is specified and can backup either the metasstore, a database, specific retention policy, or a specific shard. It can also take incremental backups with the `since` flag, which will only backup TSM files that have been created since that timestamp. The backup is safe to run online. However, for shards that are still hot for writes, they won't be able to create new TSM files while the backup for that single shard runs. If the backup isn't too large and the write throughput isn't too high this shouldn't be a problem since the writes will just go into the WAL cache.	2015-12-30 18:06:50 -05:00
Jason Wilder	a38c95ec85	Update compactions to run concurrently This has a few changes in it (unfortuantely). The main change is to run compactions concurrently. While implementing this, a few query and performance bugs showed up that are also fixed by this commit.	2015-12-23 18:01:11 -07:00
Jason Wilder	bb2562b2ab	Return CompactionGroups from planning	2015-12-23 18:01:11 -07:00
Jason Wilder	8c7e11f4cf	Aggressively clean up KeyCursor resources	2015-12-17 12:51:51 -07:00
Jason Wilder	3893bc60e1	Speed up TSM compactor Just keep the current block for each iterator in the buffers.	2015-12-16 11:16:17 -07:00
Alexandre Viau	ad1044dde9	typo: unkown -> unknown	2015-12-15 18:10:47 -05:00
Philip O'Toole	01ac0b3f23	Tweak compaction log messages	2015-12-15 10:33:13 -08:00
Jason Wilder	d7cff651d1	Cancel writing TSM files when engine closes If the engine is closed while a compaction is going on, the close call blocks until the goroutine exits. This could be several minutes because the control does not return back up to the channel selector while there is still data to write.	2015-12-08 15:41:53 -07:00
Jason Wilder	9d82e24ca0	Fix performance of dropping large number of keys	2015-12-08 10:47:06 -07:00
Jason Wilder	f245b44afa	Set full compaction duration option on planner Was set on engine and not planner so it was always 0.	2015-12-08 09:56:36 -07:00
Paul Dix	8096c6b845	Update TSM, address PR #5011 comments * Moved TSM file extension to a constant * Fixed typos * Changed group.size() back to being a uint64 since it can have multiple files up to 4GB each.	2015-12-07 14:47:17 -05:00
Paul Dix	820b0d31d6	Update TSM to delete from the WAL/cache * Update cache loader to delete entries from cache * Add cache.Delete() * Update delete to look at keys in the Cache in addition to the FileStore * Update cache compaction to never happen if the cache is empty	2015-12-07 14:35:48 -05:00
Paul Dix	937233d988	Update TSM compaction planning logic * Update Plan to do a full compaction if cold for writes * Remove MaxFileSize as a config variable from Compactor. Should be a set constant * Update Plan to keep track of if the last check was fully compacted so we can skip future planning calls * Update compact min file count to 3 so that compactions run more frequently	2015-12-07 08:26:30 -05:00
Paul Dix	1bee7d1512	Update TSM, remove old version, add config * remove rolloverTSMFileSize constant that is no longer used * remove the maxGenerationFileCount since it is no longer a limitation that's necessary with the new compaction scheme. We no longer read WAL segments as part of the compaction so memory is only used as we read in each individual key * remove minFileCount and switch to a user configurable variable * remove the mutex from WALSegmentWriter. There's never more than one open in the WAL at one time and it's not exported through any function so the lock on the WAL should be used. This simplified keeping track of the last write time and removed a bunch of unnecessary locks. * update WALSegmentWriter.Write to take the compressed bytes so that encoding and compression can occur before the call to write (while we don't hold the WAL lock) * remove a bunch of unnecessary locking in WAL.writeToLog * Add check for TSM file magic number and vesion * Remove old tsm, log, and unused cursor code * Remove references to tsm1dev everywhere except in the inspector * Clean up config options for compaction and snapshotting * Remove old TSM configuration options * Update the config.sample.toml with TSM options * Update WAL compact to force if it has been cold for writes for a configurable period of time (1h by default)	2015-12-06 18:50:39 -05:00
Philip O'Toole	6e88547a5e	Support shutting down engine goroutines This was causing races in the code, when the cache was being reloaded, because back-to-back open-and-closing of the engine during testing left goroutines running. With this change the engine is completely shutdown when Close() is called on it.	2015-12-06 09:16:38 -08:00

1 2

89 Commits (7def8bc0c98c04080ed649d3eb31fb00baf68482)