influxdb

Commit Graph

Author	SHA1	Message	Date
Jason Wilder	8082fc61ba	Fix parsing keys when loading database index The code for parsing a key our of the WAL or TSM files in the engine was naive and didn't account for measurements with escape chars. This uses the correct parsing code to parse and load them correctly. Fixes #6496	2016-04-30 14:47:19 -06:00
Todd Persen	9eb4c1ec57	Fix typo in comment.	2016-04-29 16:26:27 -07:00
Jason Wilder	abcb559b09	Remove index meta data when series and measurements are gone This remove the dropMeta param from the tsdb.Store.DeleteSeries and lets the shard determine when to remove the meta data from the index based on what series still have data in the shard. This uncovered a nasty bug in compactions where a fully deleted series would prematurely end the compactions and not carry forward the rest of the data in the TSM file. This is now fixed as well.	2016-04-29 16:31:57 -06:00
Edd Robinson	4d1cfa887c	Ensure measurement dropped when no more series	2016-04-29 00:05:42 +01:00
Jason Wilder	2bd5880d7a	Remove series from index when shard is closed When a shard is closed and removed due to retention policy enforcement, the series contained in the shard would still exists in the index causing a memory leak. Restarting the server would cause them not to be loaded. Fixes #6457	2016-04-28 12:34:46 -06:00
Jason Wilder	4e353867d5	Fix first block not getting purged when deleting series	2016-04-27 17:08:00 -06:00
Ben Johnson	f7af787aef	add DELETE query support This commit adds query language support for deleting series with a `DELETE` query.	2016-04-27 15:16:23 -06:00
Jason Wilder	aefd2ad08b	Add DeleteSeries and DeleteSeriesRange	2016-04-27 13:09:53 -06:00
Jason Wilder	c306090361	Fix tombstone rename on windows	2016-04-27 13:09:53 -06:00
Jason Wilder	86d37614e4	Remove debugging from test output	2016-04-27 13:09:53 -06:00
Jason Wilder	bf3aa5857d	Don't add tombstone for timerange not contained by file	2016-04-27 13:09:53 -06:00
Jason Wilder	6042e114a1	Remove tombstoned values during compaction This will skip blocks that are fully tombstoned as well as remove points that have been removed within a block.	2016-04-27 13:09:53 -06:00
Jason Wilder	23bbfb2192	Prevent truncated WAL entries from panicing	2016-04-27 13:09:53 -06:00
Jason Wilder	0de21ade40	Add delete range of values support to WAL and cache loader	2016-04-27 13:09:53 -06:00
Jason Wilder	d13d01b516	Allow deleting series by time on a shard	2016-04-27 13:09:53 -06:00
Jason Wilder	4d71d2b01f	Add support for deleting cache values using time range	2016-04-27 13:09:52 -06:00
Jason Wilder	c154cd4b4a	Remove TSMReaderOptions Not used	2016-04-27 13:09:52 -06:00
Jason Wilder	c8bd41c2d8	Remove TSM reader Keys func It's very inneficient and should never be used.	2016-04-27 13:09:52 -06:00
Jason Wilder	7e06d558d5	Update ContainsValue to handle tombstones	2016-04-27 13:09:52 -06:00
Jason Wilder	97504a552c	Support time range tombstones in FileStore/KeyCursor	2016-04-27 13:09:52 -06:00
Jason Wilder	27c2bc3f15	Sepearate IndexWriter from TSMIndex Allows for future versionion of the TSMIndex as well as removing a lot of unnecessary code.	2016-04-27 13:09:52 -06:00
Jason Wilder	bb82331db7	Move TSMIndex defn to reader.go	2016-04-27 13:09:52 -06:00
Jason Wilder	1ac0b01c5a	Remove fileAccessor No longer used	2016-04-27 13:09:52 -06:00
Jason Wilder	a789e819a3	Remove NewTSMReaderWithOptions There are two TSMIndex implementations, the directIndex and the indirectIndex. Originally, we only had the directIndex and later added the indirectIndex and NewTSMReaderWithOptions in order to allow both indexes to be used in tests and code. This has created a problem since we really only use the directIndex for writing and always use the indirectIndex for reading. This changes removes the NewTSMReaderWithOptions func so that it is no longer possible to create a TSMReader with a directIndex. This will allow a lot of the block reading code used by the directIndex to be removed and simplify maintainence. It also gives better test coverage of the code that is actually used by the TSM engine now.	2016-04-27 13:09:52 -06:00
Jason Wilder	bc6328d196	Add time range support to tombstone files This adds support for a time range to tombstone files to allow a subset of points to be deleted instead of the whole series. It changes the tombstone file format to a binary format and maintains backwards compatibility with the old text format tombstone files.	2016-04-27 13:09:52 -06:00
Tait Clarridge	df0e16a92f	Add safer unlock to CreateFieldIfNotExists A deadlock can occur if the field was created while we were waiting for the lock.	2016-04-25 12:44:58 -04:00
Ben Johnson	9c1fa76f3c	Merge pull request #6452 from benbjohnson/simple8b update dep: simple8b @ b421ab40	2016-04-22 11:05:42 -06:00
Ben Johnson	286072f65a	update dep: simple8b @ b421ab40	2016-04-22 09:46:05 -06:00
Jonathan A. Sternberg	d26e4e3650	Pass binary expressions to the underlying query Binary math inside of a where condition was previously disallowed. Now, these types of queries are just passed verbatim down to the underlying query engine which can handle it. We may want to revisit this when it comes to tags at some point as it prevents the more efficient filtering of tags that a simple expression allows, but it allows a query like this to be done: SELECT * FROM cpu WHERE value + 2 < 5 So while it can be better, this is a good initial implementation to provide this functionality. There are very rare situations where a tag may be used appropriately in one of these circumstances. Fixes #3558.	2016-04-22 11:30:36 -04:00
Ben Johnson	d204a8b683	optimize tsm1.FloatDecoder This commit changes the `FloatDecoder.val` from a `float64` type to a `uint64` to avoid an additional type conversion during read. Now the type gets converted to a `float64` only on call to `Values()`.	2016-04-21 08:49:12 -06:00
Jason Wilder	87ceb7426a	Don't lock the cache while adding entries Entries have their own locking so the cache doesn't need to be lock when adding to them.	2016-04-20 16:08:58 -06:00
Jason Wilder	89aeaafd50	Re-use the string point key	2016-04-20 16:08:58 -06:00
Jason Wilder	fbaa7db54f	Don't lock entry when scanning new values to add	2016-04-20 16:00:26 -06:00
Jason Wilder	bfa225f149	Merge pull request #6430 from influxdata/jw-cache-load-size Disable cache max memory size when reloading the cache	2016-04-20 14:35:23 -06:00
Stephen Gutekanst	9dc09c5257	Make logging output location more programmatically configurable (#6213 ) This has various benefits: - Users embedding InfluxDB within other Go programs can specify a different logger / prefix easily. - More consistent with code used elsewhere in InfluxDB (e.g. services, other `run.Server.` fields, etc). - This is also more efficient, because it means `executeQuery` no longer allocates a single `log.Logger` each time it is called.	2016-04-20 21:07:08 +01:00
Jason Wilder	f679787080	Disable cache max memory size when reloading the cache The cache max memory size is an approximate size and can prevent a shard from loading at startup. This change disable the max size at startup to prevent this problem and sets the limt back after reloading. Fixes #6109	2016-04-20 10:41:30 -06:00
Jonathan A. Sternberg	c8c38e15cd	Merge pull request #6386 from influxdata/js-iterator-next-error Modify all of the iterators to allow returning an error on Next()	2016-04-20 10:39:53 -04:00
Ben Johnson	54454e1e5b	Merge pull request #6424 from benbjohnson/optimize-bit-reader Optimize tsm1.BitReader	2016-04-20 08:28:24 -06:00
Seif Lotfy	c6e3c87e00	Add Block checksum validation and "influx_inspect verify" tool Fixes #5502	2016-04-19 22:33:03 +02:00
Jonathan A. Sternberg	493ef0e1ce	Merge pull request #6416 from influxdata/js-3166-deterministic-limit Sort the series keys inside of a tag set so the output is deterministic	2016-04-19 14:49:49 -04:00
Ben Johnson	1d2238c642	optimize tsm1.BitReader This commit rewrites the `tsm1.BitReader` to use an 8-byte buffer instead of a 1-byte buffer and provide an inlineable fast bit read.	2016-04-19 11:34:17 -06:00
Jason Wilder	f841a90d35	Use int64 instead of time.Time in timestamp encoder/decoder	2016-04-19 10:25:27 -06:00
Jason Wilder	61beeca426	Update timestamp benchmarks	2016-04-19 10:17:32 -06:00
Jonathan A. Sternberg	09c46a451a	Sort the series keys inside of a tag set so the output is deterministic The series keys within a tag set were previously not sorted which would cause the output to be non-deterministic. This sorts the output series by their keys so it has a consistent output especially when using limits. Fixes #3166.	2016-04-18 17:45:31 -04:00
Jonathan A. Sternberg	7ec2a991d5	Modify all of the iterators to allow returning an error on Next() This also switches the remaining iterators to be lazy so they can return errors properly. They needed to be converted to lazy initialization anyway, which has the side effect of making it much easier for us to propagate the underlying error during initialization. Updated the Emitter to return errors when it cannot read properly from the iterators.	2016-04-18 11:17:55 -04:00
Jonathan A. Sternberg	93745d9693	Merge pull request #6391 from influxdata/js-5553-limit-queries-slow-with-group-by Propagate the limit option to the low level iterators	2016-04-16 09:39:25 -04:00
Jonathan A. Sternberg	bd5fdd797d	Propagate the limit option to the low level iterators When a GROUP BY or multiple sources are used, the top level limit iterator requires reading the entire iterator stream so it can find all of the tag groups it needs to return. For large data series, this ends up with the limit iterator discarding a lot of output. This change adds a new lower level limit iterator on each series itself so that there are fewer data points that have to be thrown away by the top level iterator. Fixes #5553.	2016-04-15 18:23:54 -04:00
Jonathan A. Sternberg	835d08591e	Do not filter out empty tags from series keys	2016-04-13 09:15:57 -04:00
Jonathan A. Sternberg	60282cf52d	Merge pull request #6284 from influxdata/js-3371-where-clause-compare-tags-and-fields Enhance comparing tags and fields in the where clause	2016-04-12 11:45:54 -04:00
Pierre Fersing	29b19a2293	Fix deadlock in tsm1/file_store	2016-04-12 09:39:21 +02:00
Jonathan A. Sternberg	ea6262b712	Enhance comparing tags and fields in the where clause Now it is possible to compare tags and fields and it is also now possible to compare tags and tags. Previously, it was only possible to compare fields with fields and tags with a string or a regex. Fixes #3371.	2016-04-11 18:10:08 -04:00
Ben Johnson	525e22c92b	tsm1 query engine alloc reduction This commit makes a number of performance improvements to reduce allocations during query execution. Several objects and buffers are now reused across the components to avoid allocations. Previously a simple `count(value)` query across 1M points would require 26,000+ allocations. After the changes in this commit that number has been reduced to 88.	2016-04-11 14:50:59 -06:00
Jonathan A. Sternberg	5bdd61bde7	Support empty tags for all WHERE equality operations A missing tag on a point was sometimes treated as `""` and sometimes treated as a separate `null` entity. This change modifies the equality operations to always treat a missing tag as an empty string. Empty tags are not indexed and do not have the same performance as a tag that exists. Fixes #3773.	2016-04-11 12:01:35 -04:00
Edd Robinson	5327a75a6f	Merge pull request #6216 from influxdata/er-scope-proto Change protobuf package names to avoid clashes	2016-04-07 16:38:21 +01:00
Jonathan A. Sternberg	a58430bb60	Merge pull request #6217 from influxdata/js-tsdb-unused-code Remove unused code and increase some test coverage for the tsdb package	2016-04-06 10:07:43 -04:00
Jonathan A. Sternberg	028fdaff81	Merge pull request #6222 from influxdata/js-6206-descending-tsm1-iterators Handle nil values from the tsm1 cursor correctly	2016-04-06 10:05:20 -04:00
Jonathan A. Sternberg	94ec92d669	Handle nil values from the tsm1 cursor correctly Send nil values from the tsm1 cursor at the end of the cursor. After the cursor reached tsm1, the `nextAt()` call would always return the default value rather than a nil value. Descending also didn't work correctly because the seeking functionality for tsm1 iterators would always act like they were ascending instead of descending when choosing which value to select. This resulted in very strange output from the emitter since it couldn't figure out if it was ascending or descending. Fixes #6206.	2016-04-06 09:27:02 -04:00
Jonathan A. Sternberg	7a229c7e4e	Remove unused code and increase some test coverage for the tsdb package	2016-04-06 09:24:56 -04:00
joelegasse	84f8dd7c85	Merge pull request #6190 from influxdata/jw-race Fix race on measurementFields	2016-04-06 08:13:58 -04:00
Edd Robinson	184257a10d	Scope all internal protobuf packages	2016-04-05 13:54:21 +01:00
Jonathan A. Sternberg	37b63cedec	Cleanup QueryExecutor and split statement execution code The QueryExecutor had a lot of dead code made obsolete by the query engine refactor that has now been removed. The TSDBStore interface has also been cleaned up so we can have multiple implementations of this (such as a local and remote version). A StatementExecutor interface has been created for adding custom functionality to the QueryExecutor that may not be available in the open source version. The QueryExecutor delegate all statement execution to the StatementExecutor and the QueryExecutor will only keep track of housekeeping. Implementing additional queries is as simple as wrapping the cluster.StatementExecutor struct or replacing it with something completely different. The PointsWriter in the QueryExecutor has been changed to a simple interface that implements the one method needed by the query executor. This is to allow different PointsWriter implementations to be used by the QueryExecutor. It has also been moved into the StatementExecutor instead. The TSDBStore interface has now been modified to contain the code for creating an IteratorCreator. This is so the underlying TSDBStore can implement different ways of accessing the underlying shards rather than always having to access each shard individually (such as batch requests). Remove the show servers handling. This isn't a valid command in the open source version of InfluxDB anymore. The QueryManager interface is now built into QueryExecutor and is no longer necessary. The StatementExecutor and QueryExecutor split allows task management to much more easily be built into QueryExecutor rather than as a separate struct.	2016-04-04 13:27:17 -04:00
Jason Wilder	ca8b0ca143	Optimize locking in CreateFieldIfNotExists Also remove some dead code that is no longer relevant with tsm.	2016-04-01 20:44:40 -06:00
Jason Wilder	3f4c5a5585	Fix race on measurementFields Both Shard and Engine had the same reference to the measurementField map, but they each protected it with their own locks. This causes a race when write and queries are occurring because writes can add new fields to the map while queries are reading from it. The fix moves the ownership to the Engine and provides protected accessors to that Shard now users. For the most parts, the access on shard were old dead code. Fixing the measurementFields map race created a new race on the internal fields map. This is now unexported and protected via MeasurementFields exported funcs. Fixes #6188	2016-04-01 18:57:01 -06:00
Jason Wilder	07e3215d11	Remove ununsed Series.match func	2016-03-31 10:19:46 -06:00
Jason Wilder	40c4973423	Remove per measurement stats collection The stats setup ends up creating a lot of lock contention which signifcantly impacts write throughput when a large number of measurements are used. Fixes #6131	2016-03-31 10:19:27 -06:00
Jason Wilder	f1bb87d4f8	Convert index write lock to series lock	2016-03-31 10:19:27 -06:00
Edd Robinson	8e2d1e48c7	Check if engine closed. Fixes #6140	2016-03-31 15:59:04 +01:00
Edd Robinson	75a2218fa1	Ensure syncronised access to engine	2016-03-31 15:58:19 +01:00
Jason Wilder	873ac2715d	Fix panic: runtime error: slice bounds out of range Writing a key that exceeds the max key length could cause a panic when reading a tsm file because the 2 bytes used for the key length would not be enough to represent the actual key length. The writer will now return an error if when trying to write a key that is too large.	2016-03-30 23:44:17 -06:00
Jonathan A. Sternberg	711a6614e6	Implement the point limit monitor Fixes #6077.	2016-03-30 16:08:56 -04:00
Joe LeGasse	f10c300765	Update to conversion tool to work in current versions After adding type-switches to the tsm1 packages, the custom implementation found in the conversion tool broke. This change uses tsm1.NewValue() instead of a custom implementation. This change also ensures that the tsm1.Value interface can only be implemented internally to allow for the optimized type-switch based encoding	2016-03-30 13:26:46 -04:00
Jason Wilder	9f41acba2f	Move shard mapping logic into index	2016-03-29 12:59:27 -06:00
Jason Wilder	60c3898577	Add godoc for KeyAt func	2016-03-29 12:59:26 -06:00
Jason Wilder	1b08e2dd55	Use walk func to load all tsm keys to index Avoids allocating a big map or all keys.	2016-03-29 12:59:26 -06:00
Jason Wilder	96e076b6df	Limit how many shards are loaded concurrently Since loading a shard can allocate a lot of memory, running them all at once could OOM the process. This limits the number of shards loaded to 4. This will be changed to a config option provided the approach helps.	2016-03-29 12:59:26 -06:00
Jason Wilder	d4757ad040	Remove sync.Pool from wal UnmarshalBinary When loading many shards concurrently they block trying to acquire a write lock in the sync pool adding a new source of contention. Since this code flow always needs to allocate a buffer it's not really buying us much.	2016-03-29 12:59:26 -06:00
Jason Wilder	3f0e871425	Reduce lock content when loading database index	2016-03-29 12:59:26 -06:00
Jason Wilder	03ced4cc90	Load shards concurrently	2016-03-29 12:58:52 -06:00
Ben Johnson	45f1c28adb	add tsm iterator stats buffer This commit adds a buffer for stats to be updated without requiring a mutex lock/unlock on every point. The tradeoff is that stats are not exactly precise. This works for our use case because stats are only periodically checked.	2016-03-23 12:23:22 -06:00
Jonathan A. Sternberg	a35d9602cd	Fix where filters when a OR is used and when a tag does not exist If an OR was used, merging filters between different expressions would not work correctly. If one of the sides had a set of series ids with a condition and the other side had no series ids associated with the expression, all of the series from the side with a condition would have the condition ignored. Instead of defaulting a non-existant series filter to true, it should just be false and the evaluation of the one side that does exist should take care of determining if the series id should be included or not. The AND condition used false correctly so did not have to be changed. If a tag did not exist and `!=` or `!~` were used, it would return false even though the neither a field or a tag equaled those values. This has now been modified to correctly return the correct series ids and the correct condition. Also fixed a panic that would occur when a tag caused a field access to become unnecessary. The filter using the field access still got created and used even though it was unnecessary, resulting in an attempted access to a non-initialized map. Fixes #5152 and a bunch of other miscellaneous issues.	2016-03-22 12:19:06 -04:00
Ben Johnson	573dd0f96a	Merge pull request #6035 from benbjohnson/query-engine-reduce-alloc Reduce allocations in query execution	2016-03-22 10:11:14 -06:00
Ben Johnson	6e1c1da25b	reduce allocations in query execution This commit removes some heap objects by converting them from pointer references to non-pointers or by reusing buffers.	2016-03-22 09:51:39 -06:00
Jason Wilder	7857e07a1e	Merge pull request #6062 from influxdata/mr-prune-wal-config Remove unused WAL configuration variables/fields	2016-03-22 09:20:27 -06:00
Jonathan A. Sternberg	ad96207868	Fix ORDER BY desc so it doesn't skip values After reading the initial buffer, ORDER BY desc would read the next block into the buffer and only read the first element. It's because the code that was copied from the ascending cursor wasn't modified correctly to set the position to the last element in the buffer. The buffer size has also been lowered from 1000 to 10 to match with the ascending cursor for performance with limit queries. Fixes #6055.	2016-03-22 09:40:11 -04:00
Ben Johnson	7156c1f9bd	add IteratorStats This commit adds an `IteratorStats` that holds aggregate iterator processing information. A method is also added to `Iterator` to return the stats: Stats() influxql.IteratorStats The remote iterators will also emit their stats in the point stream upon first connection, on a given interval, and then finally once the last point has been sent.	2016-03-21 16:25:19 -06:00
Jason Wilder	ee2f21e76f	Merge pull request #6082 from influxdata/jw-tsm Fix partially written TSM files	2016-03-21 15:42:27 -06:00
Jason Wilder	7567453c9a	Ensure TSM files are fsync'd Make sure TSM files are fsync'd when closed and also that the parent dir is fsync'd when they are renamed.	2016-03-21 15:03:52 -06:00
Jason Wilder	a4e5446ddd	Return error when TSM writer close returns one The TSM writer uses a bufio.Writer that needs to be flushed before it's closed. If the flush fails for some reason, the error is not handled by the defer and the compactor continues on as if all is good. This can create files with truncated indexes or zero-length TSM files. Fixes #5889	2016-03-21 15:00:36 -06:00
Jonathan A. Sternberg	6655ca7769	Create a new interrupt iterator that will stop emitting points after an interrupt Use of the iterator is spread out into both `IteratorCreators` and inside of the iterators themselves. Part of the interrupt must be handled inside of the engine so it stops trying to emit points when an interrupt is found and another part of the interrupt has to happen when combining the iterators so it doesn't just start reading the next shard.	2016-03-21 12:07:07 -04:00
Mark Rushakoff	7a2adfcc5d	Remove unused WAL configuration variables/fields These were all b1/bz1 settings that no longer have any effect: - {Default,}MaxWALSize - {Default,}WALFlushInterval - {Default,}WALPartitionFlushDelay - {Default,WAL}ReadySeriesSize - {Default,WAL}CompactionThreshold - {Default,WAL}MaxSeriesSize - {Default,WAL}FlushColdInterval - {Default,WAL}PartitionSizeThreshold	2016-03-20 13:16:52 -07:00
Jonathan A. Sternberg	d75428f79f	Rename the special condition "name" to "_name" to reduce conflicts Fixes #6034.	2016-03-16 17:17:04 -04:00
Jonathan A. Sternberg	eb2d49dbe4	Merge pull request #6007 from benbjohnson/explicit-system-names Allow querying of system-like series	2016-03-15 16:15:17 -04:00
Cory LaNou	ba6a95e9bc	Merge pull request #5994 from influxdata/single-server-lite Single Server	2016-03-14 16:11:37 -05:00
Ben Johnson	f692621ef5	allow querying of system-like series Internal system series start with an underscore prefix but restricting this prevents users who already use an underscore prefix in their series names. Fixes #5870	2016-03-14 13:50:52 -06:00
Jason Wilder	3fd40d48a1	Merge pull request #6006 from influxdata/jw-deadlock Fix deadlock when running backup	2016-03-14 13:36:45 -06:00
Jason Wilder	9984cd5d6d	Fix skipping blocks at query time when overlaps exist Depending on how data is written across TSM files, it was possible to skip over some blocks at query time making it looks like data was missing.	2016-03-14 13:11:11 -06:00
Jason Wilder	000459e350	Fix deadlock when running backup A deadlock occurs under write load if a backup is run in between the time when a snapshot compactions has snapshotted the cache and successfully written it to disk. The issus is that the second snapshot call will block on the commit lock while it is holding the engine write lock. This causes all writes to block as well as prevents the currently runnign snapshot compaction from completing because it needs to acquire a read-lock. This PR removes the commit lock and just returns an error if a snapshot is in progress to all any locks being held to be released. The caller can determine whether to retry or giveup.	2016-03-14 12:36:48 -06:00
Cory LaNou	27cfaa4b7a	in memory meta, single node configs, etc.	2016-03-14 16:55:54 +00:00
Joe LeGasse	344e5abd41	Changed type-switch a few places to reduce allocations. Slices of tsm1.Value interfaces are only ever used with all the same types, and the previous code would switch on the type returned from a call to Value(), which allocated and returned an interface{} object for the underlying value. This change instead type-switches on the tsm1.Value object itself, allowing it direct access to the underlying value field, eliminating the unecessary allocations.	2016-03-11 15:57:05 -05:00
Ben Johnson	beda072426	add support for remote expansion of regex This commit moves the `tsdb.Store.ExpandSources()` function onto the `influxql.IteratorCreator` and provides support for issuing source expansion across a cluster.	2016-03-11 12:40:07 -07:00

1 2 3 4 5 ...

1104 Commits (8fda621d8b41b383e0c9895fdab77809285fdf34)