influxdb

Commit Graph

Author	SHA1	Message	Date
Jonathan A. Sternberg	ecba19eb27	Prevent a panic when a query simultaneously finishes and is killed at the same time There is a strange race condition where a query can be killed and finish at approximately the same time. If this happens, the query gets retrieved by the killing task, the query gets closed by the normal processing thread, and then the killing task attempts to kill the query afterwards. Since the close doesn't mark the query as already killed (since it's not killed, just merely unused), the killing thread attempts to close the channel again. Mark the query as killed whenever it is closed to prevent a double close from happening. This should never cause the status to be erroneously reported since the query status is removed from the query table within the same lock scope.	2018-01-02 11:04:01 -06:00
Edd Robinson	f6835632e7	Merge master into branch	2017-12-08 17:11:07 +00:00
Ben Johnson	e0df47d54f	Fixing up tests.	2017-12-02 16:52:34 -07:00
Jonathan A. Sternberg	db60a83d5a	Fix query compilation so multiple nested distinct calls is allowable When refactoring the query engine, I thought calling `count(distinct(value))` multiple times was disallowed and so the refactor made it so that wasn't possible. It turns out that this pattern is allowed because since the distinct is nested, it is aggregated anyway and can be combined with other aggregates. This removes the erroneously placed restriction.	2017-11-28 11:09:32 -06:00
Jonathan A. Sternberg	a73c3a1965	Fix race condition in the merge iterator close method If the close happens when next is being called, it can result in a race condition where the current iterator gets set to nil after the initial check. This also fixes the finalizer so it runs the close method in a goroutine instead of running it by itself. This is because all finalizers run on the same goroutine so a close that takes a long time can cause a backup for all finalizers. This also removes the redundant call to `runtime.SetFinalizer` from the finalizer itself because a finalizer, when called, has already cleared itself.	2017-11-27 16:55:41 -06:00
Ben Johnson	01491ca4f4	intermediate	2017-11-27 07:52:18 -07:00
Ben Johnson	fc966a1b67	Add series file backup/restore.	2017-11-22 08:55:54 -07:00
Edd Robinson	c098081c7d	Don't initialise a new Authorizer each query	2017-11-17 11:06:43 +00:00
Jonathan A. Sternberg	0b7c56bcd8	Update the zap logger dependency The previous sha was taken from a revision on a devel branch that I thought would continue staying in the tree after it was merged. That revision was rebased away and the API was changed for the logger. This updates the usage of the logger and adds a simple package for constructing the base logger. The 1.0 version of zap changed the format of the default console logger so this change moves over to this new logger instead of attempting to retain backwards compatibility with the old format.	2017-11-10 16:27:16 -06:00
Edd Robinson	6443355467	Pass through SystemIterator in PB	2017-11-08 19:57:16 +00:00
Jonathan A. Sternberg	8e6c208b55	Merge pull request #9080 from influxdata/js-influxql-data-type Update influxql so DataType constants are the type DataType	2017-11-07 20:05:23 -06:00
Jonathan A. Sternberg	b09396f977	Update influxql so DataType constants are the type DataType	2017-11-07 18:34:53 -06:00
Ben Johnson	156f25ac23	Improve SHOW TAG KEYS performance.	2017-11-07 10:59:19 -07:00
Edd Robinson	9f7a6ab044	refactor for external influxql package	2017-11-06 19:15:01 +00:00
Edd Robinson	fbcb299b8a	Support WHERE time clause in SHOW TAG VALUES This commit adds time support to SHOW TAG VALUES. Time can be used as both a lower and upper boundary. However, there are some caveats. For the `inmem` index, filtering by time will still return all results because the index data is shared across shards. For the `tsi1` index, filtering by time will only work down to the shard lever. Specifically, when querying by time all shards within that time range will be used to generate the results.	2017-11-06 19:15:01 +00:00
Edd Robinson	b2a3b8837f	Remove time from SHOW MEASUREMENTS	2017-11-06 19:15:00 +00:00
Edd Robinson	c61e215ded	Add time-based rewriter tests	2017-11-06 19:15:00 +00:00
Edd Robinson	98d584b63f	Use index for SHOW X meta queries When a meta query does not include a time component then it can be answered exclusively by the index. This should result in a much faster query execution that if the TSM engine was engaged. This commit rewrites the following queries such that they make use of the index where no time component is present: - SHOW MEASUREMENTS - SHOW SERIES - SHOW TAG KEYS - SHOW FIELD KEYS	2017-11-06 19:15:00 +00:00
Jonathan A. Sternberg	748fc4ae79	Update the influxql path inside of the template files	2017-11-03 10:57:17 -05:00
Stuart Carnie	f3d45ba301	influxdata/influxdb/influxql -> influxdata/influxql	2017-10-30 14:40:26 -07:00
Stuart Carnie	7435725e05	move RewriteStatement and neldermead to influxdb/query package	2017-10-30 10:24:15 -07:00
Jonathan A. Sternberg	11c62eb9ab	Merge pull request #9020 from influxdata/js-9018-panic-on-task-manager-close Do not panic when the task manager closes if a killed query is still running	2017-10-26 12:10:57 -05:00
Jonathan A. Sternberg	764e4e501c	Do not panic when the task manager closes if a killed query is still running This prevents a channel from being closed twice when the task manager is closed. That same check existed to make sure a panic didn't happen when detaching a killed query, but the check was forgotten when closing the task manager. This also adds a new error when attempting to kill an already killed query.	2017-10-26 11:31:18 -05:00
Stuart Carnie	c51ba16287	fixes #9007	2017-10-25 13:08:55 -07:00
Stuart Carnie	e9313876ab	EXPLAIN ANALYZE * Introduces EXPLAIN ANALYZE command, which produces a detailed tree of operations used to execute the query. introduce context.Context to APIs metrics package * create groups of named measurements * safe for concurrent access tracing package EXPLAIN ANALYZE implementation for OSS Serialize EXPLAIN ANALYZE traces from remote nodes use context.Background for tests group with other stdlib packages additional documentation and remove unused API use influxdb/pkg/testing/assert remove testify reference	2017-10-20 08:01:37 -07:00
Jonathan A. Sternberg	f20cab6e99	Implicitly decide on the lower limit for fill queries when none is present This allows the query: SELECT mean(value) FROM cpu GROUP BY time(1d) To function in some way that makes sense. The upper limit is implicitly the `now()` starting time and the lower limit will be whichever interval the lowest point falls into. When no lower bound is specified and `max-select-buckets` is specified, the query will only consider points that would satisfy `max-select-buckets`. So if you have one point written in 1970, have another point within the last minute, and then do the above query with `max-select-buckets` being equal to 10, the older point from 1970 will not be considered.	2017-10-05 15:56:44 -05:00
Jonathan A. Sternberg	18cc515345	Support marshaling unsigned points	2017-10-05 11:44:27 -05:00
Jonathan A. Sternberg	79092610c8	Support unsigned binary math in fields Field math works similar to condition evaluation, but not the exact same because we have more information to work with in field expressions than we do in conditional math because fields retain the information about their source while conditions do not. The main difference is that you cannot add an unsigned literal to the output of an integer iterator while you can inside of a condition. You can perform math on a positive integer literal to an unsigned iterator. Inside of the condition, we aren't sure if an integer is because of a literal or because of an iterator so we can't make that distinction.	2017-10-02 17:06:49 -05:00
Jonathan A. Sternberg	4e6075e819	Merge pull request #8873 from influxdata/js-emitter-unsigned-type Support the unsigned type in query.Emitter	2017-09-25 15:45:28 -05:00
Jonathan A. Sternberg	1d833cd920	Support the unsigned type in query.Emitter	2017-09-25 15:08:18 -05:00
Jonathan A. Sternberg	bcf2e8fca5	Prevent deadlock when doing math on the result of a subquery The `fill(none)` attribute got set on subqueries, but that can cause an issue with certain subqueries just like it caused a deadlock on outer queries.	2017-09-22 14:45:53 -05:00
Jonathan A. Sternberg	107feebb81	Report the task status for a query This more accurately shows whether or not a query has been killed. Instead of automatically removing it from the query table when it's killed even though goroutines and iterators may still be open, it now marks the process as killed. This should allow us to more accurately determine if a query has been stalled and is still using resources on the server. This is related to #8848, but not directly connected.	2017-09-19 14:48:56 -05:00
Jonathan A. Sternberg	0ef94e0cf0	Add unsigned iterators for all types This allows unsigned data to be queried from the storage engine. Binary math is not yet implemented for unsigned types.	2017-09-18 15:09:10 -05:00
Jonathan A. Sternberg	5a9553b2c4	Remove unused casting code from the query engine Originally, casting was performed inside of the query engine especially for call iterators. Currently, the engine takes care of all casting so we just need to normalize the iterators types for type safety reasons rather than actual functional reasons. Removing this code. Cover coverage showed that it was not hit when run against the actual server. I ran the tests package and got code coverage of the query package while running the tests in that package.	2017-09-18 12:33:34 -05:00
Jonathan A. Sternberg	aa7ae36880	Merge pull request #8826 from influxdata/js-iterator-options-marshaling Fix group by marshaling in the IteratorOptions	2017-09-14 16:46:11 -05:00
Jonathan A. Sternberg	5b3b7a5102	Fix group by marshaling in the IteratorOptions Ensure that the marshaling and unmarshaling an IteratorOptions returns the same thing.	2017-09-14 15:40:54 -05:00
Jonathan A. Sternberg	6e60edc4bd	Merge pull request #8771 from influxdata/js-restore-old-timerange-behavior This restores the old time range behavior	2017-09-14 12:00:50 -05:00
Jonathan A. Sternberg	c8440b4778	This restores the old time range behavior All time ranges are combined with AND regardless of context and regardless of whether it makes any logical sense. That was the previous behavior and, unfortunately, a lot of people rely on it.	2017-09-14 09:10:37 -05:00
Jonathan A. Sternberg	9cbd604603	Fix time constraints in subqueries from the refactor	2017-09-08 11:55:53 -05:00
Jonathan A. Sternberg	590be193e5	Include the number of scanned cached values in the iterator cost	2017-09-06 15:41:07 -05:00
Jonathan A. Sternberg	50d404e690	Initial implementation of explain plan It prints the statistics of each iterator that will access the storage engine. For each access of the storage engine, it will print the number of shards that will potentially be accessed, the number of files that may be accessed, the number of series that will be created, the number of blocks, and the size of those blocks.	2017-09-01 09:01:10 -05:00
Joe LeGasse	732a0c2eaa	Merge pull request #8769 from influxdata/jl-map-cleanup cleanup: remove poor usage of ',ok' with maps	2017-08-31 09:18:42 -04:00
Ben Johnson	1dbe0662d8	Use system cursors for measurement, series, and tag key meta queries.	2017-08-30 08:35:20 -06:00
Joe LeGasse	a95647b720	cleanup: remove poor usage of ',ok' with maps There are several places in the code where comma-ok map retrieval was being used poorly. Some were benign, like checking existence before issuing an unconditional delete with no cleanup. Others were potentially far more serious: assuming that if 'ok' was true, then the resulting pointer retrieved from the map would be non-nil. `nil` is a perfectly valid value to store in a map of pointers, and the comma-ok syntax is meant for when membership is distinct from having a non-zero value. There was only one or two cases that I saw that being used correctly for maps of pointers.	2017-08-30 09:49:31 -04:00
Jonathan A. Sternberg	1c7bafcd3e	Force subqueries to match the parent queries ordering Previously, subqueries would honor their own ordering. We never really supported that and I have no idea if it would work since most parts in the query engine assume that points are being delivered in only one ordering. Subqueries have now been modified so if a person tries to do different ordering, they get an error when running the query. If they specify an ordering in the top most query, that ordering gets propagated to all subqueries. Fixes #8699.	2017-08-28 15:57:40 -05:00
Jonathan A. Sternberg	5dbcd1b06f	Merge pull request #8745 from influxdata/js-refactor-validation Refactor validation code and move it to the compiler	2017-08-28 11:35:24 -05:00
Jonathan A. Sternberg	d2fcb893e1	Close the query shard group after the iterators are created Now, the prepared statement keeps the open resource and closing the open resource created from `Prepare` is the responsibility of the prepared statement. This also nils out the local shard mapping after it is closed to prevent it from being used after it is closed.	2017-08-28 09:46:11 -05:00
Jonathan A. Sternberg	5cdd1b1489	Fix the panic message for the new interval iterator	2017-08-26 17:36:32 -05:00
Jonathan A. Sternberg	905e7fe05e	Refactor validation code and move it to the compiler This refactors the validation code so it is more flexible and performs a small bit of work to make preparing and executing the query easier. The general idea is that compilation will eventually do more heavy lifting in creating the initial plan and prepare will construct an actual plan rather than just doing some basic field rewriting. This change at least sets us up for that change in the future and moves the validation code to the query execution instead of in the parser. This also frees up the parser to parse the complete AST without worrying if the query itself is valid. That could be useful for client code that wants to compile a partial query to an AST and then perform modifications on the AST for some reason.	2017-08-26 17:36:32 -05:00
Jonathan A. Sternberg	8738e72cf1	Refactor the select call into three separate phases The first call is to compile the query. This performs some initial processing that can be done before having any access to the shards. At the moment, it does very little, but it's intended to be changed to eventually perform initial validations of the query and create an internal graph structure for the execution of the query. The second call is to prepare the query. This step has access to the shard mapper. Right now, it just maps the shards and rewrites the fields of the query for any wildcards. In the future, it is intended to do the above, but also to prepare the final directed acyclical graph that will execute the query. The third call is to select the query. This step is intended to create all of the iterators for processing the query. At the moment, much of the work intended for the second step is performed in the third step.	2017-08-25 07:50:13 -05:00
Jonathan A. Sternberg	f7724b780a	Add benchmark tests back	2017-08-25 07:50:02 -05:00
Jonathan A. Sternberg	d46e8ffdae	Merge pull request #8740 from influxdata/js-map-shards-interface Pass the select options to the shard mapper again	2017-08-24 12:26:15 -05:00
Jonathan A. Sternberg	421a91d480	Pass the select options to the shard mapper again	2017-08-24 09:55:02 -05:00
Edd Robinson	8c4686fb1b	Ensure that sorted heaps are merged correctly When merging streams of system iterators we don't use tags or time. Instead we add series keys (in the case of, for example, `SHOW SERIES`) to the `Aux` field of the iterators' elements. This is because we only emit merged and sorted sets of series key to the client. We currently use `SortedMergeHeap`s to merge together multiple iterators, and the comparitor function did not consider `Aux` fields when determining which heap to pop the next item off during a merge. As such, `SHOW SERIES` and `SHOW TAG KEYS` (any meta query that gets converted into a special type of `SELECT`) were returning results in arbitrary order. This issue was never noticed on the `inmem` index because the streams are always duplicates of each other, and of course it doesn't matter if you arbitrarily merge together two idential, sorted streams... The issue first manifested itself on the `tsi1` index, but this fix will apply to both indexes.	2017-08-23 17:21:24 +01:00
Jonathan A. Sternberg	96689e661e	Move query engine code from the statement executor to the query engine The statement rewriting logic should be in the query engine as part of preparing a query. This creates a shard mapper interface that the query engine expects and then passes it to the query engine instead of requiring the query to be preprocessed before being input into the query engine. This interface is (mostly) the same as the old interface, just moved to a different package.	2017-08-23 10:07:30 -05:00
Jonathan A. Sternberg	966e845c03	Refactor the select tests to use subtests and fewer functions This will allow refactoring the query engine and the select statement more easily since fewer code locations will need to be changed. It also reduces the amount of code while still keeping individual tests that are filterable.	2017-08-23 08:21:49 -05:00
Jonathan A. Sternberg	8bd04ebe39	Remove TimeRange function and replace with a more accurate ConditionExpr function The ConditionExpr function is more accurate because it parses the condition and ensures that time conditions are actually used correctly. That means that attempting to combine conditions with OR will not result in the query silently pretending it's an AND and nested conditions work correctly so there is only one way to read the query. It also extracts the non-time conditions into a separate condition so we can stop attempting to parse around the time conditions in lower layers of the storage engine. This change does not remove those hacks, but a following commit should be able to sanitize the condition and remove them.	2017-08-16 16:45:35 -05:00
Jonathan A. Sternberg	9a2357c2c0	Separate the query engine into a separate package This change provides a clear separation between the query engine mechanics and the query language so that the language can be parsed and dealt with separate from the query engine itself.	2017-08-16 13:38:43 -05:00

1 2 3 4

158 Commits (933a14e16f0232c5ec883644b7363a59da7bcd90)