influxdb

Commit Graph

Author	SHA1	Message	Date
Sam Arnold	b7e7de24d6	refactor: separate coarse and fine permission interfaces (#20996 )	2021-03-22 09:52:33 -04:00
Sam Arnold	903b8cd0ea	feat(query): Hyper log log operators in influxql (#20603 ) * feat(query): hyper log log counting in query engine In addition to helping with normal queries, this can improve the 'SHOW CARDINALITY' meta-queries: time influx -database mydb -execute 'select count_hll(sum_hll(_seriesKey)) from big' name: big time count_hll ---- --------- 0 200767781 influx -database mydb -execute 0.06s user 0.12s system 0% cpu 8:49.99 total	2021-02-08 08:38:14 -05:00
Ben Johnson	5263070632	feat(query): Parallelize field iterator planning	2020-06-11 08:01:14 -06:00
Jonathan A. Sternberg	c381389f35	Subquery ordering with aggregates in descending mode was wrong The sort order of points when performing aggregates never took into account if they were ascending or descending so when multiple series were aggregated, it would ensure they were sorted in the correct order. But it wouldn't reverse this order when descending was used. Additionally, it seems that the iterator template and the iterator file itself became out of sync. It seems the template was not reverted correctly from a previously incorrect change and only the float type was changed to the correct version and the tests used the float version.	2019-07-17 09:55:38 -05:00
Ben Wells	e9bada090f	Fix misspelling identified by misspell	2019-02-03 20:27:43 +00:00
Patrick Hemmer	7dc7efd501	rename "triple_exponential_average" -> "triple_exponential_derivative"	2018-05-16 19:40:12 -04:00
Jonathan A. Sternberg	d42062def2	Add technical analysis algorithms This adds numerous technical analysis algorithms: * exponential_moving_average * double_exponential_moving_average * triple_exponential_moving_average * relative_strength_index * triple_exponential_average * kaufmans_efficiency_ratio (commonly referred to as just "Efficiency Ratio") * kaufmans_adaptive_moving_average * chande_momentum_oscillator (both the common 'smoothed' version, and the ta-lib version)	2018-04-23 22:27:21 -04:00
Jonathan A. Sternberg	8aeb0fa0c6	Update explain analyze to output data related to the iterator scanners	2018-04-02 14:49:22 -05:00
Jonathan A. Sternberg	0f304690c5	Enable casting values from a subquery This also fixes the cursor system to abandon iterators that will not produce meaningful results since the variables are all unknown types. This creates a weird behavior that existed in previous releases and we are keeping here for backwards compatibility. If a subquery referenced a field that didn't exist in the subquery, it will return nothing. But, if there are two subqueries and one of them has the field exist and the other doesn't, the second will return all null values.	2018-03-30 16:58:37 -05:00
Jonathan A. Sternberg	d4db76508f	Add some unit tests to subqueries This is not complete, but it is a starting point for more thorough tests of subqueries. This also reorders the use of `cmp.Diff` so the `want` is first and `got` is second. This way, the `want` shows up as a minus sign in the diff rather than, confusingly, as a plus sign.	2018-03-27 14:56:27 -05:00
Jonathan A. Sternberg	f8d60a881d	Refactor the math engine to compile the query and use eval This change makes it so that we simplify the math engine so it doesn't use a complicated set of nested iterators. That way, we have to change math in one fewer place. It also greatly simplifies the query engine as now we can create the necessary iterators, join them by time, name, and tags, and then use the cursor interface to read them and use eval to compute the result. It makes it so the auxiliary iterators and all of their complexity can be removed. This also makes use of the new eval functionality that was recently added to the influxql package. No math functions have been added, but the scaffolding has been included so things like trigonometry functions are just a single commit away. This also introduces a small breaking change. Because of the call optimization, it is now possible to use the same selector multiple times as a selector. So if you do this: SELECT max(value) * 2, max(value) / 2 FROM cpu This will now return the timestamp of the max value rather than zero since this query is considered to have only a single selector rather than multiple separate selectors. If any aspect of the selector is different, such as different selector functions or different arguments, it will consider the selectors to be aggregates like the old behavior.	2018-03-19 15:01:15 -05:00
Jonathan A. Sternberg	c8b0c6e166	Update influxql to include the function type evaluators in the query package	2018-03-14 15:42:28 -05:00
Jonathan A. Sternberg	df7a660fb3	Modify the Select call to return a Cursor The Cursor returned will be capable of scanning rows into a structure. It replaces part of the function for why the Emitter existed. The Emitter would both join the resulting rows and then transform the values into a models.Row so it could be returned to the results. In the future, we will be able to use the Cursor directly to write out values which should be more memory efficient.	2018-03-09 12:47:41 -06:00
Jonathan A. Sternberg	733d842812	Turn the ExecutionContext into a context.Context Along with modifying ExecutionContext to be a context and have the TaskManager return the context itself, this also creates a Monitor interface and exposes the Monitor through the Context. This way, we can access the monitor from within the query.Select method and keep all of the limits inside of the query package instead of leaking them into the statement executor. An eventual goal is to remove the InterruptCh from the IteratorOptions and use the Context instead, but for now, we'll just assign the done channel from the Context to the IteratorOptions so at least they refer to the same channel.	2018-03-08 14:03:20 -06:00
Jonathan A. Sternberg	9e122eb1a4	Fix the implicit time range in a subquery The implicit time range for an interval is supposed to be now when no end is specified. In a subquery though, the interval doesn't exist and so it doesn't set the end time to now, but to the max time. Since the subquery qualifies as something that should have the implicit end time apply, this results in a query that runs slowly because it is filling in a bunch of unasked for intervals if a fill is specified. This hack adds the implicit end time if it sees the parent query's end time is set to the maximum available time. This is a temporary fix for this problem. The query compilation should perform these time range calculations in the compilation stage and the subqueries should use the compilation stage during execution instead of ignoring it. That work takes a lot more effort though and is more prone to running into unforeseen bugs. This fix introduces a subtle, but likely rare to run into bug. If the top level query specifies the maximum time as the end time and the subquery has an interval, the subquery should use the end time rather than now as the time range. With this hack, it will interpret it as an implicit time rather than an explicit one. This is unlikely to matter though.	2018-02-27 17:10:10 -06:00
Edd Robinson	98d584b63f	Use index for SHOW X meta queries When a meta query does not include a time component then it can be answered exclusively by the index. This should result in a much faster query execution that if the TSM engine was engaged. This commit rewrites the following queries such that they make use of the index where no time component is present: - SHOW MEASUREMENTS - SHOW SERIES - SHOW TAG KEYS - SHOW FIELD KEYS	2017-11-06 19:15:00 +00:00
Stuart Carnie	f3d45ba301	influxdata/influxdb/influxql -> influxdata/influxql	2017-10-30 14:40:26 -07:00
Stuart Carnie	c51ba16287	fixes #9007	2017-10-25 13:08:55 -07:00
Stuart Carnie	e9313876ab	EXPLAIN ANALYZE * Introduces EXPLAIN ANALYZE command, which produces a detailed tree of operations used to execute the query. introduce context.Context to APIs metrics package * create groups of named measurements * safe for concurrent access tracing package EXPLAIN ANALYZE implementation for OSS Serialize EXPLAIN ANALYZE traces from remote nodes use context.Background for tests group with other stdlib packages additional documentation and remove unused API use influxdb/pkg/testing/assert remove testify reference	2017-10-20 08:01:37 -07:00
Jonathan A. Sternberg	79092610c8	Support unsigned binary math in fields Field math works similar to condition evaluation, but not the exact same because we have more information to work with in field expressions than we do in conditional math because fields retain the information about their source while conditions do not. The main difference is that you cannot add an unsigned literal to the output of an integer iterator while you can inside of a condition. You can perform math on a positive integer literal to an unsigned iterator. Inside of the condition, we aren't sure if an integer is because of a literal or because of an iterator so we can't make that distinction.	2017-10-02 17:06:49 -05:00
Jonathan A. Sternberg	50d404e690	Initial implementation of explain plan It prints the statistics of each iterator that will access the storage engine. For each access of the storage engine, it will print the number of shards that will potentially be accessed, the number of files that may be accessed, the number of series that will be created, the number of blocks, and the size of those blocks.	2017-09-01 09:01:10 -05:00
Jonathan A. Sternberg	d2fcb893e1	Close the query shard group after the iterators are created Now, the prepared statement keeps the open resource and closing the open resource created from `Prepare` is the responsibility of the prepared statement. This also nils out the local shard mapping after it is closed to prevent it from being used after it is closed.	2017-08-28 09:46:11 -05:00
Jonathan A. Sternberg	8738e72cf1	Refactor the select call into three separate phases The first call is to compile the query. This performs some initial processing that can be done before having any access to the shards. At the moment, it does very little, but it's intended to be changed to eventually perform initial validations of the query and create an internal graph structure for the execution of the query. The second call is to prepare the query. This step has access to the shard mapper. Right now, it just maps the shards and rewrites the fields of the query for any wildcards. In the future, it is intended to do the above, but also to prepare the final directed acyclical graph that will execute the query. The third call is to select the query. This step is intended to create all of the iterators for processing the query. At the moment, much of the work intended for the second step is performed in the third step.	2017-08-25 07:50:13 -05:00
Jonathan A. Sternberg	421a91d480	Pass the select options to the shard mapper again	2017-08-24 09:55:02 -05:00
Jonathan A. Sternberg	96689e661e	Move query engine code from the statement executor to the query engine The statement rewriting logic should be in the query engine as part of preparing a query. This creates a shard mapper interface that the query engine expects and then passes it to the query engine instead of requiring the query to be preprocessed before being input into the query engine. This interface is (mostly) the same as the old interface, just moved to a different package.	2017-08-23 10:07:30 -05:00
Jonathan A. Sternberg	9a2357c2c0	Separate the query engine into a separate package This change provides a clear separation between the query engine mechanics and the query language so that the language can be parsed and dealt with separate from the query engine itself.	2017-08-16 13:38:43 -05:00

26 Commits (jdstrand/update-golang-jwt-1.10)