influxdb

Commit Graph

Author	SHA1	Message	Date
Edd Robinson	b12b0d12fb	Add regex benchmarks and fix existing approach	2016-10-25 11:10:03 +01:00
Edd Robinson	06d1226b9a	Rewrite exact match regexes to use tsdb index This commit adds support for replacing regexes with non-regex conditions when possible. Currently the following regexes are supported: - host =~ /^foo$/ will be converted into host = 'foo' - host !~ /^foo$/ will be converted into host != 'foo' Note: if the regex expression contains character classes, grouping, repetition or similar, it may not be rewritten. For example, the condition: name =~ /^foo\|bar$/ will not be rewritten. Support for this may arrive in the future. Regexes that can be converted into simpler expression will be able to take advantage of the tsdb index, making them significantly faster.	2016-10-25 11:10:03 +01:00
Jonathan A. Sternberg	19a61dbb44	Align binary math expression streams by time Also fills in missing values using the fill expression for any binary aggregation.	2016-10-18 13:31:13 -05:00
Mark Rushakoff	0ddb7ad842	Disallow derivative call with non-duration 2nd arg Previously, calling derivative with a non-duration second argument was allowed during parsing but would panic during execution due to a failed type conversion. This change ensures the second argument is a duration literal.	2016-10-17 16:20:53 -07:00
Jonathan A. Sternberg	3496c5b85f	Merge pull request #7442 from influxdata/js-5955-make-regex-work-on-field-keys-in-select Support using regexes to select fields and dimensions	2016-10-17 11:37:47 -05:00
Jonathan A. Sternberg	b60b4b371e	Support using regexes to select fields and dimensions The functionality works the same as wildcards, but this time, you can specify a regular expression. One limitation is that you can't specify whether you only want to select fields or tags. Since the regex can be changed to suit the person's needs, I don't currently think this is an issue.	2016-10-13 22:17:14 -05:00
Jonathan A. Sternberg	95859b8ab4	Remove accidentally added string support for the stddev call Strings would always return an empty string and stddev is meaningless when it comes to strings. This removes that functionality so strings don't automatically get picked up when using a wildcard.	2016-10-10 14:58:28 -05:00
Jonathan A. Sternberg	6afc2a77a5	Implement cumulative_sum() function The `cumulative_sum()` function can be used to sum each new point and output the current total. For the following points: cpu value=2 0 cpu value=4 10 cpu value=6 20 This would output the following points: > SELECT cumulative_sum(value) FROM cpu time value ---- ----- 0 2 10 6 20 12 As can be seen, each new point adds to the sum of the previous point and outputs the value with the same timestamp. The function can also be used with an aggregate like `derivative()`. > SELECT cumulative_sum(mean(value) FROM cpu WHERE time >= now() - 10m GROUP BY time(1m)	2016-10-07 10:11:53 -05:00
Michael Desa	f9b8129770	Add sample function to query language First Pass at implementing sample Add sample iterators for all types Remove size from sample struct Fix off by one error when generating random number Add benchmarks for sample iterator Add test and associated fixes for off by one error Add test for sample function Remove NumericLiteral from sample function call Make clear that the counter is incr w/ each call Rename IsRandom to AllSamplesSeen Add a rng for each reducer that is created The default rng that comes with math/rand has a global lock. To avoid having to worry about any contention on the lock, each reducer now has its own time seeded rng. Add sample function to changelog	2016-10-06 09:41:42 -07:00
Michael Desa	966e5503bf	Add fill(linear) to query language Clean up template for fill average Change fill(average) to fill(linear) Update average to linear in infuxql spec Add Integer Tests and associated fixes Update CHANGELOG for fill(linear)	2016-10-04 14:27:04 -07:00
Jason Wilder	a3fd12198e	Avoid extra allocations when evalating binary expressions	2016-09-29 13:18:38 -06:00
Jonathan A. Sternberg	3afdf3cd94	Merge tag 'v1.0.1'	2016-09-27 17:53:33 -05:00
Jonathan A. Sternberg	dbc4a9150f	Prevent manual use of system queries Manual use of system queries could result in a user using the query incorrect. Rather than check to make sure the query was used correctly, we're just going to prevent users from using those sources so they can't use them incorrectly.	2016-09-23 10:00:18 -05:00
Cory LaNou	acbf193640	add test to prevent future parsing regressions for time durations	2016-09-16 11:44:05 -05:00
Jason Wilder	a6d3e46893	Fix panic when parsing ms durations	2016-09-16 08:47:18 -06:00
Jonathan A. Sternberg	635ce337f0	Merge pull request #7304 from influxdata/js-remove-substatement-method Remove defunct `Substatement()` call	2016-09-15 08:32:40 -05:00
Jonathan A. Sternberg	c11cbc5f05	Merge pull request #7309 from influxdata/js-go-vet-for-1.7 Update source files to pass vet checks for go 1.7	2016-09-15 08:32:30 -05:00
Jonathan A. Sternberg	477d6231db	Update source files to pass vet checks for go 1.7 The vet checks for some files did not pass for go 1.7. As part of a preliminary start to making go 1.7 work with this software, go vet should pass. Also updated the gogo/protobuf dependency which fixed the code generator to work with go 1.7 too. Ran `go generate` on the entire repository to ensure every file was up to date.	2016-09-14 15:01:22 -05:00
Cory LaNou	71f0c7e1e9	return appropriate error if overflowing duration when parsing	2016-09-14 09:27:38 -05:00
Jonathan A. Sternberg	0b94f5dc1a	Skip past points at the same time in derivative call within a merged series The derivative() call would panic if it received two points at the same time because it tried to divide by zero. The derivative call now skips past these points. To avoid skipping past these points, use `GROUP BY *` so that each series is kept separated into their own series. The difference() call has also been modified to skip past these points. Even though difference doesn't divide by the time, difference is supposed to perform the same as derivative, but without dividing by the time.	2016-09-13 16:57:36 -05:00
Jonathan A. Sternberg	dbb8c5570c	Duplicate parsing bug in ALTER RETENTION POLICY Return an error when we encounter the same option twice in ALTER RETENTION POLICY and remove the `maxNumOptions` number from the parsing loop. The `maxNumOptions` number would need to be modified if another option was added to the parsing loop and it didn't correctly prevent duplicate options from being reported as an error anyway.	2016-09-13 15:56:13 -05:00
Jonathan A. Sternberg	aae88fc3c3	Support ON and use default database for SHOW commands Normalize all of the SHOW commands so they allow both using ON to specify the database and using the default database. Some commands would require one and some would require the other and it was confusing when using the query language. Affected commands: * SHOW RETENTION POLICIES * SHOW MEASUREMENTS * SHOW SERIES * SHOW TAG KEYS * SHOW TAG VALUES * SHOW FIELD KEYS	2016-09-13 15:36:59 -05:00
Jonathan A. Sternberg	394c13870b	Remove defunct `Substatement()` call	2016-09-13 14:17:31 -05:00
Jonathan A. Sternberg	4326da0820	Implement time math for lazy time literals When attempting to reduce the WHERE clause, the time literals had not been converted from string literals yet. This adds the functionality to have it handle the same time math when the time literal is still a string literal.	2016-09-09 13:34:56 -05:00
Jonathan A. Sternberg	04c59b8941	Fix the dollar sign so it properly handles reserved keywords The dollar sign would sometimes be accepted as whitespace if it was immediately followed by a reserved keyword or an invalid character. It now reads these properly as a bound parameter rather than ignoring the dollar sign.	2016-09-02 15:32:46 -05:00
Jonathan A. Sternberg	4ff0b10210	Merge pull request #7139 from influxdata/js-7137-show-tag-values-string-method Properly output the SHOW TAG VALUES command so it can be reparsed	2016-09-01 10:19:19 -05:00
Jonathan A. Sternberg	dc2527ce86	Merge branch '1.0'	2016-08-31 14:45:57 -05:00
Jonathan A. Sternberg	23f2d50ecb	Use defaults from `meta` package for `CREATE DATABASE` Instead of having the parser set the defaults, the command will set the defaults so that the constants for that are actually used. This way we can also identify which things the user provided and which ones we are filling with default values. This allows the meta client to be able to make smarter decisions when determining if the user requested a conflict or if the requested capabilities match with what is currently available. If you just say `CREATE DATABASE WITH NAME myrp`, the user doesn't really care what the duration of the retention policy is and just wants to use the default. Now, we can use that information to determine if an existing retention policy would conflict with what the user requested rather than returning an error if a default value ever gets changed since the meta client command can communicate intent more easily.	2016-08-30 13:23:49 -05:00
Nathaniel Cook	888dc8cbd2	Merge pull request #7234 from influxdata/nc-influxql-readme Update Influxql Readme	2016-08-29 13:09:34 -06:00
Jonathan A. Sternberg	f67558c2a7	Merge pull request #7236 from influxdata/js-7220-revert-limit-shard-concurrency Revert "limit shard concurrency"	2016-08-29 13:41:46 -05:00
Nathaniel Cook	3ab4e9fa1d	update InfluxQL readme to reflect current code	2016-08-29 12:33:55 -06:00
Jonathan A. Sternberg	c05c7f6360	Revert "limit shard concurrency" This reverts commit `6c7d56d4bc`.	2016-08-29 12:39:52 -05:00
Jonathan A. Sternberg	b8a70105aa	Fix alter retention policy when all options are used We added `SHARD DURATION` as an extra option, but forgot to increase the maximum number of allowable options from 3 to 4. So if 4 options were used, the last one was ignored. This was commonly `DEFAULT`, but it could have been any of the options.	2016-08-26 11:25:18 -05:00
Jonathan A. Sternberg	8b234546a8	Merge pull request #7204 from influxdata/1.0 Merge 1.0 branch to master	2016-08-25 15:20:30 -05:00
Jonathan A. Sternberg	10029caf2f	Support negative timestamps in the query engine Negative timestamps are now supported. We also now refuse two nanoseconds that are at the edge of the minimum time window. One of the nanoseconds we do not accept is because we need MinInt64 to be used for some internal comparisons in the TSM engine and it was causing an underflow when we subtracted one from the minimum time. The second is so we can have one minimum time that signifies the default minimum that nobody can write to (so we can implicitly rewrite the timestamp on aggregate queries) but still use the explicit timestamp if it is given to us by the user. We aren't able to tell the difference between if the user provided it or if it was implicit without those values being different. If the default minimum time is used with an aggregate query, we rewrite the time to be the epoch for backwards compatibility since we believe that's more important than supporting that extra nanosecond.	2016-08-25 12:52:41 -05:00
Jonathan A. Sternberg	993ac1ca2e	Remove confusing comment and unnecessary continue	2016-08-23 19:43:18 -05:00
Ashish Gaurav	4e17f9bb13	add mode() function & tests	2016-08-23 19:31:41 -05:00
Edd Robinson	90ff713f21	Fix base64 encoding issue in stats Fixes #7177.	2016-08-22 15:21:31 +01:00
Ben Johnson	8aa224b22d	reduce memory allocations in index This commit changes the index to point to index data in the shards instead of keeping it in-memory on the heap.	2016-08-16 14:09:00 -06:00
Jonathan A. Sternberg	f0f7d91d6c	Properly output all commands so they can be reparsed The commands fixed: * SHOW TAG VALUES * SHOW STATS * SHOW DIAGNOSTICS	2016-08-15 15:04:51 -05:00
Jonathan A. Sternberg	87f7c66b8a	Merge pull request #7119 from influxdata/js-create-database-use-defaults Use defaults from `meta` package for `CREATE DATABASE`	2016-08-11 10:34:22 -05:00
Jonathan A. Sternberg	32d10de94f	Check in between query statements to see if the query was interrupted This allows a long series of uninterruptible statements to still be interrupted for a long running query that might do something like create or drop many databases.	2016-08-10 15:36:02 -05:00
Jonathan A. Sternberg	ab049d7f0a	Support mixed duration units It is now possible to use a mixed duration unit like `1h30m`. The duration units can be in whatever order as long as they are connected to each other. There is a change to the scanner. A token such as `10x` will be scanned as a duration literal, but will then fail to parse as an invalid duration. This should not be a breaking change as there is no situation where `10m10` was a valid order of tokens for the parser. Fixes #3634.	2016-08-10 13:34:19 -05:00
Jonathan A. Sternberg	3959656968	Add additional statistics to query executor The query executor would only store the number of active queries and the query duration so it was impossible to determine how many queries were actually executed during that timeframe because quick queries would be gone before the call to gather statistics was made. This adds two new statistics so track when queries start and when queries finish and doesn't decrement the counter so the number of executed queries can be obtained using `derivative()` and `difference()`.	2016-08-10 11:35:06 -05:00
Jonathan A. Sternberg	530b00bd76	Use defaults from `meta` package for `CREATE DATABASE` Instead of having the parser set the defaults, the command will set the defaults so that the constants for that are actually used. This way we can also identify which things the user provided and which ones we are filling with default values. This allows the meta client to be able to make smarter decisions when determining if the user requested a conflict or if the requested capabilities match with what is currently available. If you just say `CREATE DATABASE WITH NAME myrp`, the user doesn't really care what the duration of the retention policy is and just wants to use the default. Now, we can use that information to determine if an existing retention policy would conflict with what the user requested rather than returning an error if a default value ever gets changed since the meta client command can communicate intent more easily.	2016-08-09 12:00:06 -05:00
Ben Johnson	55b3e63ced	concurrent series limit This commit fixes the `MaxSelectSeriesN` limit which was broken by the implementation of lazy iterators. The setting previously limited the total number of series but the new implementation limits the concurrent number of series being processed.	2016-08-09 08:58:01 -06:00
Jonathan A. Sternberg	b98763a3d8	Merge pull request #7118 from influxdata/js-go-generate go generate on every package to ensure they are generated with the correct dependency	2016-08-08 09:02:32 -05:00
Jonathan A. Sternberg	4cdfc3280d	Move the CQ interval by the group by offset This will make the period selected by the CQ system work correctly for a query with an offset.	2016-08-05 14:39:52 -05:00
Jonathan A. Sternberg	ed2f81357f	go generate on every package to ensure they are generated with the correct dependency	2016-08-05 14:35:07 -05:00
Ben Johnson	6c7d56d4bc	limit shard concurrency This commit limits queries to only process one shard at a time. However, within a shard, multiple series can still be processed in parallel. Shard iterators are lazily instantiated during query execution to limit the amount of memory a given query uses.	2016-08-05 09:45:57 -06:00
Jonathan A. Sternberg	a62fbcde42	Merge pull request #7113 from agaurav/Test/medianBenchmarks benchmark tests for median aggrergator	2016-08-04 09:53:42 -05:00
Ashish Gaurav	70c8c021ac	added benchmark tests for median aggrergator (Package: influxql,influxql_test)	2016-08-04 08:02:19 +05:30
Jason Wilder	19546faab3	Release cursor/iterator resources aggressively	2016-08-03 00:21:39 -06:00
Jonathan A. Sternberg	2c739c0532	Fix parseFill to check for fill ident before attempting to parse an expression The previous parseFill would try to parse an expression and only unscan one token when it failed. This caused it to not put back the correct number of tokens with some expression. Now it has been modified to check for the fill ident ahead of time and then use ParseExpr() to parse the call. If the expression fails to parse into a call, it will send an error instead of trying to continue with an invalid parser state. Fixes #6543.	2016-08-01 11:38:44 -05:00
Cory LaNou	1117526873	remove IF EXISTS/IF NOT EXISTS from influxql language	2016-07-29 12:58:05 -05:00
Jason Wilder	0b60862248	Close drained iterators Aux and condition iterators where not closed which could cause TSM files to leak if they were queried against while a compaction was running.	2016-07-28 20:25:37 -06:00
Jonathan A. Sternberg	94e07158d5	Merge pull request #7009 from influxdata/js-5750-wildcard-support-for-aggregates Support wildcards in aggregate functions	2016-07-28 18:13:59 -05:00
Jonathan A. Sternberg	23ef9484a4	Support wildcards in aggregate functions	2016-07-28 17:56:32 -05:00
Jonathan A. Sternberg	86bd97f3b9	Switch SHOW MEASUREMENTS and SHOW TAG VALUES to directly access the tsdb.Store The `SHOW MEASUREMENTS` and `SHOW TAG VALUES` cannot go through the query engine to get the speed they need. They also only need access to the database index and do not need access to specific shards. This removes the query rewriting that was done to turn these two queries into a select statement and reimplements them inside of the coordinator as an interface on the TSDBStore.	2016-07-28 17:38:11 -05:00
Ben Johnson	5df6f75545	check for nil iterator creation This commit checks if an iterator is `nil` before adding to an iterator list during creation.	2016-07-27 13:54:56 -06:00
Jonathan A. Sternberg	7b8bcfb173	Add option to suppress logging query statements in the query executor	2016-07-26 13:53:45 -05:00
Jonathan A. Sternberg	837a9804cf	Refactoring the monitor service to avoid expvar Truncate the time interval output of the monitor service to be on even time intervals rather than on every minute based on the start time. This normalizes the output from the monitor service.	2016-07-07 11:13:58 -05:00
Jonathan A. Sternberg	bb060a60c6	Fix regex binary encoding for a measurement Previously, it encoded the text representation of the regex literal which included the surrounding slashes used in the query language. The binary encoding should only include the exact string used to create the regular expression.	2016-07-05 11:39:41 -05:00
Jonathan A. Sternberg	3bd51d3537	Fix fill(previous) when used with math operators	2016-06-29 09:54:12 -05:00
Jonathan A. Sternberg	497db2a6d3	Removing dead code from every package except influxql The tsdb package had a substantial amount of dead code related to the old query engine still in there. It is no longer used, so it was removed since it was left unmaintained. There is likely still more code that is the same, but wasn't found as part of this code cleanup. influxql has dead code show up because of the code generation so it is not included in this pruning.	2016-06-20 22:41:07 -05:00
Jonathan A. Sternberg	252cde1e81	Fix golint errors for the influxql package	2016-06-20 08:51:02 -05:00
Jonathan A. Sternberg	8e1b036b0a	Modify the max nanosecond time to be one nanosecond less The highest time represented by a nanosecond needs to be used for an exclusive range, so the maximum time needs to be one less than the possible maximum number of nanoseconds representable by an int64 so that we don't lose a point at that one time. Previously worked in the open source version because the timestamp used for finding a shard would be truncated by the retention policy so the lookup time didn't run into this edge case because it didn't rest on the truncation boundary. Since that point didn't really belong in that shard group and was placed there by mistake, it's best to fix this bug since the timestamp used to create the shard group should be capable of retrieving it.	2016-06-16 12:15:41 -05:00
Jonathan A. Sternberg	9837de793c	Support regex and other operations for selecting the key in SHOW TAG VALUES This adds support for using regex expressions in SHOW TAG VALUES when selecting the key. Also supporting the `!=` operation for the comparison. Now you can do any of the following: SHOW TAG VALUES WITH KEY != "region" SHOW TAG VALUES WITH KEY =~ /region/ SHOW TAG VALUES WITH KEY !~ /region/ It also adds a new SetLiteral AST node that will potentially be used in the future to allow set operations for other comparisons in the future. Fixes #4532.	2016-06-13 10:03:14 -05:00
Ben Johnson	48f1a6d858	Merge pull request #6820 from benbjohnson/http-query-node-id Add NodeID to execution options	2016-06-10 13:10:15 -06:00
Jonathan A. Sternberg	bdd15be6cc	Merge pull request #6806 from influxdata/js-separate-task-manager Refactor the TaskManager to be separate from the QueryExecutor	2016-06-10 13:21:46 -05:00
Jonathan A. Sternberg	9db82e6bf0	Switch ExecutionContext to be passed by value	2016-06-10 12:31:51 -05:00
Jonathan A. Sternberg	a6147fa685	Public method to return query information for running queries	2016-06-10 12:31:48 -05:00
Jonathan A. Sternberg	55973d2815	Separate the task manager from the query executor The task manager now acts as its own statement executor so that a custom statement executor can perform custom actions for KillQueryStatement and ShowQueriesStatement.	2016-06-10 12:30:51 -05:00
Ben Johnson	7d4bea7153	add node id to execution options This commit changes the `ExecutionOptions` and `SelectOptions` to allow a `NodeID` for specifying an exact node to query against.	2016-06-10 09:20:44 -06:00
Jonathan A. Sternberg	2fa6d306c2	Add option to KILL QUERY to kill on a specific host Option only applies to clustering.	2016-06-07 16:48:07 -05:00
Jonathan A. Sternberg	b972c220aa	Merge pull request #6757 from influxdata/js-refactor-execute-query Refactor ExecuteQuery to take options as a struct	2016-06-07 10:35:52 -05:00
Jonathan A. Sternberg	b8e22d9d79	Merge pull request #6586 from influxdata/js-3733-rename-default-retention-policy Modify the default retention policy name and make it configurable	2016-06-06 15:05:29 -05:00
Nathaniel Cook	128b07e352	Merge pull request #6764 from influxdata/nc-holt-winters-nans Check for NaNs from Holt-Winters and do not return them	2016-06-03 11:17:11 -06:00
Nathaniel Cook	01aa9073b6	check for NaNs and do not return them	2016-06-03 10:51:46 -06:00
Ben Johnson	3fa5cefa32	add Iterators.Merge()	2016-06-03 10:27:17 -06:00
Jonathan A. Sternberg	1e84b22407	Update SHOW TAG VALUES to use a fast dedupe iterator Include a benchmark test for the fast dedupe iterator.	2016-06-02 22:03:59 -05:00
Nathaniel Cook	ce74fe0b06	count and sum return 0 for empty intervals	2016-06-01 15:53:23 -06:00
Jonathan A. Sternberg	71c8e9e567	Refactor ExecuteQuery to take options as a struct This allows us to add additional options to ExecuteQuery without creating parameter bloat. Removing the unused Series structs. Their necessity was removed by a previous commit, but the structs were not removed yet. Add another type of interrupt iterator that monitors the interrupt channel and calls `Close()` on the iterator when the interrupt happens. It will primarily be used for asynchronously closing the ReaderIterator, but it will only close the read side of the connection properly. More work needs to be done to allow closing the write side efficiently.	2016-06-01 12:30:52 -05:00
Joe LeGasse	f2fd988ab9	Delay parsing of date/time strings until needed The current code would compare every string literal it crossed and tried to coerce them to time literals if the _looked_ like date/time strings. The only time the TimeLiteral was used is when comparing to the the 'time' value in a where clause. This change moves the string parsing code until we attempt to compare 'time' to a string, at which point we know we need/want a TimeLiteral, and not just an ordinary string. Fixes #6727	2016-05-27 09:43:45 -04:00
Nathaniel Cook	2927fee2d1	update comment on MaxTime	2016-05-27 11:07:50 +01:00
Nathaniel Cook	9314ae8e80	fix overflow in window iterator and holt winters roundTime	2016-05-27 11:07:50 +01:00
Edd Robinson	f4fc905fa9	Reject timestamps too far in future	2016-05-27 11:07:48 +01:00
Mark Rushakoff	fed67ffdf0	Fix typo in parse error	2016-05-24 10:47:51 -07:00
Edd Robinson	ef4b36c063	Merge pull request #6695 from influxdata/er-trace Add more context to recovered panics	2016-05-24 15:16:51 +01:00
Jonathan A. Sternberg	baaa782c95	Modify the default retention policy name and make it configurable The default retention policy name is changed to "autogen" instead of "default" since it ends up being ambiguous when we tell a user to check the default retention policy, it is uncertain if we are referring to the default retention policy (which can be changed) or the retention policy with the name "default". Now the automatically generated retention policy name is "autogen". The default retention policy is now also configurable through the configuration file so an administrator can customize what they think should be the default. Fixes #3733.	2016-05-24 09:51:23 -04:00
Jonathan A. Sternberg	32e42b93ae	Merge pull request #6705 from influxdata/js-6701-duplicate-points-with-select Filter out sources that do not match the shard database/retention policy	2016-05-24 09:48:31 -04:00
Edd Robinson	853a95e0b3	Reduce allocations on Query's io.Stringer implementation A query's String method is called multiple times per query. This commit ensures all calls to query.String share use of a strings.NewReplacer. This approximately halves the number of allocations for the benchmarked query.	2016-05-24 14:27:38 +01:00
Jonathan A. Sternberg	5e7e0bd19b	Filter out sources that do not match the shard database/retention policy If you use a statement like this: SELECT value FROM one..cpu, two..cpu It will access both the `one` and `two` databases as if you had selected the `cpu` measurement twice for both of them. Updated the `tsdb.Shard` create iterator function to filter out any sources that do not apply to that shard so this duplication doesn't happen. Fixes #6701.	2016-05-23 17:05:33 -04:00
David Norton	9d5c4a0f41	fix #6702 : return correct required privileges Change SelectStatement to return required privileges for all Sources.	2016-05-23 16:44:23 -04:00
Edd Robinson	67e793e512	Add more context to recovered panics	2016-05-23 12:24:05 +01:00
Edd Robinson	40732a35d0	Merge pull request #6660 from influxdata/er-vet Fix vet issues	2016-05-20 11:12:25 +01:00
Nathaniel Cook	6ed0d94343	Add Holt-Winters forecasting method.	2016-05-19 09:24:56 -06:00
Jonathan A. Sternberg	451a5205ef	Support bound parameters in the parser The parser can be passed a map of keys to literal values to be replaced into the query. Parameters are preceded by a dollar sign (`$`). If a parameter key is missing, an error is thrown by the parser. Fixes #2926.	2016-05-18 20:10:15 -04:00
Edd Robinson	f680ab0f0d	Fix vet issues	2016-05-18 13:34:11 +01:00
Jonathan A. Sternberg	c01cda58e8	Print the query executor's stack trace on a panic to the log Fixes #6644.	2016-05-17 09:23:55 -04:00
Jonathan A. Sternberg	23f6a706bb	Support cast syntax for selecting a specific type Casting syntax is done with the PostgreSQL syntax `field1::float` to specify which type should be used when selecting a field. You can also do `field1::field` or `tag1::tag` to specify that a field or tag should be selected. This makes it possible to select a tag when a field key and a tag key conflict with each other in a measurement. It also means it's possible to choose a field with a specific type if multiple shards disagree. If no types are given, the same ordering for how a type is chosen is used to determine which type to return. The FieldDimensions method has been updated to return the data type for the fields that get returned. The SeriesKeys function has also been removed since it is no longer needed. SeriesKeys was originally used for the fill iterator, but then expanded to be used by auxiliary iterators for determining the channel iterator types. The fill iterator doesn't need it anymore and the auxiliary types are better served by FieldDimensions implementing that functionality, so SeriesKeys is no longer needed. Fixes #6519.	2016-05-16 12:08:29 -04:00
Jonathan A. Sternberg	fb7be8ce4e	Fix decodeTags to decode tags correctly from encodeTags encodeTags would encode the tags by outputting every key followed by every value in alphabetical order. decodeTags would try to read this in an old format that printed tags in key/value order. This fix matches decodeTags to match the same format encodeTags outputs.	2016-05-11 16:21:39 -04:00
Jonathan A. Sternberg	733a17d9e9	Update SHOW FIELD KEYS to return the field type with the field key Fixes #3451.	2016-05-10 13:16:57 -04:00
Jonathan A. Sternberg	700431d6de	Merge pull request #6548 from influxdata/js-remove-unused-reserved-keyword Remove unused FORCE and INNER keywords	2016-05-10 10:14:14 -04:00
Ben Johnson	078e561820	parallelize iterators	2016-05-09 10:25:30 -06:00
Ben Johnson	fdf34d4356	move call iterator to series level This commit moves the `CallIterator` to wrap the individual series instead of wrapping a shard. This allows individual points to be aggregated before being merged. This will cause a small increase in memory usuage per series but it shows a 20% decrease in query time when there are a moderate number of points per series.	2016-05-05 09:59:03 -06:00
Jonathan A. Sternberg	18fa63fdac	Remove unused FORCE and INNER keywords	2016-05-03 13:21:14 -04:00
Ben Johnson	417df18396	Merge pull request #6533 from benbjohnson/optimize-show-series Optimize SHOW SERIES	2016-05-03 09:15:21 -06:00
Jonathan A. Sternberg	a2a5c32770	Merge pull request #6539 from influxdata/js-6495-fix-aggregates-with-empty-shards Fix aggregate returns when data is missing from some shards	2016-05-03 10:56:21 -04:00
Ben Johnson	49eb3b8d04	optimize show series iterator This commit changes the `SeriesIterator` to process one measurement at a time and uses a `floatFastDedupeIterator` to avoid point encoding during deduplication.	2016-05-03 08:52:44 -06:00
Jonathan A. Sternberg	d6d0addcec	Fix aggregate returns when data is missing from some shards If a shard is empty for a specific field and the field type is something other than a float, a nil iterator would get returned from one of the empty shards and cause the combined iterators to be cast to the float type and all other iterator types to be discarded (or for integers, to be cast). This is rare since most aggregates don't accept strings or booleans, but for queries like: SELECT distinct(string) FROM mydata It would result in nothing getting returned if one of the shards didn't have a value for `string`. This change modifies the query engine to return nil for the shards instead of a fake iterator and then to only use the fake iterator if the final aggregate iterator is nil (meaning that no iterators could be constructed for the field from any shard). Fixes #6495.	2016-05-03 10:41:22 -04:00
Jonathan A. Sternberg	64556e4f8e	Support offset argument in the GROUP BY time(...) call An offset of `time(1m, now())` will anchor the offset to the current time of the query. The default offset is `0s` which is the current default anyway. This fixes #2074 by making time zone offset support unnecessary. Time comparisons can use timezones inside of the time clause and the offset needed for non-hour timezone differences can be used as part of the offset argument.	2016-05-02 14:02:35 -04:00
Jonathan A. Sternberg	8f876d0f3c	Merge pull request #6510 from influxdata/js-5890-return-time-with-selector Return the time with a selector when there is no group by interval	2016-04-29 17:39:20 -04:00
Jonathan A. Sternberg	2e3c137404	Merge pull request #6514 from influxdata/js-6472-panic-on-count-distinct Fix validation to catch a string used in `count(distinct())`	2016-04-29 16:07:59 -04:00
Jonathan A. Sternberg	e1b384b287	Merge pull request #6516 from influxdata/js-6494-support-booleans-for-min-max Support booleans for min() and max()	2016-04-29 16:05:13 -04:00
Ben Johnson	cd0ce3e5bd	Fix DELETE Spec This changes the `DELETE SERIES` specification to `DELETE`. /cc @gunnaraasen	2016-04-29 13:12:55 -06:00
Jonathan A. Sternberg	a05e2b164e	Support booleans for min() and max() Fixes #6494.	2016-04-29 14:56:22 -04:00
Jonathan A. Sternberg	ff3ee909de	Fix validation to catch a string used in `count(distinct())` Also removes the functions `HasSimpleCount()` and `HasCountDistinct()` as they are no longer useful. They had a small role in validation that has now been moved into `validateAggregates()`. Fixes #6472.	2016-04-29 13:46:18 -04:00
Jonathan A. Sternberg	6748155f8b	Add NAME token to the list of reserved keywords Fixes #6489.	2016-04-29 13:20:39 -04:00
Jonathan A. Sternberg	043bd6031e	Return the time with a selector when there is no group by interval The time of the point will be returned with a selector when there is no group by interval and when there is only one selector. Any other conditions will return the start time of the interval. Fixes #5890.	2016-04-29 11:27:26 -04:00
Jonathan A. Sternberg	6f61c0ea4a	Add POST /query endpoint and warning messages for using GET with write operations In order to follow REST a bit more carefully, all write operations should go through a POST in the future. We still allow read operations through either GET or POST (similar to the Graphite /render endpoint), but write operations will trigger a returned warning as part of the JSON response and will eventually return an error. Also updates the Golang client libraries to always use POST instead of GET. Fixes #6290.	2016-04-29 09:00:23 -04:00
David Norton	8c09f19c9a	fix #6505 : update InfluxQL README.md for regex_lit	2016-04-28 22:56:33 -04:00
lvheyang	751a81f414	fix show statement's rewriting bug	2016-04-29 08:23:50 +08:00
Ben Johnson	f7af787aef	add DELETE query support This commit adds query language support for deleting series with a `DELETE` query.	2016-04-27 15:16:23 -06:00
Ben Johnson	fd840f242c	Merge pull request #6463 from benbjohnson/optimize Reduce interrupt iterator checks & field access	2016-04-26 13:16:28 -06:00
Ben Johnson	1b6524a7bf	reduce interrupt iterator checks The interrupt iterator currently introduces a non-trivial amount of overhead to queries by checking for interrupts every 256 points. This commit adjusts that check to every 5000 points. There are also several places where nested field access has been adjusted to minimize field lookups.	2016-04-26 12:16:07 -06:00
Ben Johnson	291dd08cd4	remove SHOW SERVERS & DROP SERVER This commit removes support for `SHOW SERVERS` and `DROP SERVER` from the `influxql` package. It also removes extraneous cluster testing code from `cmd/influxd/run`. Fixes #6465	2016-04-26 10:28:42 -06:00
Jonathan A. Sternberg	9b6d5dfd78	Allow derivative() function to be used with ORDER BY desc The derivative function had an arbitrary limitation that would cause it to set the value to zero if the previous value was after the next value. This caused all `ORDER BY desc` queries with `derivative()` to always return zero values. Fixes #4675.	2016-04-22 12:20:32 -04:00
Jonathan A. Sternberg	e28d16cfcf	Log slow queries if they pass a configurable threshold Fixes #6429.	2016-04-22 12:04:15 -04:00
Jonathan A. Sternberg	c77cbb8389	Merge pull request #6375 from influxdata/js-6118-derivative-on-multiple-fields Remove restrictions on where derivative can be used entirely	2016-04-22 12:02:01 -04:00
Jonathan A. Sternberg	62c66b788c	Improve query sanitization to prevent a password leak in the logs Sanitizing is now done through pattern matching rather than parsing the query and replacing the password in the query. This prevents accidentally redacting the wrong part of a query and revealing what the password is through association. Fixes #3883.	2016-04-22 11:27:09 -04:00
Jonathan A. Sternberg	22a0505559	Remove restrictions on where derivative can be used entirely This removes the previous restrictions that kept derivative as only capable of being used in a single field and only at the top level. This lets users determine how they want to use derivative more freely and opens up the possibility of also using math between derivatives. This may open up some problems when it comes to math between derivatives as timestamps may not match correctly. That is likely a problem related to any binary math to begin with though and can probably be ignored by the derivatives. I'm also not sure it makes sense to perform any math between a derivative and a difference or perform math between a derivative and a mean. Fixes #6118.	2016-04-22 11:17:23 -04:00
Stephen Gutekanst	9dc09c5257	Make logging output location more programmatically configurable (#6213 ) This has various benefits: - Users embedding InfluxDB within other Go programs can specify a different logger / prefix easily. - More consistent with code used elsewhere in InfluxDB (e.g. services, other `run.Server.` fields, etc). - This is also more efficient, because it means `executeQuery` no longer allocates a single `log.Logger` each time it is called.	2016-04-20 21:07:08 +01:00
thbourlove	0e3964d8c2	fix panic in transform iterator on division if left point is nil in division expr, the influxd server would panic. Now check left point before use it.	2016-04-21 00:07:20 +08:00
Jonathan A. Sternberg	dbbb742c40	Fix the elapsed tests which broke due to a merge A testing function had its signature modified. The new elapsed test code was not modified for the PR and fails to compile now that the PR was merged.	2016-04-20 11:37:36 -04:00
Jonathan A. Sternberg	c8c38e15cd	Merge pull request #6386 from influxdata/js-iterator-next-error Modify all of the iterators to allow returning an error on Next()	2016-04-20 10:39:53 -04:00
Nathaniel Cook	465f5a375f	add elapsed function	2016-04-19 12:54:54 -06:00
Jonathan A. Sternberg	09c46a451a	Sort the series keys inside of a tag set so the output is deterministic The series keys within a tag set were previously not sorted which would cause the output to be non-deterministic. This sorts the output series by their keys so it has a consistent output especially when using limits. Fixes #3166.	2016-04-18 17:45:31 -04:00
Jonathan A. Sternberg	7ec2a991d5	Modify all of the iterators to allow returning an error on Next() This also switches the remaining iterators to be lazy so they can return errors properly. They needed to be converted to lazy initialization anyway, which has the side effect of making it much easier for us to propagate the underlying error during initialization. Updated the Emitter to return errors when it cannot read properly from the iterators.	2016-04-18 11:17:55 -04:00
Jonathan A. Sternberg	34544d2c4b	Fix CREATE RETENTION POLICY parsing so it doesn't consume tokens it shouldn't The optional sections of the command consumed the semicolon token and didn't put it back for the outer loop. The code shouldn't explicitly check for a semicolon or EOF anyway, so these checks were removed and the token gets unscanned if it doesn't match the optional token that the parser is looking for. Fixes #6398.	2016-04-17 02:41:17 -04:00
Jonathan A. Sternberg	86046bb2d0	Implement derivatives across intervals for aggregate queries For aggregate queries, derivatives will now alter the start time to one interval behind and will use that interval to find the derivative of the first point instead of giving no value for that interval. Null values will still be discarded so if the interval before the one you are querying is null, then it will be discarded like if it were in the middle of the query. You can use `fill(0)` to fill in these values. This does not apply to raw queries yet. Also modified the derivative and difference aggregates to use the stream iterator instead of the reduce slice iterator for space efficiency. Fixes #3247. Contributes to #5943.	2016-04-15 18:16:08 -04:00
Jonathan A. Sternberg	f4ce20c92a	Merge pull request #6394 from influxdata/js-integer-literal-duration-math Allow time math with integer timestamps	2016-04-15 18:15:43 -04:00
Jonathan A. Sternberg	a85188bd61	Allow time math with integer timestamps	2016-04-15 17:36:37 -04:00
Jonathan A. Sternberg	425e315f45	Detect when a time literal will overflow or underflow the query engine Fixes #3369.	2016-04-15 17:35:54 -04:00
Jonathan A. Sternberg	207a4b9d07	Recover from a panic during query execution	2016-04-15 11:13:54 -04:00
Jonathan A. Sternberg	9c2e430a72	Removing no longer used code in the query engine	2016-04-15 10:22:34 -04:00
Jonathan A. Sternberg	42b68d915f	Return a deprecated message when IF NOT EXISTS is used The deprecated message is now attached to a new attribute returned with the results. This message can then be read by clients to warn a user about upcoming changes to the query engine. The `influx` client has already been modified to read this message and print it out for every format except CSV. The first warning message is a deprecated message about removing `IF NOT EXISTS` from `CREATE DATABASE`. The message will also be printed to the server log. Fixes #5707.	2016-04-15 09:17:59 -04:00
Jonathan A. Sternberg	03bbff1a65	Merge pull request #6380 from influxdata/js-6379-validate-percentile-args Added checking for the first argument in percentile()	2016-04-14 15:57:07 -04:00
Ben Johnson	4f381d03d7	add double buffer on chan iterator This commit changes the channel iterators to use a double buffer to reduce allocations. The caller of `Iterator.Next()` must copy out the point before calling `Next()` again.	2016-04-14 13:52:13 -06:00
Jonathan A. Sternberg	1b6ffa1d2d	Merge pull request #6363 from influxdata/js-6292-percentile-as-selector Allow percentile to be used as a selector	2016-04-14 12:28:16 -04:00
Jonathan A. Sternberg	d36698ebd0	Added checking for the first argument in percentile() The `percentile()` call previously did not validate that the first argument was a variable reference and that would let an invalid query slip by that would panic the query engine. Added checking for this case and also included test cases for the other calls that require a variable reference as the first argument. Fixes #6379.	2016-04-14 12:12:31 -04:00
Jonathan A. Sternberg	9d01f3aa56	Allow the implicit time field to be renamed Fixes #6296.	2016-04-14 11:04:06 -04:00
Jonathan A. Sternberg	66a599825b	Allow percentile to be used as a selector Fixes #6292.	2016-04-13 13:29:14 -04:00
Jonathan A. Sternberg	50bd78433c	Merge pull request #6291 from influxdata/js-6261-optimize-distinct Optimize the distinct call	2016-04-12 17:09:10 -04:00
Nathaniel Cook	6ae62e9644	update Percentile to preserve Aux fields since its a selector	2016-04-12 13:34:50 -06:00
Ben Johnson	f7f35affd2	add distinct iterator benchmark	2016-04-12 13:22:03 -06:00
Jonathan A. Sternberg	6708d0c439	Optimize the distinct call Change distinct so it uses a custom reducer that keeps internal state instead of requiring all of the points to be kept as a slice in memory. Fixes #6261.	2016-04-11 18:29:50 -04:00
Ben Johnson	525e22c92b	tsm1 query engine alloc reduction This commit makes a number of performance improvements to reduce allocations during query execution. Several objects and buffers are now reused across the components to avoid allocations. Previously a simple `count(value)` query across 1M points would require 26,000+ allocations. After the changes in this commit that number has been reduced to 88.	2016-04-11 14:50:59 -06:00
Jonathan A. Sternberg	ca534bf09f	Fix the subset method on tags If a point had no tags at all and was asked for the subset of tags with at least one key, it would return a new set of tags that was completely empty. In contrast, if the point had any tags at all, it would return a set of tags with the tag value being an empty string. This lead to a point with no tags being treated differently than a point with at least one tag. Fixing this so the tag value will always be an empty string for consistency. A missing tag should always be empty.	2016-04-11 12:00:55 -04:00
Jonathan A. Sternberg	fa5a38dcd4	Fixing aggregate queries with no GROUP BY to include the end time Queries with a time constraint but no group by would not include the final point from the underlying iterator. Fixes #6229.	2016-04-07 14:11:28 -04:00
Jonathan A. Sternberg	61aa2980ce	Merge pull request #6255 from influxdata/js-6248-strings-in-select-panic Throw an error when an invalid expression is used with aux iterators	2016-04-07 12:32:52 -04:00
Jonathan A. Sternberg	d176c8babb	Throw an error when an invalid expression is used with aux iterators The following query was fixed previously: SELECT 'value' FROM cpu This ended up hitting the `buildExprIterator()` code path and was handled properly. But this query: SELECT 'value', value FROM cpu This took a different code path that would trigger a panic because it triggered a panic instead of an error condition. This code path has now been modified to trigger an error instead of a panic. Fixes #6248.	2016-04-07 11:01:45 -04:00
Edd Robinson	dfee15bd19	Scopes influxql Protobuf package to prevent clashes Fixes #6211. In Go-land packages with the same name, e.g., internal, do not clash with each other when they're in different parts of the project. However with protobufs definitions will clash if they share the same package name. This commit renames the influxql protobuf package to `influxql` to avoid a clash with a message definition in another protobuf package called internal. Go package aliases allow us to continue to refer to the internal package as `internal` rather than `influxql`.	2016-04-05 13:36:47 +01:00
Jonathan A. Sternberg	37b63cedec	Cleanup QueryExecutor and split statement execution code The QueryExecutor had a lot of dead code made obsolete by the query engine refactor that has now been removed. The TSDBStore interface has also been cleaned up so we can have multiple implementations of this (such as a local and remote version). A StatementExecutor interface has been created for adding custom functionality to the QueryExecutor that may not be available in the open source version. The QueryExecutor delegate all statement execution to the StatementExecutor and the QueryExecutor will only keep track of housekeeping. Implementing additional queries is as simple as wrapping the cluster.StatementExecutor struct or replacing it with something completely different. The PointsWriter in the QueryExecutor has been changed to a simple interface that implements the one method needed by the query executor. This is to allow different PointsWriter implementations to be used by the QueryExecutor. It has also been moved into the StatementExecutor instead. The TSDBStore interface has now been modified to contain the code for creating an IteratorCreator. This is so the underlying TSDBStore can implement different ways of accessing the underlying shards rather than always having to access each shard individually (such as batch requests). Remove the show servers handling. This isn't a valid command in the open source version of InfluxDB anymore. The QueryManager interface is now built into QueryExecutor and is no longer necessary. The StatementExecutor and QueryExecutor split allows task management to much more easily be built into QueryExecutor rather than as a separate struct.	2016-04-04 13:27:17 -04:00
Jonathan A. Sternberg	43e3330480	Fix the reader iterator so it doesn't read the first point when creating the iterator	2016-04-01 17:31:28 -04:00
Jonathan A. Sternberg	c193bde61c	Throw an error when time is compared to an invalid literal A bigger refactor of these functions is needed to support #3290, but this will work for the more common case that someone uses double quotes instead of single quotes when surrounding a time literal. Fixes #3932.	2016-03-31 11:29:20 -06:00
Edd Robinson	9cd0bc65f5	Let SHARD DURATION be specified in isolation Fixed #6152.	2016-03-31 17:42:50 +01:00
Ben Johnson	b28c4db3d0	mark merge iterator as initialized This commit sets the `MergeIterator.init` flag after initialization. Previously this would generate a new heap on every call to `Next()` which caused some aggregate queries to slow by ~10,000%.	2016-03-31 09:56:23 -06:00
Jonathan A. Sternberg	eb467d8d7f	Merge pull request #6126 from influxdata/js-6115-chunked-query-support-mid-series Support chunking in the middle of a series in the emitter	2016-03-30 20:59:24 -04:00
Jonathan A. Sternberg	178a6e2f0a	Merge pull request #6113 from influxdata/js-6112-simple-moving-average Implement simple moving average	2016-03-30 20:57:55 -04:00
Jonathan A. Sternberg	711a6614e6	Implement the point limit monitor Fixes #6077.	2016-03-30 16:08:56 -04:00
Jonathan A. Sternberg	278b0950a7	Perform lazy initialization of the heap for the MergeIterator The MergeIterator creation function would call `peek()` on the iterator to initialize the heap. Since this function can sometimes take a long time (such as a huge aggregate query on a shard), the `influxql.Select()` wouldn't return until the query had already been completed. The `influxql.Select()` call should be just the creation of the iterators and shouldn't calculate anything. This is important for future features like the point limiter that have to be initialized after the `influxql.Select()` call.	2016-03-30 16:08:55 -04:00
Jonathan A. Sternberg	4c9e382e63	Allow a custom function to be used for monitoring a query	2016-03-30 16:08:55 -04:00
Jonathan A. Sternberg	3e93689020	Prototype for returning custom errors from the query manager	2016-03-30 16:08:55 -04:00
Jonathan A. Sternberg	364dce36ab	Support chunking in the middle of a series in the emitter Limit the maximum size of the return value when chunking is not used to prevent the server from going out of memory. Fixes #6115.	2016-03-30 14:36:35 -04:00
Jonathan A. Sternberg	6453dbc249	Implement simple moving average The simple moving average will gradually emit points instead of waiting until the end. This should apply to derivative and difference in the future too. Fixes #6112.	2016-03-29 14:36:43 -04:00
Jonathan A. Sternberg	c1643e69c1	Have the server kill all queries on shutdown Related to #6140, but won't actually fix that problem. It will correctly stop new queries from being started during shutdown and will send the interrupt signal to queries during shutdown. Since the interrupt signal is asynchronous, there isn't currently a way to wait for the queries to complete themselves before shutting down the engine.	2016-03-29 11:48:21 -04:00
Jonathan A. Sternberg	9ddc59aab5	Merge pull request #6105 from influxdata/js-1825-difference-function Implement the difference function	2016-03-29 09:37:59 -04:00
Jonathan A. Sternberg	a9720f926e	Implement the difference function The difference function is implemented very similar to how derivative is implemented. It is an aggregate function that acts over the entire aggregate. This function will also have the same problems that derivative has with getting values from the previous interval or point. This will be fixed separately as part of #5943. Fixes #1825.	2016-03-29 09:27:12 -04:00
Jonathan A. Sternberg	cb77caf3f2	Merge pull request #6127 from influxdata/js-fix-query-manager-timeout-test Set the query manager timeout test to a higher value to avoid a false positive	2016-03-29 09:02:58 -04:00
Jonathan A. Sternberg	2b83939aeb	Set the query manager timeout test to a higher value to avoid a false positive	2016-03-25 16:45:38 -04:00
Tait Clarridge	45b3e61ac7	Add configurable shard duration to retention policies Allows configuration of shard group duration at database creation, and retention policy create/alter time. Query examples: ``` CREATE DATABASE testdb WITH DURATION 90d SHARD DURATION 30m NAME rp_testdb CREATE RETENTION POLICY rp_testdb2 ON testdb DURATION INF REPLICATION 1 SHARD DURATION 30m ALTER RETENTION POLICY rp_testdb2 ON testdb SHARD DURATION 1h ``` This can be useful with long duration retention policies with lots of data, where you can split into smaller shards to relieve memory pressure.	2016-03-24 00:25:49 -04:00
Jonathan A. Sternberg	d85b7334cd	Merge pull request #6081 from influxdata/js-6075-query-time-limit Support a timeout for running queries in the query manager	2016-03-23 11:32:08 -04:00
Jonathan A. Sternberg	2f853d3a7f	Merge pull request #6093 from influxdata/js-fix-bad-aux-iterator-merge Fix a bad merge that removed ExpandSources from AuxIterators	2016-03-22 17:13:44 -04:00
Jonathan A. Sternberg	114e734ee5	Fix a bad merge that removed ExpandSources from AuxIterators Regenerated the protobuf file for influxql to use a newer protobuf.	2016-03-22 16:36:22 -04:00
Jonathan A. Sternberg	b89edfd71c	Fix the code that handles semicolons This allows multiple semicolons in a row now and also requires that a semicolon separate commands. The query specification says this is required, but a boolean error in `ParseQuery` makes one semicolon optional and multiple semicolons an error. Fixes #5728.	2016-03-22 15:32:54 -04:00
Jonathan A. Sternberg	79fe4490c2	Support a timeout for running queries in the query manager Include an interrupt iterator at the top level to interrupt the fill iterator if it is producing too many points. Fixes #6075.	2016-03-22 13:30:40 -04:00
Jonathan A. Sternberg	3a7d537ee6	Merge pull request #6028 from influxdata/js-5116-default-no-fill-for-select-into Modify fill(null) to fill(none) in SELECT INTO queries	2016-03-22 12:13:17 -04:00
Jonathan A. Sternberg	38232ffc17	Merge pull request #6048 from influxdata/js-4306-string-equality-against-a-float Fix eval when types do not match between the lhs and rhs	2016-03-22 12:12:10 -04:00
Ben Johnson	573dd0f96a	Merge pull request #6035 from benbjohnson/query-engine-reduce-alloc Reduce allocations in query execution	2016-03-22 10:11:14 -06:00
Ben Johnson	6e1c1da25b	reduce allocations in query execution This commit removes some heap objects by converting them from pointer references to non-pointers or by reusing buffers.	2016-03-22 09:51:39 -06:00
Jonathan A. Sternberg	abae1cfed0	Limit the maximum number of concurrent queries Fixes #6079.	2016-03-21 22:34:27 -04:00
Ben Johnson	d58c6608fe	add InterruptIterator.Stats()	2016-03-21 16:38:18 -06:00
Ben Johnson	7156c1f9bd	add IteratorStats This commit adds an `IteratorStats` that holds aggregate iterator processing information. A method is also added to `Iterator` to return the stats: Stats() influxql.IteratorStats The remote iterators will also emit their stats in the point stream upon first connection, on a given interval, and then finally once the last point has been sent.	2016-03-21 16:25:19 -06:00
Jonathan A. Sternberg	d03125cdf7	Unit tests for the query manager	2016-03-21 12:40:54 -04:00
Jonathan A. Sternberg	6655ca7769	Create a new interrupt iterator that will stop emitting points after an interrupt Use of the iterator is spread out into both `IteratorCreators` and inside of the iterators themselves. Part of the interrupt must be handled inside of the engine so it stops trying to emit points when an interrupt is found and another part of the interrupt has to happen when combining the iterators so it doesn't just start reading the next shard.	2016-03-21 12:07:07 -04:00
Jonathan A. Sternberg	3e580bcf04	Allow a query to be killed While this allows a query to be killed, it doesn't really do anything yet since the interrupt happens only after the first row gets emitted (the entire first series). This section of code will likely have to be refactored to make this work since we need a way to interrupt a currently running iterator.	2016-03-21 12:07:06 -04:00
Jonathan A. Sternberg	117f62c33e	Implement a simple task manager for queries The currently running queries can be listed with the command `SHOW QUERIES` and it will display the current commands that have been run, the database they were run against, and how long they have been running.	2016-03-21 12:06:06 -04:00
Jonathan A. Sternberg	43a5e84aaf	Merge pull request #6047 from influxdata/js-6040-boolean-distinct Support the distinct() call for booleans	2016-03-17 17:17:21 -04:00
Jonathan A. Sternberg	f62f8a02dd	Fix eval when types do not match between the lhs and rhs Fixes #4306.	2016-03-17 10:50:14 -04:00
Jonathan A. Sternberg	d24f9a9fb5	Merge pull request #5756 from influxdata/js-5744-integer-literal-support Support integer literals in the query language	2016-03-17 10:49:51 -04:00
Jonathan A. Sternberg	e47426ff6e	Support integer literals in the query language Numbers in the query without any decimal will now be emitted as integers instead and be parsed as an IntegerLiteral. This ensures we keep the original context that a query was issued with and allows us to act more similar to how programming languages are typically structured when it comes to floats and ints. This adds functionality for dealing with integers promoting to floats in the various different places where math are used. Fixes #5744 and #5629.	2016-03-17 10:37:34 -04:00
Jonathan A. Sternberg	2e7816ebd9	Support the distinct() call for booleans Normalize the time for the distinct() call to either be at the beginning of the group by interval or the start time similar to every other call. The timestamp previously just showed the first time found and didn't make a lot of sense in the context of what the function was supposed to do. Fixes #6040.	2016-03-17 09:32:54 -04:00
Jonathan A. Sternberg	d75428f79f	Rename the special condition "name" to "_name" to reduce conflicts Fixes #6034.	2016-03-16 17:17:04 -04:00
Jonathan A. Sternberg	eab6ac3871	Modify fill(null) to fill(none) in SELECT INTO queries Fixes #5116.	2016-03-16 11:14:41 -04:00
Jonathan A. Sternberg	eb2d49dbe4	Merge pull request #6007 from benbjohnson/explicit-system-names Allow querying of system-like series	2016-03-15 16:15:17 -04:00
Edd Robinson	e5a41f7796	DROP SHARD	2016-03-14 22:59:55 +00:00
Cory LaNou	ba6a95e9bc	Merge pull request #5994 from influxdata/single-server-lite Single Server	2016-03-14 16:11:37 -05:00
Ben Johnson	f692621ef5	allow querying of system-like series Internal system series start with an underscore prefix but restricting this prevents users who already use an underscore prefix in their series names. Fixes #5870	2016-03-14 13:50:52 -06:00
Jonathan A. Sternberg	94916082c9	Make binary expressions with either point being nil return a nil point This also fixes integer to float and float/integer to boolean binary expressions to correctly work with nil points at all. Related to #5973.	2016-03-14 13:27:59 -04:00
Ben Johnson	e96185f993	add support for remote expansion of regex This commit moves the `tsdb.Store.ExpandSources()` function onto the `influxql.IteratorCreator` and provides support for issuing source expansion across a cluster.	2016-03-14 16:55:53 +00:00
Jonathan A. Sternberg	3f68bd12ee	Merge pull request #5979 from influxdata/js-5974-aux-iterator-close-panic Fix aux iterators to respect early closing	2016-03-14 12:03:50 -04:00
Jonathan A. Sternberg	0042866002	Teach the AuxIterator how to background Now the AuxIterator will know when it is backgrounded so that it can stop reading from the primary iterator when all of the child iterators have been closed.	2016-03-14 11:12:02 -04:00
Jonathan A. Sternberg	f1faf106b1	Ensure binary expressions cannot mix aggregates and raw fields Closes #5973.	2016-03-12 11:31:16 -05:00
Jonathan A. Sternberg	74d51e3842	Support nil values in binary math expressions with two iterators Related to #5959 and #5973.	2016-03-11 15:57:35 -05:00
Ben Johnson	beda072426	add support for remote expansion of regex This commit moves the `tsdb.Store.ExpandSources()` function onto the `influxql.IteratorCreator` and provides support for issuing source expansion across a cluster.	2016-03-11 12:40:07 -07:00
Jonathan A. Sternberg	09a9b3c53e	Fix aux iterators to respect early closing The primary input iterator for an aux iterator would continue trying to send points to a closed channel even after an aux iterator had already been closed. This changes the aux iterators to use sync.Cond instead of channels and lower level syncing primitives for handling buffered input/output. Fixes #5974.	2016-03-11 12:07:32 -05:00
Jonathan A. Sternberg	620addadde	Forbid binary expressions in function calls Also fixes derivative calls with an aggregate function to require a group by interval. The call without a group by interval doesn't make sense as it will never return anything since it will always have one point. Fixes #5968.	2016-03-10 11:38:47 -05:00
Jonathan A. Sternberg	df4c305012	Merge pull request #5961 from influxdata/js-5959-top-incompatible-with-binary-expr Forbid using top() or bottom() in a binary expression	2016-03-10 10:45:52 -05:00
Jonathan A. Sternberg	a55896a555	Merge pull request #5956 from influxdata/js-2715-allow-regex-for-field-values Support using field regex comparisons in the WHERE clause	2016-03-09 17:38:39 -05:00
Jonathan A. Sternberg	f276c705df	Forbid using top() or bottom() in a binary expression Fixes #5959.	2016-03-09 17:32:17 -05:00
Jonathan A. Sternberg	4ba3386679	Support using field regex comparisons in the WHERE clause Fixes #2715.	2016-03-09 13:21:24 -05:00
Jonathan A. Sternberg	546de3f7cb	Return error message when improper types are used in SELECT Fixes #5949.	2016-03-09 11:36:18 -05:00
Ben Johnson	41dde61226	SHOW SERIES	2016-03-08 11:47:57 -07:00
Nathaniel Cook	4961a4435b	Fix nil comparison for top/bottom	2016-03-07 15:21:22 -07:00
Nathaniel Cook	46fc6e5516	Expose Reduce Functions for Kapacitor	2016-03-07 14:03:14 -07:00
Jonathan A. Sternberg	370f8e4176	Merge pull request #5916 from influxdata/js-refactor-reduce-slice-func Refactor reduce slice func to use the aggregator and emitter	2016-03-07 15:49:37 -05:00
Jonathan A. Sternberg	9c5bc8ab2b	Refactor reduce slice func to use the aggregator and emitter	2016-03-07 13:25:45 -05:00
Jonathan A. Sternberg	2f0e246757	Implemented the tag values iterator for `SHOW TAG VALUES` `SHOW TAG VALUES` output has been modified to print the measurement name for every measurement and to return the output in two columns: key and value. An example output might be: > SHOW TAG VALUES WITH KEY IN (host, region) name: cpu --------- key value host server01 region useast name: mem --------- key value host server02 region useast `measurementsByExpr` has been taught how to handle reserved keys (ones with an underscore at the beginning) to allow reusing that function and skipping over expressions that don't matter to the call. Fixes #5593.	2016-03-06 09:52:34 -05:00
Jonathan A. Sternberg	9113839e4c	Fix sorting of `first()` and `last()` calls across shards Previously the call iterator would normalize the time to the interval for all calls. This meant that when `first()` or `last()` was called with no group by interval the value would be found for each shard, the time was normalized, then it tried to find the value between the shards (but no longer with any time data as that had already been eliminated). This removes part of the time logic from the call iterators and makes a new iterator `IntervalIterator` to normalize the times as they come out of the underlying iterator. Fixes #5890.	2016-03-03 21:15:43 -05:00
Jonathan A. Sternberg	8d89a203a2	Fix sorting for distinct by sorting by value when the point time is the same	2016-03-03 19:09:38 -05:00
Jonathan A. Sternberg	caee9a475c	Merge pull request #5895 from influxdata/js-5886-more-aggregate-iterator-types Support all iterator types for count(), first(), and last()	2016-03-03 15:23:57 -05:00
Jonathan A. Sternberg	fabc41dcfb	Have division always return a float in the new query engine Fixes #5892.	2016-03-03 08:15:54 -05:00
Jonathan A. Sternberg	e3660fae93	Support all iterator types for count(), first(), and last() All three of these iterators are supposed to support all four types of iterators, but the implementation was never done for string or boolean. Fixes #5886.	2016-03-02 23:49:55 -05:00
Jonathan A. Sternberg	2440568b27	Merge pull request #5875 from influxdata/js-5852-mean-function-accuracy Improve mean accuracy while retaining the speedup with a custom iterator	2016-03-02 17:09:58 -05:00
Jonathan A. Sternberg	1c543b28a9	Refactored call iterators to make them public and more usable as a library This refactor is primarily to support Kapacitor. Kapacitor doesn't care about the iterators and mostly keeps the points it handles in memory. The iterator interface is more than Kapacitor cares about. This commit refactors and opens up the internals of aggregating and reducing incoming points so it can be used by an outside library with the same code. It also makes the iterators used by the call iterators publically usable with new functionality. Reducers are split into two methods which are separate interfaces that can be combined for dealing with casting between different types. The Aggregator interfaces accept points into the aggregator and retain any internal state they need. The Emitter interface will then create a point from that aggregated state which can be fed to the iterator. The Emitters do not fill in the name or tag of the point as that is expected to be done by the person aggregating the point. While the Emitters do sometimes fill in the time, that value will also be overwritten by the iterator. Filling in the time is to allow a future version that will allow returning the point time instead of just the interval time.	2016-03-02 16:10:49 -05:00
Jonathan A. Sternberg	d11bc6182c	Improve mean accuracy while retaining the speedup with a custom iterator Fixes #5852.	2016-03-02 14:48:11 -05:00
Jonathan A. Sternberg	87fc143732	Fix limit iterator with multiple sources The limit iterator would short circuit if there were no dimensions and all points had been read. It also needs to consider that multiple sources will require reading the entire iterator too, so the short circuit requires only a single source. Fixes #5871.	2016-03-01 21:44:45 -05:00
Jonathan A. Sternberg	aa0b603938	Convert `SHOW FIELD KEYS` to the new query engine Fixes #5579.	2016-02-25 18:31:02 -05:00
Jonathan A. Sternberg	cddc1b2241	Fix remote execution for partially replicated clusters The RPC handler for remote queries would attempt to reuse a closed connection for certain commands that didn't use pooling. The RPC commands that close the connection have been fixed to not try reusing the connection. When creating an iterator, if there are no points to return, the points decoder would hit an EOF that it didn't catch and would return that error back to the client who made the request. It now properly returns no points by using a `nilFloatIterator` if there are no points to return. This fixes remote execution when a cluster has nothing to return.	2016-02-25 17:46:51 -05:00
Ben Johnson	16eea8eecc	add SeriesList marshaling	2016-02-25 15:38:16 -07:00
Ben Johnson	0dda9f6608	add remote execution This commit adds remote execution to the query engine.	2016-02-25 08:41:20 -07:00
Jonathan A. Sternberg	53056d862b	Eliminating dead code in `(*influxql.SelectStatement).RewriteWildcards()` The dimensions array in `RewriteWildcards` gets emptied by an earlier section of the code and then tries to iterate over that empty slice to append it to the list of dimensions. That makes the loop dead code that can't ever be hit. Also improve the efficiency of this method by not creating a new slice when there are no wildcards. We already check at the beginning of the function if there is a wildcard out of necessity. There's no point in making a new slice and copying the contents if we know that there will be no wildcards to expand. It also improves memory efficiency by assuming that if a wildcard exists, there is only one and the pre-allocated slice can take advantage of that. If there are multiple wildcards, then a new slice will have to be created in the middle of the loop to raise the capacity.	2016-02-23 10:36:01 -05:00
Jonathan A. Sternberg	f7ef382596	Remove dimensions from field wildcards When a wildcard is specified for the field but not the dimensions, the dimensions get added to the list of fields as part of `RewriteWildcards()`. But when a dimension was given with no wildcard, the dimension didn't get removed from the wildcard in the fields section. This teaches the rewriter to disclude dimensions explicitly included from being expanded as a field. Now this statement when a measurement has one tag named host and a field named value: SELECT * FROM cpu GROUP BY host Would expand to this: SELECT value FROM cpu GROUP BY host Instead of this: SELECT host, value FROM cpu GROUP BY host If you want the latter behavior, you can include it like this: SELECT host, * FROM cpu GROUP BY host Fixes #5770.	2016-02-23 10:22:56 -05:00
Jonathan A. Sternberg	7a03df2af1	Remove the non-unreachable panics in the new query engine The only panics left are ones that should be unreachable unless there is a bug. Fixes #5777.	2016-02-22 12:52:43 -05:00
Jonathan A. Sternberg	b6a0b6a65a	Merge pull request #5742 from influxdata/js-ensure-non-empty-column-names Ensure column names get implicitly renamed with conflicts	2016-02-22 08:55:38 -05:00
Jonathan A. Sternberg	6982d5310e	Replace a panic with returning an error when an unsupported call is used Fixes #5773.	2016-02-21 19:39:14 -05:00
Jonathan A. Sternberg	a8d637b03c	Ensure column names get implicitly renamed with conflicts	2016-02-19 14:35:56 -05:00
Jonathan A. Sternberg	f84556fe22	Update binary expressions to handle mixed math between integers and floats This also changes it so all NumericLiterals are treated as a float rather than sometimes being cast to an integer. Fixes #5740.	2016-02-19 14:32:22 -05:00
Jonathan A. Sternberg	8fc6a0f648	Merge pull request #5732 from influxdata/js-5730-binary-expr-column-name Assign a name to columns with binary expressions in them	2016-02-19 14:30:43 -05:00

... 3 4 5 6 7 ...

1096 Commits (485968cce4b5b101ebfcb6ccf073fe8e8e9d6a91)