influxdb

Commit Graph

Author	SHA1	Message	Date
Andrew Lamb	dbe52f1ca1	chore: Upgrade datafusion (#6467 ) * chore: Update datafusion * fix: Update for new apis * chore: Update expected plan * fix: Update for new config construction * chore: update clippy * fix: Fix error codes * fix: update another test * chore: Run cargo hakari tasks Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-01-03 15:29:11 +00:00
Marco Neumann	a5d693eba2	feat: lower Influx regex expressions to DF regex expressions (#6394 ) * feat: lower Influx regex experessions to DF regex expressions For #6388. * refactor: address review comments	2022-12-15 09:33:28 +00:00
Marco Neumann	65687bf0fa	test: regex baseline test (#6389 ) For #6388. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-12-13 17:42:31 +00:00
Andrew Lamb	e0ecacf6cc	chore: Update DataFusion (get median fix and automatic string to timestamp coercion) (#6363 ) * chore: Update DataFusion pin to get median fix * chore: Update for new Expr node * test: add test for median * test: add test for coercion of strings to timestamps * chore: Run cargo hakari tasks Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-12-12 12:14:00 +00:00
Andrew Lamb	9175f4a0b5	chore: Upgrade datafusion to get correct support for multi-part identifiers (#6349 ) * test: add tests for periods in measurement names * chore: Update Datafusion * chore: Update for changed APIs * chore: Update expected plan output * chore: Run cargo hakari tasks Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-12-08 11:27:13 +00:00
Andrew Lamb	fc5697b8e7	chore: Update datafusion again (N of N) (#6218 ) * chore: Update datafusion again (4 of N) * fix: Update plans * fix: Update for renamed API * fix: Update more plans * chore: Update to datafusion @ d355f69aae2cc951cfd021e5c0b690861ba0c4ac * fix: update explain plan tests * fix: update test after schema error * chore: Update datafusion again * fix: Add size() calculation to selectors * chore: Run cargo hakari tasks * fix: Update newly added test Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-28 17:09:40 +00:00
Nga Tran	52d70b060a	test: retention test for querier inthe query_tests (#6220 )	2022-11-23 17:04:14 +00:00
Andrew Lamb	9fb1de0428	chore: Update datafusion (2 of N) right before arrow 27 upgrade (#6207 ) * chore: Update datafusion (2 of N) right before arrow 27 upgrade * fix: Update tests for better unsigned pushdown * chore: Run cargo hakari tasks Co-authored-by: CircleCI[bot] <circleci@influxdata.com>	2022-11-23 11:04:14 +00:00
Andrew Lamb	1a1ea74cb7	chore: Upgrade datafusion again (#6160 ) * Revert "Revert "chore: Update datafusion again (#6108)"" This reverts commit 766b3bbeb440618cfe332f6ee7d4f8a8217acc48. * fix: Respect the partition sort key * chore: update plans Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-22 19:28:26 +00:00
Andrew Lamb	4630bbb956	feat: push down all predicates (#6042 ) * feat: push down all predicates * fix: fmt * fix: fmt Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-18 16:22:01 +00:00
Marco Neumann	71ffc92559	fix: only push safe select expression through de-dup (#6156 ) * fix: only push safe select expression through de-dup Fixes #6066. * docs: improve Co-authored-by: Andrew Lamb <alamb@influxdata.com> * fix: rebase * test: ensure we do not split ORs Co-authored-by: Andrew Lamb <alamb@influxdata.com>	2022-11-18 09:56:11 +00:00
Andrew Lamb	67712b595c	Revert "chore: Update datafusion again (#6108 )" (#6159 ) This reverts commit `fbe9f27f10`.	2022-11-16 21:14:55 +00:00
Andrew Lamb	fbe9f27f10	chore: Update datafusion again (#6108 ) * chore: Update datafusion pin + api code * chore: Run cargo hakari tasks * refactor: combine_sort_key is more idomatic and add rationale comments * refactor: satisfy borrow checker and updated comments * fix: Add test case for combine_sort_key * fix: Apply suggestions from code review Co-authored-by: Marco Neumann <marco@crepererum.net> * fix: Add back test for deeply nested expression * fix: Update output ordering Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: Marco Neumann <marco@crepererum.net> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-16 14:41:52 +00:00
Andrew Lamb	20f1ae1c8f	test: tests in the reorg planner and query tests for merging parquet files (#6137 ) * test: tests in the reorg planner and query tests for merging parquet files * fix: use 20 files Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-15 20:29:44 +00:00
Marco Neumann	1a5fc3d772	test: use `EXPLAIN ANALYZE` for SQL metric tests (#6084 ) * test: use `EXPLAIN ANALYZE` for SQL metric tests Needs a bit more infra (due to normalization), but this seems to be worth it so we can easily hook up more metrics in the future. * docs: explain regexes	2022-11-09 09:00:27 +00:00
Marco Neumann	903f7bafa7	refactor: expose `ParquetExec` directly to DataFusion phys. plan (#6072 ) * refactor: expose `ParquetExec` directly to DataFusion phys. plan Closes #5897. * fix: update tracing tests * refactor: use `EmptyExec` * refactor: use `target_partitions` * refactor: improve UUID normalization in query tests Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-11-08 12:19:28 +00:00
Andrew Lamb	034d9b371d	chore: Update datafusion and arrow/arrow-flight/parquet to `26.0.0` (#6061 ) * chore: Update datafusion and arrow/arrow-flight/parquet to `26.0.0` * fix: Update query_functions * fix: update for TimestampNanosecondArray API changes * fix: update for TimestampNanosecondArray API changes * chore: Update flatbuffers and remove rustsec warning * chore: Update text * fix: update more test * fix: Lock ahash to exactly 0.8.0 * fix: Update datafusion pin * chore: Run cargo hakari tasks Co-authored-by: Carol (Nichols \|\| Goulding) <carol.nichols@gmail.com> Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-07 11:01:58 +00:00
Andrew Lamb	474620f4a7	chore: Update datafusion and other dependencies (#5976 ) * chore: Update datafusion and other dependencies * chore: Update expected plan * chore: Run cargo hakari tasks Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-10-26 14:14:13 +00:00
Marco Neumann	1d440ddb2d	refactor: `IOxReadFilterNode` can always accumulate statistics (#5954 ) * refactor: `IOxReadFilterNode` can always accumulate statistics `IOxReadFilterNode` used to not emit statistics if one chunk has duplicates or delete predicates. This is wrong (or at least overly conservative), because the node itself (or the chunks themselves) do NOT perform dedup or delete predicate filtering. Instead this is done is done by parent nodes (`DeduplicateExec` and `FilterExec`) and its their job to propagate statistics correctly. Helps w/ #5897. * test: explain setup Co-authored-by: Andrew Lamb <alamb@influxdata.com> Co-authored-by: Andrew Lamb <alamb@influxdata.com>	2022-10-24 13:34:22 +00:00
Andrew Lamb	7781ed0455	chore: Update datafusion (#5928 ) * chore: Upgrade datafusion * chore: Update for new API * chore: Update expected output * fix: Update comment * fix: compilation Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-10-20 14:37:49 +00:00
Andrew Lamb	d706f8221d	chore: Update datafusion and arrow / parquet / arrow-flight 25.0.0 (#5900 ) * chore: Update datafusion and `arrow` / `parquet` / `arrow-flight` 25.0.0 * chore: Update for structure changes * chore: Update for new projection pushdown * chore: Run cargo hakari tasks * fix: fmt Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-10-18 20:58:47 +00:00
Andrew Lamb	9134ccd6c3	chore: Update datafusion again (#5855 ) * chore: Update datafusion * chore: Updates for changes in datafusion * chore: more updates * fix: update doc example Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-10-13 19:18:57 +00:00
Andrew Lamb	d57c99638c	chore: Update datafusion + `arrow`, `arrow-flight`, and `parquet` to 24.0.0.0 (#5792 ) * chore: Update datafusion + `arrow`, `arrow-flight`, and `parquet` to 24.0.0.0 * fix: Update for coercion, fix explain plans for change in column name display * chore: Update datafusion lock * fix: Update for other API changes * chore: Update to latest datafusion pin * chore: Run cargo hakari tasks Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-10-12 16:19:14 +00:00
Dom Dwyer	c4f542bbe2	refactor(ingester): remove tombstone support This commit removes tombstone support from the ingester, and deletes associated code/helpers/tests. This commit does NOT remove tombstone support from any other service, but MAY include removing overlapping test coverage. This also removes the tombstone support from the Ingester -> Querier RPC response message. This has the nice side effect of removing a whole lot of thread spawning in the ingester tests for the Executor, speeding everything up!	2022-10-11 13:10:04 +02:00
Marco Neumann	365a246f8d	refactor: do not run de-dup in ingester for querier requests (#5626 ) * refactor: do not run de-dup in ingester for querier requests This removes the entire de-dup logic from the inegster for querier requests. Furthermore, it even removes the entire datafusion execution from the querier and just dumps the in-memory record batches as quickly as possible. No filters are applied. Note that even prior to this PR, we've never applied projections (tracked by #5624). Pros: - speed up query planning within the querier (since we need the ingester response for state reconciling) - lowered ingester CPU load Cons: - more querier<>ingester network traffic Closes #5602. * test: extend query test case * fix: ingester tests Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-22 07:33:54 +00:00
Andrew Lamb	786711f244	chore: Update datafusion (#5672 ) * chore: Update datafusion pin * chore: Update expected results Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-20 10:19:43 +00:00
Andrew Lamb	45d795055a	feat: Support calling influxql/flux selector aggregates from IOx SQL (#5628 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-14 10:37:17 +00:00
Andrew Lamb	1fd31ee3bf	chore: Update datafusion / `arrow` / `arrow-flight` / `parquet` to version 22.0.0 (#5591 ) * chore: Update datafusion / `arrow` / `arrow-flight` / `parquet` to version 22.0.0 * fix: enable dynamic comparison flag * chore: derive Eq for clippy * chore: update explain plans * chore: Update sizes for ReadBuffer encoding * chore: update more tests Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-12 17:45:03 +00:00
Andrew Lamb	de47f5605b	chore: Update datafusion (with new sqlparser release) - option 1 (#5433 ) * chore: Update datafusion pin * chore: Update now that user is a reserved word * chore: Update cargo.lock * fix: update query for user function Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-08-29 19:10:00 +00:00
Nga Tran	283e908132	test: workaround for time > a number (#5477 ) * test: workaround for time > a number * chore: cargo update * chore: Revert "chore: cargo update" This reverts commit 0798e4e14674267ddd2308b12a25031fc35de8b6.	2022-08-26 20:52:12 +00:00
Nga Tran	34ccc9c7f5	chore: Revert "chore: Revert "refactor: bump batch size (#5251 )" (#5288 )" (#5300 ) This reverts commit `471b8be92f`. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-08-04 13:19:46 +00:00
Nga Tran	471b8be92f	chore: Revert "refactor: bump batch size (#5251 )" (#5288 ) This reverts commit `bb172f8fa8`.	2022-08-03 14:23:45 +00:00
Marco Neumann	bb172f8fa8	refactor: bump batch size (#5251 ) This is what DataFusion uses by default and I don't see a reason why we should use such small batch sizes. The affect is probably only visible in certain filter-aggregate queries that don't focus on a single series (because there we likely end up with 1 or 2 batches only, esp. after #5250) for coarse-grained filters, esp. when the filter key is not the first sort key. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-08-01 13:49:58 +00:00
Andrew Lamb	66af2bdd88	refactor: Split up `delete_three_delete_three_chunks.sql` test case (#5197 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-22 20:57:31 +00:00
Andrew Lamb	9fed013848	chore: Update datafusion pin (#5162 ) * chore: Update datafusion pin * fix: Update expected output Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-20 14:34:08 +00:00
Andrew Lamb	e2d871b00b	chore: Update datafusion and arrow/parquet/arrow-flight to `18.0.0` (#5079 ) * chore: Update datafusion to 10.0.0, arrow/parquet/arrow-flight to 18 * chore: Run cargo hakari tasks * fix: update cargo pin Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-18 15:01:03 +00:00
Andrew Lamb	c4c251129e	chore: Update datafusion (#5020 ) * chore: Update datafusion * fix: Update plan * fix: update explain plans Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-07-01 19:59:41 +00:00
Marco Neumann	016dd93d9c	feat: filter chunks before requesting read buffers (#4996 ) Fixes #4976.	2022-07-01 08:59:07 +00:00
Andrew Lamb	01fb2e132d	chore: Update datafusion pin (#4969 ) * chore: Update datafusion pin * fix: Update for api * fix: Explicitly set coalsce batch size * fix: Update batch size as well * fix: update tests for new explain plan, and improved coercion	2022-06-29 17:52:37 +00:00
Andrew Lamb	8e96a2721d	chore: Update datafusion (again) (#4788 ) * chore: Update datafusion * chore: Update imports * refactor: update API usage * refactor: clean up some uses of binary_expr * fix: remove unused export * fix: update explain output * chore: update more explain tests Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-07 08:17:56 +00:00
Marco Neumann	47347bef9f	test: add query test scenario w/ missing columns in different chunks (#4656 ) * test: do NOT filter out query test scenarios w/ unordered stages in different partitions It should be possible to have two chunks in different partitions where both are in the ingester stage or the first one is in the parquet stage and the 2nd one in the ingester stage. * test: add query test scenario w/ missing columns in different chunks Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-23 12:13:41 +00:00
Nga Tran	4434ec6836	chore: convert some debug to trace to reduce noises (#4589 )	2022-05-12 22:40:29 +00:00
Nga Tran	66fe4c54ec	fix: make test output deterministic (#4578 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-12 14:15:51 +00:00
Nga Tran	1913a0150f	test: explain tests for queries whose data come from both parquet and ingester (#4546 ) * refactor: remove New from a test scenario setup * test: add explain for 2 different chunk stages * test: expplain for several chunks from both parquet and ingester	2022-05-10 13:17:18 +00:00
Carol (Nichols \|\| Goulding)	96f0c88b48	fix: Remove db crate from query_tests	2022-05-06 11:30:36 -04:00
Nga Tran	799480d34e	refactor: query_tests - port a few OG to NG tests and remove many more that already ported (#4487 ) * refactor: port a few OG to NG tests and remove many more that already ported * chore: Apply suggestions from code review * chore: address review comments Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-02 14:44:02 +00:00
Marco Neumann	2337935660	test: chunks in ingester stage (#4415 ) * refactor: document and improve `MockIngesterConnection` * refactor: split `OldOneMeasurementFourChunksWithDuplicates` for `EXPLAIN` queries * fix: mark "IngsterPartition" chunks as unsorted * fix: "group by" queries may require sorted comparison * refactor: re-export a few more types from querier * fix: ensure that test parquet files are de-duped * test: chunks in ingester stage * docs: explain test code	2022-04-26 07:55:19 +00:00
dependabot[bot]	4c94e46642	chore(deps): Bump croaring from 0.5.2 to 0.6.0 (#4408 ) * chore(deps): Bump croaring from 0.5.2 to 0.6.0 Bumps [croaring](https://github.com/saulius/croaring-rs) from 0.5.2 to 0.6.0. - [Release notes](https://github.com/saulius/croaring-rs/releases) - [Commits](https://github.com/saulius/croaring-rs/compare/0.5.2...0.6.0) --- updated-dependencies: - dependency-name: croaring dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> * fix: croaring 0.6.0 compat Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Marco Neumann <marco@crepererum.net>	2022-04-25 16:41:08 +00:00
Marco Neumann	b1af5b3f44	feat: query log system table for querier (#4157 ) * feat: query log system table for querier Closes #4084. * fix: typo Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * docs: extend Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-30 15:38:11 +00:00
Marco Neumann	2b76c31157	refactor: make statistics null counts optional (#4160 ) Min/max values and distinct counts are already optional, so let's make the null counts optional as well. This will be helpful for NG to deal w/ partial statistics (e.g. we only populate stats for the time column). Note that the total count is still mandatory, but we normally have the chunk/file-level row count at hand.	2022-03-29 17:47:57 +00:00

1 2

91 Commits (2bb6db3f376eb10af0f3e3e5d5d2d12f2568a148)