influxdb

Commit Graph

Author	SHA1	Message	Date
Andrew Lamb	bb8021d9fd	fix: "Can not convert index to usize in dictionary of type creating group by value Int32" (#2151 ) * test: add reproducer for index error * chore: update datafusion	2021-08-02 12:20:41 +00:00
Carol (Nichols \|\| Goulding)	9d15798288	fix: Address or allow Clippy warnings new with Rust 1.54	2021-07-30 09:59:59 -04:00
Andrew Lamb	e6cbd4d217	feat: Use statistics for count() queries (#2038 ) feat: Use statistics for count() queries docs: fix mangled comment * refactor: rewrite to use fold * refactor: use sort_by_cached_key * fix: set null count properly * fix: fmt + clippy	2021-07-28 19:39:41 +00:00
Andrew Lamb	3ea84c6be4	feat: expose null_counts in system.chunk_columns (#2105 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-27 11:05:23 +00:00
Andrew Lamb	5fb3e00f2a	fix: Properly record total_count and null_count in statistics (#2103 ) * fix: Properly record total_count and null_count in statistics * fix: fix statistics calculation in mutable_buffer * refactor: expose null counts in read_buffer * refactor: expose null_count in parquet_file * fix: update server crate tests * fix: update query_tests tests * docs: tweak comments * refactor: Use storage_stats rather than adding `null_count` * refactor: rename test data field for clarity * fix: fixup merge conflicts * refactor: rename initial_non_null_count to initial_total_count * refactor: caculate null_count as row_count - to_add	2021-07-26 18:13:36 +00:00
Marco Neumann	6ef3680554	feat: collect replay plan during catalog loading	2021-07-23 09:23:06 +02:00
Andrew Lamb	38261cc7ac	test: add tests using `to_timestamp()` as predicates in SQL (#2099 ) * test: add tests using `to_timestamp()` as predicates in SQL * fix: cleanup redundancy Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-22 21:06:52 +00:00
Andrew Lamb	01c79f1a1a	fix: Print all timestamps using RFC3339 format (#2098 ) * fix: Use IOx pretty printer rather than arrow pretty printer * chore: update tests in the query crate * chore: update influxdb_iox tests * chore: Update end to end tests * chore: update query_tests * chore: update mutable_buffer tests * refactor: update parquet_file tests * refactor: update db tests * chore: update kafka integration test output * fix: merge conflict	2021-07-22 19:04:52 +00:00
Raphael Taylor-Davies	20d06e3225	feat: include more information in system.operations table (#2097 ) * feat: include more information in system.operations table * chore: review feedback Co-authored-by: Andrew Lamb <alamb@influxdata.com> Co-authored-by: Andrew Lamb <alamb@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-22 17:16:09 +00:00
Andrew Lamb	387667330a	chore: Update datafusion deps (#2073 ) * chore: Update datafusion deps * fix: update tests	2021-07-21 08:27:03 +00:00
Raphael Taylor-Davies	091837420f	feat: add PersistenceWindows sytem table (#2030 ) (#2062 ) * feat: add PersistenceWindows sytem table (#2030) * chore: update log Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-20 13:10:57 +00:00
Andrew Lamb	1c16988a51	chore: Update datafusion references (#2056 )	2021-07-19 18:09:06 +00:00
Andrew Lamb	4da8a16c18	chore: update to arrow 5.0 and master datafusion (#2049 ) * chore: update to arrow 5.0 and master datafusion * fix: Update test for change in object size	2021-07-19 12:49:51 +00:00
Marco Neumann	2263189e09	test: make TestDb lifecycle better for testing This is a leftover from #1972.	2021-07-19 09:50:44 +02:00
Marco Neumann	1ef2bc1887	refactor: `Db::{write_chunk_to_object_store => Db::persist_partition}` The previous method allowed to persist any chunk -- even ones that should not be persisted yet and w/o any order of peristence. That will break our persistence windows. So instead offer a sane higher-level interface that can trigger persistence of a partition within the boundaries of the lifecycle rules. This needs some adjustments for our test suite.	2021-07-16 12:07:58 +02:00
Andrew Lamb	3fd6430fb6	fix: rename `estimated_bytes` to `memory_bytes` and expose `object_store_bytes` in ChunkSummary and system.chunks (#2017 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-15 16:00:24 +00:00
Andrew Lamb	3bb32594ba	refactor: rename end-to-end.rs to end_to_end.rs (#2015 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-15 13:50:32 +00:00
Marco Neumann	d89fca00be	feat: persist "drop chunk"	2021-07-15 12:07:56 +02:00
Nga Tran	0b1f2b1fd0	chore: merge main to branch	2021-07-14 16:17:14 -04:00
Nga Tran	552e3fb691	fix: Padd stats compute deterministic order of sort key and update tests that got changed by the use of sort key	2021-07-14 14:06:41 -04:00
Andrew Lamb	4800b36949	chore: Update IOx to a pre-release version of arrow and datafusion to test out performance improvement	2021-07-13 15:44:57 -04:00
Andrew Lamb	0164cabbf3	refactor: do not use DataFrame DataFusion API / stop optimizing twice (#1982 ) * refactor: do not use DataFrame DataFusion API * fix: update output to reflect not running optimizer twice Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-13 16:29:43 +00:00
kodiakhq[bot]	f26f844ed2	Merge branch 'main' into ntran/use_sortkey	2021-07-12 18:12:47 +00:00
Nga Tran	7b7a60993d	feat: consider time as a special key	2021-07-09 18:54:22 -04:00
Marco Neumann	676034b4ae	docs: explain why the path placeholder is there	2021-07-09 09:45:13 +02:00
Marco Neumann	09e611deb7	refactor: lift query schema generation up to caller Do no longer scan chunks during query planning to determine the schema (except for the lifetime jobs where we have a good reason to do so). Instead pass the schema down to from whoever is triggering the query. For real SQL queries, we then just use the the table-wide schemas introduced in #1913. Apart from avoiding schema merges we now also don't crash any longer when no chunks are left in the table (aka columns are present but all rows are gone). Fixes #1768. Fixes #1884.	2021-07-09 09:24:21 +02:00
Marco Neumann	6ac1420335	test: fix out dir for query tests	2021-07-09 09:16:28 +02:00
kodiakhq[bot]	c8126784a8	Merge branch 'main' into ntran/avoid_sort_in_scan	2021-07-08 20:22:18 +00:00
Nga Tran	da6249a4df	fix: address reviewers' comments and also fixe a bug they discovered	2021-07-08 15:54:54 -04:00
Andrew Lamb	dd3eff7748	refactor: Always use `row_count` for count of rows in system.* tables (#1937 )	2021-07-08 19:28:11 +00:00
Andrew Lamb	f670224ea1	chore: Reduce output spew during query tests (#1926 ) * chore: Reduce output spew during query tests * docs: Update query_tests/src/runner.rs Co-authored-by: Edd Robinson <me@edd.io> Co-authored-by: Edd Robinson <me@edd.io> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-08 11:06:24 +00:00
Andrew Lamb	7602bde850	chore: Update datafusion deps (#1799 ) * chore: Update datafusion deps + rework code * refactor: remove workaround as it has been contributed upstream * fix: Update query/src/exec/split.rs Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-08 10:58:32 +00:00
Nga Tran	5c722af0fa	fix: remove comments	2021-07-07 16:50:53 -04:00
Nga Tran	d3c4f8c249	fix: store sort key correctly inthe schema. Update tests to reflect it	2021-07-07 15:55:23 -04:00
Edd Robinson	2ec9151b32	Merge branch 'main' into er/fix/read_buffer/predicate	2021-07-06 13:35:04 +01:00
Marco Neumann	8387eaed27	test: do not recompile `query_tests` when test content changes There is no need to recompile the entire `query_tests` crate when the CONTENT (not the SET) of the test cases changes, e.g. due to new optimizations, datafusion upgrades, query additions, etc. We now check if `cases.rs` really changed before touching it, so that Cargo can rely on the files mtime.	2021-07-05 15:30:10 +02:00
Marco Neumann	d6cff911b6	test: ensure that query tests don't rebuild all the time Beforehand: ```text ❯ env CARGO_LOG=cargo::core::compiler::fingerprint=info cargo test -p query_tests [2021-07-05T08:52:13Z INFO cargo::core::compiler::fingerprint] stale: changed "/home/mneumann/src/influxdb_iox/query_tests/cases" [2021-07-05T08:52:13Z INFO cargo::core::compiler::fingerprint] (vs) "/home/mneumann/src/influxdb_iox/target/debug/build/query_tests-0e8f741dfb84437f/output" [2021-07-05T08:52:13Z INFO cargo::core::compiler::fingerprint] FileTime { seconds: 1625474716, nanos: 436081357 } != FileTime { seconds: 1625474752, nanos: 52625167 } [2021-07-05T08:52:13Z INFO cargo::core::compiler::fingerprint] fingerprint error for query_tests v0.1.0 (/home/mneumann/src/influxdb_iox/query_tests)/Test/TargetInner { ..: lib_target("query_tests", ["lib"], "/home/mneumann/src/influxdb_iox/query_tests/src/lib.rs", Edition2018) } [2021-07-05T08:52:13Z INFO cargo::core::compiler::fingerprint] err: current filesystem status shows we're outdated [2021-07-05T08:52:13Z INFO cargo::core::compiler::fingerprint] fingerprint error for query_tests v0.1.0 (/home/mneumann/src/influxdb_iox/query_tests)/RunCustomBuild/TargetInner { ..: custom_build_target("build-script-build", "/home/mneumann/src/influxdb_iox/query_tests/build.rs", Edition2018) } [2021-07-05T08:52:13Z INFO cargo::core::compiler::fingerprint] err: current filesystem status shows we're outdated [2021-07-05T08:52:13Z INFO cargo::core::compiler::fingerprint] fingerprint error for query_tests v0.1.0 (/home/mneumann/src/influxdb_iox/query_tests)/Build/TargetInner { ..: lib_target("query_tests", ["lib"], "/home/mneumann/src/influxdb_iox/query_tests/src/lib.rs", Edition2018) } [2021-07-05T08:52:13Z INFO cargo::core::compiler::fingerprint] err: current filesystem status shows we're outdated Compiling query_tests v0.1.0 (/home/mneumann/src/influxdb_iox/query_tests) ``` The issue is that both the input and the test output files are located under `cases/`. `build.rs` used `cargo:rerun-if-changed=cases` which per Cargo doc will scan ALL files in that directory. Note that the normal `exclude` directive in `Cargo.toml` does NOT work, see https://github.com/rust-lang/cargo/issues/4587 . So we need to split input and output files into separate directories (`cases/{in,out}`).	2021-07-05 15:30:10 +02:00
Raphael Taylor-Davies	5b00bc69e6	refactor: use Arc<Db> in lifecycle actions (#1873 ) * refactor: use Arc<Db> in lifecycle actions * chore: review feedback	2021-07-01 19:56:33 +00:00
Andrew Lamb	56c8c8d428	feat: Use separate executor for queries and compactions/moves (#1870 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-01 16:47:50 +00:00
Andrew Lamb	07826306ed	fix: Always deduplicate data prior to insertion into the ReadBuffer (#1863 ) * fix: mark ReadBuffer as always deduplicated * fix: Use compact plans during merge * docs: Update server/src/db/chunk.rs Co-authored-by: Nga Tran <ntran@influxdata.com> Co-authored-by: Raphael Taylor-Davies <1781103+tustvold@users.noreply.github.com> Co-authored-by: Nga Tran <ntran@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-07-01 16:23:37 +00:00
Edd Robinson	8fc07cf4f0	fix: correctly evaluate exprs matching disjoint rows	2021-07-01 16:05:09 +01:00
Andrew Lamb	cfa06e1497	chore: Add query tests for compacted chunks (#1861 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-06-30 20:59:29 +00:00
Andrew Lamb	9e1723620c	refactor: rename load_chunk_to_read_buffer to move_chunk_to_read_buffer (#1857 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-06-30 16:53:18 +00:00
Andrew Lamb	fef160e24f	feat: Implement data driven query_tests and port explain tests (#1814 ) * feat: Implment data driven query testing and port explain tests * fix: do not fmt the auto generated cases * refactor: split setup and parser into separate modules * refactor: Add log to runner, add end to end tests * docs: fixu cpmments	2021-06-29 16:09:51 +00:00
Carol (Nichols \|\| Goulding)	93881da016	feat: Make Write Buffer store_entry async In preparation for the Kafka write buffer implementation needing to call async functions.	2021-06-23 10:48:18 -04:00
Andrew Lamb	5362c7c924	feat: enable query deduplication (#1762 )	2021-06-21 18:49:04 +00:00
Raphael Taylor-Davies	ea04ce40dc	feat: transactional lifecycle API (#1753 ) * feat: transactional lifecycle API * chore: remove redundant upgrade * feat: lifecycle error propagation * chore: add usage doctest Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-06-21 13:09:53 +00:00
Andrew Lamb	ab052c0501	fix: fix flaky test by updating datafusion dep (#1758 ) * chore: update DataFusion dependencies * fix: Re-enable previously flaking tests This reverts commit `c63ad0ea31`. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-06-18 20:17:18 +00:00
Andrew Lamb	de67bd3efe	refactor: Remove PartitionChunk::table_schema (#1756 ) * refactor: Remove PartitionChunk::table_schema * docs: update comments	2021-06-18 16:13:16 +00:00
Andrew Lamb	1c13d676b4	refactor: Rename query::PartitionChunk --> query::QueryChunk (#1754 )	2021-06-18 13:24:09 +00:00

1 2

57 Commits (a72bacae67f10d100a06bcab0e4bc2be56b16409)