Edd Robinson
1416097a35
Merge branch 'main' into er/feat/read_buffer/num_rle
2021-05-11 23:30:55 +01:00
Edd Robinson
aa83669740
refactor: move encodings to scalar module
2021-05-11 22:49:20 +01:00
Edd Robinson
482e4dab86
refactor: shuffle string encodings
2021-05-11 22:47:42 +01:00
Edd Robinson
f86e0641fd
refactor: clarify benchmark
2021-05-11 22:47:42 +01:00
Edd Robinson
f5fe270e43
refactor: move benchmark
2021-05-11 22:47:36 +01:00
Raphael Taylor-Davies
b02105e47b
feat: construct StringDictionary from PackedStringArray ( #1475 )
...
* feat: construct StringDictionary from PackedStringArray
* chore: fix formatting
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-11 20:16:25 +00:00
kodiakhq[bot]
67984fbff5
Merge pull request #1474 from influxdata/er/feat/rb_metrics
...
fix: ensure metrics not overwriting
2021-05-11 20:06:16 +00:00
Edd Robinson
696e4e0cfd
fix: ensure metrics not overwriting
2021-05-11 20:57:31 +01:00
Raphael Taylor-Davies
4409d2c8af
feat: instrument catalog locks ( #1464 )
...
* feat: instrument catalog locks (#1355 )
* chore: add metrics test
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-11 18:59:11 +00:00
Andrew Lamb
9d0c3a2b1a
refactor: Remove multi-table per chunk code in MUB ( #1471 )
...
* refactor: Remove multi-table per chunk code in MUB
* fix: clippy
* fix: bench build
* fix: merge conflicts
2021-05-11 17:49:07 +00:00
Raphael Taylor-Davies
c85d1574eb
feat: move dictionary and bitset into arrow_utils ( #1459 )
...
* feat: move dictionary and bitset into arrow_utils
* chore: review feedback
* chore: remove redundant dictionary methods
* chore: consistent type parameter name in PackedStringArray
* chore: review comments
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-11 16:43:38 +00:00
Raphael Taylor-Davies
d1da954fe4
feat: don't store encoded strings twice in RLE dictionaries ( #1469 )
2021-05-11 15:22:25 +00:00
Raphael Taylor-Davies
e00b4aaa24
feat: associate labels with a domain ( #1463 )
...
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-11 13:03:45 +00:00
kodiakhq[bot]
be74eab3fd
Merge pull request #1468 from influxdata/er/feat/rb_metrics
...
feat: add in-depth column encoding metrics
2021-05-11 12:45:08 +00:00
Edd Robinson
32abe2e777
feat: wire up stats to metrics
2021-05-11 13:38:32 +01:00
Edd Robinson
c4987028fb
feat: expose all column stats
2021-05-11 13:00:52 +01:00
Edd Robinson
88ed58aa8a
feat: column statistics for int/float
2021-05-11 13:00:52 +01:00
Edd Robinson
ef2eda04ef
feat: add string encoder statistics
2021-05-11 13:00:52 +01:00
Edd Robinson
3622a92c8b
feat: wire in rb column metrics
2021-05-11 13:00:52 +01:00
kodiakhq[bot]
767d6b66f6
Merge pull request #1448 from influxdata/crepererum/optional_stats_minmax
...
refactor: make `StatValues::{min,max}` optional + handle NaNs
2021-05-11 07:39:02 +00:00
kodiakhq[bot]
6f106c49aa
Merge branch 'main' into crepererum/optional_stats_minmax
2021-05-11 07:35:13 +00:00
Andrew Lamb
b6290f1ff3
chore: update deps ( #1466 )
2021-05-10 22:14:08 +00:00
Marco Neumann
795f5bfcb7
refactor: make `StatValues::{min,max}` optional + handle NaNs
...
This will allow us to:
- handle all-NULL columns correctly
- be in-line with Parquet (where min/max are optional)
- handle NaNs at least somewhat sane (they do not "poison" stats
anymore)
2021-05-10 17:12:25 +02:00
Andrew Lamb
f037c1281a
feat: Calculate all system tables "on demand" ( #1452 )
...
* feat: compute system.columns table on demand
* feat: compute system.chunk_columns on demand
* feat: compute system.operations on demand
* fix: fixup schemas
* fix: Log errors
* fix: clippy
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-10 14:43:55 +00:00
kodiakhq[bot]
8dcb4a54a2
Merge pull request #1456 from influxdata/fixrulesexistence
...
fix: Load only databases for which a config exists
2021-05-10 11:18:42 +00:00
Marko Mikulicic
9f5350a6c5
fix: Load only databases for which a config exists
...
Closes #1450
2021-05-10 13:14:22 +02:00
kodiakhq[bot]
ebdab25a3c
Merge pull request #1453 from influxdata/ntran/e2e_with_unload
...
feat: Query Parquet files
2021-05-07 22:44:54 +00:00
Nga Tran
c6b933eb63
chore: merge main to branch
2021-05-07 18:40:17 -04:00
kodiakhq[bot]
6fc05769b1
Merge pull request #1458 from influxdata/cn/path-manipulation
...
fix: Use PathBuf rather than debug formatting and back to String
2021-05-07 21:41:04 +00:00
Nga Tran
f2c19ec080
refactor: further address Carol's comment
2021-05-07 17:40:40 -04:00
kodiakhq[bot]
1c61f9b485
Merge branch 'main' into cn/path-manipulation
2021-05-07 21:36:22 +00:00
Nga Tran
971500681f
refactor: address Andrew's and Carol's comment
2021-05-07 17:33:19 -04:00
Andrew Lamb
bc6ade4776
feat: Include system.chunk_columns` in the tables that are scraped ( #1455 )
...
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-07 20:31:04 +00:00
Carol (Nichols || Goulding)
e2cc4634bf
fix: Use PathBuf rather than debug formatting and back to String
...
This is the same fix I made in 54c5f98
, just found a few more spots :)
2021-05-07 15:58:11 -04:00
Raphael Taylor-Davies
e2e3c9f77c
feat: workaround bug/limitation in OT handling of observers ( #1457 )
...
* feat: workardound bug/limitation in OT handling of observers
* chore: fix lints
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-07 18:53:28 +00:00
kodiakhq[bot]
60f4b927e8
Merge pull request #1454 from influxdata/er/fix/null_rb_row_groups
...
fix: don't blow up on all null columns
2021-05-07 17:24:31 +00:00
Edd Robinson
4a414fc8fb
fix: don't blow up on all null columns
2021-05-07 17:31:18 +01:00
Nga Tran
31d49db0ed
chore: a litlle more cleanup
2021-05-07 09:38:41 -04:00
Nga Tran
ba015ee4df
refactor: clean up and add comments
2021-05-07 09:31:41 -04:00
kodiakhq[bot]
07af2e6cda
Merge pull request #1443 from influxdata/er/feat/regex_pred_op
...
feat: Add Regex matching support to IOx's gRPC API
2021-05-07 12:58:21 +00:00
Edd Robinson
8ccc359cab
refactor: address PR feedback
2021-05-07 13:48:44 +01:00
Edd Robinson
4c4bd2f164
refactor: update query/src/func/regex.rs
2021-05-07 13:44:51 +01:00
Edd Robinson
4cc7a99854
refactor: include not match in support check
2021-05-07 13:44:51 +01:00
Edd Robinson
beee3115f4
feat: expose regex =\~ and to gRPC API
2021-05-07 13:44:51 +01:00
Edd Robinson
eae3fec571
feat: wire up regex UDF as predicate filter expr
2021-05-07 13:44:51 +01:00
Edd Robinson
3fc2c9fc04
feat: add DataFusion regex match operator
...
This commit adds a new custom UDF to IOx that provide a regex operator to Datafusion plans.
Effectively it allows predicates to contain regex operators that are applied as filters, only allowing rows that satisfy the regex to be returned.
I did not use the Arrow regex kernel for this work because that does not return a boolean array indicating which rows matched a regex, but instead returns a new string array of results. This doesn't work well with DF's approach to filtering.
2021-05-07 13:44:51 +01:00
Andrew Lamb
b5ea71f45f
feat: Expose the storage usage for each column in system.chunk_columns ( #1441 )
...
* feat: Expose the storage usage for each column in system.chunk_columns
* fix: fixup logical conflicts
* refactor: move coalsce logic into the read buffer
* fix: Update system_tables to not use coalese
* fix: Improve comments
Co-authored-by: Carol (Nichols || Goulding) <193874+carols10cents@users.noreply.github.com>
Co-authored-by: Carol (Nichols || Goulding) <193874+carols10cents@users.noreply.github.com>
2021-05-07 12:36:49 +00:00
Raphael Taylor-Davies
9320f59de0
feat: add shard sink indirection ( #1447 )
...
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-07 11:04:51 +00:00
kodiakhq[bot]
2e5a9bf18e
Merge pull request #1449 from influxdata/alamb/downgrade-rust
...
chore: revert to Rust 1.51.0
2021-05-07 10:37:06 +00:00
kodiakhq[bot]
07f549450f
Merge branch 'main' into alamb/downgrade-rust
2021-05-07 10:32:53 +00:00