Commit Graph

3040 Commits (bba387d6fff2aea0f4d929d428041158c15989bc)

Author SHA1 Message Date
Edd Robinson 1416097a35
Merge branch 'main' into er/feat/read_buffer/num_rle 2021-05-11 23:30:55 +01:00
Edd Robinson aa83669740 refactor: move encodings to scalar module 2021-05-11 22:49:20 +01:00
Edd Robinson 482e4dab86 refactor: shuffle string encodings 2021-05-11 22:47:42 +01:00
Edd Robinson f86e0641fd refactor: clarify benchmark 2021-05-11 22:47:42 +01:00
Edd Robinson f5fe270e43 refactor: move benchmark 2021-05-11 22:47:36 +01:00
Raphael Taylor-Davies b02105e47b
feat: construct StringDictionary from PackedStringArray (#1475)
* feat: construct StringDictionary from PackedStringArray

* chore: fix formatting

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-11 20:16:25 +00:00
kodiakhq[bot] 67984fbff5
Merge pull request #1474 from influxdata/er/feat/rb_metrics
fix: ensure metrics not overwriting
2021-05-11 20:06:16 +00:00
Edd Robinson 696e4e0cfd fix: ensure metrics not overwriting 2021-05-11 20:57:31 +01:00
Raphael Taylor-Davies 4409d2c8af
feat: instrument catalog locks (#1464)
* feat: instrument catalog locks (#1355)

* chore: add metrics test

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-11 18:59:11 +00:00
Andrew Lamb 9d0c3a2b1a
refactor: Remove multi-table per chunk code in MUB (#1471)
* refactor: Remove multi-table per chunk code in MUB

* fix: clippy

* fix: bench build

* fix: merge conflicts
2021-05-11 17:49:07 +00:00
Raphael Taylor-Davies c85d1574eb
feat: move dictionary and bitset into arrow_utils (#1459)
* feat: move dictionary and bitset into arrow_utils

* chore: review feedback

* chore: remove redundant dictionary methods

* chore: consistent type parameter name in PackedStringArray

* chore: review comments

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-11 16:43:38 +00:00
Raphael Taylor-Davies d1da954fe4
feat: don't store encoded strings twice in RLE dictionaries (#1469) 2021-05-11 15:22:25 +00:00
Raphael Taylor-Davies e00b4aaa24
feat: associate labels with a domain (#1463)
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-11 13:03:45 +00:00
kodiakhq[bot] be74eab3fd
Merge pull request #1468 from influxdata/er/feat/rb_metrics
feat: add in-depth column encoding metrics
2021-05-11 12:45:08 +00:00
Edd Robinson 32abe2e777 feat: wire up stats to metrics 2021-05-11 13:38:32 +01:00
Edd Robinson c4987028fb feat: expose all column stats 2021-05-11 13:00:52 +01:00
Edd Robinson 88ed58aa8a feat: column statistics for int/float 2021-05-11 13:00:52 +01:00
Edd Robinson ef2eda04ef feat: add string encoder statistics 2021-05-11 13:00:52 +01:00
Edd Robinson 3622a92c8b feat: wire in rb column metrics 2021-05-11 13:00:52 +01:00
kodiakhq[bot] 767d6b66f6
Merge pull request #1448 from influxdata/crepererum/optional_stats_minmax
refactor: make `StatValues::{min,max}` optional + handle NaNs
2021-05-11 07:39:02 +00:00
kodiakhq[bot] 6f106c49aa
Merge branch 'main' into crepererum/optional_stats_minmax 2021-05-11 07:35:13 +00:00
Andrew Lamb b6290f1ff3
chore: update deps (#1466) 2021-05-10 22:14:08 +00:00
Marco Neumann 795f5bfcb7 refactor: make `StatValues::{min,max}` optional + handle NaNs
This will allow us to:

- handle all-NULL columns correctly
- be in-line with Parquet (where min/max are optional)
- handle NaNs at least somewhat sane (they do not "poison" stats
  anymore)
2021-05-10 17:12:25 +02:00
Andrew Lamb f037c1281a
feat: Calculate all system tables "on demand" (#1452)
* feat: compute system.columns table on demand

* feat: compute system.chunk_columns on demand

* feat: compute system.operations on demand

* fix: fixup schemas

* fix: Log errors

* fix: clippy

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-10 14:43:55 +00:00
kodiakhq[bot] 8dcb4a54a2
Merge pull request #1456 from influxdata/fixrulesexistence
fix: Load only databases for which a config exists
2021-05-10 11:18:42 +00:00
Marko Mikulicic 9f5350a6c5
fix: Load only databases for which a config exists
Closes #1450
2021-05-10 13:14:22 +02:00
kodiakhq[bot] ebdab25a3c
Merge pull request #1453 from influxdata/ntran/e2e_with_unload
feat: Query Parquet files
2021-05-07 22:44:54 +00:00
Nga Tran c6b933eb63 chore: merge main to branch 2021-05-07 18:40:17 -04:00
kodiakhq[bot] 6fc05769b1
Merge pull request #1458 from influxdata/cn/path-manipulation
fix: Use PathBuf rather than debug formatting and back to String
2021-05-07 21:41:04 +00:00
Nga Tran f2c19ec080 refactor: further address Carol's comment 2021-05-07 17:40:40 -04:00
kodiakhq[bot] 1c61f9b485
Merge branch 'main' into cn/path-manipulation 2021-05-07 21:36:22 +00:00
Nga Tran 971500681f refactor: address Andrew's and Carol's comment 2021-05-07 17:33:19 -04:00
Andrew Lamb bc6ade4776
feat: Include system.chunk_columns` in the tables that are scraped (#1455)
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-07 20:31:04 +00:00
Carol (Nichols || Goulding) e2cc4634bf fix: Use PathBuf rather than debug formatting and back to String
This is the same fix I made in 54c5f98, just found a few more spots :)
2021-05-07 15:58:11 -04:00
Raphael Taylor-Davies e2e3c9f77c
feat: workaround bug/limitation in OT handling of observers (#1457)
* feat: workardound bug/limitation in OT handling of observers

* chore: fix lints

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-07 18:53:28 +00:00
kodiakhq[bot] 60f4b927e8
Merge pull request #1454 from influxdata/er/fix/null_rb_row_groups
fix: don't blow up on all null columns
2021-05-07 17:24:31 +00:00
Edd Robinson 4a414fc8fb fix: don't blow up on all null columns 2021-05-07 17:31:18 +01:00
Nga Tran 31d49db0ed chore: a litlle more cleanup 2021-05-07 09:38:41 -04:00
Nga Tran ba015ee4df refactor: clean up and add comments 2021-05-07 09:31:41 -04:00
kodiakhq[bot] 07af2e6cda
Merge pull request #1443 from influxdata/er/feat/regex_pred_op
feat: Add Regex matching support to IOx's gRPC API
2021-05-07 12:58:21 +00:00
Edd Robinson 8ccc359cab refactor: address PR feedback 2021-05-07 13:48:44 +01:00
Edd Robinson 4c4bd2f164 refactor: update query/src/func/regex.rs 2021-05-07 13:44:51 +01:00
Edd Robinson 4cc7a99854 refactor: include not match in support check 2021-05-07 13:44:51 +01:00
Edd Robinson beee3115f4 feat: expose regex =\~ and to gRPC API 2021-05-07 13:44:51 +01:00
Edd Robinson eae3fec571 feat: wire up regex UDF as predicate filter expr 2021-05-07 13:44:51 +01:00
Edd Robinson 3fc2c9fc04 feat: add DataFusion regex match operator
This commit adds a new custom UDF to IOx that provide a regex operator to Datafusion plans.
Effectively it allows predicates to contain regex operators that are applied as filters, only allowing rows that satisfy the regex to be returned.

I did not use the Arrow regex kernel for this work because that does not return a boolean array indicating which rows matched a regex, but instead returns a new string array of results. This doesn't work well with DF's approach to filtering.
2021-05-07 13:44:51 +01:00
Andrew Lamb b5ea71f45f
feat: Expose the storage usage for each column in system.chunk_columns (#1441)
* feat: Expose the storage usage for each column in system.chunk_columns

* fix: fixup logical conflicts

* refactor: move coalsce logic into the read buffer

* fix: Update system_tables to not use coalese

* fix: Improve comments

Co-authored-by: Carol (Nichols || Goulding) <193874+carols10cents@users.noreply.github.com>

Co-authored-by: Carol (Nichols || Goulding) <193874+carols10cents@users.noreply.github.com>
2021-05-07 12:36:49 +00:00
Raphael Taylor-Davies 9320f59de0
feat: add shard sink indirection (#1447)
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-07 11:04:51 +00:00
kodiakhq[bot] 2e5a9bf18e
Merge pull request #1449 from influxdata/alamb/downgrade-rust
chore: revert to Rust 1.51.0
2021-05-07 10:37:06 +00:00
kodiakhq[bot] 07f549450f
Merge branch 'main' into alamb/downgrade-rust 2021-05-07 10:32:53 +00:00