Commit Graph

2860 Commits (9a666fac00293aef45cd084db9444d09b7e5f86c)

Author SHA1 Message Date
Edd Robinson 9a666fac00 feat: implement RLE methods for materialising 2021-05-14 13:05:02 +01:00
Edd Robinson c55dce3af5 feat: implement stat methods 2021-05-14 13:05:02 +01:00
Edd Robinson 958219d63e feat: skeleton scalar RLE 2021-05-14 13:05:02 +01:00
Edd Robinson 41abeba7a0
Merge pull request #1494 from influxdata/er/feat/rb_raw_size
feat: add uncompressed read buffer size metric
2021-05-14 12:47:57 +01:00
Edd Robinson 91fda41f8e refactor: pdate read_buffer/src/column/boolean.rs
Co-authored-by: Marko Mikulicic <mkm@influxdata.com>
2021-05-14 12:11:54 +01:00
Edd Robinson d80e71ad86 feat: add new metric to track raw size 2021-05-14 10:34:54 +01:00
Edd Robinson 51c9c15026 refactor: include raw size in log message 2021-05-14 09:42:24 +01:00
Edd Robinson 966093deec feat: expose size_raw via ReadBuffer API 2021-05-14 09:42:24 +01:00
Edd Robinson 984f505267 feat: implement raw column size on bool columns 2021-05-14 09:42:24 +01:00
Edd Robinson 1a20f3fb4a feat: implement raw column size on float columns 2021-05-14 09:42:24 +01:00
Edd Robinson 301df03e72 feat: implement raw column size on integer columns 2021-05-14 09:42:24 +01:00
Edd Robinson 850db3f6c2 feat: implement raw size on string columns 2021-05-14 09:42:22 +01:00
kodiakhq[bot] 68b3577b0d
Merge pull request #1492 from influxdata/pointsmetric
fix: Add ingest_fields_total
2021-05-13 16:08:49 +00:00
Marko Mikulicic 35c2ca17fc
fix: Add ingest_fields_total
ingest_lines_total count lines (which apparently are the same as points, quite confusingly)

No yaks harmed in the making of this PR.

(NOTE: the code around metric, especially dealing with happy and error paths is very painful;
to be done in another PR)
2021-05-13 17:55:07 +02:00
Raphael Taylor-Davies 8ab71be5eb
feat: use PackedStringArray in mutable buffer (#1476)
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-13 15:25:19 +00:00
kodiakhq[bot] 553bc83eef
Merge pull request #1486 from influxdata/ntran/parquet_os
feat: we now can read parquet files from all kind of object stores
2021-05-13 10:27:40 +00:00
Nga Tran 9583636748 feat: we now can read parquet files form all kind of object stores 2021-05-12 18:05:34 -04:00
Andrew Lamb ebea554e5a
feat: Report num_cpus seen by IOx on startup (#1484) 2021-05-12 13:42:16 +00:00
Marco Neumann 0b1ef52481
chore: upgrade arrow and datafusion (#1483) 2021-05-12 13:08:37 +00:00
kodiakhq[bot] 658c34009a
Merge pull request #1480 from influxdata/er/feat/read_buffer/num_rle
refactor: shuffle encodings around in Read Buffer
2021-05-12 10:30:13 +00:00
Edd Robinson 1416097a35
Merge branch 'main' into er/feat/read_buffer/num_rle 2021-05-11 23:30:55 +01:00
Edd Robinson aa83669740 refactor: move encodings to scalar module 2021-05-11 22:49:20 +01:00
Edd Robinson 482e4dab86 refactor: shuffle string encodings 2021-05-11 22:47:42 +01:00
Edd Robinson f86e0641fd refactor: clarify benchmark 2021-05-11 22:47:42 +01:00
Edd Robinson f5fe270e43 refactor: move benchmark 2021-05-11 22:47:36 +01:00
Raphael Taylor-Davies b02105e47b
feat: construct StringDictionary from PackedStringArray (#1475)
* feat: construct StringDictionary from PackedStringArray

* chore: fix formatting

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-11 20:16:25 +00:00
kodiakhq[bot] 67984fbff5
Merge pull request #1474 from influxdata/er/feat/rb_metrics
fix: ensure metrics not overwriting
2021-05-11 20:06:16 +00:00
Edd Robinson 696e4e0cfd fix: ensure metrics not overwriting 2021-05-11 20:57:31 +01:00
Raphael Taylor-Davies 4409d2c8af
feat: instrument catalog locks (#1464)
* feat: instrument catalog locks (#1355)

* chore: add metrics test

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-11 18:59:11 +00:00
Andrew Lamb 9d0c3a2b1a
refactor: Remove multi-table per chunk code in MUB (#1471)
* refactor: Remove multi-table per chunk code in MUB

* fix: clippy

* fix: bench build

* fix: merge conflicts
2021-05-11 17:49:07 +00:00
Raphael Taylor-Davies c85d1574eb
feat: move dictionary and bitset into arrow_utils (#1459)
* feat: move dictionary and bitset into arrow_utils

* chore: review feedback

* chore: remove redundant dictionary methods

* chore: consistent type parameter name in PackedStringArray

* chore: review comments

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-11 16:43:38 +00:00
Raphael Taylor-Davies d1da954fe4
feat: don't store encoded strings twice in RLE dictionaries (#1469) 2021-05-11 15:22:25 +00:00
Raphael Taylor-Davies e00b4aaa24
feat: associate labels with a domain (#1463)
Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-11 13:03:45 +00:00
kodiakhq[bot] be74eab3fd
Merge pull request #1468 from influxdata/er/feat/rb_metrics
feat: add in-depth column encoding metrics
2021-05-11 12:45:08 +00:00
Edd Robinson 32abe2e777 feat: wire up stats to metrics 2021-05-11 13:38:32 +01:00
Edd Robinson c4987028fb feat: expose all column stats 2021-05-11 13:00:52 +01:00
Edd Robinson 88ed58aa8a feat: column statistics for int/float 2021-05-11 13:00:52 +01:00
Edd Robinson ef2eda04ef feat: add string encoder statistics 2021-05-11 13:00:52 +01:00
Edd Robinson 3622a92c8b feat: wire in rb column metrics 2021-05-11 13:00:52 +01:00
kodiakhq[bot] 767d6b66f6
Merge pull request #1448 from influxdata/crepererum/optional_stats_minmax
refactor: make `StatValues::{min,max}` optional + handle NaNs
2021-05-11 07:39:02 +00:00
kodiakhq[bot] 6f106c49aa
Merge branch 'main' into crepererum/optional_stats_minmax 2021-05-11 07:35:13 +00:00
Andrew Lamb b6290f1ff3
chore: update deps (#1466) 2021-05-10 22:14:08 +00:00
Marco Neumann 795f5bfcb7 refactor: make `StatValues::{min,max}` optional + handle NaNs
This will allow us to:

- handle all-NULL columns correctly
- be in-line with Parquet (where min/max are optional)
- handle NaNs at least somewhat sane (they do not "poison" stats
  anymore)
2021-05-10 17:12:25 +02:00
Andrew Lamb f037c1281a
feat: Calculate all system tables "on demand" (#1452)
* feat: compute system.columns table on demand

* feat: compute system.chunk_columns on demand

* feat: compute system.operations on demand

* fix: fixup schemas

* fix: Log errors

* fix: clippy

Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>
2021-05-10 14:43:55 +00:00
kodiakhq[bot] 8dcb4a54a2
Merge pull request #1456 from influxdata/fixrulesexistence
fix: Load only databases for which a config exists
2021-05-10 11:18:42 +00:00
Marko Mikulicic 9f5350a6c5
fix: Load only databases for which a config exists
Closes #1450
2021-05-10 13:14:22 +02:00
kodiakhq[bot] ebdab25a3c
Merge pull request #1453 from influxdata/ntran/e2e_with_unload
feat: Query Parquet files
2021-05-07 22:44:54 +00:00
Nga Tran c6b933eb63 chore: merge main to branch 2021-05-07 18:40:17 -04:00
kodiakhq[bot] 6fc05769b1
Merge pull request #1458 from influxdata/cn/path-manipulation
fix: Use PathBuf rather than debug formatting and back to String
2021-05-07 21:41:04 +00:00
Nga Tran f2c19ec080 refactor: further address Carol's comment 2021-05-07 17:40:40 -04:00