Commit Graph

531 Commits (683205ad0333ce36206397c438f7d251ffc47b56)

Author SHA1 Message Date
Carol (Nichols || Goulding) 683205ad03 refactor: Use `Vec::clear` instead of `Vec::truncate(0)` 2020-06-22 15:32:15 -04:00
Carol (Nichols || Goulding) 1e341a7321 fix: Encode and decode string data as bytes
String data isn't guaranteed to be UTF-8
2020-06-22 15:32:14 -04:00
Carol (Nichols || Goulding) 672d3fe668 fix: Assert that encoded strings' lengths fits in an i32 2020-06-22 15:19:19 -04:00
Carol (Nichols || Goulding) df75db6870 refactor: Remove some unneeded type annotations 2020-06-22 15:17:03 -04:00
Carol (Nichols || Goulding) 8bc25e92bf refactor: Shorten unused cases 2020-06-22 15:15:37 -04:00
Carol (Nichols || Goulding) d7dbf061cb feat: Implement String encoding/decoding
Fixes #148.
2020-06-22 15:15:34 -04:00
Carol (Nichols || Goulding) bf884ff3d3 refactor: Extract a constant for max varint size for 64-bit integers 2020-06-22 14:53:53 -04:00
Carol (Nichols || Goulding) 4a91a8b45f refactor: Remove unneeded lifetime annotations 2020-06-22 14:53:53 -04:00
Carol (Nichols || Goulding) f2fc4a6d43 chore: Remove or change scope for outdated dead_code allows 2020-06-22 14:53:53 -04:00
Edd Robinson 2768b15bf4
Merge pull request #168 from influxdata/er/tsm-parquet
feat: Add support for converting TSM files into Parquet
2020-06-22 19:10:17 +01:00
Edd Robinson b3e78d712d refactor: address PR feedback 2020-06-22 18:56:17 +01:00
Edd Robinson 844625d811 fix: down-sample timestamps to μs 2020-06-22 18:56:17 +01:00
Edd Robinson e507183fbd refactor: cleanup + clippy 2020-06-22 18:56:17 +01:00
Edd Robinson 4bbeac7a1c refactor: extend packers 2020-06-22 18:56:17 +01:00
Edd Robinson 106bd69b5a feat: support converting from TSM->Parquet 2020-06-22 18:56:17 +01:00
Edd Robinson 9006af8961 feat: support converting from BlockType 2020-06-22 18:56:17 +01:00
Edd Robinson 3c24b6e10e refactor: small API change 2020-06-22 18:56:17 +01:00
Edd Robinson 5f40974752 refactor: don't error on string blocks 2020-06-22 18:56:17 +01:00
Edd Robinson 353c7a618b fix: ensure short blocks decode correctly 2020-06-22 18:56:17 +01:00
Edd Robinson 68a1d5355d refactor: simplify block types 2020-06-22 18:56:17 +01:00
Edd Robinson 621f2f91f0 refactor: hoist tsm mapper to delorean_tsm 2020-06-22 18:56:17 +01:00
Edd Robinson f046dbeea0 refactor: organise code in delorean_tsm crate 2020-06-22 18:56:17 +01:00
Edd Robinson 0ca6fdfa5f refactor: StorageError -> TSMError 2020-06-22 18:56:17 +01:00
Edd Robinson 85e0b4ec16 refactor: hoist tsm reader into own crate 2020-06-22 18:56:17 +01:00
Edd Robinson fd9f2ea5b8 refactor: split out index reading and block decoding
This commit splits out the functionality required to read a TSM file's
index, and decode the blocks within the file.
2020-06-22 18:56:17 +01:00
Edd Robinson 6339083b87 feat: implement mapping between blocks and table
This commit implements the ability to map from multiple columns into a
single tablular view, where columns are aligned by their timestamp
components.
2020-06-22 18:56:17 +01:00
Edd Robinson 5418b34fcc feat(tsm): map TSM data model to table model
This commit adds a new type `TSMMeasurementMapper` that will iterate
through a `TSMReader`'s index and collect together all series and blocks
by measurement. These units are called `MeasurementTable`s.
2020-06-22 18:56:17 +01:00
Edd Robinson a5d090a8cb
Merge pull request #162 from influxdata/er/packer
refactor: make Packer generic
2020-06-22 18:01:07 +01:00
Andrew Lamb 506a7f19d5
feat: add stats command for computing storage statistics (#160)
* feat: add stats command for computing storage statistics

* fix: Make function name less redundant

* fix: Apply suggestions from code review

Co-authored-by: Jake Goulding <jake.goulding@integer32.com>

* fix: remove changes to Cargo.lock

* fix: Cleanup to use `impl Into<String>` rather than String::from

* fix: fmt

* fix: less mut builder

* fix: fmt

* fix: clippy/fmt

* fix: fixup test

* fix: remove dstool reference

Co-authored-by: Jake Goulding <jake.goulding@integer32.com>
2020-06-22 07:33:52 -04:00
Edd Robinson 53692a63d7 refactor: DRY up Packers accessors 2020-06-22 11:37:29 +01:00
Edd Robinson 128a3c3fc0 refactor: delegate on 2020-06-22 11:33:02 +01:00
Edd Robinson 49b5322487 feat: add resize_exact to packers 2020-06-22 11:25:17 +01:00
Edd Robinson 28057ca8ce refactor: update delorean_table/src/packers.rs
Co-authored-by: Jake Goulding <jake.goulding@integer32.com>
2020-06-22 11:25:17 +01:00
Edd Robinson cba797e5ee refactor: update delorean_table/src/packers.rs
Co-authored-by: Jake Goulding <jake.goulding@integer32.com>
2020-06-22 11:25:17 +01:00
Edd Robinson b4f44b61e4 refactor: Update delorean_table/src/packers.rs
Co-authored-by: Jake Goulding <jake.goulding@integer32.com>
2020-06-22 11:25:09 +01:00
Edd Robinson c26ac10b3b refactor: update delorean_ingest/src/lib.rs
Co-authored-by: Jake Goulding <jake.goulding@integer32.com>
2020-06-22 11:24:29 +01:00
Edd Robinson f011748345 refactor: update delorean_parquet/tests/read_write.rs
Co-authored-by: Jake Goulding <jake.goulding@integer32.com>
2020-06-22 11:24:29 +01:00
Edd Robinson 11132b9b30 docs: Update delorean_parquet/src/writer.rs
Co-authored-by: Jake Goulding <jake.goulding@integer32.com>
2020-06-22 11:24:29 +01:00
Edd Robinson 146000d55b refactor: update delorean_ingest/src/lib.rs
Co-authored-by: Jake Goulding <jake.goulding@integer32.com>
2020-06-22 11:24:29 +01:00
Edd Robinson cd435d9b51 refactor: update delorean_ingest/src/lib.rs
Co-authored-by: Jake Goulding <jake.goulding@integer32.com>
2020-06-22 11:24:29 +01:00
Edd Robinson ac7bb6bf68 refactor: make Packer generic 2020-06-22 11:24:29 +01:00
Carol (Nichols || Goulding) ac153fa3d2
Merge pull request #172 from influxdata/cn-dstool-cleanup
refactor: Remove all references to 'dstool'
2020-06-19 11:21:03 -04:00
Jake Goulding 99b5f95cfe
Merge pull request #171 from influxdata/doc-links
docs: Use correct Markdown link syntax
2020-06-19 11:17:28 -04:00
Carol (Nichols || Goulding) 78c2fb99ac refactor: Remove all references to 'dstool'
Since it got merged into the main `delorean` binary.
2020-06-19 11:08:59 -04:00
Jake Goulding 0e9653cd23 docs: Use correct Markdown link syntax 2020-06-19 10:39:58 -04:00
Jake Goulding c778af92f5
Merge pull request #170 from influxdata/reduce-rebuilds
build: Prevent spurious rebuilds due to the object-store
2020-06-19 10:36:51 -04:00
Jake Goulding 6a97995a19
Merge pull request #144 from influxdata/rusoto-stream 2020-06-19 09:25:30 -04:00
Jake Goulding bfb0213ac3 feat: Update Rusoto to allow streaming data on uploads 2020-06-19 09:18:44 -04:00
Jake Goulding d5c39d931c build: Prevent spurious rebuilds due to the object-store
The file `.env` wasn't precisely specified, which caused Cargo to
re-run the build script each time. Thankfully, `dotenv` returns the
file path that it picked to load, so we can use it directly.
2020-06-19 08:37:09 -04:00
Andrew Lamb 8185c80c03
fix: fix logical merge conflict (#169) 2020-06-18 18:51:25 -04:00