influxdb

Commit Graph

Author	SHA1	Message	Date
Andrew Lamb	a48f681e56	feat(parquet): reduce and limit buffering when writing parquet files (#7880 ) * feat: limit buffering when writing parquet files ("combined solution") * chore: Run cargo hakari tasks --------- Co-authored-by: Raphael Taylor-Davies <r.taylordavies@googlemail.com> Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-05-31 13:27:32 +00:00
Dom Dwyer	928a4d163e	build: remove unused dependencies from crates This commit fixes loads of crates (47!) had unused dependencies, or mis-configured dependencies (test deps as normal deps). I added the "unused_crate_dependencies" to all crates to help prevent this mess from growing again! https://doc.rust-lang.org/beta/nightly-rustc/rustc_lint_defs/builtin/static.UNUSED_CRATE_DEPENDENCIES.html This has the minor downside of false-positives when specifying dev-dependencies for test/bench binaries - these are files in /test or /benches (not normal tests). This commit includes a workaround, importing them in lib.rs (gated by a feature flag). I think the trade-off of better dependency management is worth it!	2023-05-23 14:55:43 +02:00
Andrew Lamb	6344fe8c3f	chore: Add rationale for `clippy::future_not_send` (#7822 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-05-18 16:58:56 +00:00
Carol (Nichols \|\| Goulding)	0c4a86cbaa	fix: Use humantime rather than chrono-english	2023-05-01 11:31:42 -04:00
Carol (Nichols \|\| Goulding)	cc7c44f76a	chore: Upgrade to Rust 1.68 (#7175 ) * chore: Upgrade to Rust 1.68 * fix: Remove unnecessary into_iter, thanks Clippy! * fix: Use the size of the type, not a reference to the type... oops. Thanks clippy! * fix: Return block directly instead of creating a variable Thanks clippy! --------- Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2023-03-12 13:22:20 +00:00
Carol (Nichols \|\| Goulding)	11b014f64f	fix: Allow large error variants where it likely doesn't matter	2023-02-03 13:06:19 -05:00
Carol (Nichols \|\| Goulding)	30fea67701	fix: Move variables within format strings. Thanks clippy! Changes made automatically using `cargo clippy --fix`.	2023-02-03 13:06:17 -05:00
Paul Dix	82e57ac76a	feat: make data generator handle failed requests (#6397 ) Updates the data generator to handle failed requests. Adds some println output to show progress along the way. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-12-13 21:35:23 +00:00
Carol (Nichols \|\| Goulding)	abcbe19966	fix: Box some Error type fields to make the Error types small As found by this new Clippy lint: <https://rust-lang.github.io/rust-clippy/master/index.html#result_large_err>	2022-11-09 10:54:18 -05:00
kodiakhq[bot]	3fcca070f0	Merge branch 'main' into cn/one-at-a-time-please	2022-11-03 13:53:35 +00:00
Andrew Lamb	4fb2843d05	refactor: Rename `schema::selection::Selection` to `schema::projection::Projection` (#6037 ) * chore: Rename `schema::selection::Selection` to `schema::projection::Projection` * fix: docs Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-11-02 18:15:04 +00:00
Carol (Nichols \|\| Goulding)	729ffffa3e	fix: Let's not all write to the same file at the same time Fixes #6001. The generator can create multiple agents that all write in parallel to the same file, which results in garbage. Share the same File instance with a Mutex around it and lock the file until you've written one whole line.	2022-10-28 13:44:33 -04:00
Marco Neumann	42b89ade03	refactor: use `SendableRecordBatchStream` to write parquets (#5911 ) Use a proper typed stream instead of peeking the first element. This is more in line with our remaining stack and shall also improve error handling. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-10-19 12:59:53 +00:00
Carol (Nichols \|\| Goulding)	63a4a1051a	fix: Restore --help flag in iox_data_generator	2022-10-07 17:02:56 -04:00
Carol (Nichols \|\| Goulding)	be6519d4f2	fix: Add filename to the data generator read file error	2022-09-29 16:03:33 -04:00
Jake Goulding	7389fbe528	feat: Write data to Parquet files from the data generator	2022-09-29 16:03:33 -04:00
Carol (Nichols \|\| Goulding)	79afb6ef54	fix: Don't attempt to write 0 lines	2022-09-29 13:07:42 -04:00
Dom Dwyer	cd4087e00d	style: add no todo!() or dbg!() lints Some crates had theme, some not - lets be consistent and have the compiler spot dbg!() and todo!() macro calls - they should never be in prod code!	2022-09-29 13:10:07 +02:00
Carol (Nichols \|\| Goulding)	483c1da666	fix: Make org and bucket optional again if writing points to files/stdout/no-op	2022-09-23 11:45:46 -04:00
Carol (Nichols \|\| Goulding)	c8108f01e7	chore: Upgrade to Rust 1.64 (#5727 ) * chore: Upgrade to Rust 1.64 * fix: Use iter find instead of a for loop, thanks clippy * fix: Remove some needless borrows, thanks clippy * fix: Use then_some rather than then with a closure, thanks clippy * fix: Use iter retain rather than filter collect, thanks clippy Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-09-22 18:04:00 +00:00
Carol (Nichols \|\| Goulding)	b982bdaf2f	fix: Derive Eq when we derive PartialEq and members can derive Eq Allow this in generated code that we don't control, though. Recommended by clippy now. https://rust-lang.github.io/rust-clippy/master/index.html#derive_partial_eq_without_eq	2022-08-11 15:04:06 -04:00
Marco Neumann	0fbff981ec	chore(deps): Bump sqlx to 0.6.0 and uuid to 1 (#4894 ) Closes #4889. Closes #4890. Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-06-17 10:28:28 +00:00
dependabot[bot]	23c9e38ea7	chore(deps): Bump clap from 3.1.18 to 3.2.1 (#4848 ) * chore(deps): Bump clap from 3.1.18 to 3.2.1 Bumps [clap](https://github.com/clap-rs/clap) from 3.1.18 to 3.2.1. - [Release notes](https://github.com/clap-rs/clap/releases) - [Changelog](https://github.com/clap-rs/clap/blob/master/CHANGELOG.md) - [Commits](https://github.com/clap-rs/clap/compare/v3.1.18...clap_complete-v3.2.1) --- updated-dependencies: - dependency-name: clap dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> * chore: fix clap deprecations Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Marco Neumann <marco@crepererum.net> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-06-14 15:42:18 +00:00
Carol (Nichols \|\| Goulding)	0d723fb21d	fix: Remove allow dead_code and remove dead code	2022-05-06 16:58:03 -04:00
Carol (Nichols \|\| Goulding)	a4443e4c31	fix: Remove OG gRPC client code and APIs	2022-04-29 16:29:49 -04:00
dependabot[bot]	65ab5213e5	chore(deps): Bump clap from 3.0.14 to 3.1.1 (#3809 ) Bumps [clap](https://github.com/clap-rs/clap) from 3.0.14 to 3.1.1. - [Release notes](https://github.com/clap-rs/clap/releases) - [Changelog](https://github.com/clap-rs/clap/blob/master/CHANGELOG.md) - [Commits](https://github.com/clap-rs/clap/compare/v3.0.14...v3.1.1) --- updated-dependencies: - dependency-name: clap dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-02-22 14:51:53 +00:00
Marco Neumann	c399e676ca	chore: upgrade clap to v3	2022-01-17 12:12:46 +01:00
Marco Neumann	f3f6f335a9	chore: upgrade to snafu 0.7 (#3440 )	2022-01-11 19:22:36 +00:00
Paul Dix	8efd02930e	feat: add total throughput to data generator output On log level info, data generator will now output what the total throughput is in rows per second after every agent's successful write.	2021-12-11 17:12:28 -05:00
Paul Dix	8c88e1e52c	refactor: change orgbucket to database in data generator	2021-12-09 13:39:33 -05:00
Paul Dix	01e86a031e	feat: add regex to bucket writers assignment in data generator This adds the ability to specify a regex to match against database names when specifying what agents should write to which buckets in the data generator. A default has also been added for ratio so that it doesn't need to be specified if only a single database writer is defined.	2021-12-09 13:39:33 -05:00
Paul Dix	2c8d17bea8	refactor: change percent to ratio in data generator bucket writers	2021-12-08 12:09:04 -05:00
Paul Dix	31aa41e240	feat: add ability for data generator to write to many buckets This adds the ability for the data generator to write to many databases. A new command line argument, `bucket_list`, is added which should be a file name. The file should contain a list of databsaes, one per line, with the structure of <org>_<bucket>. This is a little odd given the data generator expects org and bucket separately, but I expect the file that we'll be using will be database names, which have this format. The configuration can specify what percentage of the list should get written to by which agents at what sampling interval. This should allow configurations where databases get different levels of ingest and different types (as specified via different agent specs). The structure is a little wonky, but I think it'll get the job done. The next step is to run some perf tests to see how the data generator performs if writing to 10k databases.	2021-12-08 12:09:04 -05:00
Paul Dix	3279725d10	refactor: Add agent name to data generator (#3297 ) This is work leading up to giving the data generator the ability to write to many databases. The plan is to specify which agents databases will use to write data.	2021-12-05 11:21:04 -05:00
Paul Dix	3c848049ba	refactor: remove create database from data generator This removes the create databsase command line flag and associated code from the data generator runner. Creation of databases should live outside the generator in other tools.	2021-12-03 16:22:45 -05:00
Carol (Nichols \|\| Goulding)	948a45a4ea	fix: Use split_once rather than reimplementing manually Identified by clippy. https://rust-lang.github.io/rust-clippy/master/index.html#manual_split_once	2021-12-02 11:52:02 -05:00
Carol (Nichols \|\| Goulding)	5d0fd1c603	fix: Allow dead code on fields that are now detected as never read	2021-12-02 11:52:01 -05:00
Marco Neumann	dbf2642582	fix: `jaeger_debug` -> `jaeger_debug_header`	2021-12-01 18:02:39 +01:00
Marco Neumann	4bbe756b52	feat: make jaeger-debug-id configurable	2021-12-01 15:02:15 +01:00
Marco Neumann	c961454dcd	feat: `jaeger-debug-id` from data generator	2021-12-01 14:33:09 +01:00
Marco Neumann	4e043ecb55	refactor: remove old routing / sharding config This is superseded by the new router subsystem.	2021-11-29 12:33:48 +01:00
Marco Neumann	7f2e4f4342	refactor: remove write buffer direction The direction was required when a database could read or write from/to a write buffer. Now it is clear from the usage context of a write buffer context which of the two applications is meant (databases read, routers write) so the direction flag is no longer required.	2021-11-26 12:38:40 +01:00
Raphael Taylor-Davies	7aa386b07f	fix: flaky incrementing_i64_that_resets (#3197 )	2021-11-23 16:37:12 +00:00
Raphael Taylor-Davies	88868e7496	feat: remove legacy write service from influxdb_iox_client (#3043 ) * feat: remove legacy write service from influxdb_iox_client * chore: review feedback Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2021-11-05 17:26:18 +00:00
Paul Dix	7d90314f2b	fix: data generator example	2021-11-04 09:56:58 -04:00
Paul Dix	041957be48	fix: datagenerator PR feedback, implement field count	2021-11-04 09:56:58 -04:00
Paul Dix	7044b89453	feat: Refactor Data Generator This is a huge commit that refactors the data generator. It removes many of the previous features that didn't quite make sense. The goal of this refactor was to make the data generator capable of representing complex tagsets that have values dependent on each other. It also significantly optimizes things to use far less memory and generate data much faster. Follow on work will update the generation of line protocol to support spaces in tags and their keys, double quotes in strings, and add more examples and documentation.	2021-11-04 09:56:58 -04:00
Paul Dix	348b91edc4	feat: Add noop option to data generator I needed this feature to be able to see how much memory and resources a given data spec toml would take to run.	2021-11-04 09:56:58 -04:00
Paul Dix	32bf4be64c	chore: add benchmark for data generator tag set	2021-11-04 09:56:58 -04:00
Paul Dix	db2f8a58fc	feat: Add tag_set and tag_pairs to measurements in Data Generator This adds the ability to specify a tag_set and a collection of tag_pairs to measurements in the data generator. Tag pairs are evaluated once when the generator is created. This avoids re-running handlebars evaluations while generating data for tags that don't change value. This commit also fixes an issue when printing the generation output to stdout while generating from more than one agent. Previously it would be garbled together. Follow on PRs will update the tag generation code in measurement specs to be more consistent and optimzised for performance. I'll be removing the restriction of using different options while using tag_set and tag_pairs. I wanted to get this in first to show the structure of what is output.	2021-11-04 09:56:58 -04:00

1 2

70 Commits (f3b6c8bf1587a52fc868a05c72c2ab106e7b7616)