Commit Graph

8 Commits (bf4fce3c77b3f996cb151cb9aed0fd511b011ace)

Author SHA1 Message Date
Ryan Russell 55f5d0ae11
docs: Improving Readability r2 (#4781)
* docs: Improve TOML readability

Signed-off-by: Ryan Russell <git@ryanrussell.org>

* docs: Fix `jemalloc` link

Signed-off-by: Ryan Russell <git@ryanrussell.org>

* docs: Readability in `point_generation.rs`

Signed-off-by: Ryan Russell <git@ryanrussell.org>
2022-06-04 16:46:57 +00:00
Paul Dix 8c88e1e52c refactor: change orgbucket to database in data generator 2021-12-09 13:39:33 -05:00
Paul Dix 2c8d17bea8 refactor: change percent to ratio in data generator bucket writers 2021-12-08 12:09:04 -05:00
Paul Dix 31aa41e240 feat: add ability for data generator to write to many buckets
This adds the ability for the data generator to write to many databases. A new command line argument, `bucket_list`, is added which should be a file name. The file should contain a list of databsaes, one per line, with the structure of <org>_<bucket>. This is a little odd given the data generator expects org and bucket separately, but I expect the file that we'll be using will be database names, which have this format.

The configuration can specify what percentage of the list should get written to by which agents at what sampling interval. This should allow configurations where databases get different levels of ingest and different types (as specified via different agent specs). The structure is a little wonky, but I think it'll get the job done. The next step is to run some perf tests to see how the data generator performs if writing to 10k databases.
2021-12-08 12:09:04 -05:00
Paul Dix 3279725d10
refactor: Add agent name to data generator (#3297)
This is work leading up to giving the data generator the ability to write to many databases. The plan is to specify which agents databases will use to write data.
2021-12-05 11:21:04 -05:00
Paul Dix 7044b89453 feat: Refactor Data Generator
This is a huge commit that refactors the data generator. It removes many of the previous features that didn't quite make sense. The goal of this refactor was to make the data generator capable of representing complex tagsets that have values dependent on each other. It also significantly optimizes things to use far less memory and generate data much faster. Follow on work will update the generation of line protocol to support spaces in tags and their keys, double quotes in strings, and add more examples and documentation.
2021-11-04 09:56:58 -04:00
Paul Dix 64fca1ee34 feat: Support sampling interval strings in data generator
This changes the sampling_interval in the data generator to be a string, supporting things like ns, us, ms, s, m, h and others.
2021-08-25 17:35:01 -04:00
Paul Dix d5f01a2a68 refactor: move data generator to IOx repo and fix build 2021-08-19 14:26:15 -04:00