influxdb

Commit Graph

Author	SHA1	Message	Date
Marco Neumann	657ac249e9	feat: track ingester jobs (#3836 )	2022-02-23 15:33:47 +00:00
dependabot[bot]	b63f920d4c	chore(deps): Bump parquet from 9.0.2 to 9.1.0 (#3828 ) * chore(deps): Bump parquet from 9.0.2 to 9.1.0 Bumps [parquet](https://github.com/apache/arrow-rs) from 9.0.2 to 9.1.0. - [Release notes](https://github.com/apache/arrow-rs/releases) - [Changelog](https://github.com/apache/arrow-rs/blob/master/CHANGELOG.md) - [Commits](https://github.com/apache/arrow-rs/compare/9.0.2...9.1.0) --- updated-dependencies: - dependency-name: parquet dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> * chore: update chunk size test Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Raphael Taylor-Davies <r.taylordavies@googlemail.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-23 11:25:15 +00:00
dependabot[bot]	5a79b3a68b	chore(deps): Bump arrow-flight from 9.0.2 to 9.1.0 (#3829 ) Bumps [arrow-flight](https://github.com/apache/arrow-rs) from 9.0.2 to 9.1.0. - [Release notes](https://github.com/apache/arrow-rs/releases) - [Changelog](https://github.com/apache/arrow-rs/blob/master/CHANGELOG.md) - [Commits](https://github.com/apache/arrow-rs/compare/9.0.2...9.1.0) --- updated-dependencies: - dependency-name: arrow-flight dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-23 11:03:22 +00:00
dependabot[bot]	3b7d31c88a	chore(deps): Bump arrow from 9.0.2 to 9.1.0 (#3826 ) Bumps [arrow](https://github.com/apache/arrow-rs) from 9.0.2 to 9.1.0. - [Release notes](https://github.com/apache/arrow-rs/releases) - [Changelog](https://github.com/apache/arrow-rs/blob/master/CHANGELOG.md) - [Commits](https://github.com/apache/arrow-rs/compare/9.0.2...9.1.0) --- updated-dependencies: - dependency-name: arrow dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-02-23 09:25:46 +00:00
dependabot[bot]	ad3868ed7c	chore(deps): Bump tokio from 1.16.1 to 1.17.0 (#3814 ) * chore(deps): Bump tokio from 1.16.1 to 1.17.0 Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.16.1 to 1.17.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/compare/tokio-1.16.1...tokio-1.17.0) --- updated-dependencies: - dependency-name: tokio dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> * build: update workspace-hack Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Dom Dwyer <dom@itsallbroken.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-22 16:27:43 +00:00
Carol (Nichols \|\| Goulding)	1b9212540b	feat: Send IngesterQueryResponse data back as response of doGet Flight request (#3772 ) * fix: Adjust fields of IngesterQueryResponse * feat: Adjust IngestHandler query method to call prepare_data_to_querier * feat: Send ingest query result data back through Flight doGet * feat: Send delete predicates and max sequencer number in metadata * fix: greater_than_sequence_number should be of type SequenceNumber * fix: Remove DeletePredicates from IngesterQueryResponse Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-18 17:42:49 +00:00
Marco Neumann	f54ef92b77	fix: supervise and shutdown ingester background tasks (#3769 ) * fix: supervise and shutdown ingester background tasks Closes #3761. Closes #3762. * docs: improve wording Co-authored-by: Raphael Taylor-Davies <1781103+tustvold@users.noreply.github.com> * test: join/shutdown handling for ingester Co-authored-by: Raphael Taylor-Davies <1781103+tustvold@users.noreply.github.com>	2022-02-18 09:35:29 +00:00
Andrew Lamb	a30803e692	chore: Update datafusion, update `arrow`/`parquet`/`arrow-flight` to 9.0 (#3733 ) * chore: Update datafusion * chore: Update arrow * fix: missing updates * chore: Update cargo.lock * fix: update for smaller parquet size * fix: update test for smaller parquet files * test: ensure parquet_file tests write multiple row groups * fix: update callsite * fix: Update for tests * fix: harkari * fix: use IoxObjectStore::existing Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-15 12:10:24 +00:00
dependabot[bot]	89105ccfab	chore(deps): Bump tokio-util from 0.6.9 to 0.7.0 (#3743 ) Bumps [tokio-util](https://github.com/tokio-rs/tokio) from 0.6.9 to 0.7.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/commits) --- updated-dependencies: - dependency-name: tokio-util dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-02-15 11:33:41 +00:00
Carol (Nichols \|\| Goulding)	73828323ac	feat: Ingester Flight gRPC API (#3623 ) * feat: Add a way to run ingester with an in-memory catalog from the CLI If you set the --catalog-dsn string to "mem", rather than using that as a Postgres connection URL, create an in-memory catalog. Planning on using this in tests, so not documenting. * fix: Set default topic to the same value as SHARED_KAFKA_TOPIC Namely, both should use an underscore. I don't think there's a way to directly share these values between a constant and an annotation. * feat: Add a flight API (handshake only) to ingester * fix: Create partitions if using file-based write buffer * fix: Change the server fixture to handle ingester server type For now, the ingester doesn't implement the deployment API. Not sure if it should or not. * feat: Start implementing ingester do_get, namely decoding the query Skip serialization of the predicate for the moment. * refactor: Rename ingest protos to ingester to match crate name * refactor: Rename QueryResults to QueryData * feat: Move ingester flight client to new querier crate * fix: Off by one error, different starting indexes in sequencers * fix: Create new CLI argument to pick the catalog type * fix: Create a CLI option to set the number of topics to auto-create in the write buffer * fix: Check the arrow flight service's health to tell that the ingester gRPC is up * fix: Set postgres as the default catalog type * fix: Return an error rather than panicking if CLI args aren't right	2022-02-09 19:07:44 +00:00
Paul Dix	59b2141c0b	feat: Add lifecycle manager to ingester (#3645 ) This adds the lifecycle manager to the ingester. It will trigger based on a threshold for max partition size or age or based on keeping total memory under a certain threshold. It defines a new interface for a persister, which is stubbed out for IngesterData. I'm not sure yet how persistence errors should be handled. The assumption here is that the persister continues to retry persistence forever until it succeeds. There is one scenario I can think of that may cause this lifecycle manager problems. If a single partition is very high throughput, it could cause things to back up as persistence is not parallelized within a single partition. Any given partition can currently only run one persistence operation at a time. We can address this later. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-08 15:23:40 +00:00
Paul Dix	ce46bbaada	feat: wire up the write buffer to the ingester process (#3533 ) This adds the scaffolding for the ingester server to consume data from Kafka. This ingests data in an in memory structure while creating records in the catalog for any partitions that don't yet exist. I've removed catalog_update.rs in ingester for now. That was mostly a placeholder and will be going in a combination of handler.rs and data.rs on my next PR which will have some primitive lifecycle wired up. There's one ugly bit here where the DML write is cloned because it's getting borrowed to output spans and metrics. I'll need to follow up with a refactor to make it so that the DML write's tables can be consumed without it gumming up the metrics stuff. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-03 11:47:28 +00:00
Marco Neumann	22778a3a80	chore: upgrade rskafka and parking_lot (#3592 )	2022-02-01 11:50:42 +00:00
Carol (Nichols \|\| Goulding)	bf89162fa5	refactor: Move IoxMetadata to parquet_file	2022-01-31 10:36:33 -05:00
Carol (Nichols \|\| Goulding)	dd9620da0c	feat: Create a new proto definition for the new design's IoxMetadata	2022-01-31 10:36:32 -05:00
Carol (Nichols \|\| Goulding)	5e0e0d8aa7	feat: Write parquet to object storage in a similar way as parquet_file::Storage	2022-01-31 10:36:32 -05:00
Carol (Nichols \|\| Goulding)	c633c9bc5c	feat: Wire object store into ingester persistence	2022-01-31 10:36:30 -05:00
Nga Tran	8735ede74f	feat: IoxMetadata for parquet file (#3547 ) * feat: IoxMetadata for parquet file * fix: typos * refactor: address review comments Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-01-28 14:41:59 +00:00
Andrew Lamb	5488c257d1	chore: Update datafusion, upgrade to arrow/parqet/arrow-flight 8.0.0 (#3517 ) * chore: Update datafusion * chore: update to arrow 8 * fix: update to use new DataFusion APIs * fix: update case for sortedness * fix: cargo hakari	2022-01-27 13:33:27 +00:00
Carol (Nichols \|\| Goulding)	bc44d33108	feat: Implement a snapshot method on DataBuffer (#3518 ) * feat: Implement a snapshot method on DataBuffer Fixes #3510. * test: Add a test snapshotting batches with different but compatible schemas * fix: Simplify min/max sequencer number collection The first batch should always have the min sequencer number. The last batch should always have the max sequencer number. The min should always be less than (or equal to, in case there's only one batch) the max.	2022-01-26 15:22:51 +00:00
Nga Tran	52866fe6a9	fix: merge record batches into one batch (#3535 ) * fix: merge record batches into one batch refactor: address review comments * chore: update test output	2022-01-25 23:29:16 +00:00
NGA-TRAN	797ba459b9	chore: merge main to branch	2022-01-24 12:06:23 -05:00
NGA-TRAN	939ea536d4	feat: add but ignore a few compaction tests	2022-01-24 12:00:23 -05:00
Paul Dix	bb893510a0	feat: Add scaffolding for ingester server * Adds a new ingester command to start an ingester server * Moves previous ingester server over to handler * Skeleton for gRPC and HTTP handlers	2022-01-21 18:02:19 -05:00
NGA-TRAN	cd01b141f3	refactor: for paul	2022-01-21 16:49:02 -05:00
NGA-TRAN	191adc9fc7	feat: initial implementation for ingester's compaction	2022-01-20 18:22:41 -05:00
NGA-TRAN	edb97f51cf	refactor: add persisting struct	2022-01-19 12:36:18 -05:00
NGA-TRAN	8a17e1c132	refactor: address review comments	2022-01-19 11:20:20 -05:00
NGA-TRAN	fe9a41ee9a	chore: remove non-longer needed dependency	2022-01-18 21:45:20 -05:00
NGA-TRAN	b57f027e35	refactor: address review comments	2022-01-18 20:57:13 -05:00
NGA-TRAN	367a9fb812	fix: add workspace-hack	2022-01-18 18:10:42 -05:00
NGA-TRAN	125285ae9a	feat: commit in order to pull and merge new commit from main	2022-01-18 16:11:25 -05:00
NGA-TRAN	23290fd2ff	fix: new data structures suggested by reviewers	2022-01-18 14:04:07 -05:00
NGA-TRAN	ef336b4659	feat: add ingester crate and a few basic data structures for its data lifecycle	2022-01-17 15:38:03 -05:00

1 2 3

134 Commits (4fc5a90d606e41a913576bdb9fab6029e05e5e44)