influxdb

Commit Graph

Author	SHA1	Message	Date
kodiakhq[bot]	d0965bb0b2	Merge branch 'main' into dom/mb-partitioning	2022-02-16 11:30:42 +00:00
Paul Dix	f542045485	feat: wire up persistence in ingester (#3685 ) This adds persistence into the ingester with a lifecycle manager. The persist operation must still be updated to keep track of the min_unpersisted_sequence_number for each sequencer.	2022-02-16 00:13:40 +00:00
Edd Robinson	7ac9e216c4	refactor: use same log message	2022-02-15 14:36:55 +00:00
Edd Robinson	8a5ea29190	refactor: add measurement to log	2022-02-15 14:31:26 +00:00
Marco Neumann	44ee0166a0	fix: start Kafka write buffer stream at "earliest" offset, not at "0" (#3748 )	2022-02-15 13:36:59 +00:00
Marco Neumann	9e7a27b344	fix: default Kafka topic name is `iox-shared` (#3747 ) Do NOT use underscores in the Kafka topic because this is not supported by Kafka. This was initially fixed by #3555 but reverted by #3623.	2022-02-15 12:34:46 +00:00
Andrew Lamb	a30803e692	chore: Update datafusion, update `arrow`/`parquet`/`arrow-flight` to 9.0 (#3733 ) * chore: Update datafusion * chore: Update arrow * fix: missing updates * chore: Update cargo.lock * fix: update for smaller parquet size * fix: update test for smaller parquet files * test: ensure parquet_file tests write multiple row groups * fix: update callsite * fix: Update for tests * fix: harkari * fix: use IoxObjectStore::existing Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-15 12:10:24 +00:00
Dom Dwyer	e055800039	refactor: enable Partitioner in request pipeline Adds the Partitioner DML handler into the handler stack, modifying the input types of down-stream handlers to accept the partitioned data.	2022-02-15 11:34:33 +00:00
dependabot[bot]	89105ccfab	chore(deps): Bump tokio-util from 0.6.9 to 0.7.0 (#3743 ) Bumps [tokio-util](https://github.com/tokio-rs/tokio) from 0.6.9 to 0.7.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/commits) --- updated-dependencies: - dependency-name: tokio-util dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-02-15 11:33:41 +00:00
Dom Dwyer	e99922d518	refactor: parametrise DML handler input type Allow a DML handler to specify the write input type on which it operates. This allows us to construct a write handler pipeline that transforms the request as it passes through the various handlers. We'll use this to implement a handler that annotates a normal set of table writes with the partition key, modifying downstream handlers to expect this annotated input.	2022-02-15 11:23:45 +00:00
Marco Neumann	c6e374a025	feat: allow catalog access w/o a transaction (#3735 ) * feat: allow catalog access w/o a transaction Now the caller has the full control if they want to use a transaction or not. * fix: remove non-transaction-safe `create_many` * fix: remove unnecessary transactions	2022-02-15 10:15:36 +00:00
dependabot[bot]	60a7f87645	chore(deps): Bump serde_json from 1.0.78 to 1.0.79 (#3739 ) Bumps [serde_json](https://github.com/serde-rs/json) from 1.0.78 to 1.0.79. - [Release notes](https://github.com/serde-rs/json/releases) - [Commits](https://github.com/serde-rs/json/compare/v1.0.78...v1.0.79) --- updated-dependencies: - dependency-name: serde_json dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-14 20:42:54 +00:00
Raphael Taylor-Davies	26fd5273f0	feat: static database configuration (#2436 ) (#3732 ) * feat: static database configuration (#2436) * chore: fmt * feat: don't base64 encode UUIDs in ServerConfigFile Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-14 19:42:49 +00:00
Raphael Taylor-Davies	c79050254f	refactor: traitify database configuration (#2436 ) (#3730 ) * refactor: traitify database configuration (#2436) * chore: review feedback Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-13 09:26:44 +00:00
Raphael Taylor-Davies	866777ecd2	feat: static router configuration (#2436 ) (#3725 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-11 14:09:37 +00:00
Raphael Taylor-Davies	4e3f66ed07	feat: CLI and gRPC APIs for shutting down and restarting databases (#3720 ) * feat: allow catalog wipe and rebuild whilst shutdown * feat: CLI and gRPC APIs for shutting down and restarting databases * feat: add ability to skip replay on restart * fix: test_wipe_persisted_catalog_error_db_exists * fix: wipe_preserved_catalog	2022-02-11 10:14:43 +00:00
Raphael Taylor-Davies	910f381355	refactor: require UUID to create Database (#3715 ) * refactor: require UUID to create Database * chore: review feedback * chore: fmt Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-10 20:04:06 +00:00
Raphael Taylor-Davies	b1190262b7	feat: restartable `Database` (#3368 ) (#3711 ) * feat: restartable `Database` (#3368) * chore: fmt * fix: wipe_preserved_catalog * chore: review feedback Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-10 18:32:05 +00:00
Andrew Lamb	d9f331ba2a	chore: update datafusion, stop repartitioning so aggressively (#3633 ) * chore: update datafusion * fix: Update to use new datafusion api * chore: update expected plans * fix: support zero output partitions * fix: update test * fix: Update for new DataFusion API * fix: newly added system table * fix: update cargo lock	2022-02-09 19:53:41 +00:00
Carol (Nichols \|\| Goulding)	73828323ac	feat: Ingester Flight gRPC API (#3623 ) * feat: Add a way to run ingester with an in-memory catalog from the CLI If you set the --catalog-dsn string to "mem", rather than using that as a Postgres connection URL, create an in-memory catalog. Planning on using this in tests, so not documenting. * fix: Set default topic to the same value as SHARED_KAFKA_TOPIC Namely, both should use an underscore. I don't think there's a way to directly share these values between a constant and an annotation. * feat: Add a flight API (handshake only) to ingester * fix: Create partitions if using file-based write buffer * fix: Change the server fixture to handle ingester server type For now, the ingester doesn't implement the deployment API. Not sure if it should or not. * feat: Start implementing ingester do_get, namely decoding the query Skip serialization of the predicate for the moment. * refactor: Rename ingest protos to ingester to match crate name * refactor: Rename QueryResults to QueryData * feat: Move ingester flight client to new querier crate * fix: Off by one error, different starting indexes in sequencers * fix: Create new CLI argument to pick the catalog type * fix: Create a CLI option to set the number of topics to auto-create in the write buffer * fix: Check the arrow flight service's health to tell that the ingester gRPC is up * fix: Set postgres as the default catalog type * fix: Return an error rather than panicking if CLI args aren't right	2022-02-09 19:07:44 +00:00
Edd Robinson	2334e779eb	feat: implement read_window_aggregate sub-command	2022-02-09 12:32:48 +00:00
Edd Robinson	0774e1d328	feat: add read_window_aggregate request builder	2022-02-09 12:32:48 +00:00
Marco Neumann	4bddab56e2	feat: create new sequencers in ingester on demand (#3671 ) There is no need to introduce yet another admin action to do that. If the sequencer does not exist yet, we can just create it and set the `min_unpersisted_sequence_number` to 0 (which is done be `create_or_get`).	2022-02-09 12:26:30 +00:00
Edd Robinson	dfa6fd8579	feat: add quiet option to storage	2022-02-08 21:27:29 +00:00
Edd Robinson	11855a5eff	feat: add format flag	2022-02-08 21:15:07 +00:00
Edd Robinson	c175ccd1b4	feat: make stop/stop/predicate global (#3681 )	2022-02-08 20:06:47 +00:00
kodiakhq[bot]	ace76cef14	Merge branch 'main' into dom/sharded-cache	2022-02-08 16:09:48 +00:00
Paul Dix	59b2141c0b	feat: Add lifecycle manager to ingester (#3645 ) This adds the lifecycle manager to the ingester. It will trigger based on a threshold for max partition size or age or based on keeping total memory under a certain threshold. It defines a new interface for a persister, which is stubbed out for IngesterData. I'm not sure yet how persistence errors should be handled. The assumption here is that the persister continues to retry persistence forever until it succeeds. There is one scenario I can think of that may cause this lifecycle manager problems. If a single partition is very high throughput, it could cause things to back up as persistence is not parallelized within a single partition. Any given partition can currently only run one persistence operation at a time. We can address this later. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-08 15:23:40 +00:00
Marco Neumann	5de4d6203f	refactor: catalog transaction (#3660 ) * refactor: catalog Unit of Work (= transaction) Setup an inteface to handle Units of Work within our catalog. Previously both the Postgres and the in-mem backend used "mini-transactions on demand". Now the caller has a clear way to establish boundaries and gets read and write isolation. A single `Arc<dyn Catalog>` can create as many `Box<dyn UnitOfWork>` as you like, but note that depending on the backend you may not scale infinitely (postgres will likely impose certain limits and the in-mem backend limits concurrency to 1 to keep things simple). * docs: improve wording Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * refactor: rename Unit of Work to Transaction * test: improve `test_txn_isolation` * feat: clearify transaction drop semantics Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-08 13:38:33 +00:00
kodiakhq[bot]	4567800901	Merge branch 'main' into er/feat/tag_values_cli	2022-02-08 13:07:59 +00:00
Raphael Taylor-Davies	be662ec731	feat: lazy query log! (#3654 ) * feat: lazy query log * chore: fmt * chore: review feedback Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-08 13:07:28 +00:00
Edd Robinson	6c10e1e901	feat: support _measurement/_field tag keys	2022-02-08 11:32:28 +00:00
Edd Robinson	eb733042ca	feat: add support for tag_values cli	2022-02-07 22:02:29 +00:00
Edd Robinson	38a889ecf6	refactor: remove unnecessary struct	2022-02-07 22:02:29 +00:00
Marco Neumann	d9cc9f5a2a	feat: expose write buffer connection config via CLI (#3651 ) * feat: improve rskafka config error messages * feat: expose write buffer connection config via CLI	2022-02-07 16:24:28 +00:00
Marco Neumann	977ccc1989	fix: use a single metric registry for ingester (#3652 ) With this change write buffer ingestion metrics are showing up under `/metrics` Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-07 15:56:54 +00:00
Edd Robinson	87ac926e06	feat: add queries system table (#3655 ) Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-02-07 15:26:06 +00:00
Carol (Nichols \|\| Goulding)	2e30483f1f	refactor: Remove predicate module from predicate crate (#3648 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-07 14:54:07 +00:00
Marco Neumann	e2db1df11f	refactor: improve writer buffer consumer interface (#3631 ) * refactor: improve writer buffer consumer interface The change looks huge but is actually rather simple. To understand the interface change, let me first explain what we want: - be able to fetch watermarks for any sequencer - have streams: - each streams tracks a sequencer and has an offset state (no read multiplexing) - we can seek a stream - seeking and streaming cannot be done at the same time (that would be weird and likely leads to many bugs both in write buffer and in the user code) - ideally we don't need to create streams of all sequencers but can choose a subset Before this change we had one mutable consumer struct where you can get all streams and watermark functions (this mutable-borrows the consumer) or you can seek a single stream (this also mutable-borrows the consumer). This is a bit weird for multiple reasons: - you cannot seek a single stream without dropping all of them - the mutable-borrow construct makes it really difficult to pass the streams into separate threads - the consumer is boxed (because its mutable) which makes it more difficult to handle in a large-scale application What this change does is the following: - you have an immutable consumer (similar to the producer) - the consumer offers the following methods: - get the set of sequencer IDs - get watermark for any sequencer - get a stream handler (see next point) for any sequencer - the stream handler captures the stream state (offset) and provides you a standard `Stream<_>` interface as well as a seek function. Mutable-borrows ensure that you cannot use both at the same time. The stream handler provides you the stream via `handler.stream()`. It doesn't implement `Stream<_>` itself because the way boxing, dynamic dispatch work, and pinning interact (i.e. I couldn't get it to work without the indirection). As a bonus point (which we don't use however) you can now create multiple streams for the same sequencer and they all have their own offset. * fix: review comments Co-authored-by: Carol (Nichols \|\| Goulding) <193874+carols10cents@users.noreply.github.com> Co-authored-by: Carol (Nichols \|\| Goulding) <193874+carols10cents@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-07 12:24:17 +00:00
Edd Robinson	a52c0a26e6	feat: print read filter results	2022-02-04 22:14:22 +00:00
Edd Robinson	d328b37803	feat: teach IOx to convert RPC frames into Recordbatches	2022-02-04 18:34:54 +00:00
Edd Robinson	4cdaaf96bf	refactor: clean up errors	2022-02-04 18:34:54 +00:00
Edd Robinson	ea0ece8b4b	feat: issue read_filter request	2022-02-04 18:34:54 +00:00
Dom Dwyer	0b044b95fb	perf: use sharded namespace cache Enables the ShardedCache for the namespace schema cache.	2022-02-04 16:12:51 +00:00
Dom Dwyer	026a557c0b	refactor: rename TableNamespaceSharder Rename to JumpHash and expose the hashing internals for reuse (outside of only table & namespace sharding).	2022-02-04 15:56:09 +00:00
Dom Dwyer	0fd122e365	refactor: "inf" retention const Adds the iox_catalog::INFINITE_RETENTION_POLICY constant.	2022-02-04 15:35:33 +00:00
Dom Dwyer	f1ba50f40b	feat: resolve query pool ID at startup This commit adds a --query-pool flag to router2, used to upsert a catalog record at startup. Auto-created namespaces will reference this query pool. This is for testing only and will be removed in a future commit.	2022-02-04 15:35:30 +00:00
Dom Dwyer	aefc70a9ea	feat(router2): namespace auto-creation Decorate the existing request handler pipeline with a layer that implicitly creates the namespace when a write request is received.	2022-02-04 15:34:15 +00:00
Marco Neumann	0c01044677	fix: partition range in ingester CLI has INCLUSIVE end (#3641 )	2022-02-04 13:41:57 +00:00
Marco Neumann	d2ccf23263	fix: use standard DSN argument for router2 CLI (#3632 ) - support long-form (instead of relying on positional arguments) - use same code as everying else Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-03 17:20:52 +00:00

1 2 3 4 5 ...

337 Commits (37c65fc24f2170a8a187cd62d66f1122c0b7b099)