influxdb

Commit Graph

Author	SHA1	Message	Date
Dom	9cd1286051	Merge branch 'main' into dom/meta-remove-row-count	2022-05-23 16:39:38 +01:00
Marco Neumann	2029bd16ba	feat: enable debugging of failed querier->ingester requests (#4659 ) * feat: enable debugging of failed querier->ingester requests - extend `query-ingester` CLI to allow usage of predicates - on failed requests: log all information that required for the CLI - test the "ingester fails" scenario * test: explain Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * docs: improve Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * refactor: move b64 pred. serde into a single crate Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-05-23 15:37:31 +00:00
Dom Dwyer	2e6c49be83	refactor: remove IoxMetadata min & max timestamp Removes the min/max timestamp fields from the IoxMetadata proto structure embedded within a Parquet file's metadata. These values are redundant as they already exist within the Parquet column statistics, and precluded streaming serialisation as these removed min/max values were needed before serialising the file.	2022-05-23 16:27:08 +01:00
Dom Dwyer	a142a9eb57	refactor: remove row_count from IoxMetadata Remove the redundant row_count from the IoxMetadata structure that is serialised into the Parquet file. The reasoning is twofold: * The Parquet file's native metadata already contains a row count * Needing to know the number of rows up-front precludes streaming	2022-05-23 16:18:35 +01:00
Carol (Nichols \|\| Goulding)	2ee4a6669a	refactor: Move the code merging write infos to generated_types to share	2022-05-11 14:07:42 -04:00
Carol (Nichols \|\| Goulding)	26170b7a07	refactor: Move gRPC conversion code to generated_types to share	2022-05-11 14:07:12 -04:00
Andrew Lamb	84fd883688	feat: Add query_ingester CLI command (#4554 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-10 18:18:07 +00:00
Jake Goulding	e07bcd40c2	refactor: Remove unused dependencies These were found by iterating over all of the dependencies of each Cargo.toml, then grepping that crate for the dependency's name. If it didn't show up, I attempted to remove it. I left a few dependencies that this process flagged: * generated_types - `pbjson`,`serde`. Apparently used by the generated code. * grpc-router-test-gen - `prost`. Apparently used by the generated code. * influxdb_iox - `heappy`. Doesn't appear used, but is behind enough feature flags that I don't care to reason about and it's already optional. - `tikv_jemalloc_sys`. Appears to be setting a feature flag of an indirect dependency. * iox_gitops_adapter - `k8s_openapi`. Appears to be setting a feature flag of an indirect dependency.	2022-05-06 15:57:58 -04:00
Carol (Nichols \|\| Goulding)	6681298a93	fix: Remove unused dependencies found with cargo-udeps	2022-05-06 14:51:54 -04:00
Carol (Nichols \|\| Goulding)	068096e7e1	fix: Rename data_types2 to data_types	2022-05-06 14:45:39 -04:00
Carol (Nichols \|\| Goulding)	fb8f8d22c0	fix: Remove now-unused ServerId. Fixes #4451	2022-05-06 14:45:38 -04:00
Carol (Nichols \|\| Goulding)	485d6edb8f	refactor: Move IngesterQueryRequest to generated_types	2022-05-06 14:45:37 -04:00
Carol (Nichols \|\| Goulding)	e9a42c418a	fix: Only use data_types2 in generated_types	2022-05-06 14:45:36 -04:00
Carol (Nichols \|\| Goulding)	91961273c2	fix: Remove unused Rust code in generated_types	2022-05-06 11:50:03 -04:00
Carol (Nichols \|\| Goulding)	e6e0655b31	fix: Remove OG proto definitions Fixes #4475.	2022-05-06 11:50:03 -04:00
Carol (Nichols \|\| Goulding)	b422eac064	fix: Add go_package to protos (#4505 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-05-04 15:20:09 +00:00
Andrew Lamb	6381ea60bb	chore: port remaining read_filter influxrpc tests to NG (#4383 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-29 14:06:50 +00:00
Paul Dix	8e48fcd620	feat: add remote pull partition (#4433 ) Add lookup of partitions by table id to catalog. Add API to catalog to return partitions by table id. Add to client to return partitions by table id. Add CLI to pull remote schema, partition, and parquet files into a local catalog and object store.	2022-04-28 21:04:27 +00:00
Andrew Lamb	e13d3433ae	feat: Use datafusion serialization code rather than our own copy of it (#4421 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-28 13:03:34 +00:00
Andrew Lamb	115f007317	refactor: Use DataFusion `Expr` instead of our own custom wrapper for `ValueExpr` (#4440 ) * refactor: Use DataFusion `Expr` instead of custom wrapper for BinaryExprs * fix: apply code review suggestions * fix: more code review suggestions	2022-04-27 19:20:15 +00:00
二手掉包工程师	4b47d723b1	refactor: Rename time to iox_time (#4416 ) Signed-off-by: hi-rustin <rustin.liu@gmail.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-26 00:19:59 +00:00
Marco Neumann	86e8f05ed1	fix: make all catalog IDs 64bit (#4418 ) Closes #4365. Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-25 16:49:34 +00:00
Andrew Lamb	73bed810da	chore: Update arrow, arrow-flight, parquet, tonic, prost, etc (#4357 ) * chore: Update datafusion * chore: Update arrow/arrow-flight/parquet to 12 * chore: update datafusion correctly * chore: Update prost, tonic, and dependents * fix: Fixup some api changes * fix: Update test output in db * fix: Update test output in parquet_file * fix: remove old pbjson types * fix: Add "--experimental_allow_proto3_optional" flag * chore: Run cargo hakari tasks * fix: compile error * chore: Update heappy Co-authored-by: CircleCI[bot] <circleci@influxdata.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-20 11:12:17 +00:00
Andrew Lamb	0642ec0b82	docs: add note about write_info API being internal (#4356 ) * docs: add note about write_info API being internal * fix: update doc urls Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-20 09:25:14 +00:00
Andrew Lamb	5ea676d3f7	feat: add per kafka partition durability reporting to write info response (#4341 ) * feat: add per kafka partition durability reporting to write info response * fix: buf lint + test cleanup * fix: clean up protobuf * refactor: pull out conversion of KafkaPartitionStatus into a function * fix: fmt * fix: typo Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-19 16:46:20 +00:00
Andrew Lamb	e3d83fe757	chore: update datafusion (#4342 ) * chore: update datafusion * fix: Update imports for change in datafusion organization	2022-04-19 13:38:12 +00:00
Marco Neumann	5b48675435	fix: actually transmit record-batch metadata from querier (#4347 ) Attaching the "batch => partition" mapping via per-batch schema KV metadata does NOT work because flight will transmit the schema once for all batches (even though on the Rust side we have a schema ref attached to every batch, probably for convenience). Instead we now use the same global protobuf metadata that we also use for the "partition => max sequence number" information. This somewhat limits our ability to create record batches lazily on the ingester side (since the global metadata is sent before any actual payload) but I think we should not modify the usage of the flight protocol too much right now (e.g. by sending more schema messages). If this becomes an issue, we can always find a more complex solution in the future.	2022-04-19 10:54:23 +00:00
Paul Dix	5bf4550259	feat: add object store service to router (#4338 ) Add method to catalog to get parquet file by object store id. Add gRPC service for object store to get a file from by its uuid. Add the object store service to router2 with object store config.	2022-04-16 17:58:31 +00:00
Paul Dix	99cbb28a89	feat: add initial catalog service to router (#4316 ) Create new crate for iox_catalog_service. Add rpc to return parquet_file records by partition id. Add CatalogService to router2. The catalog service will be added to over time to provide access to the catalog over gRPC.	2022-04-14 17:39:18 +00:00
Marco Neumann	83f77712b1	refactor: querier<>ingester flight protocol adjustments (#4286 ) * refactor: querier<>ingester flight protocol adjustments This makes a few adjustments to the querier<>ingester flight protocol. Query Scope =========== The querier will request data for ALL sequencer IDs for now. There is no reason to have a request per sequencer ID. We can add a range/set filter later if we want, but this is not required for now. Partition-level =============== The only time when the querier cares about sequencer IDs (i.e. sharding) at all is when it selects which ingesters to ask for unpersisted data (this is currently not implemented, it just asks all ingesters). Afterwards the querier only cares about partitions (which are bound to specific sequencers anyways) because this is the level where parquet file persistence and compaction as well as deduplication happen. So we make partitions a first-class citizen in the ingester response. Metadata VS RecordBatches ========================= The global app-metadata will list all partitions and their max persisted parquet files and tombstones (theoretically tombstones are at table-level, but the ingester could in the future break them down to the partition-level). Then it receives a stream of record batches. Each record batch is tagged (via key-value metadata in its schema) so it can be assigned to a partition. At the moment the ingester returns 0 or 1 batches per unpersisted partition (0 in case we've filtered out all the data via the predicate), but in the future it is free to return multiple batches. This setup gives the ingester more freedom over memory management and (potentially parallel) query processing, while at the same time keeps the set of duplicated information minimal and allows easy extensions (since the global metadata is a full-blown protobuf message). Querier ======= At the moment the querier ignores all the metdata. Follow-up PRs will change that. * docs: improve Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org> * refactor: make code clearer Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>	2022-04-12 16:48:40 +00:00
Marco Neumann	380cd9bbff	refactor: use a single flight client implementation (#4273 ) "end-user -> querier" and "querier -> ingester" should use a single Flight client implementation. The difference is just the request and response metadata. This changes our default Flight client to use protobuf instead of JSON for the ticket format.	2022-04-12 09:08:25 +00:00
Andrew Lamb	a30a85e62c	feat: Add get_write_info service (#4227 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-07 19:24:58 +00:00
Andrew Lamb	5d66cd0a81	feat: Add WriteSummary serialization and deserialization to protobuf (#4232 ) * feat: Add WriteSummary serialization and deserialization to protobuf * fix: clippy Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-05 09:57:32 +00:00
dependabot[bot]	276449ee09	chore(deps): Bump pbjson from 0.2.3 to 0.3.0 (#4215 ) Bumps [pbjson](https://github.com/influxdata/pbjson) from 0.2.3 to 0.3.0. - [Release notes](https://github.com/influxdata/pbjson/releases) - [Commits](https://github.com/influxdata/pbjson/compare/0.2.3...0.3.0) --- updated-dependencies: - dependency-name: pbjson dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-04-04 12:05:46 +00:00
dependabot[bot]	36dd6f26a3	chore(deps): Bump pbjson-build from 0.2.3 to 0.3.0 (#4220 ) Bumps [pbjson-build](https://github.com/influxdata/pbjson) from 0.2.3 to 0.3.0. - [Release notes](https://github.com/influxdata/pbjson/releases) - [Commits](https://github.com/influxdata/pbjson/compare/0.2.3...0.3.0) --- updated-dependencies: - dependency-name: pbjson-build dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-04 10:45:31 +00:00
Andrew Lamb	833c10c083	feat: return write_token from HTTP writes to router2 (#4202 ) * feat: return write_token from HTTP writes to router2 * fix: Update router2/src/dml_handlers/instrumentation.rs Co-authored-by: Dom <dom@itsallbroken.com> * refactor: Use WriteSummary::default more vigorously * fix: fix typo and add links to follow on issues Co-authored-by: Dom <dom@itsallbroken.com> Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-04-02 10:34:51 +00:00
Andrew Lamb	a1df864283	feat: Support 'SHOW NAMESPACES' in sql repl (#4164 ) * feat: Support `SHOW NAMESPACES` in sql repl * feat: add basic support to clients * fix: add get_namespaces service test * fix: proper error handling * test: end to end test for namespace client * refactor: Use QuerierDatabase rather than Catalog * refactor: remove unused function	2022-03-31 12:57:33 +00:00
Nga Tran	ddc2c8304f	fix: have the compaction level set correctly (#4184 ) * fix: have the compaction level set correctly, especially for compacted file from the compactor * fix: typo	2022-03-30 21:23:40 +00:00
Andrew Lamb	58c630d709	chore: Update datafusion (#4133 ) * chore: Update datafusion * fix: typo * fix: Update explain plan output * fix: update Cargo.locl Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-25 15:08:39 +00:00
Marco Neumann	51da6dd7fa	feat: store sort key in NG metadata (#4110 ) The sort key is optional and currently only produced by `iox_tests`. Writing it within the ingester/compactor is tracked by #3968. The sort key is read by the querier (and this will be verified by the query tests and is required to merge #4103). Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-23 18:24:46 +00:00
Luke Bond	b098828c97	feat: schema grpc server & proto in router2 (#4081 ) * feat: schema grpc server & proto in router2 * chore: comments in schema proto Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-22 11:27:20 +00:00
Andrew Lamb	8f1938a482	chore: Update datafusion (#4022 ) * chore: Update datafusion * chore: update for change in Expr Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-14 17:24:00 +00:00
Andrew Lamb	d2c0acdd46	refactor: Remove serving readiness gate (#3986 ) * refactor: Remove serving_readiness * fix: remove more * fix: remove test Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-03-09 12:17:44 +00:00
Carol (Nichols \|\| Goulding)	9961efd702	feat: Send parquet and tombstone seq nums with ingester query response (#3925 ) Fixes #3867.	2022-03-04 15:22:29 +00:00
Carol (Nichols \|\| Goulding)	2a90841715	refactor: Move IngesterQueryRequest to data_types2 So that querier doesn't need to depend on ingester.	2022-03-02 13:52:13 -05:00
Raphael Taylor-Davies	2a842fbb1a	feat: correctly sort data and store in catalog metadata (#3864 ) * feat: respect sort order in ChunkTableProvider (#3214) feat: persist sort order in catalog (#3845) refactor: owned SortKey (#3845) * fix: size tests * refactor: immutable SortKey * test: test sort order restart (#3845) * chore: explicit None for sort key * chore: test cleanup * fix: handling of sort keys containing fields * chore: remove unused selected_sort_key * chore: more docs Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-25 17:56:27 +00:00
Carol (Nichols \|\| Goulding)	723a0c659f	fix: Remove greater_than_sequence_number from IngesterQueryRequest (#3856 )	2022-02-24 19:23:44 +00:00
Carol (Nichols \|\| Goulding)	252ced7adf	feat: Add row count to the parquet_file record in the catalog (#3847 ) Fixes #3842.	2022-02-24 15:20:50 +00:00
Carol (Nichols \|\| Goulding)	71f62eee68	fix: Remove min_time and max_time from IngesterQueryRequest (#3839 ) Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-23 15:46:31 +00:00
Carol (Nichols \|\| Goulding)	1b9212540b	feat: Send IngesterQueryResponse data back as response of doGet Flight request (#3772 ) * fix: Adjust fields of IngesterQueryResponse * feat: Adjust IngestHandler query method to call prepare_data_to_querier * feat: Send ingest query result data back through Flight doGet * feat: Send delete predicates and max sequencer number in metadata * fix: greater_than_sequence_number should be of type SequenceNumber * fix: Remove DeletePredicates from IngesterQueryResponse Co-authored-by: kodiakhq[bot] <49736102+kodiakhq[bot]@users.noreply.github.com>	2022-02-18 17:42:49 +00:00

1 2 3 4 5 ...

433 Commits (66823522f389bda2bd164950b1c3d8a3b7ef671e)