influxdb

Commit Graph

Author	SHA1	Message	Date
davidby-influx	4aa7cd88e8	feat: add type conflict checker to influx_inspect (#23616 ) (#23617 ) adds two commands "check-schema" and "merge-schema" to influx_inspect. These test for field type conflicts in all fields.idx beneath a directory and merges the derived schemas if "check-schema" has been run multiple times on different directories (cherry picked from commit `84c4f676b0`)	2022-08-10 15:14:05 -07:00
davidby-influx	3635e1c1b1	fix: replace unprintable and invalid characters in errors (#23387 ) (#23418 ) Replace unprintable and invalid characters with '?' in logged errors. Truncate consecutive runs of them to only 3 repeats of '?' closes https://github.com/influxdata/influxdb/issues/23386 (cherry picked from commit `0ae0bd6e2e`) closes https://github.com/influxdata/influxdb/issues/23390	2022-06-09 09:26:23 -07:00
Dane Strandboge	0574163566	build: upgrade to go1.18 (#23250 )	2022-03-31 16:17:57 -05:00
davidby-influx	0c3dca883e	fix: correctly handle MaxSeriesPerDatabaseExceeded (#23091 ) Check for the correctly returned PartialWriteError in (*shard).validateSeriesAndFields, allow partial writes. closes https://github.com/influxdata/influxdb/issues/23090	2022-02-01 19:08:51 -08:00
Geoffrey Wossum	91609fdd3f	fix(restore): fix race condition which causes restore command to fail (#22796 ) * fix(restore): fix race condition which causes restore command to fail Fixes a race condition in the restore code path that causes shard data restores to fail. When the bug occurs, `Error while freeing cold shard resources` appears in the log files. fixes issue #15323	2021-11-03 14:21:33 -05:00
Dane Strandboge	8b38d0e2bf	build: upgrade protobuf library (#22606 )	2021-10-15 11:42:47 -05:00
Sam Arnold	fd81373937	test: expose tcpaddr for enterprise tests (#22172 ) * docs: update comment for series updates * fix: expose TCP address for Enterprise test harness * refactor: remove dead RemoteServer code	2021-08-11 17:19:26 -04:00
Sam Arnold	6d22e69ef1	fix: hard limit on field size while parsing line protocol (#21843 ) Per https://docs.influxdata.com/enterprise_influxdb/v1.9/write_protocols/line_protocol_reference/ we only support 64KB, but 1MB is a more realistic practical limit. Before this commit there was no enforcement of field value size. Closes #21841	2021-07-14 17:11:09 -04:00
davidby-influx	aca69e530f	fix: don't access a field in a nil struct (#21693 )	2021-06-15 10:23:38 -07:00
davidby-influx	f8202876ad	chore: minor refactor suggested by go lint (#21614 ) (cherry picked from commit `7d10228e19`)	2021-06-04 14:07:00 -07:00
davidby-influx	f64be286be	fix: avoid rewriting fields.idx unnecessarily (#21592 ) Under heavy write load creating new fields and measurements the rewrite of the fields.idx file is a bottleneck. This enhancement combines multiple writes into a single one and shares any error return value with all of the combined invocations. MeasurementFieldSet and the new MeasurementFieldSetWriter must both now be explicitly closed. Closes #21577	2021-06-04 09:21:33 -07:00
davidby-influx	c8da9bafbf	chore(ae): add more logging (#21381 ) (#21452 ) tsdb.Engine.IsIdle and tsdb.Engine.Digest now return a reason string for why the engine & shard are not idle. Callers can then use this string for logging, if desired. The returned reason does not allocate memory, so the caller may want to add the shard ID and path for more information in the log. This is intended to be used in calls from the anti-entropy service in Enterprise. (cherry picked from commit `bf45841359`) fixes https://github.com/influxdata/influxdb/issues/21448	2021-05-11 09:46:45 -07:00
Sam Arnold	04f4817aae	fix(services/storage): multi measurement queries return all applicable series (#19592 ) (#20934 ) This fixes multi measurement queries that go through the storage service to correctly pick up all series that apply with the filter. Previously, negative queries such as `!=`, `!~`, and predicates attempting to match empty tags did not work correctly with the storage service when multiple measurements or `OR` conditions were included. This was because these predicates would be categorized as "multiple measurements" and then it would attempt to use the field keys iterator to find the fields for each measurement. The meta queries for these did not correctly account for negative equality operators or empty tags when finding appropriate measurements and those could not be changed because it would cause a breaking change to influxql too. This modifies the storage service to use new methods that correctly account for the above situations rather than the field keys iterator. Some queries that appeared to be single measurement queries also get considered as multiple measurement queries. Any query with an `OR` condition will be considered a multiple measurement query. This bug did not apply to single measurement queries where one measurement was selected and all of the logical operators were `AND` values. This is because it used a different code path that correctly handled these situations. Backport of #19566. (cherry picked from commit `ceead88bd5`) Co-authored-by: Jonathan A. Sternberg <jonathan@influxdata.com>	2021-03-12 16:34:14 -05:00
Sam Arnold	17b9ea8723	feat: Add WITH KEY to show tag keys (#20793 ) * fix: Change from RewriteExpr to PartitionExpr Also remove some dead code * feat: WITH KEY implementation * feat: query rewriting for WITH KEY in SHOW TAG KEYS	2021-02-25 08:38:29 -05:00
Sam Arnold	21823db00b	feat: series creation ingress metrics (#20700 ) After turning this on and testing locally, note the 'seriesCreated' metric "localStore": {"name":"localStore","tags":null,"values":{"pointsWritten":2987,"seriesCreated":58,"valuesWritten":23754}}, "ingress": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"cq","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":4}}, "ingress:1": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"database","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":4}}, "ingress:2": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"httpd","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":46}}, "ingress:3": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"ingress","rp":"monitor"},"values":{"pointsWritten":14,"seriesCreated":14,"valuesWritten":42}}, "ingress:4": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"localStore","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":6}}, "ingress:5": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"queryExecutor","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":10}}, "ingress:6": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"runtime","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":30}}, "ingress:7": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"shard","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":22}}, "ingress:8": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"subscriber","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":6}}, "ingress:9": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_cache","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":18}}, "ingress:10": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_engine","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":58}}, "ingress:11": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_filestore","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":4}}, "ingress:12": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_wal","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":8}}, "ingress:13": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"write","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":18}}, "ingress:14": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"cpu","rp":"autogen"},"values":{"pointsWritten":1342,"seriesCreated":13,"valuesWritten":13420}}, "ingress:15": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"disk","rp":"autogen"},"values":{"pointsWritten":642,"seriesCreated":6,"valuesWritten":4494}}, "ingress:16": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"diskio","rp":"autogen"},"values":{"pointsWritten":214,"seriesCreated":2,"valuesWritten":2354}}, "ingress:17": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"mem","rp":"autogen"},"values":{"pointsWritten":107,"seriesCreated":1,"valuesWritten":963}}, "ingress:18": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"processes","rp":"autogen"},"values":{"pointsWritten":107,"seriesCreated":1,"valuesWritten":856}}, "ingress:19": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"swap","rp":"autogen"},"values":{"pointsWritten":214,"seriesCreated":1,"valuesWritten":642}}, "ingress:20": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"system","rp":"autogen"},"values":{"pointsWritten":321,"seriesCreated":1,"valuesWritten":749}}, Closes: https://github.com/influxdata/influxdb/issues/20613	2021-02-05 14:52:43 -04:00
Sam Arnold	eb92c997cd	feat: Ingress metrics by measurement Partial implementation of https://github.com/influxdata/influxdb/issues/20612 Implements per-measurement points written metric. Next step: Also support per-login.	2021-02-02 15:58:28 -05:00
Sam Arnold	6795ec6c01	refactor: do not use context value anti-pattern Extending the context instead of fixing the API breaks type safety. For tracking the number of points / values written, it is much clearer to pass an explicit tracker.	2021-02-01 14:34:11 -05:00
davidby-influx	fe3af66c54	fix(tsdb): minimize lock contention when adding new fields or measurements (#20504 ) fields.idx frequent writes cause lock contention and fields.idx is recreated when a field or measurement is added in a WritePointsWithContext() This eliminates locking during the actual file rewrite, and limits it to the times when the MeasurementFieldSet is actually being read or written in memory and when the new file is being renamed. Test verification of correct behavior by checking the fields.idx file matches the in-memory copy after heavily parallel measurement addition. Fixes https://github.com/influxdata/influxdb/issues/20500	2021-01-15 08:31:45 -08:00
Sam Arnold	d1a1e4b667	chore: restore ImportShard This reverts commit `d14acea44d`.	2020-12-07 11:01:00 -04:00
davidby-influx	6ec446f422	fix(tsm1): "snapshot in progress" error during backup This fix adds a skipCacheOk flag to tsdb.Store.CreateShardSnapshot() and tsdb.Shard.CreateSnapshot() to pass to tsdb.Engine.CreateSnapshot() A value of true allows the backup to proceed even if a cache snapshot cannot be taken. This flag is set to true in tsm1.Engine.Backup(), the OSS backup code path This flag is set to false in tsm1.Engine.Export() https://github.com/influxdata/plutonium/issues/3227	2020-11-05 11:08:08 -08:00
davidby-influx	23be20bf1b	fix(tsm1): "snapshot in progress" error during backup When an InfluxDB database is very busy writing new points the backup the process can fail because it can not write a new snapshot. The error is: operation timed out with error: create snapshot: snapshot in progress. This happens because InfluxDB takes almost "continuously" a snapshot from the cache caused by the high number of points ingested. The fix for this was https://github.com/influxdata/influxdb/pull/16627 but it was for OSS only, and was not in the code path for backups in clusters. This fix adds a skipCacheOk flag to tsdb.Engine.CreateSnapshot(). A value of true allows the backup to proceed even if a cache snapshot cannot be taken. This flag is set to true in tsm1.Engine.Backup(), the OSS backup code path and in tsdb.Shard.CreateSnapshot(), the cluster backup code path. This flag is set to false in tsm1.Engine.Export() https://github.com/influxdata/plutonium/issues/3227	2020-10-30 10:37:36 -07:00
Ayan George	6ce0e11738	feat: Collect values written stats (#19187 ) * feat(engine/tsm1): Add WritePointsWithContext() Add WritePontsWithContext() and make WritePoints() a thin wrapper for it. The purpose is to add statistics context values that we'll use to propagate the number of fields and points written to calls up the call chain. * feat(tsdb): Add WriteToShardWithContext() When applied, this patch adds WriteToShardWithContext() and wraps it with WriteToShard() to preserve the API. The the purpose of this addition is to propagate a context.Context value to Shard.WritePointsWithContext(). * feat(tsdb/shard): Add WritePointsWithContext() The purpose of adding WritePointsWithContext() is to propage context values down to engine code and propage statistics via the context.Value up to callers. This patch also adds values written statistics to the shard. * feat(http): Gather values written stats WritePointsWithContext() was added to propagate context values down to the engine and communicate stats to the caller. * feat(http): Gather values written stats WritePointsWithContext() was added to propagate context values down to the engine and communicate stats to the caller. * refactor: Change MetricKey to ContextKey This patch gives the type we're useing for context keys a better name.	2020-08-12 11:26:12 -04:00
Tristan Su	d14acea44d	chore: clean up unused functions	2020-05-08 13:45:34 +08:00
Edd Robinson	77fe5a9a62	Treat fields and measurements as raw bytes	2018-12-19 14:38:50 +00:00
Tanya Gordeeva	7c9ff60413	tsdb/shard: reduce measurement field copying Removes cloning measurement fields on writes, instead atomically swaps out measurement field sets when fields are added (with new overhead of copying existing fields whenever a new one is added).	2018-11-02 18:49:17 -07:00
Edd Robinson	0f67d8f294	Merge pull request #10387 from influxdata/er-index-vars Add shards' index types to /debug/vars	2018-10-29 10:12:05 +00:00
Edd Robinson	cade59e253	Fix panic in IndexSet This commit fixes a panic where a concurrent removal of a shard and meta query could cause a `nil` index to be added to the IndexSet`.	2018-10-26 18:23:54 +01:00
Edd Robinson	9b4cf1e39c	Add the shard index type to /debug/vars This commit adds an `indexType` key to the shard sections of the `/debug/vars` endpoint, as well as the `_internal` shard statistics. The tag will be reported as `"indexType": "inmem"` or `"indexType": "tsi1"`.	2018-10-18 13:46:12 +01:00
Ben Johnson	2d266ca186	Merge pull request #9801 from influxdata/bj-validate-write Add option for unicode validation.	2018-08-20 03:44:41 -10:00
Edd Robinson	80dc07cbcb	Efficient means of getting fields for measurement If it's known that the read request only needs to use a single measurement, then we can avoid the need to get field keys via the query engine. However, that means that a new method of getting the field keys for a measurement would be needed. This commit exposes a method to efficiently get field key names for a measurement across multiple shards. name	2018-07-18 12:21:54 +01:00
Edd Robinson	9c5c1c7001	Optimisation for expressions with single measument	2018-07-18 12:21:54 +01:00
Jacob Marble	dcb85d2e92	Init TSI partition logger TSI Partition logging was never initialized because WithLogger was called after Open; Open initializes Partition loggers.	2018-07-05 14:27:09 -07:00
Stuart Carnie	7e998779e6	feat(tsdb/store): Option to disable compactions for offline tools Allows an offline tool to open the tsdb.Store with compactions disabled.	2018-06-13 10:29:59 -07:00
Edd Robinson	28b6df7afb	Ensure remote read can handle no data in time	2018-06-12 23:10:18 +01:00
Ben Johnson	cfaaf39d8b	Export Shard.Engine()	2018-06-04 13:25:03 -06:00
Jacob Marble	3f2ff742c0	Remove unused 'database' field	2018-05-18 09:22:43 -07:00
Jacob Marble	7f8b7af61e	Cleanup index memory footprint counting code (#9828 ) * Fix IndexSet.DedupeInmemIndexes * Cleanup index memory footprint code	2018-05-15 11:25:19 -07:00
Jacob Marble	0763d1789e	Get inmem index bytes without double-counting	2018-05-10 11:33:52 -07:00
Jason Wilder	de58584ce7	Merge pull request #9748 from influxdata/jw-series-type Prevent series type conflict	2018-05-10 07:05:45 -06:00
Jacob Marble	2dc2b97fb9	tsdb/index: Add Bytes() methods (#9794 )	2018-05-04 08:47:05 -07:00
Ben Johnson	58aed93fe6	Add option for unicode validation.	2018-05-02 11:16:55 -06:00
Jason Wilder	2be2418b89	Add series type validation to Engine This is the start of per-series validation that occurs in the Engine write path. It uses an in-memory radix tree to reduce memory usage and is re-built on demand the first time a series is written.	2018-04-30 17:26:23 -06:00
Edd Robinson	ba16268f41	Merge pull request #9777 from influxdata/er-index-log Log information about index version during startup	2018-04-26 10:48:54 +01:00
Jeff Wendling	e5dbc18d0b	remove bool return param from dataTypeFromModelsFieldType	2018-04-25 09:48:24 -06:00
Edd Robinson	32e195860b	Log index type when opening shard	2018-04-25 13:02:09 +01:00
Jeff Wendling	29a62e4f74	Add FieldValidator to allow custom validations on measurements No appreciable changes in benchmark results. It seems like this function is less than 4% of cpu time in the write workloads in the benchmarks at least.	2018-04-23 20:21:27 -06:00
Ben Johnson	dbbe9d8467	Merge pull request #9615 from influxdata/bj-check-shard-count-on-series-iterator-master Remove error for series file when no shards exist	2018-04-20 08:14:24 -06:00
Jason Wilder	97ecf62ffb	Return time range from delete predicate func This moves the time range to delete to be returned by the predicate func in DeleteSeriesRangeWithPredicate. It allows for a single delete to delete different ranges of times per series instead of a single range of time for all series.	2018-04-09 20:01:33 -06:00
Jacob Marble	470ee7f176	Add ability to delete many series with predicate	2018-03-28 08:32:18 -07:00
Stuart Carnie	2cc1f5137e	support for tenant+bucket NOTE: to match storage service, values for database and rp are hard-coded to `db` and `rp` respectively	2018-03-23 12:26:55 -07:00

1 2 3 4 5 ...

370 Commits (154f23b4a676ceb5bbd33e0f6f09e1c136db5272)