influxdb

Commit Graph

Author	SHA1	Message	Date
Sam Arnold	d1a1e4b667	chore: restore ImportShard This reverts commit `d14acea44d`.	2020-12-07 11:01:00 -04:00
davidby-influx	6ec446f422	fix(tsm1): "snapshot in progress" error during backup This fix adds a skipCacheOk flag to tsdb.Store.CreateShardSnapshot() and tsdb.Shard.CreateSnapshot() to pass to tsdb.Engine.CreateSnapshot() A value of true allows the backup to proceed even if a cache snapshot cannot be taken. This flag is set to true in tsm1.Engine.Backup(), the OSS backup code path This flag is set to false in tsm1.Engine.Export() https://github.com/influxdata/plutonium/issues/3227	2020-11-05 11:08:08 -08:00
davidby-influx	23be20bf1b	fix(tsm1): "snapshot in progress" error during backup When an InfluxDB database is very busy writing new points the backup the process can fail because it can not write a new snapshot. The error is: operation timed out with error: create snapshot: snapshot in progress. This happens because InfluxDB takes almost "continuously" a snapshot from the cache caused by the high number of points ingested. The fix for this was https://github.com/influxdata/influxdb/pull/16627 but it was for OSS only, and was not in the code path for backups in clusters. This fix adds a skipCacheOk flag to tsdb.Engine.CreateSnapshot(). A value of true allows the backup to proceed even if a cache snapshot cannot be taken. This flag is set to true in tsm1.Engine.Backup(), the OSS backup code path and in tsdb.Shard.CreateSnapshot(), the cluster backup code path. This flag is set to false in tsm1.Engine.Export() https://github.com/influxdata/plutonium/issues/3227	2020-10-30 10:37:36 -07:00
Ayan George	6ce0e11738	feat: Collect values written stats (#19187 ) * feat(engine/tsm1): Add WritePointsWithContext() Add WritePontsWithContext() and make WritePoints() a thin wrapper for it. The purpose is to add statistics context values that we'll use to propagate the number of fields and points written to calls up the call chain. * feat(tsdb): Add WriteToShardWithContext() When applied, this patch adds WriteToShardWithContext() and wraps it with WriteToShard() to preserve the API. The the purpose of this addition is to propagate a context.Context value to Shard.WritePointsWithContext(). * feat(tsdb/shard): Add WritePointsWithContext() The purpose of adding WritePointsWithContext() is to propage context values down to engine code and propage statistics via the context.Value up to callers. This patch also adds values written statistics to the shard. * feat(http): Gather values written stats WritePointsWithContext() was added to propagate context values down to the engine and communicate stats to the caller. * feat(http): Gather values written stats WritePointsWithContext() was added to propagate context values down to the engine and communicate stats to the caller. * refactor: Change MetricKey to ContextKey This patch gives the type we're useing for context keys a better name.	2020-08-12 11:26:12 -04:00
Tristan Su	d14acea44d	chore: clean up unused functions	2020-05-08 13:45:34 +08:00
Edd Robinson	77fe5a9a62	Treat fields and measurements as raw bytes	2018-12-19 14:38:50 +00:00
Tanya Gordeeva	7c9ff60413	tsdb/shard: reduce measurement field copying Removes cloning measurement fields on writes, instead atomically swaps out measurement field sets when fields are added (with new overhead of copying existing fields whenever a new one is added).	2018-11-02 18:49:17 -07:00
Edd Robinson	0f67d8f294	Merge pull request #10387 from influxdata/er-index-vars Add shards' index types to /debug/vars	2018-10-29 10:12:05 +00:00
Edd Robinson	cade59e253	Fix panic in IndexSet This commit fixes a panic where a concurrent removal of a shard and meta query could cause a `nil` index to be added to the IndexSet`.	2018-10-26 18:23:54 +01:00
Edd Robinson	9b4cf1e39c	Add the shard index type to /debug/vars This commit adds an `indexType` key to the shard sections of the `/debug/vars` endpoint, as well as the `_internal` shard statistics. The tag will be reported as `"indexType": "inmem"` or `"indexType": "tsi1"`.	2018-10-18 13:46:12 +01:00
Ben Johnson	2d266ca186	Merge pull request #9801 from influxdata/bj-validate-write Add option for unicode validation.	2018-08-20 03:44:41 -10:00
Edd Robinson	80dc07cbcb	Efficient means of getting fields for measurement If it's known that the read request only needs to use a single measurement, then we can avoid the need to get field keys via the query engine. However, that means that a new method of getting the field keys for a measurement would be needed. This commit exposes a method to efficiently get field key names for a measurement across multiple shards. name	2018-07-18 12:21:54 +01:00
Edd Robinson	9c5c1c7001	Optimisation for expressions with single measument	2018-07-18 12:21:54 +01:00
Jacob Marble	dcb85d2e92	Init TSI partition logger TSI Partition logging was never initialized because WithLogger was called after Open; Open initializes Partition loggers.	2018-07-05 14:27:09 -07:00
Stuart Carnie	7e998779e6	feat(tsdb/store): Option to disable compactions for offline tools Allows an offline tool to open the tsdb.Store with compactions disabled.	2018-06-13 10:29:59 -07:00
Edd Robinson	28b6df7afb	Ensure remote read can handle no data in time	2018-06-12 23:10:18 +01:00
Ben Johnson	cfaaf39d8b	Export Shard.Engine()	2018-06-04 13:25:03 -06:00
Jacob Marble	3f2ff742c0	Remove unused 'database' field	2018-05-18 09:22:43 -07:00
Jacob Marble	7f8b7af61e	Cleanup index memory footprint counting code (#9828 ) * Fix IndexSet.DedupeInmemIndexes * Cleanup index memory footprint code	2018-05-15 11:25:19 -07:00
Jacob Marble	0763d1789e	Get inmem index bytes without double-counting	2018-05-10 11:33:52 -07:00
Jason Wilder	de58584ce7	Merge pull request #9748 from influxdata/jw-series-type Prevent series type conflict	2018-05-10 07:05:45 -06:00
Jacob Marble	2dc2b97fb9	tsdb/index: Add Bytes() methods (#9794 )	2018-05-04 08:47:05 -07:00
Ben Johnson	58aed93fe6	Add option for unicode validation.	2018-05-02 11:16:55 -06:00
Jason Wilder	2be2418b89	Add series type validation to Engine This is the start of per-series validation that occurs in the Engine write path. It uses an in-memory radix tree to reduce memory usage and is re-built on demand the first time a series is written.	2018-04-30 17:26:23 -06:00
Edd Robinson	ba16268f41	Merge pull request #9777 from influxdata/er-index-log Log information about index version during startup	2018-04-26 10:48:54 +01:00
Jeff Wendling	e5dbc18d0b	remove bool return param from dataTypeFromModelsFieldType	2018-04-25 09:48:24 -06:00
Edd Robinson	32e195860b	Log index type when opening shard	2018-04-25 13:02:09 +01:00
Jeff Wendling	29a62e4f74	Add FieldValidator to allow custom validations on measurements No appreciable changes in benchmark results. It seems like this function is less than 4% of cpu time in the write workloads in the benchmarks at least.	2018-04-23 20:21:27 -06:00
Ben Johnson	dbbe9d8467	Merge pull request #9615 from influxdata/bj-check-shard-count-on-series-iterator-master Remove error for series file when no shards exist	2018-04-20 08:14:24 -06:00
Jason Wilder	97ecf62ffb	Return time range from delete predicate func This moves the time range to delete to be returned by the predicate func in DeleteSeriesRangeWithPredicate. It allows for a single delete to delete different ranges of times per series instead of a single range of time for all series.	2018-04-09 20:01:33 -06:00
Jacob Marble	470ee7f176	Add ability to delete many series with predicate	2018-03-28 08:32:18 -07:00
Stuart Carnie	2cc1f5137e	support for tenant+bucket NOTE: to match storage service, values for database and rp are hard-coded to `db` and `rp` respectively	2018-03-23 12:26:55 -07:00
Stuart Carnie	aa61359cc7	Storage RPC API improvements. See PR for details * reduce # allocations (115M -> 22M) * reduce size allocations (53GB -> 1.3GB) * reduce RPC query time (45s -> 12.9s)	2018-03-21 13:46:09 -07:00
Ben Johnson	da8669f3e2	Remove error for series file when no shards exist	2018-03-21 14:41:11 -06:00
Jonathan A. Sternberg	f8d60a881d	Refactor the math engine to compile the query and use eval This change makes it so that we simplify the math engine so it doesn't use a complicated set of nested iterators. That way, we have to change math in one fewer place. It also greatly simplifies the query engine as now we can create the necessary iterators, join them by time, name, and tags, and then use the cursor interface to read them and use eval to compute the result. It makes it so the auxiliary iterators and all of their complexity can be removed. This also makes use of the new eval functionality that was recently added to the influxql package. No math functions have been added, but the scaffolding has been included so things like trigonometry functions are just a single commit away. This also introduces a small breaking change. Because of the call optimization, it is now possible to use the same selector multiple times as a selector. So if you do this: SELECT max(value) * 2, max(value) / 2 FROM cpu This will now return the timestamp of the max value rather than zero since this query is considered to have only a single selector rather than multiple separate selectors. If any aspect of the selector is different, such as different selector functions or different arguments, it will consider the selectors to be aggregates like the old behavior.	2018-03-19 15:01:15 -05:00
Ben Johnson	f6fdba2590	Allow SHOW SERIES kill.	2018-03-15 11:22:34 -06:00
Stuart Carnie	6cf6ae7af4	Use combined IndexSet when executing meta queries * removed unused fieldset field	2018-03-15 09:59:11 -07:00
Edd Robinson	c1e1412dae	Don't panic when checking for field	2018-03-12 15:25:20 +00:00
Edd Robinson	544329380f	Add empty series sketches back to tsi1 index This commit adds initial empty sketches back to the tsi1 index, as well as ensuring that ephemeral sketches in the index `LogFile` are updated accordingly. The commit also adds a test that verifies that the merged sketches at the store level produce the correct results under writes, deletions and re-opening of the store. This commit does not provide working sketches for post-compaction on the tsi1 index.	2018-02-07 14:52:13 -07:00
Stuart Carnie	a058d204d8	remove redundant closing channel	2018-02-06 12:08:58 -07:00
Edd Robinson	42c3adeffc	simplify packages under tsdb	2018-01-21 09:41:27 -08:00
Edd Robinson	4ccb6ada69	Remove unused code/cleanup tsdb package	2018-01-20 14:06:15 +00:00
Jason Wilder	1c8676b4a3	Rebuild corrupted fields index when necessary If the fields.idx was corrupted in someway, it would cause the shard to fail to load. Deleting the file will allow it to be rebuilt. This change handles this automatically so it's rebuilt if necessary without user intervention.	2018-01-16 11:31:07 -07:00
Edd Robinson	ceb3abd118	Remove series when shard rolls over Series should only be removed from the series file when they're no longer present in any shard. This commit ensures that during a shard rollover, the series local to the shard are checked against all other series in the database. Series that are no longer present in any other shards' bitsets, are then marked as deleted in the series file.	2018-01-16 15:58:20 +00:00
Edd Robinson	e902998f4e	All closes are now fast	2018-01-16 14:56:54 +00:00
Edd Robinson	d890f29fcb	Remove redundant index methods Now that each shard-local index is maintaining a bitset of series ids, tracking the series present in the local shard's tsm engine, there is no need to track shards in the `inmem` index. This commit removes the methods associated with tracking those series/shard relationships.	2018-01-16 14:56:54 +00:00
Edd Robinson	286c8f4c09	Return to original DELETE/DROP SERIES semantics This reverts commit `59afd8cc90`.	2018-01-15 12:00:30 +00:00
Edd Robinson	e610e7c21d	Track undeleted series IDs per-shard with inmem This commit adds a bitset into each shard's in-memory index, to be used to track undeleted series ids. Currently tsi1 support is not implemented. When new series are added to the shard, the series id is added to the bitset. When series are deleted from the shard, the series ids are removed from the bitset. Becasue each shard shares the same inmem index reference, the bitset is stored in the `ShardIndex`, which is local to each shard, and then different references are passed into the shared `Index` object, depending on which shard is writing the series.	2018-01-11 01:01:54 +00:00
David Norton	1c452d83cb	fix #9286 : return digest size	2018-01-08 13:15:14 -05:00
Stuart Carnie	c986cac76e	improve performance when writes exceed max tag values or series ``` benchmark old ns/op new ns/op delta BenchmarkShardIndex_CreateSeriesListIfNotExists_MaxValuesExceeded-8 6175374 2714158 -56.05% BenchmarkShardIndex_CreateSeriesListIfNotExists_MaxValuesNotExceeded-8 344502 326312 -5.28% BenchmarkShardIndex_CreateSeriesListIfNotExists_NoMaxValues-8 346734 329961 -4.84% BenchmarkShardIndex_CreateSeriesListIfNotExists_MaxSeriesExceeded-8 2414945 1996223 -17.34% benchmark old allocs new allocs delta BenchmarkShardIndex_CreateSeriesListIfNotExists_MaxValuesExceeded-8 45377 128 -99.72% BenchmarkShardIndex_CreateSeriesListIfNotExists_MaxValuesNotExceeded-8 33 20 -39.39% BenchmarkShardIndex_CreateSeriesListIfNotExists_NoMaxValues-8 33 20 -39.39% BenchmarkShardIndex_CreateSeriesListIfNotExists_MaxSeriesExceeded-8 15219 71 -99.53% benchmark old bytes new bytes delta BenchmarkShardIndex_CreateSeriesListIfNotExists_MaxValuesExceeded-8 1354539 480114 -64.56% BenchmarkShardIndex_CreateSeriesListIfNotExists_MaxValuesNotExceeded-8 2101 1261 -39.98% BenchmarkShardIndex_CreateSeriesListIfNotExists_NoMaxValues-8 2100 1261 -39.95% BenchmarkShardIndex_CreateSeriesListIfNotExists_MaxSeriesExceeded-8 707247 477737 -32.45% ```	2017-12-27 17:27:03 -07:00

1 2 3 4 5 ...

352 Commits (2e26dc62cbcb670de912be405e1904a1108497db)