influxdb

Commit Graph

Author	SHA1	Message	Date
WeblWabl	96e44cac73	fix: PlanOptimize is running too frequently (#26211 ) PlanOptimize is being checked far too frequently. This PR is the simplest change that can be made in order to ensure that PlanOptimize is not being ran too much. To alleviate the frequency I've added a lastWrite parameter to PlanOptimize and added an additional test that mocks the edge cause out in the wild that led to this PR. Previously in test cases for PlanOptimize I was not checked to see if certain cases would be picked up by Plan I've adjusted a few of the existing test cases after modifying Plan and PlanOptimize to have the same lastWrite time.	2025-04-08 12:22:29 -05:00
WeblWabl	d8bcbd894c	feat: Add CompactPointsPerBlock config opt (#26100 ) * feat: Add CompactPointsPerBlock config opt This PR adds an additional parameter for influxd CompactPointsPerBlock. It adjusts the DefaultAggressiveMaxPointsPerBlock to 10,000. We had discovered that with the points per block set to 100,000 compacted TSM files were increasing. After modifying the points per block to 10,000 we noticed that the file sizes decreased. The value has been set as a parameter that can be adjusted by administrators this allows there to be some tuning if compression problems are encountered.	2025-03-05 14:59:06 -06:00
davidby-influx	5f576331d3	chore: refactor field creation for maintainability Address review comments in the port work of the field creation. Also fixes one bug in returning the wrong error.	2025-02-18 14:00:11 -08:00
WeblWabl	f04105bede	feat: Modify optimized compaction to cover edge cases (#25594 ) * feat: Modify optimized compaction to cover edge cases This PR changes the algorithm for compaction to account for the following cases that were not previously accounted for: - Many generations with a groupsize over 2 GB - Single generation with many files and a groupsize under 2 GB - Where groupsize is the total size of the TSM files in said shard directory. - shards that may have over a 2 GB group size but many fragmented files (under 2 GB and under aggressive point per block count) closes https://github.com/influxdata/influxdb/issues/25666	2025-01-14 14:51:09 -06:00
davidby-influx	2dc3dcb3d1	fix: do not escape CSV output (#24311 ) CSV output is incorrectly escaped. Add a boolean flag to tag output functions to prevent this. closes https://github.com/influxdata/influxdb/issues/24309	2023-06-29 12:00:41 -07:00
Brandon Pfeifer	e484c4d871	chore: upgrade Go to v1.19.3 (1.x) (#23941 ) * chore: upgrade Go to 1.19.3 This re-runs ./generate.sh and ./checkfmt.sh to format and update source code (this is primarily responsible for the huge diff.) * fix: update tests to reflect sorting algorithm change	2022-11-28 12:15:47 -05:00
Dane Strandboge	0574163566	build: upgrade to go1.18 (#23250 )	2022-03-31 16:17:57 -05:00
davidby-influx	3702fe8e76	fix: for Windows, copy snapshot files being backed up (#22551 ) On Windows, make copies of files for snapshots, because Go does not support the FILE_SHARE_DELETE flag which allows files (and links) to be deleted while open. This causes temporary directories to be left behind after backups. closes https://github.com/influxdata/influxdb/issues/16289	2021-09-22 10:56:17 -07:00
Sam Arnold	38de69cc1c	fix: flux error properly read by cloud (#22348 )	2021-08-31 17:43:12 -04:00
davidby-influx	7d3efe1e9e	fix: avoid compaction queue stats flutter. (#22195 ) When the compaction planner runs, if it cannot acquire a lock on the files it plans to compact, it returns a nil list of compaction groups. This, in turn, sets the engine statistics for compactions queues to zero, which is incorrect. Instead, use the length of pending files which would have been returned. closes https://github.com/influxdata/influxdb/issues/22138	2021-08-16 09:21:07 -07:00
davidby-influx	73bdb2860e	chore: add logging to compaction (#21707 ) Compaction logging will generate intermediate information on volume of data written and output files created, as well as improve some of the anti-entropy messages related to compaction. This will also apply to `influx_tools compact` Closes https://github.com/influxdata/influxdb/issues/21704	2021-06-16 15:28:44 -07:00
Sam Arnold	8edf7a4e2f	fix(storage): cursor requests are [start, stop] instead of [start, stop) (#21347 ) * fix: backport tsdb fix for window pushdowns From https://github.com/influxdata/influxdb/pull/19855 * fix(storage): cursor requests are [start, stop] instead of [start, stop) The cursors were previously [start, stop) to be consistent with how flux requests data, but the underlying storage file store was [start, stop] because that's how influxql read data. This reverts back the cursor behavior so that it is now [start, stop] everywhere and the conversion from [start, stop) to [start, stop] is performed when doing the cursor request to get the next cursor. cherry-pick from #21318 Co-authored-by: Sam Arnold <sarnold@influxdata.com> (cherry picked from commit `7766672797`) * chore: fix formatting Co-authored-by: Jonathan A. Sternberg <jonathan@influxdata.com>	2021-04-30 15:26:31 -04:00
Daniel Moran	333cff1b15	fix(tsdb): exclude the stop time from the array cursor (#21139 ) This is a backport of #14262 to the 1.x storage engine. This also ports the table tests that existed with the pre-beta version of the storage engine to the one that is now used in the production version. A few of the tests are skipped. These are portions of the storage engine that have not been ported over. They should be unskipped when that functionality is ported over. Co-authored-by: Jonathan A. Sternberg <jonathan@influxdata.com>	2021-04-06 14:50:07 -04:00
Sam Arnold	21823db00b	feat: series creation ingress metrics (#20700 ) After turning this on and testing locally, note the 'seriesCreated' metric "localStore": {"name":"localStore","tags":null,"values":{"pointsWritten":2987,"seriesCreated":58,"valuesWritten":23754}}, "ingress": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"cq","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":4}}, "ingress:1": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"database","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":4}}, "ingress:2": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"httpd","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":46}}, "ingress:3": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"ingress","rp":"monitor"},"values":{"pointsWritten":14,"seriesCreated":14,"valuesWritten":42}}, "ingress:4": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"localStore","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":6}}, "ingress:5": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"queryExecutor","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":10}}, "ingress:6": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"runtime","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":30}}, "ingress:7": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"shard","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":22}}, "ingress:8": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"subscriber","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":6}}, "ingress:9": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_cache","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":18}}, "ingress:10": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_engine","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":58}}, "ingress:11": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_filestore","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":4}}, "ingress:12": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_wal","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":8}}, "ingress:13": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"write","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":18}}, "ingress:14": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"cpu","rp":"autogen"},"values":{"pointsWritten":1342,"seriesCreated":13,"valuesWritten":13420}}, "ingress:15": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"disk","rp":"autogen"},"values":{"pointsWritten":642,"seriesCreated":6,"valuesWritten":4494}}, "ingress:16": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"diskio","rp":"autogen"},"values":{"pointsWritten":214,"seriesCreated":2,"valuesWritten":2354}}, "ingress:17": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"mem","rp":"autogen"},"values":{"pointsWritten":107,"seriesCreated":1,"valuesWritten":963}}, "ingress:18": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"processes","rp":"autogen"},"values":{"pointsWritten":107,"seriesCreated":1,"valuesWritten":856}}, "ingress:19": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"swap","rp":"autogen"},"values":{"pointsWritten":214,"seriesCreated":1,"valuesWritten":642}}, "ingress:20": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"system","rp":"autogen"},"values":{"pointsWritten":321,"seriesCreated":1,"valuesWritten":749}}, Closes: https://github.com/influxdata/influxdb/issues/20613	2021-02-05 14:52:43 -04:00
Sam Arnold	eb92c997cd	feat: Ingress metrics by measurement Partial implementation of https://github.com/influxdata/influxdb/issues/20612 Implements per-measurement points written metric. Next step: Also support per-login.	2021-02-02 15:58:28 -05:00
Sam Arnold	6795ec6c01	refactor: do not use context value anti-pattern Extending the context instead of fixing the API breaks type safety. For tracking the number of points / values written, it is much clearer to pass an explicit tracker.	2021-02-01 14:34:11 -05:00
Sam Arnold	98a76a11a0	feat(tsi): optimize series iteration When using queries like 'select count(_seriesKey) from bigmeasurement`, we should iterate over the tsi structures to serve the query instead of loading all the series into memory up front. Closes #20543	2021-01-25 14:27:31 -05:00
Ayan George	6ce0e11738	feat: Collect values written stats (#19187 ) * feat(engine/tsm1): Add WritePointsWithContext() Add WritePontsWithContext() and make WritePoints() a thin wrapper for it. The purpose is to add statistics context values that we'll use to propagate the number of fields and points written to calls up the call chain. * feat(tsdb): Add WriteToShardWithContext() When applied, this patch adds WriteToShardWithContext() and wraps it with WriteToShard() to preserve the API. The the purpose of this addition is to propagate a context.Context value to Shard.WritePointsWithContext(). * feat(tsdb/shard): Add WritePointsWithContext() The purpose of adding WritePointsWithContext() is to propage context values down to engine code and propage statistics via the context.Value up to callers. This patch also adds values written statistics to the shard. * feat(http): Gather values written stats WritePointsWithContext() was added to propagate context values down to the engine and communicate stats to the caller. * feat(http): Gather values written stats WritePointsWithContext() was added to propagate context values down to the engine and communicate stats to the caller. * refactor: Change MetricKey to ContextKey This patch gives the type we're useing for context keys a better name.	2020-08-12 11:26:12 -04:00
David Norton	102fcd671b	fix(tsm1): make Digest() safe for concurrent use This change adds a lock around digest creation so that it is safe for concurrent calls. Prior to this change, calls from multiple goroutines resulted in "Digest aborted, problem renaming tmp digest" errors.	2019-11-12 18:02:41 -05:00
Edd Robinson	ecff62b9e4	test(storage): add test for reproducing #14229	2019-06-28 16:18:32 +01:00
Ben Wells	e9bada090f	Fix misspelling identified by misspell	2019-02-03 20:27:43 +00:00
Edd Robinson	348dac1672	Add repro test case for UTF-8 issue	2018-12-19 14:38:31 +00:00
Edd Robinson	dece5b847f	Refactor index names	2018-08-21 14:32:30 +01:00
Edd Robinson	3bcb8ad9b2	Merge pull request #10161 from influxdata/er-tidy Simplify loops	2018-08-06 15:24:49 +01:00
Edd Robinson	9eece563b1	Simplify loops	2018-08-05 15:16:33 +01:00
David Norton	50bbf11299	add digest manifest	2018-08-03 15:17:08 -04:00
Stuart Carnie	d977c0ac24	fix(tsdb): Fix existing Prometheus tests based on batch cursors	2018-07-16 08:55:37 -07:00
Jacob Marble	544636c815	TSM: Fix ShouldCompactCache without WAL	2018-06-13 17:37:17 -07:00
Jacob Marble	735aa2d7dc	Add SeriesIDSet() to Index interface	2018-05-18 09:22:43 -07:00
Jacob Marble	3f2ff742c0	Remove unused 'database' field	2018-05-18 09:22:43 -07:00
Jason Wilder	aea9bf3464	Hide series type map behind feature flag The performance is not good enough to enable by default so this allows the functionality to be merged while performance is improved.	2018-05-02 06:50:35 -06:00
Jason Wilder	2be2418b89	Add series type validation to Engine This is the start of per-series validation that occurs in the Engine write path. It uses an in-memory radix tree to reduce memory usage and is re-built on demand the first time a series is written.	2018-04-30 17:26:23 -06:00
Jacob Marble	63b9c98187	fix tests by closing iterators and cursors	2018-04-30 13:46:03 -07:00
Jason Wilder	97ecf62ffb	Return time range from delete predicate func This moves the time range to delete to be returned by the predicate func in DeleteSeriesRangeWithPredicate. It allows for a single delete to delete different ranges of times per series instead of a single range of time for all series.	2018-04-09 20:01:33 -06:00
Jonathan A. Sternberg	117aac4b9e	Adding additional aggregate benchmarks for tsm1 This will help us address performance problems in the underlying tsm1 implementations of the aggregate iterators.	2018-04-09 10:37:33 -05:00
Jacob Marble	470ee7f176	Add ability to delete many series with predicate	2018-03-28 08:32:18 -07:00
Jason Wilder	477de23e35	Merge pull request #9609 from influxdata/jw-compaction-filter Add capability change compaction planner	2018-03-22 07:30:52 -06:00
Jason Wilder	0eb6564e79	Add extension point to swap out the compaction planner	2018-03-21 15:51:00 -06:00
Stuart Carnie	aa61359cc7	Storage RPC API improvements. See PR for details * reduce # allocations (115M -> 22M) * reduce size allocations (53GB -> 1.3GB) * reduce RPC query time (45s -> 12.9s)	2018-03-21 13:46:09 -07:00
Edd Robinson	45af822200	Support less granular mtime	2018-02-27 16:12:59 +00:00
Jason Wilder	f7279b57f3	Re-open last WAL segment Re-open the last wal segment instead of creating a new one. This fixes an issue where the last modified time of the WAL would change on restart. It also avoids a lot of IO file churn on restart.	2018-02-20 14:24:04 -07:00
Joe LeGasse	129c2f0120	tsm: skipping LastModified test for now	2018-01-17 11:14:45 -05:00
Edd Robinson	b9d0a39131	Skip empty series keys	2018-01-15 12:00:30 +00:00
Edd Robinson	a4bef3a4bc	Refactoring delete tests	2018-01-15 12:00:30 +00:00
Edd Robinson	74481b9415	Fix shard tests	2018-01-15 12:00:30 +00:00
Jason Wilder	ba9a5af7eb	Mark series deleted in series file This commit adds the ability to correctly mark a series as deleted in the global series file. Whenever a shard engine determines that a series should be deleted, it checks with each shard's bitset for series that are to be deleted and are no longer contained in any shard-local bitsets. These series are then removed from the series file.	2018-01-15 12:00:30 +00:00
Edd Robinson	286c8f4c09	Return to original DELETE/DROP SERIES semantics This reverts commit `59afd8cc90`.	2018-01-15 12:00:30 +00:00
Jason Wilder	a4d13c7098	Update TestIndex_SeriesIDSet The series ids are no longer lower than 4 so this test will always fail.	2018-01-11 13:49:50 -07:00
Ben Johnson	d610a79487	Merge pull request #9295 from influxdata/partition-series-file Partition series file	2018-01-11 08:45:18 -07:00
Edd Robinson	ecef790574	Update timeout on test	2018-01-11 11:41:30 +00:00

1 2 3 4

154 Commits (db/shards-persisting-rp)