influxdb

Commit Graph

Author	SHA1	Message	Date
devanbenz	035b66c7c1	feat: use a debug log	2025-05-23 11:41:11 -05:00
devanbenz	eb3e879ce6	fix: Use 10*millisecond for ticker	2025-04-18 16:06:12 -05:00
devanbenz	df86dfc8f1	feat: Add WaitWithTimeout to Partition This PR makes it easier to debug potential hanging retention service routines during DeleteShard.	2025-04-18 13:22:19 -05:00
WeblWabl	45a8227ad6	fix(influxd): update xxhash, avoid stringtoslicebyte in cache (#578 ) (#25622 ) (#25624 ) * fix(influxd): update xxhash, avoid stringtoslicebyte in cache (#578) * fix(influxd): update xxhash, avoid stringtoslicebyte in cache This commit does 3 things: * it updates xxhash from v1 to v2; v2 includes a assembly arm version of Sum64 * it changes the cache storer to write with a string key instead of a byte slice. The cache only reads the key which WriteMulti already has as a string so we can avoid a host of allocations when converting back and forth from immutable strings to mutable byte slices. This includes updating the cache ring and ring partition to write with a string key * it updates the xxhash for finding the cache ring partition to use Sum64String which uses unsafe pointers to directly use a string as a byte slice since it only reads the string. Note: this now uses an assembly version because of the v2 xxhash update. Go 1.22 included new compiler ability to recognize calls of Method([]byte(myString)) and not make a copy but from looking at the call sites, I'm not sure the compiler would recognize it as the conversion to a byte slice was happening several calls earlier. That's what this change set does. If we are uncomfortable with any of these, we can do fewer of them (for example, not upgrade xxhash; and/or not use the specialized Sum64String, etc). For the performance issue in maz-rr, I see converting string keys to byte slices taking between 3-5% of cpu usage on both the primary and secondary. So while this pr doesn't address directly the increased cpu usage on the secondary, it makes cpu usage less on both which still feels like a win. I believe these changes are easier to review that switching to a byte slice pool that is likely needed in other places as the compiler provides nearly all of the correctness checks we need (we are relying also on xxhash v2 being correct). * helps #550 * chore: fix tests/lint * chore: don't use assembly version; should inline This 2 line change causes xxhash to use a purego Sum64 implementation which allows the compiler to see that Sum64 only read the byte slice input which them means is can skip the string to byte slice allocation and since it can skip that, it should inline all the calls to getPartitionStringKey and Sum64 avoiding 1 call to Sum64String which isn't inlined. * chore: update ci build file the ci build doesn't use the make file!!! * chore: revert "chore: update ci build file" This reverts commit 94be66fde03e0bbe18004aab25c0e19051406de2. * chore: revert "chore: don't use assembly version; should inline" This reverts commit 67d8d06c02e17e91ba643a2991e30a49308a5283. (cherry picked from commit 1d334c679ca025645ed93518b7832ae676499cd2) * feat: need to update go sum --------- Co-authored-by: Phil Bracikowski <13472206+philjb@users.noreply.github.com> (cherry picked from commit `06ab224516`)	2024-12-06 16:05:03 -06:00
WeblWabl	5c9e45f033	fix(tsi1/partition/test): fix data races in test code (#57 ) (#25338 ) * fix(tsi1/partition/test): fix data races in test code (#57) * fix(tsi1/partition/test): fix data races in test code This PR is like influxdata/influxdb#24613 but solves it with a setter method for MaxLogFileSize which allows unexporting that value and MaxLogFileAge. There are actually two places locks were needed in test code. The behavior of production code is unchanged. (cherry picked from commit f0235c4daf4b97769db932f7346c1d3aecf57f8f) * feat: modify error handling to be more idiomatic closes https://github.com/influxdata/influxdb/issues/24042 * fix: errors.Join() filters nil errors --------- Co-authored-by: Phil Bracikowski <13472206+philjb@users.noreply.github.com>	2024-09-16 20:26:14 -05:00
WeblWabl	7dc8b1d648	fix(tsi1/partition/test): fix data race in test code (#25288 )	2024-09-11 19:48:41 -05:00
Shiwen Cheng	7333da9592	fix(tsi1): fix data race between appendEntry and FlushAndSync tsi1.(*LogFile) (#25182 ) Extend lock lifespan to encompass the flushAndSync() call to avoid a race closes https://github.com/influxdata/influxdb/issues/25181	2024-07-23 14:40:10 -07:00
Jack	6af0be9234	fix: panic index out of range for invalid series keys (#24565 ) * chore: add scaffolding for naive solution * feat: test case scaffolding * fix: implement check for series key before proceeding * fix: add validation for ReadSeriesKeyMeasurement usage * refactor: explicit use of series key len * feat: add remaining check to index * feat: add check to remaining files As the Len function is used as part of the parseSeriesKey, this also needs to be accounted for on the nil return from this function as it is used in different contexts * feat: expand test cases * chore: go fmt * chore: update test failure message * chore: impl feedback on unnecessary sz checks * feat: expand test cases * fix: nil series key check In both sections for index.go there is a pre-existing length check against the series key which should catch invalid values, perhaps this explains why it hasn't cropped up in the reported panics. For even more safety, we can also skip a nil key because we know that subsequent calls will cause a panic where this key is attempted to be used * fix: remove nil tags check A key with no tags is valid, so we should not check for BOTH nil key and tags as a key could be nil, which is invalid, yet still have tags and therefore cause the check to pass which we do not want * feat: extend test cases from feedback * fix: extend checks for CompareSeriesKeys * feat: add nilKeyHandler for shared key checking logic * fix: logical error in nilKeyHandler Prior to this, the else was always defaulted to at the end of the conditional branch, which causes unexpected behaviour and a failure of a bunch of tests. * fix: return tags keep nil data In a recent change to this, we agreed on a simple name == nil check for the actual data. As a follow on to this, I just realised that we don't actually want to nil back the tags, even if they're not checked, because having no tags is a valid input so we can simply return whatever we were passed unchanged. * fix: use len == 0 for extra safety * feat: extra test for blank series key	2024-01-23 09:44:29 +00:00
davidby-influx	aad79e471f	fix: prevent world-writable MANIFEST files (#24235 ) When a new MANIFEST file is created, set its permissions to 644, not 666 closes https://github.com/influxdata/influxdb/issues/24233	2023-05-18 12:07:34 -07:00
Brandon Pfeifer	e484c4d871	chore: upgrade Go to v1.19.3 (1.x) (#23941 ) * chore: upgrade Go to 1.19.3 This re-runs ./generate.sh and ./checkfmt.sh to format and update source code (this is primarily responsible for the huge diff.) * fix: update tests to reflect sorting algorithm change	2022-11-28 12:15:47 -05:00
davidby-influx	eb3cc88772	fix: generalize test for Windows (#23580 ) Also eliminate race condition in tests (cherry picked from commit `7e37a7ad16`)	2022-07-21 13:28:10 -07:00
davidby-influx	a8732dcf52	fix: restore in-memory Manifest on write error (#23552 ) Do not update the `FileSet` or `activeLogFile` field in the in-memory Partition structure if the Manifest file is not correctly saved to the disk. closes https://github.com/influxdata/influxdb/issues/23553	2022-07-20 12:59:15 -07:00
davidby-influx	25cea95beb	fix: add paths to tsi log and index file errors (#23557 ) Add paths to various TSI errors on opening and unmarshaling files to help poinpoint the corrupt files. Closes https://github.com/influxdata/influxdb/issues/23556	2022-07-19 09:02:20 -07:00
davidby-influx	061cf55f2a	fix: create TSI MANIFEST files atomically (#23539 ) When a MANIFEST file is created in TSI, it should be written to a temp file, then atomically renamed, to avoid overwriting the existing file only to fail on the later write. closes https://github.com/influxdata/influxdb/issues/23536	2022-07-13 10:11:49 -07:00
davidby-influx	a2dd708a26	fix: improve error messages opening index partitions (#23532 ) Where possible, add the file path path to any errors on opening, reading, (un)marshaling, or validating the various files comprising a partition closes https://github.com/influxdata/influxdb/issues/23506	2022-07-12 14:22:36 -07:00
davidby-influx	a428043f84	fix: lost TSI reference / close TagValueSeriesIDIterator in error case (#23461 ) (#23462 ) (cherry picked from commit `8bd4fc502d`) closes https://github.com/influxdata/influxdb/issues/23460 Co-authored-by: Dane Strandboge <dstrandboge@influxdata.com>	2022-06-16 11:54:04 -07:00
davidby-influx	d3db48e93d	fix: fully clean up partially opened TSI (#23430 ) When one partition in a TSI fails to open, all previously opened partitions should be cleaned up, and remaining partitions should not be opened closes https://github.com/influxdata/influxdb/issues/23427	2022-06-10 11:31:29 -07:00
Dane Strandboge	0574163566	build: upgrade to go1.18 (#23250 )	2022-03-31 16:17:57 -05:00
davidby-influx	7d182158f4	fix: add database to MaxSeriesPerDatabase error message (#23113 ) To simplify debugging, print the database name when the max-series-per-database limit is exceeded in InMem indices. closes https://github.com/influxdata/influxdb/issues/23112	2022-02-08 11:52:14 -08:00
davidby-influx	0c3dca883e	fix: correctly handle MaxSeriesPerDatabaseExceeded (#23091 ) Check for the correctly returned PartialWriteError in (*shard).validateSeriesAndFields, allow partial writes. closes https://github.com/influxdata/influxdb/issues/23090	2022-02-01 19:08:51 -08:00
lifeibo	5be1c044c3	fix(tsi): sync index file before close (#21932 )	2021-11-24 08:36:03 -05:00
Dane Strandboge	06d1df22a2	chore: fix deadlock in `influx_inspect dumptsi` (#22661 )	2021-10-20 12:48:59 -05:00
Sam Arnold	59fe8e515e	test: fix DiskSizeBytes flakiness (#22641 )	2021-10-08 09:47:12 -04:00
Sam Arnold	1755b8f6d2	fix: TSI logfile race (#22338 ) modTime should be protected by the read lock. Fixes #22337	2021-08-30 17:43:37 -04:00
Sam Arnold	3ae389b359	test: add extra logging when disk size test fails (#22103 )	2021-08-07 06:48:42 -04:00
Sam Arnold	444c22b67d	test: fix order of index teardown (#22038 )	2021-08-04 16:34:51 -04:00
Sam Arnold	e62efaf751	fix: old tsl files should be compacted without new writes (#22006 ) * fix: old tsl files should be compacted wihout new writes * chore: update changelog.md	2021-08-02 13:36:23 -04:00
Sam Arnold	b64c2c3dcf	fix: tsi index should compact old or too-large log files (#21943 ) * fix: tsi index should compact old log files that are too large * chore: run automated formatter * chore: update changelog * fix: review comments	2021-07-26 17:40:15 -04:00
Tristan Su	108e2600b3	fix(tsi): clean up FileSet fields (#18961 )	2021-07-12 10:42:38 -04:00
Sam Arnold	b7e7de24d6	refactor: separate coarse and fine permission interfaces (#20996 )	2021-03-22 09:52:33 -04:00
Sam Arnold	17b9ea8723	feat: Add WITH KEY to show tag keys (#20793 ) * fix: Change from RewriteExpr to PartitionExpr Also remove some dead code * feat: WITH KEY implementation * feat: query rewriting for WITH KEY in SHOW TAG KEYS	2021-02-25 08:38:29 -05:00
Sam Arnold	21823db00b	feat: series creation ingress metrics (#20700 ) After turning this on and testing locally, note the 'seriesCreated' metric "localStore": {"name":"localStore","tags":null,"values":{"pointsWritten":2987,"seriesCreated":58,"valuesWritten":23754}}, "ingress": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"cq","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":4}}, "ingress:1": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"database","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":4}}, "ingress:2": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"httpd","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":46}}, "ingress:3": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"ingress","rp":"monitor"},"values":{"pointsWritten":14,"seriesCreated":14,"valuesWritten":42}}, "ingress:4": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"localStore","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":6}}, "ingress:5": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"queryExecutor","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":10}}, "ingress:6": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"runtime","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":30}}, "ingress:7": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"shard","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":22}}, "ingress:8": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"subscriber","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":6}}, "ingress:9": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_cache","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":18}}, "ingress:10": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_engine","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":58}}, "ingress:11": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_filestore","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":4}}, "ingress:12": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_wal","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":8}}, "ingress:13": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"write","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":18}}, "ingress:14": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"cpu","rp":"autogen"},"values":{"pointsWritten":1342,"seriesCreated":13,"valuesWritten":13420}}, "ingress:15": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"disk","rp":"autogen"},"values":{"pointsWritten":642,"seriesCreated":6,"valuesWritten":4494}}, "ingress:16": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"diskio","rp":"autogen"},"values":{"pointsWritten":214,"seriesCreated":2,"valuesWritten":2354}}, "ingress:17": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"mem","rp":"autogen"},"values":{"pointsWritten":107,"seriesCreated":1,"valuesWritten":963}}, "ingress:18": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"processes","rp":"autogen"},"values":{"pointsWritten":107,"seriesCreated":1,"valuesWritten":856}}, "ingress:19": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"swap","rp":"autogen"},"values":{"pointsWritten":214,"seriesCreated":1,"valuesWritten":642}}, "ingress:20": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"system","rp":"autogen"},"values":{"pointsWritten":321,"seriesCreated":1,"valuesWritten":749}}, Closes: https://github.com/influxdata/influxdb/issues/20613	2021-02-05 14:52:43 -04:00
Sam Arnold	32612313df	fix: minor test fixes for go1.15 and also flaky timeouts Also run gofmt	2021-01-08 14:59:33 -05:00
Ayan George	4ef4fe9aef	fix(tsi1): Acquire a lock when modifying measurement map This patch protects an internal map for concurrent use. (LogFile).Writes() method calls (LogFile).createMeasurementIfNotExists() which writes to a shared map. (LogFile).Writes() acquires a read-lock which leaves createMeasurementIfNotExists() open to concurrent writes to its shared map. This commit adds the ExecEntries method to the LogFile type so that we can properly lock calls to (LogFile).appendEntry() using defer. (LogFile).ExecEntries() is used to mostly replace the body of (LogFile).Writes() and incurs another function call since ExecEntries() can't be inlined. Below is the output of build with "-m -m -m" gcflags: ./log_file.go:1076:6: cannot inline (LogFile).ExecEntries: unhandled op DEFER The performace impact of the additional function call should be negligable and is outwieghed by the safety and simplicity of using defer.	2020-08-31 12:52:54 -04:00
dengzhi.ldz	331569bc11	perf(tsi1): batch write tombstone entries when dropping/deleting	2020-06-24 09:26:09 -06:00
Ayan George	3b9be0145c	fix: address static check warning s1039 (#18135 ) This commit quiets staticcheck's warnings about "unnecessary use of fmt.Sprintf" and "unnecessary use of fmt.Sprint". Prior to this commit we were wrapping simple constant strings without any formatting verbs with fmt.Sprintf().	2020-05-18 13:55:05 -04:00
Ben Johnson	848ca48225	fix(tsdb): Revert "fix: remove some unsafe marshalling to reduce risk of segfault" This reverts commit `30dab03310`.	2020-04-13 13:22:22 -06:00
docmerlin (j. Emrys Landivar)	30dab03310	fix: remove some unsafe marshalling to reduce risk of segfault We were seing segfaults in Roaring bitmaps sometimes, under very high load with networked drives. This may reduce risk of segfault by forcing marshalling to copy the data.	2020-03-11 13:47:29 -04:00
Ayan George	5f47c388df	chore(influxdb): Forward port 16999 (#17032 ) * fix: access tsi active log file with READ lock The activeLogFile pointer may be altered by other routine so the READ lock is needed. * Merge pull request #16384 from foobar/tsi-partition-lock fix: access tsi active log file with READ lock Co-authored-by: Tristan Su <sooqing@gmail.com> Co-authored-by: David Norton <dgnorton@gmail.com>	2020-03-04 16:20:58 -05:00
Edd Robinson	34c0fdafc0	feat(storage): Offline series file compaction	2020-02-03 13:57:31 -07:00
David Norton	903d2c2d28	fix(tsm1): improve series cardinality limit Prior to this change, new series would be added to the series file before checking the series cardinality limit. If the limit was exceeded, the write was rejected even though the series had already been added to the series file.	2020-01-21 16:45:13 -05:00
Edd Robinson	cac4c8956c	fix(tsi1): index defect with negated equality filters Fixes #15859 This commit fixes a defect in the TSI index where a filter using the negated equality operator would result in no matching series being returned for series stored within the `IndexFile` portions of the index. The root cause of this was due to missing legacy-handling code in the index for this particular iterator.	2019-11-12 15:10:42 +00:00
Ben Johnson	14d45c913c	fix(tsi1): replace TSI compaction wait group with counter	2019-09-18 11:47:37 -06:00
Edd Robinson	9bfd1119b9	fix(storage): Fix issue where fields re-appear Fixes #10052 This commit fixes an issue where field keys would reappear in results when querying previously dropped measurements. The issue manifests itself when duplicates of a new series are inserted into the `inmem` index. In this case, a map that tracks the number of series belonging to a measurement was incorrectly incremented once for each duplication of the series. Then, when it came time to drop the measurement, the index assumed there were several series belonging to the measurement left in the index (because the counter was higher than it should be). The result of that was that the `fields.idx` file (which stores a mapping between measurements and field keys) was not truncated and rebuilt. This left old field keys in that file, which were then returned in subsequent queries over all field keys.	2019-07-05 12:24:03 +01:00
Ben Johnson	2edbc907a1	Add nil check for tagKeyValueEntry.setIDs() Previously it was possible to set IDs on a `nil` entry which would in turn cause a panic. If this panic was recovered by the server then it would result in a mutex in the `inmem` index staying locked indefinitely.	2019-04-02 10:04:39 -06:00
Ben Johnson	c61db43dc2	Update tagKeyValue mutex to write lock. This commit changes the read lock to a write lock when calling the `ids()` function because `ids()` can mutate the underlying series ids slice.	2019-02-15 09:29:48 -07:00
Jeff Wendling	40d2b70376	Ensure that cached series id sets are Go heap backed	2019-02-12 15:20:24 -07:00
Ben Johnson	6e5226437a	Convert TagValueSeriesIDCache to use string fields. This commit changes `name`, `key`, and `value` to from `[]byte` to `string`.	2019-02-12 14:22:40 -07:00
Edd Robinson	05e7def600	Merge pull request #10332 from ludweeg/ludweeg/unslice Simplify s[:] to s where s is a slice	2019-02-11 10:24:43 +00:00
Ben Wells	e9bada090f	Fix misspelling identified by misspell	2019-02-03 20:27:43 +00:00

1 2 3 4 5 ...

461 Commits (db/wait-timeout-utility)