influxdb

Commit Graph

Author	SHA1	Message	Date
WeblWabl	45a8227ad6	fix(influxd): update xxhash, avoid stringtoslicebyte in cache (#578 ) (#25622 ) (#25624 ) * fix(influxd): update xxhash, avoid stringtoslicebyte in cache (#578) * fix(influxd): update xxhash, avoid stringtoslicebyte in cache This commit does 3 things: * it updates xxhash from v1 to v2; v2 includes a assembly arm version of Sum64 * it changes the cache storer to write with a string key instead of a byte slice. The cache only reads the key which WriteMulti already has as a string so we can avoid a host of allocations when converting back and forth from immutable strings to mutable byte slices. This includes updating the cache ring and ring partition to write with a string key * it updates the xxhash for finding the cache ring partition to use Sum64String which uses unsafe pointers to directly use a string as a byte slice since it only reads the string. Note: this now uses an assembly version because of the v2 xxhash update. Go 1.22 included new compiler ability to recognize calls of Method([]byte(myString)) and not make a copy but from looking at the call sites, I'm not sure the compiler would recognize it as the conversion to a byte slice was happening several calls earlier. That's what this change set does. If we are uncomfortable with any of these, we can do fewer of them (for example, not upgrade xxhash; and/or not use the specialized Sum64String, etc). For the performance issue in maz-rr, I see converting string keys to byte slices taking between 3-5% of cpu usage on both the primary and secondary. So while this pr doesn't address directly the increased cpu usage on the secondary, it makes cpu usage less on both which still feels like a win. I believe these changes are easier to review that switching to a byte slice pool that is likely needed in other places as the compiler provides nearly all of the correctness checks we need (we are relying also on xxhash v2 being correct). * helps #550 * chore: fix tests/lint * chore: don't use assembly version; should inline This 2 line change causes xxhash to use a purego Sum64 implementation which allows the compiler to see that Sum64 only read the byte slice input which them means is can skip the string to byte slice allocation and since it can skip that, it should inline all the calls to getPartitionStringKey and Sum64 avoiding 1 call to Sum64String which isn't inlined. * chore: update ci build file the ci build doesn't use the make file!!! * chore: revert "chore: update ci build file" This reverts commit 94be66fde03e0bbe18004aab25c0e19051406de2. * chore: revert "chore: don't use assembly version; should inline" This reverts commit 67d8d06c02e17e91ba643a2991e30a49308a5283. (cherry picked from commit 1d334c679ca025645ed93518b7832ae676499cd2) * feat: need to update go sum --------- Co-authored-by: Phil Bracikowski <13472206+philjb@users.noreply.github.com> (cherry picked from commit `06ab224516`)	2024-12-06 16:05:03 -06:00
WeblWabl	5c9e45f033	fix(tsi1/partition/test): fix data races in test code (#57 ) (#25338 ) * fix(tsi1/partition/test): fix data races in test code (#57) * fix(tsi1/partition/test): fix data races in test code This PR is like influxdata/influxdb#24613 but solves it with a setter method for MaxLogFileSize which allows unexporting that value and MaxLogFileAge. There are actually two places locks were needed in test code. The behavior of production code is unchanged. (cherry picked from commit f0235c4daf4b97769db932f7346c1d3aecf57f8f) * feat: modify error handling to be more idiomatic closes https://github.com/influxdata/influxdb/issues/24042 * fix: errors.Join() filters nil errors --------- Co-authored-by: Phil Bracikowski <13472206+philjb@users.noreply.github.com>	2024-09-16 20:26:14 -05:00
Brandon Pfeifer	e484c4d871	chore: upgrade Go to v1.19.3 (1.x) (#23941 ) * chore: upgrade Go to 1.19.3 This re-runs ./generate.sh and ./checkfmt.sh to format and update source code (this is primarily responsible for the huge diff.) * fix: update tests to reflect sorting algorithm change	2022-11-28 12:15:47 -05:00
davidby-influx	a8732dcf52	fix: restore in-memory Manifest on write error (#23552 ) Do not update the `FileSet` or `activeLogFile` field in the in-memory Partition structure if the Manifest file is not correctly saved to the disk. closes https://github.com/influxdata/influxdb/issues/23553	2022-07-20 12:59:15 -07:00
davidby-influx	a428043f84	fix: lost TSI reference / close TagValueSeriesIDIterator in error case (#23461 ) (#23462 ) (cherry picked from commit `8bd4fc502d`) closes https://github.com/influxdata/influxdb/issues/23460 Co-authored-by: Dane Strandboge <dstrandboge@influxdata.com>	2022-06-16 11:54:04 -07:00
davidby-influx	d3db48e93d	fix: fully clean up partially opened TSI (#23430 ) When one partition in a TSI fails to open, all previously opened partitions should be cleaned up, and remaining partitions should not be opened closes https://github.com/influxdata/influxdb/issues/23427	2022-06-10 11:31:29 -07:00
Dane Strandboge	0574163566	build: upgrade to go1.18 (#23250 )	2022-03-31 16:17:57 -05:00
Dane Strandboge	06d1df22a2	chore: fix deadlock in `influx_inspect dumptsi` (#22661 )	2021-10-20 12:48:59 -05:00
Sam Arnold	b64c2c3dcf	fix: tsi index should compact old or too-large log files (#21943 ) * fix: tsi index should compact old log files that are too large * chore: run automated formatter * chore: update changelog * fix: review comments	2021-07-26 17:40:15 -04:00
Tristan Su	108e2600b3	fix(tsi): clean up FileSet fields (#18961 )	2021-07-12 10:42:38 -04:00
Sam Arnold	21823db00b	feat: series creation ingress metrics (#20700 ) After turning this on and testing locally, note the 'seriesCreated' metric "localStore": {"name":"localStore","tags":null,"values":{"pointsWritten":2987,"seriesCreated":58,"valuesWritten":23754}}, "ingress": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"cq","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":4}}, "ingress:1": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"database","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":4}}, "ingress:2": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"httpd","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":46}}, "ingress:3": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"ingress","rp":"monitor"},"values":{"pointsWritten":14,"seriesCreated":14,"valuesWritten":42}}, "ingress:4": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"localStore","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":6}}, "ingress:5": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"queryExecutor","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":10}}, "ingress:6": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"runtime","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":30}}, "ingress:7": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"shard","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":22}}, "ingress:8": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"subscriber","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":6}}, "ingress:9": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_cache","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":18}}, "ingress:10": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_engine","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":58}}, "ingress:11": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_filestore","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":4}}, "ingress:12": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_wal","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":8}}, "ingress:13": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"write","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":18}}, "ingress:14": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"cpu","rp":"autogen"},"values":{"pointsWritten":1342,"seriesCreated":13,"valuesWritten":13420}}, "ingress:15": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"disk","rp":"autogen"},"values":{"pointsWritten":642,"seriesCreated":6,"valuesWritten":4494}}, "ingress:16": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"diskio","rp":"autogen"},"values":{"pointsWritten":214,"seriesCreated":2,"valuesWritten":2354}}, "ingress:17": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"mem","rp":"autogen"},"values":{"pointsWritten":107,"seriesCreated":1,"valuesWritten":963}}, "ingress:18": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"processes","rp":"autogen"},"values":{"pointsWritten":107,"seriesCreated":1,"valuesWritten":856}}, "ingress:19": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"swap","rp":"autogen"},"values":{"pointsWritten":214,"seriesCreated":1,"valuesWritten":642}}, "ingress:20": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"system","rp":"autogen"},"values":{"pointsWritten":321,"seriesCreated":1,"valuesWritten":749}}, Closes: https://github.com/influxdata/influxdb/issues/20613	2021-02-05 14:52:43 -04:00
dengzhi.ldz	331569bc11	perf(tsi1): batch write tombstone entries when dropping/deleting	2020-06-24 09:26:09 -06:00
Edd Robinson	cac4c8956c	fix(tsi1): index defect with negated equality filters Fixes #15859 This commit fixes a defect in the TSI index where a filter using the negated equality operator would result in no matching series being returned for series stored within the `IndexFile` portions of the index. The root cause of this was due to missing legacy-handling code in the index for this particular iterator.	2019-11-12 15:10:42 +00:00
Edd Robinson	05e7def600	Merge pull request #10332 from ludweeg/ludweeg/unslice Simplify s[:] to s where s is a slice	2019-02-11 10:24:43 +00:00
Edd Robinson	301ab71ba0	Remove copy-on-write when caching bitmaps In the case of caching TSI bitmaps belonging to immutable .tsi files, the underlying bitset data can be mmapped. It is possible, though rare, for this data to be unmapped (e.g., via a TSI compaction) but for the cached bitmap to be subsequently read. This leads to a segfault. This only happens when copy-on-write is set to true on the roaring bitmap, because in that case only the internal pointers are cloned. This change will reduce the TSI cache performance by around 10%, which I have deemed to account for only a few microseconds typically.	2019-01-25 18:02:48 +00:00
Edd Robinson	efdddbb31a	Allow TSI bitset cache size to be configured This commit adds a config option to the tsdb Config allowing the size of the bitset cached in the TSI index to be specified. Setting the cache size to 0 will disable the cache.	2019-01-24 17:41:45 +00:00
Edd Robinson	e20541d2ba	Expose functional option for setting TSI cache size	2019-01-23 17:15:48 +00:00
Edd Robinson	3a055a6107	Fix cardinality estimation error This commit fixes an error in the TSI index with estimating the cardinality of series recently added and then removed.	2019-01-10 17:46:30 +00:00
Jeff Wendling	0a2f6191a6	tsdb: clean up fields index for every kind of delete Before this, if you deleted everything with `delete where true` for example, then you would be left with all of your measurements in the fields index. That would cause ghost fields to reappear if someone reinserted to the measurement. This fixes that by making it so the deepest most delete code checks if the measurement was removed from the index, and if so cleaning it up out of the fields index. Additionally, it fixes bugs in that cleanup code where if you had a measurement like "m1" and "m10", when iterating over the cache or file store, "m1" would match "m10" due to it only checking the prefix. This also has it check the character right after the measurement to be either a comma because tags started, or the first character of the field separator.	2018-11-27 16:12:06 -07:00
ludweeg	5622355526	Simplify s[:] to s where s is a slice	2018-10-04 17:10:21 +03:00
Ben Johnson	0d777ad423	Fix tsi1 sketch locking.	2018-09-26 17:01:47 -06:00
Edd Robinson	76237d80f2	Address PR feedback	2018-09-18 15:58:38 -07:00
Ben Johnson	e651153f1c	Add TagValueSeriesIDCache.Delete().	2018-09-18 15:58:38 -07:00
Ben Johnson	fcbc03240a	Inline mutex into TagValueSeriesIDCache.	2018-09-18 15:58:38 -07:00
Edd Robinson	bdc293abdd	Tidy up	2018-09-18 15:58:38 -07:00
Edd Robinson	8af7c133db	Refactor cache	2018-09-18 15:58:38 -07:00
Edd Robinson	1ae716b64e	Use copy-on-write when cloning bitmaps This commit sets the copy-on-write feature of the SeriesIDSets, such that we can make immutable clones of underlying bitmaps efficiently. If the original bitmap is modified then a copy will be made, which won't affect the clone.	2018-09-18 15:58:38 -07:00
Edd Robinson	baf35f2138	Add benchmarks for cache and option to disable	2018-09-18 15:58:38 -07:00
Edd Robinson	3f6ef0ba22	Update cached bitset results with new series ids This commit ensures that cached bitset results at the Index level are updated whenever new series ids are created that would belong in those bitsets. For example, if we have a cached bitset for the tuple {mem, region, west}, and we add the series mem,host=prod,region=west then we would update the cached bitset for {mem, region, west} with the series id of the newly written series.	2018-09-18 15:58:38 -07:00
Edd Robinson	2c4c79f110	Convert cache to LRU	2018-09-18 15:58:38 -07:00
Edd Robinson	2ae2157d02	debug	2018-09-18 15:58:38 -07:00
Edd Robinson	74b3d35e40	Basic cache	2018-09-18 15:58:38 -07:00
Ben Johnson	88d006a18c	Remove TSI1 HLL sketches from heap. This commit removes the HLL sketches on each `tsi1.LogFile` and `tsi1.IndexFile` and instead caches the data at the `tsi1.Index` level. This reduces the heap size significantly for servers with many TSI-enabled shards.	2018-09-12 08:48:40 -06:00
Edd Robinson	dece5b847f	Refactor index names	2018-08-21 14:32:30 +01:00
Edd Robinson	a67f15fad4	Promote DropSeriesGlobal to Index interface	2018-08-20 17:57:16 +01:00
Ben Johnson	fdfd038401	Add roaring bitmaps to TSI index files.	2018-07-24 17:59:23 +01:00
Edd Robinson	11bea138f8	Restrict buffer size	2018-07-09 11:51:48 +01:00
Edd Robinson	3cf20823e9	Allow LogFile buffer size to be changed When adding many series using offline tooling, it's likely that every series involves an entry being appended to a LogFile. Typically an entry is 11 or 12 bytes, but the default bufio.Writer buffer size is only 4K. This means by default a write of 10,000 new series would involve ~30 buffer flushes. This commit makes the buffer configurable, and sets the value in `buildtsi` such that it reflects the number of series being written to the LogFile.	2018-07-09 11:51:48 +01:00
Edd Robinson	681af04815	Optionally disable buffer flushing/file syncing When running offline tooling, flushing buffers and syncing files on every write to a `LogFile` is not necessary. Were a hard exit with data loss to occur, the tooling can simply be run again.	2018-07-09 11:51:15 +01:00
Jacob Marble	3f2ff742c0	Remove unused 'database' field	2018-05-18 09:22:43 -07:00
Jacob Marble	7f8b7af61e	Cleanup index memory footprint counting code (#9828 ) * Fix IndexSet.DedupeInmemIndexes * Cleanup index memory footprint code	2018-05-15 11:25:19 -07:00
Jacob Marble	0763d1789e	Get inmem index bytes without double-counting	2018-05-10 11:33:52 -07:00
Jacob Marble	2dc2b97fb9	tsdb/index: Add Bytes() methods (#9794 )	2018-05-04 08:47:05 -07:00
Ben Johnson	92d38414f2	Add adjustable TSI log file size. This commit adds the `max-index-log-file-size` configuration flag so that users can restrict the maximum size of log files before compaction. The default limit was also lowered from `5MB` to `1MB`. The original size was set before we partitioned the index so the change reflects this.	2018-04-02 11:47:59 -06:00
Ben Johnson	fee6149791	Merge pull request #9489 from influxdata/bj-dumptsi-cardinality Add dumptsi path error handling.	2018-02-27 09:15:03 -07:00
Ben Johnson	b3fcc63a78	Add dumptsi path error handling.	2018-02-27 08:30:12 -07:00
Edd Robinson	96c0ecf618	Improve startup time of `inmem` index This commit improves the startup time when using the `inmem` index by ensuring that the series are created in the index and series file in batches of 10000, rather than individually. Fixes #9486.	2018-02-27 13:33:00 +00:00
Stuart Carnie	a74d296200	use underscore vs period, fix doc comment, add database name to CQ	2018-02-26 10:08:43 -07:00
Edd Robinson	7a55735562	Add option to set LogFile compaction size	2018-02-07 14:52:13 -07:00
Edd Robinson	544329380f	Add empty series sketches back to tsi1 index This commit adds initial empty sketches back to the tsi1 index, as well as ensuring that ephemeral sketches in the index `LogFile` are updated accordingly. The commit also adds a test that verifies that the merged sketches at the store level produce the correct results under writes, deletions and re-opening of the store. This commit does not provide working sketches for post-compaction on the tsi1 index.	2018-02-07 14:52:13 -07:00

1 2 3 4 5

207 Commits (db/wait-timeout-utility)