influxdb

Commit Graph

Author	SHA1	Message	Date
davidby-influx	7d3efe1e9e	fix: avoid compaction queue stats flutter. (#22195 ) When the compaction planner runs, if it cannot acquire a lock on the files it plans to compact, it returns a nil list of compaction groups. This, in turn, sets the engine statistics for compactions queues to zero, which is incorrect. Instead, use the length of pending files which would have been returned. closes https://github.com/influxdata/influxdb/issues/22138	2021-08-16 09:21:07 -07:00
Sam Arnold	fd81373937	test: expose tcpaddr for enterprise tests (#22172 ) * docs: update comment for series updates * fix: expose TCP address for Enterprise test harness * refactor: remove dead RemoteServer code	2021-08-11 17:19:26 -04:00
Sam Arnold	3ae389b359	test: add extra logging when disk size test fails (#22103 )	2021-08-07 06:48:42 -04:00
Sam Arnold	444c22b67d	test: fix order of index teardown (#22038 )	2021-08-04 16:34:51 -04:00
davidby-influx	a989f8f8b6	fix: copy names from mmapped memory before closing iterator (#22040 ) This fix ensures that memory-mapped files are not released before pointers into them are copied into heap memory. MeasurementNamesByExpr() and MeasurementNamesByPredicate() can cause panics by copying memory from mmapped files that have been released. The functions they call use iterators to files which are closed (releasing the mmapped files) before the memory is safely copied to the heap. closes https://github.com/influxdata/influxdb/issues/22000	2021-08-04 13:16:00 -07:00
Sam Arnold	e62efaf751	fix: old tsl files should be compacted without new writes (#22006 ) * fix: old tsl files should be compacted wihout new writes * chore: update changelog.md	2021-08-02 13:36:23 -04:00
Sam Arnold	b64c2c3dcf	fix: tsi index should compact old or too-large log files (#21943 ) * fix: tsi index should compact old log files that are too large * chore: run automated formatter * chore: update changelog * fix: review comments	2021-07-26 17:40:15 -04:00
Sam Arnold	23c3d35aab	chore: update protobuf library versions and remove influx_tsm (#21882 ) * chore: update protobufs * fix: run codegen during build * fix: fully remove influx_tsm	2021-07-20 09:42:52 -04:00
Sam Arnold	6d22e69ef1	fix: hard limit on field size while parsing line protocol (#21843 ) Per https://docs.influxdata.com/enterprise_influxdb/v1.9/write_protocols/line_protocol_reference/ we only support 64KB, but 1MB is a more realistic practical limit. Before this commit there was no enforcement of field value size. Closes #21841	2021-07-14 17:11:09 -04:00
Tristan Su	108e2600b3	fix(tsi): clean up FileSet fields (#18961 )	2021-07-12 10:42:38 -04:00
davidby-influx	73bdb2860e	chore: add logging to compaction (#21707 ) Compaction logging will generate intermediate information on volume of data written and output files created, as well as improve some of the anti-entropy messages related to compaction. This will also apply to `influx_tools compact` Closes https://github.com/influxdata/influxdb/issues/21704	2021-06-16 15:28:44 -07:00
davidby-influx	aca69e530f	fix: don't access a field in a nil struct (#21693 )	2021-06-15 10:23:38 -07:00
davidby-influx	bce6553459	fix: Do not close connection twice in DigestWithOptions (#21659 ) tsm1.DigestWithOptions closes its network connection twice. This may cause broken pipe errors on concurrent invocations of the same procedure, by closing a reused i/o descriptor. This fix also captures errors from TSM file closures, which were previously ignored. Closes https://github.com/influxdata/influxdb/issues/21656	2021-06-10 12:41:42 -07:00
davidby-influx	f8202876ad	chore: minor refactor suggested by go lint (#21614 ) (cherry picked from commit `7d10228e19`)	2021-06-04 14:07:00 -07:00
davidby-influx	f64be286be	fix: avoid rewriting fields.idx unnecessarily (#21592 ) Under heavy write load creating new fields and measurements the rewrite of the fields.idx file is a bottleneck. This enhancement combines multiple writes into a single one and shares any error return value with all of the combined invocations. MeasurementFieldSet and the new MeasurementFieldSetWriter must both now be explicitly closed. Closes #21577	2021-06-04 09:21:33 -07:00
davidby-influx	c8da9bafbf	chore(ae): add more logging (#21381 ) (#21452 ) tsdb.Engine.IsIdle and tsdb.Engine.Digest now return a reason string for why the engine & shard are not idle. Callers can then use this string for logging, if desired. The returned reason does not allocate memory, so the caller may want to add the shard ID and path for more information in the log. This is intended to be used in calls from the anti-entropy service in Enterprise. (cherry picked from commit `bf45841359`) fixes https://github.com/influxdata/influxdb/issues/21448	2021-05-11 09:46:45 -07:00
Sam Arnold	8edf7a4e2f	fix(storage): cursor requests are [start, stop] instead of [start, stop) (#21347 ) * fix: backport tsdb fix for window pushdowns From https://github.com/influxdata/influxdb/pull/19855 * fix(storage): cursor requests are [start, stop] instead of [start, stop) The cursors were previously [start, stop) to be consistent with how flux requests data, but the underlying storage file store was [start, stop] because that's how influxql read data. This reverts back the cursor behavior so that it is now [start, stop] everywhere and the conversion from [start, stop) to [start, stop] is performed when doing the cursor request to get the next cursor. cherry-pick from #21318 Co-authored-by: Sam Arnold <sarnold@influxdata.com> (cherry picked from commit `7766672797`) * chore: fix formatting Co-authored-by: Jonathan A. Sternberg <jonathan@influxdata.com>	2021-04-30 15:26:31 -04:00
Sam Arnold	32aa970eba	feat: mean,count aggregation for WindowAggregate pushdown in enterprise (#21291 ) We support only one aggregate list [mean,count]. All other aggregates still must be single-element lists.	2021-04-29 14:30:13 -04:00
davidby-influx	7f300dc248	fix: Anti-Entropy loops endlessly with empty shard (#21275 ) The anti-entropy service will loop trying to copy an empty shard to a data node missing that shard. This fix is one of two changes that correctly create an empty shard on a new node. This fix will set the LastModified date of an empty shard directory to the modification time of that directory, instead of to the Unix epoch. Fixes: https://github.com/influxdata/influxdb/issues/21273	2021-04-23 09:06:03 -07:00
Daniel Moran	333cff1b15	fix(tsdb): exclude the stop time from the array cursor (#21139 ) This is a backport of #14262 to the 1.x storage engine. This also ports the table tests that existed with the pre-beta version of the storage engine to the one that is now used in the production version. A few of the tests are skipped. These are portions of the storage engine that have not been ported over. They should be unskipped when that functionality is ported over. Co-authored-by: Jonathan A. Sternberg <jonathan@influxdata.com>	2021-04-06 14:50:07 -04:00
Daniel Moran	31d4d742e8	refactor: rearrange flux-related storage code to match 2.x (#21114 ) And fix CircleCI config	2021-04-01 14:25:48 -04:00
Daniel Moran	a2154f143c	feat(storage): add support for window aggregate queries (#21107 ) * feat: add cursors and readers for window aggregates * fix: backport fix + tests for race condition in flux tag cache * test: port 2.x test for array_cursor	2021-03-31 13:51:37 -04:00
Sam Arnold	b7e7de24d6	refactor: separate coarse and fine permission interfaces (#20996 )	2021-03-22 09:52:33 -04:00
Sam Arnold	04f4817aae	fix(services/storage): multi measurement queries return all applicable series (#19592 ) (#20934 ) This fixes multi measurement queries that go through the storage service to correctly pick up all series that apply with the filter. Previously, negative queries such as `!=`, `!~`, and predicates attempting to match empty tags did not work correctly with the storage service when multiple measurements or `OR` conditions were included. This was because these predicates would be categorized as "multiple measurements" and then it would attempt to use the field keys iterator to find the fields for each measurement. The meta queries for these did not correctly account for negative equality operators or empty tags when finding appropriate measurements and those could not be changed because it would cause a breaking change to influxql too. This modifies the storage service to use new methods that correctly account for the above situations rather than the field keys iterator. Some queries that appeared to be single measurement queries also get considered as multiple measurement queries. Any query with an `OR` condition will be considered a multiple measurement query. This bug did not apply to single measurement queries where one measurement was selected and all of the logical operators were `AND` values. This is because it used a different code path that correctly handled these situations. Backport of #19566. (cherry picked from commit `ceead88bd5`) Co-authored-by: Jonathan A. Sternberg <jonathan@influxdata.com>	2021-03-12 16:34:14 -05:00
Daniel Moran	3eb4fdaf33	fix(tsm1): fix data race when accessing tombstone stats (#20903 )	2021-03-09 15:20:40 -05:00
Sam Arnold	de491dab97	refactor: Remove unused function add and unused variable keysHint (#20803 ) (#20805 ) (cherry picked from commit `1068d1de6f`)	2021-02-25 15:15:32 -05:00
Sam Arnold	17b9ea8723	feat: Add WITH KEY to show tag keys (#20793 ) * fix: Change from RewriteExpr to PartitionExpr Also remove some dead code * feat: WITH KEY implementation * feat: query rewriting for WITH KEY in SHOW TAG KEYS	2021-02-25 08:38:29 -05:00
Daniel Moran	e85a10248d	fix(tsm1): fix data race and validation in cache ring (#20802 ) Co-authored-by: Yun Zhao <zhaoyun2316@gmail.com> Co-authored-by: Yun Zhao <zhaoyun2316@gmail.com>	2021-02-24 17:17:08 -05:00
davidby-influx	092c7a9976	feat: Make meta queries respect QueryTimeout values Meta queries (SHOW TAG VALUES, SHOW TAG KEYS, SHOW SERIES CARDINALITY, etc.) do not respect the QueryTimeout config parameter. Meta queries should check the query context when possible to allow cancellation and timeout. This will not be as frequent as regular queries, which use iterators, because meta queries return data in batches. Add a context.Context to (Store).MeasurementNames() (Store).MeasurementsCardinality() (Store).SeriesCardinality() (Store).TagValues() (Store).TagKeys() (Store).SeriesSketches() (*Store).MeasurementsSketches() which is tested for timeout or cancellation to allow limitation of time spent in meta queries https://github.com/influxdata/influxdb/issues/20736	2021-02-23 12:52:44 -08:00
Sam Arnold	de1a0eb2a9	feat: use count_hll for 'show series cardinality' queries (#20745 ) Closes: https://github.com/influxdata/influxdb/issues/20614 Also fix nil pointer for seriesKey iterator Fix for bug in: https://github.com/influxdata/influxdb/issues/20543 Also add a test for ingress metrics	2021-02-10 16:00:16 -05:00
Sam Arnold	903b8cd0ea	feat(query): Hyper log log operators in influxql (#20603 ) * feat(query): hyper log log counting in query engine In addition to helping with normal queries, this can improve the 'SHOW CARDINALITY' meta-queries: time influx -database mydb -execute 'select count_hll(sum_hll(_seriesKey)) from big' name: big time count_hll ---- --------- 0 200767781 influx -database mydb -execute 0.06s user 0.12s system 0% cpu 8:49.99 total	2021-02-08 08:38:14 -05:00
Sam Arnold	21823db00b	feat: series creation ingress metrics (#20700 ) After turning this on and testing locally, note the 'seriesCreated' metric "localStore": {"name":"localStore","tags":null,"values":{"pointsWritten":2987,"seriesCreated":58,"valuesWritten":23754}}, "ingress": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"cq","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":4}}, "ingress:1": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"database","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":4}}, "ingress:2": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"httpd","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":46}}, "ingress:3": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"ingress","rp":"monitor"},"values":{"pointsWritten":14,"seriesCreated":14,"valuesWritten":42}}, "ingress:4": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"localStore","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":6}}, "ingress:5": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"queryExecutor","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":10}}, "ingress:6": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"runtime","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":30}}, "ingress:7": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"shard","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":22}}, "ingress:8": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"subscriber","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":6}}, "ingress:9": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_cache","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":18}}, "ingress:10": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_engine","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":58}}, "ingress:11": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_filestore","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":4}}, "ingress:12": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_wal","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":2,"valuesWritten":8}}, "ingress:13": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"write","rp":"monitor"},"values":{"pointsWritten":2,"seriesCreated":1,"valuesWritten":18}}, "ingress:14": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"cpu","rp":"autogen"},"values":{"pointsWritten":1342,"seriesCreated":13,"valuesWritten":13420}}, "ingress:15": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"disk","rp":"autogen"},"values":{"pointsWritten":642,"seriesCreated":6,"valuesWritten":4494}}, "ingress:16": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"diskio","rp":"autogen"},"values":{"pointsWritten":214,"seriesCreated":2,"valuesWritten":2354}}, "ingress:17": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"mem","rp":"autogen"},"values":{"pointsWritten":107,"seriesCreated":1,"valuesWritten":963}}, "ingress:18": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"processes","rp":"autogen"},"values":{"pointsWritten":107,"seriesCreated":1,"valuesWritten":856}}, "ingress:19": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"swap","rp":"autogen"},"values":{"pointsWritten":214,"seriesCreated":1,"valuesWritten":642}}, "ingress:20": {"name":"ingress","tags":{"db":"telegraf","login":"_systemuser_unknown","measurement":"system","rp":"autogen"},"values":{"pointsWritten":321,"seriesCreated":1,"valuesWritten":749}}, Closes: https://github.com/influxdata/influxdb/issues/20613	2021-02-05 14:52:43 -04:00
Sam Arnold	dd3baf6d4a	feat: measurement metrics by login (#20687 ) After turning on authentication and both forms of ingress metrics: "ingress": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"cq","rp":"monitor"},"values":{"pointsWritten":38,"valuesWritten":76}}, "ingress:1": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"database","rp":"monitor"},"values":{"pointsWritten":76,"valuesWritten":152}}, "ingress:2": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"httpd","rp":"monitor"},"values":{"pointsWritten":38,"valuesWritten":874}}, "ingress:3": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"ingress","rp":"monitor"},"values":{"pointsWritten":534,"valuesWritten":1068}}, "ingress:4": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"localStore","rp":"monitor"},"values":{"pointsWritten":38,"valuesWritten":76}}, "ingress:5": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"queryExecutor","rp":"monitor"},"values":{"pointsWritten":38,"valuesWritten":190}}, "ingress:6": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"runtime","rp":"monitor"},"values":{"pointsWritten":38,"valuesWritten":570}}, "ingress:7": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"shard","rp":"monitor"},"values":{"pointsWritten":76,"valuesWritten":836}}, "ingress:8": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"subscriber","rp":"monitor"},"values":{"pointsWritten":38,"valuesWritten":114}}, "ingress:9": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_cache","rp":"monitor"},"values":{"pointsWritten":76,"valuesWritten":684}}, "ingress:10": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_engine","rp":"monitor"},"values":{"pointsWritten":76,"valuesWritten":2204}}, "ingress:11": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_filestore","rp":"monitor"},"values":{"pointsWritten":76,"valuesWritten":152}}, "ingress:12": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"tsm1_wal","rp":"monitor"},"values":{"pointsWritten":76,"valuesWritten":304}}, "ingress:13": {"name":"ingress","tags":{"db":"_internal","login":"_systemuser_monitor","measurement":"write","rp":"monitor"},"values":{"pointsWritten":38,"valuesWritten":342}}, "ingress:14": {"name":"ingress","tags":{"db":"telegraf","login":"admin","measurement":"cpu","rp":"autogen"},"values":{"pointsWritten":1,"valuesWritten":1}}, "ingress:15": {"name":"ingress","tags":{"db":"telegraf","login":"telegraf","measurement":"cpu","rp":"autogen"},"values":{"pointsWritten":1316,"valuesWritten":13160}}, "ingress:16": {"name":"ingress","tags":{"db":"telegraf","login":"telegraf","measurement":"disk","rp":"autogen"},"values":{"pointsWritten":642,"valuesWritten":4494}}, "ingress:17": {"name":"ingress","tags":{"db":"telegraf","login":"telegraf","measurement":"diskio","rp":"autogen"},"values":{"pointsWritten":214,"valuesWritten":2354}}, "ingress:18": {"name":"ingress","tags":{"db":"telegraf","login":"telegraf","measurement":"mem","rp":"autogen"},"values":{"pointsWritten":107,"valuesWritten":963}}, "ingress:19": {"name":"ingress","tags":{"db":"telegraf","login":"telegraf","measurement":"processes","rp":"autogen"},"values":{"pointsWritten":107,"valuesWritten":856}}, "ingress:20": {"name":"ingress","tags":{"db":"telegraf","login":"telegraf","measurement":"swap","rp":"autogen"},"values":{"pointsWritten":214,"valuesWritten":642}}, "ingress:21": {"name":"ingress","tags":{"db":"telegraf","login":"telegraf","measurement":"system","rp":"autogen"},"values":{"pointsWritten":321,"valuesWritten":749}}, Only by login: "ingress": {"name":"ingress","tags":{"login":"_systemuser_monitor"},"values":{"pointsWritten":42,"valuesWritten":354}}, "ingress:1": {"name":"ingress","tags":{"login":"admin"},"values":{"pointsWritten":1,"valuesWritten":1}}, "ingress:2": {"name":"ingress","tags":{"login":"telegraf"},"values":{"pointsWritten":3547,"valuesWritten":28246}}, Notice writes by users 'telegraf', '_systemuser_monitor', and 'admin'.	2021-02-04 11:52:53 -05:00
Sam Arnold	b3e763d96f	fix: consistent error for missing shard (#20694 )	2021-02-04 09:49:14 -05:00
Sam Arnold	eb92c997cd	feat: Ingress metrics by measurement Partial implementation of https://github.com/influxdata/influxdb/issues/20612 Implements per-measurement points written metric. Next step: Also support per-login.	2021-02-02 15:58:28 -05:00
Sam Arnold	117341fb0f	fix: Move value metric down to tsdb store Previously we tracked values on the http ingress, but the tsdb store is the correct place to track total values written for the instance.	2021-02-02 10:58:47 -05:00
Sam Arnold	6795ec6c01	refactor: do not use context value anti-pattern Extending the context instead of fixing the API breaks type safety. For tracking the number of points / values written, it is much clearer to pass an explicit tracker.	2021-02-01 14:34:11 -05:00
Sam Arnold	8a16bf0531	chore: run goimports -w ./	2021-01-29 11:40:02 -05:00
Sam Arnold	d28bcb8e27	Merge pull request #20544 from lesam/series-iteration-optimization feat(tsi): optimize series iteration	2021-01-25 18:17:18 -04:00
Sam Arnold	98a76a11a0	feat(tsi): optimize series iteration When using queries like 'select count(_seriesKey) from bigmeasurement`, we should iterate over the tsi structures to serve the query instead of loading all the series into memory up front. Closes #20543	2021-01-25 14:27:31 -05:00
davidby-influx	fe3af66c54	fix(tsdb): minimize lock contention when adding new fields or measurements (#20504 ) fields.idx frequent writes cause lock contention and fields.idx is recreated when a field or measurement is added in a WritePointsWithContext() This eliminates locking during the actual file rewrite, and limits it to the times when the MeasurementFieldSet is actually being read or written in memory and when the new file is being renamed. Test verification of correct behavior by checking the fields.idx file matches the in-memory copy after heavily parallel measurement addition. Fixes https://github.com/influxdata/influxdb/issues/20500	2021-01-15 08:31:45 -08:00
Sam Arnold	32612313df	fix: minor test fixes for go1.15 and also flaky timeouts Also run gofmt	2021-01-08 14:59:33 -05:00
davidby-influx	9e33be2619	fix(error): SELECT INTO doesn't return error with unsupported value (#20429 ) When a SELECT INTO query generates an illegal value that cannot be inserted, like +/- Inf, it should return an error, rather than failing silently. This adds a boolean parameter to the [data] section of influxdb.conf: * strict-error-handling When false, the default, the old behavior is preserved. When true, unsupported values will return an error from SELECT INTO queries Fixes https://github.com/influxdata/influxdb/issues/20426	2020-12-30 18:22:43 -08:00
Sam Arnold	d1a1e4b667	chore: restore ImportShard This reverts commit `d14acea44d`.	2020-12-07 11:01:00 -04:00
davidby-influx	0faac1a478	chore(tsm1): fix formatting Failed to format code before commit.	2020-11-16 21:25:26 -08:00
davidby-influx	b3724581bc	fix(tsm1): "snapshot in progress" error during backup Loop with backoff in (Engine).CreateSnapshot() to retry (Engine).WriteSnapshot() up to 3 times if ErrSnapshotInPrgress is returned. Then continue on no error or on SnapshotInProgress if skipCacheOk is true. https://github.com/influxdata/plutonium/issues/3227 (cherry picked from commit `dfa6aa8cea`)	2020-11-16 21:23:00 -08:00
davidby-influx	0dcff81f56	fix(tsm1): "snapshot in progress" error during backup Test the skipCacheOk flag to tsdb.Shard.CreateSnapshot() and tsdb.Engine.CreateSnapshot() A value of true allows the backup to proceed even if a cache snapshot cannot be taken. https://github.com/influxdata/plutonium/issues/3227	2020-11-05 16:50:51 -08:00
davidby-influx	6ec446f422	fix(tsm1): "snapshot in progress" error during backup This fix adds a skipCacheOk flag to tsdb.Store.CreateShardSnapshot() and tsdb.Shard.CreateSnapshot() to pass to tsdb.Engine.CreateSnapshot() A value of true allows the backup to proceed even if a cache snapshot cannot be taken. This flag is set to true in tsm1.Engine.Backup(), the OSS backup code path This flag is set to false in tsm1.Engine.Export() https://github.com/influxdata/plutonium/issues/3227	2020-11-05 11:08:08 -08:00
davidby-influx	23be20bf1b	fix(tsm1): "snapshot in progress" error during backup When an InfluxDB database is very busy writing new points the backup the process can fail because it can not write a new snapshot. The error is: operation timed out with error: create snapshot: snapshot in progress. This happens because InfluxDB takes almost "continuously" a snapshot from the cache caused by the high number of points ingested. The fix for this was https://github.com/influxdata/influxdb/pull/16627 but it was for OSS only, and was not in the code path for backups in clusters. This fix adds a skipCacheOk flag to tsdb.Engine.CreateSnapshot(). A value of true allows the backup to proceed even if a cache snapshot cannot be taken. This flag is set to true in tsm1.Engine.Backup(), the OSS backup code path and in tsdb.Shard.CreateSnapshot(), the cluster backup code path. This flag is set to false in tsm1.Engine.Export() https://github.com/influxdata/plutonium/issues/3227	2020-10-30 10:37:36 -07:00
David Norton	3d92eef720	feat: allow disable compaction per shard This feature allows compaction to be disabled on a per-shard basis by creating a file named do_not_compact in a shard's directory. When disabled, a message is logged every 15 minutes with the reason for compaction being disabled (existance of the file). This makes it easy to know if compaction has been disabled for any shards by searching the log for "compaction disabled" or running "find path/to/data -type f -name do_not_compact".	2020-10-06 10:58:07 -04:00
David Norton	fb98ce63ec	Merge pull request #19420 from influxdata/fix-unlocked-map-access fix: lock map before writes	2020-09-22 11:28:46 -04:00
Ayan George	42873d4424	chore: Quiet static analysis tools (#19509 ) * Remove redundant type in slice/array declarations. * Call t.Fatal() from test-functions, not non-test go-routines. * Remove unnecessary empty value operator from ranges. * Call defer .Close() methods only after checking for error on Open().	2020-09-05 12:43:29 -04:00
Ayan George	4ef4fe9aef	fix(tsi1): Acquire a lock when modifying measurement map This patch protects an internal map for concurrent use. (LogFile).Writes() method calls (LogFile).createMeasurementIfNotExists() which writes to a shared map. (LogFile).Writes() acquires a read-lock which leaves createMeasurementIfNotExists() open to concurrent writes to its shared map. This commit adds the ExecEntries method to the LogFile type so that we can properly lock calls to (LogFile).appendEntry() using defer. (LogFile).ExecEntries() is used to mostly replace the body of (LogFile).Writes() and incurs another function call since ExecEntries() can't be inlined. Below is the output of build with "-m -m -m" gcflags: ./log_file.go:1076:6: cannot inline (LogFile).ExecEntries: unhandled op DEFER The performace impact of the additional function call should be negligable and is outwieghed by the safety and simplicity of using defer.	2020-08-31 12:52:54 -04:00
Ayan George	6297ede3d9	fix(tsdb): return error on nonexistent shard id (#17060 ) Have Store.DeleteShard() return a useful error if it cannot find the requested shard. Fixes #17059	2020-08-24 14:34:44 +00:00
Ayan George	3436db4ebb	refactor: Use binary.Read() instead of io.ReadFull() (#19323 ) The original version of verifyVersion() reads into a byte slice, manually ensures its byte order, then converts it to a type comparable with Version and MagicNumber. This patch hides those details by calling binary.Read() and reading values into properly typed variables. This adds a bit of overhead but this code isn't in the hot-path and this patch greatly simplifies the code. verifyVersion() originally accepted an io.ReadSeeker. It is only called in once place and that function immediately calls seek after verifyVersion(), therefore it is probably safe to call Seek() BEFORE verifyVersion(). The benefit is that verifyVersion() is easier to test since we can pass it a bytes.Buffer. This patch adds a test for verifyVersion() as well as a benchmark. benchmark old ns/op new ns/op delta BenchmarkVerifyVersion-8 73.5 123 +67.35% Finally, this commit moves verifyVersion() from writer.go to reader.go which is where it is actually used.	2020-08-13 14:54:18 -04:00
Ayan George	6ce0e11738	feat: Collect values written stats (#19187 ) * feat(engine/tsm1): Add WritePointsWithContext() Add WritePontsWithContext() and make WritePoints() a thin wrapper for it. The purpose is to add statistics context values that we'll use to propagate the number of fields and points written to calls up the call chain. * feat(tsdb): Add WriteToShardWithContext() When applied, this patch adds WriteToShardWithContext() and wraps it with WriteToShard() to preserve the API. The the purpose of this addition is to propagate a context.Context value to Shard.WritePointsWithContext(). * feat(tsdb/shard): Add WritePointsWithContext() The purpose of adding WritePointsWithContext() is to propage context values down to engine code and propage statistics via the context.Value up to callers. This patch also adds values written statistics to the shard. * feat(http): Gather values written stats WritePointsWithContext() was added to propagate context values down to the engine and communicate stats to the caller. * feat(http): Gather values written stats WritePointsWithContext() was added to propagate context values down to the engine and communicate stats to the caller. * refactor: Change MetricKey to ContextKey This patch gives the type we're useing for context keys a better name.	2020-08-12 11:26:12 -04:00
David Norton	94a4a3474d	fix(tsdb): revert disable series id set cache size by default This reverts commit `9c41e12ee4`.	2020-08-07 14:06:03 -04:00
Ben Johnson	4a1a8c0041	Merge pull request #18689 from influxdata/batch-write-tombstones-when-deleting perf(tsi1): batch write tombstone entries when dropping/deleting	2020-06-25 08:15:12 -06:00
Ben Johnson	be0edf5d75	Merge pull request #18695 from influxdata/epoch-wait-when-dropping-shard fix(tsi1): wait deleting epoch before dropping shard	2020-06-25 08:14:53 -06:00
Ayan George	a9d02e7ab7	fix: Handle snapshot related errors (#18710 ) When applied this patch will: * log snapshot directory removal errors Prior to this patch, errors when removing temporary snapshot directories happens silently. This patch ensures that errors are logged when os.RemoveAll() fails. * refactor tsm1: Declare error value in condition Save a line of code and limits the scope of an error value. * refactor tsm1: Add MakeSnapshotLinks() This commit adds (*FileStore).MakeSnapshotLinks(). The code in this function was originally part of CreateSnapshot(). That code was hoisted out and into MakeSnapshotLinks() becuase there are two points of failure that require cleanup -- we have to delete a temporary directory on failure. Placing the code in one function allows us to check its returned error value and perform cleanup in only once place. In short, we hoisted code out of CreateSnapshot() to simplify error handling. On error, we remove any directories we created.	2020-06-25 10:05:04 -04:00
dengzhi.ldz	42dba6487a	fix(tsi1): wait deleting epoch before dropping shard	2020-06-24 09:37:13 -06:00
dengzhi.ldz	331569bc11	perf(tsi1): batch write tombstone entries when dropping/deleting	2020-06-24 09:26:09 -06:00
Tristan Su	7be913de6e	chore(tsdb): clean up unused ShardID in EngineOptions (#17243 )	2020-06-22 15:01:32 -07:00
Ben Johnson	51f647d763	fix(tsdb): Defer closing of underlying SeriesIDSetIterators This commit changes the SeriesIDSet merge/union/intersect functions to attach the underlying iterators as closers so that files can be retained until the data is no longer in use. The roaring operations can leave containers pointing at mmap data in the resulting bitmap so we have to track underlying file usage until the data is finished with.	2020-05-22 10:46:05 -06:00
Ben Johnson	9c41e12ee4	fix(tsdb): Disable series id set cache size by default. This commit changes `DefaultSeriesIDSetCacheSize` to zero so that the tag value cache is disabled by default. There is a rare known bug where the cache can cause a segfault which crasheds the process. The cache is being disabled instead of removed as some users may still need the cache for performance reasons.	2020-05-19 10:04:05 -06:00
J. Emrys Landivar	6be2b37e0e	Merge pull request #18014 from foobar/cleanup-unused-functions chore: clean up unused functions	2020-05-18 18:47:07 -05:00
Ayan George	3b9be0145c	fix: address static check warning s1039 (#18135 ) This commit quiets staticcheck's warnings about "unnecessary use of fmt.Sprintf" and "unnecessary use of fmt.Sprint". Prior to this commit we were wrapping simple constant strings without any formatting verbs with fmt.Sprintf().	2020-05-18 13:55:05 -04:00
Ayan George	c0e391935d	fix(tsdb): address staticcheck warning SA4006 (#18127 ) This commit addresses staticcheck warning SA4006 "this value of VAR is never used" by removing unused variables	2020-05-18 13:11:44 -04:00
Ayan George	1a22af7172	fix(tsdb): address staticcheck warning st1006 (#18126 ) * fix(tsdb): address staticcheck ST1006 This patch addresses staticcheck warning "receiver name should not be an underscore, omit the name if it is unused (ST1006)" for 6 methods.	2020-05-17 22:36:06 -04:00
Ayan George	4cbc6a4269	fix(tsdb): Fix variables masked by a declaration (#18129 ) Before this commit, the to and from variables were being re-declared in a block in such a way that the values were not being used. This patch uses regular assignment so that the values are visable outside of the block where they're set. Closes: 18128	2020-05-17 21:40:12 -04:00
Tristan Su	d14acea44d	chore: clean up unused functions	2020-05-08 13:45:34 +08:00
Ayan George	a0f2e0c21a	fix(tsm1): Fix temp directory search bug (#17685 ) * fix: verify precision parameter in write requests This change updates the HTTP endpoints that service v1 and v2 writes to verify the values passed in the precision parameter. * fix(tsm1): Fix temp directory search bug The original code's intention is to scan a directory for the directory with the higest value when converted to an integer. So directories may be in the form: 0.tmp 1.tmp 2.tmp 30.tmp ... 100.tmp The loop should scan the directory, strip the basename and extension from the file name to leave just a number, then store the higest number it finds. Before this patch, there is a bug that has the code only store the higest value if there is an error converting the numeric value into an integer. This patch primarily fixes that logic. In addition, this patch will save an indent level by inverting logic in two places: Instead if checkig if a file is a directory and has a suffix of ".tmp", it is probably better to test if a file is NOT a directory OR does NOT have an extension of ".tmp" then continue. Also, instead of testig if len(ss) == 2, we can test if len(ss) != 2 and continue if so. Both of these save an indent level and keeps our "happy path" to the left. Finally, this patch will use string concatination instead of calling fmt.Sprintf() to add periods to "tmp" and "tsm" extension. Co-authored-by: David Norton <dgnorton@gmail.com>	2020-04-15 10:29:46 -04:00
Ben Johnson	848ca48225	fix(tsdb): Revert "fix: remove some unsafe marshalling to reduce risk of segfault" This reverts commit `30dab03310`.	2020-04-13 13:22:22 -06:00
Ben Johnson	3f59d4be8e	fix(tsdb): Fix error lists	2020-04-01 14:54:39 -06:00
elbehery	042128b948	fix(tsdb): Replace panic with error while de/encoding corrupt data fixes #17440 While encoding or decoding corrupt data, the current behaviour is to `panic`. This commit replaces the `panic` with `error` to be propagated up to the calling `iterator`. To avoid overwriting other `error`, iterators now wraps a `TSMErrors` which contains ALL the encountered errors. TSMErrors itself implements `Error()`, the returned string contains all the error msgs, separated by "," delimiter.	2020-04-01 20:51:11 +02:00
David Norton	25381f97c8	Merge pull request #15952 from influxdata/er-verify-tombstone feat(inspect): add influx_inspect verify-tombstone tool	2020-03-11 15:56:37 -04:00
docmerlin (j. Emrys Landivar)	30dab03310	fix: remove some unsafe marshalling to reduce risk of segfault We were seing segfaults in Roaring bitmaps sometimes, under very high load with networked drives. This may reduce risk of segfault by forcing marshalling to copy the data.	2020-03-11 13:47:29 -04:00
Ayan George	5f47c388df	chore(influxdb): Forward port 16999 (#17032 ) * fix: access tsi active log file with READ lock The activeLogFile pointer may be altered by other routine so the READ lock is needed. * Merge pull request #16384 from foobar/tsi-partition-lock fix: access tsi active log file with READ lock Co-authored-by: Tristan Su <sooqing@gmail.com> Co-authored-by: David Norton <dgnorton@gmail.com>	2020-03-04 16:20:58 -05:00
Edd Robinson	c3f4382ed8	Merge pull request #16606 from influxdata/BP-1.8-er-tsm-block-fix fix(storage): ensure all block data returned	2020-03-03 14:32:15 +00:00
Ben Johnson	7a9eb1420c	fix(tsdb): Fix -compact-series-file flag	2020-02-06 13:40:19 -07:00
Gianluca Arbezzano	a3e37f417d	Merge pull request #16627 from influxdata/feature/skip-wal-cache chore(tsm1): skip WriteSnapshot during backup if snapshotter is busy	2020-02-04 21:44:48 +01:00
Gianluca Arbezzano	30621ca9ec	chore(tsm1): skip WriteSnapshot during backup if snapshotter is busy When an InfluxDB database is very busy writing new points the backup the process can fail because it can not write a new snapshot. The error is: `operation timed out with error: create snapshot: snapshot in progress`. This happens because InfluxDB takes almost "continuously" a snapshot from the cache caused by the high number of points ingested. This PR skips snapshots if the `snapshotter` does not come available after three attempts when a backup is requested. The backup won't contain the data in the cache or WAL. Signed-off-by: Gianluca Arbezzano <gianarb92@gmail.com>	2020-02-04 20:09:50 +01:00
Edd Robinson	34c0fdafc0	feat(storage): Offline series file compaction	2020-02-03 13:57:31 -07:00
David Norton	962cf6f4e7	Merge pull request #16595 from influxdata/fix/show-series-cardinality fix(tsm1): improve series cardinality limit	2020-01-21 18:08:13 -05:00
David Norton	903d2c2d28	fix(tsm1): improve series cardinality limit Prior to this change, new series would be added to the series file before checking the series cardinality limit. If the limit was exceeded, the write was rejected even though the series had already been added to the series file.	2020-01-21 16:45:13 -05:00
Edd Robinson	22798fa290	fix(storage): ensure all block data returned This commit prevents multiple blocks for the same series key having values truncated when they are being read into an empty buffer. The current cursor reader code has an optimisation that incorrectly assumes the incoming array will be limited to 1,000 values (the maximum block size), but arrays can contain values from multiple matching blocks.	2020-01-21 18:00:11 +00:00
Sean Brickley	fe55d728f0	fix(tsm1): Compaction log error	2020-01-13 20:05:05 -05:00
tmgordeeva	f1d26652e9	fix(storage): skip TSM files with block read errors (#15885 ) * fix(storage): skip TSM files with block read errors When we find a bad TSM file during compaction, propagate the error up and move the bad file aside. The engine will disregard the file so the next compaction will not hit the same error.	2019-12-13 15:05:39 -08:00
Edd Robinson	f7f19c5904	refactor(storage): add tombstone extension	2019-11-17 14:43:38 +00:00
Edd Robinson	196e46982b	Merge pull request #15861 from influxdata/BP-1.8-er-tsi-not-equal fix(tsi1): index defect with negated equality filters	2019-11-13 18:15:01 +00:00
David Norton	102fcd671b	fix(tsm1): make Digest() safe for concurrent use This change adds a lock around digest creation so that it is safe for concurrent calls. Prior to this change, calls from multiple goroutines resulted in "Digest aborted, problem renaming tmp digest" errors.	2019-11-12 18:02:41 -05:00
Edd Robinson	cac4c8956c	fix(tsi1): index defect with negated equality filters Fixes #15859 This commit fixes a defect in the TSI index where a filter using the negated equality operator would result in no matching series being returned for series stored within the `IndexFile` portions of the index. The root cause of this was due to missing legacy-handling code in the index for this particular iterator.	2019-11-12 15:10:42 +00:00
Jonathan A. Sternberg	40a7a577fc	Update flux version to v0.50.2 This upgrades the flux version to v0.50.2. The secret service, which is used for alerts, is not included. The `to()` function is also still not included.	2019-10-29 16:19:14 -05:00
elbehery	a4bb1083f2	fix(storage): Renaming corrupt data files fails fixes#14107	2019-10-28 17:32:58 +01:00
Ben Johnson	14d45c913c	fix(tsi1): replace TSI compaction wait group with counter	2019-09-18 11:47:37 -06:00
maxunt	021dd3114d	Merge pull request #14245 from influxdata/mu-tsm-clean-1.8-14058 Clean tmp tsm files when replace fails	2019-08-08 17:02:01 -07:00
Jacob Marble	b731cc5e60	feat(storage): Limit concurrent series partition compaction (#14240 ) * feat(storage): Limit concurrent series partition snapshots * feat: make concurrency configurable * fix: integrate review feedback * refactor: rename config value	2019-07-30 10:34:06 -07:00
Edd Robinson	0ff7fb96b1	Merge pull request #14266 from influxdata/er-fix-fields fix(storage): Fix issue where fields re-appear	2019-07-09 15:47:47 +01:00
Max U	c6c0a5d3b1	change log level from info to error	2019-07-08 14:52:53 -04:00
Mark Rushakoff	2dfafe822c	Remove stray fmt.Println in tsm1.StringArrayEncodeAll It was introduced in #13699. Updates #14265.	2019-07-05 15:58:53 -07:00
Edd Robinson	f4413d726b	test(storage): skip flaky test	2019-07-05 15:07:09 +01:00
Edd Robinson	9bfd1119b9	fix(storage): Fix issue where fields re-appear Fixes #10052 This commit fixes an issue where field keys would reappear in results when querying previously dropped measurements. The issue manifests itself when duplicates of a new series are inserted into the `inmem` index. In this case, a map that tracks the number of series belonging to a measurement was incorrectly incremented once for each duplication of the series. Then, when it came time to drop the measurement, the index assumed there were several series belonging to the measurement left in the index (because the counter was higher than it should be). The result of that was that the `fields.idx` file (which stores a mapping between measurements and field keys) was not truncated and rebuilt. This left old field keys in that file, which were then returned in subsequent queries over all field keys.	2019-07-05 12:24:03 +01:00
Max U	9091d72ba7	initial commit for 1.8	2019-07-02 13:18:20 -04:00
Edd Robinson	43e144a923	fix(storage): ensure WAL size correctly set on startup	2019-06-28 16:20:45 +01:00
Edd Robinson	ecff62b9e4	test(storage): add test for reproducing #14229	2019-06-28 16:18:32 +01:00
Jonathan A. Sternberg	7ca4e644f1	Update flux version to v0.33.2 (#14208 ) The flux in influxdb has been upgraded to use v0.33.2. A lot of interfaces for the storage engine were changed during this so code had to change to accomodate the new interfaces and remove the old ones. Included in this commit is a patch file for the changes that were made. A patch was generated for the following packages: * `flux/stdlib/influxdata/influxdb` * `storage/reads` * `tsdb/cursors` These are the three packages that are in common with version 2 of the database and the first of these packages contains the specific implementations that are used for version 1. It is very possible that the next time we upgrade this, the patch will not apply cleanly just like it wouldn't have applied cleanly to this update. The patch is mostly meant to document exactly what changed during the copy over to help ensure we don't forget things when adapting the interfaces. Add a patch file to hopefully make this easier in the future	2019-06-27 13:52:02 -05:00
Stuart Carnie	a0f7c15b08	chore: Fix constant for 32-bit architecture	2019-06-07 11:00:13 -07:00
Stuart Carnie	86734e7fcd	fix(storage): Don't panic when length of source slice is too large StringArrayEncodeAll will panic if the total length of strings contained in the src slice is > 0xffffffff. This change adds a unit test to replicate the issue and an associated fix to return an error. This also raises an issue that compactions will be unable to make progress under the following condition: * multiple string blocks are to be merged to a single block and * the total length of all strings exceeds the maximum block size that snappy will encode (0xffffffff) The observable effect of this is errors in the logs indicating a compaction failure. Fixes #13687	2019-06-04 17:05:01 -07:00
Stuart Carnie	f1e1164e96	fix(storage): Don't panic when length of source slice is too large StringArrayEncodeAll will panic if the total length of strings contained in the src slice is > 0xffffffff. This change adds a unit test to replicate the issue and an associated fix to return an error. This also raises an issue that compactions will be unable to make progress under the following condition: * multiple string blocks are to be merged to a single block and * the total length of all strings exceeds the maximum block size that snappy will encode (0xffffffff) The observable effect of this is errors in the logs indicating a compaction failure. Fixes #13687	2019-05-30 08:23:58 -07:00
Jacob Marble	12a52a0321	fix(series file): Sync series segment after truncate (#13836 )	2019-05-09 08:29:25 -07:00
Jonathan A. Sternberg	3406e8c7ff	Merge pull request #13153 from influxdata/deps/upgrade-flux Upgrade flux to the latest version and remove the platform dependency	2019-04-04 11:41:27 -05:00
Jonathan A. Sternberg	31501c9dcf	Upgrade flux to the latest version and remove the platform dependency This integrates the influxdb 1.x series to the latest version of Flux and updates the code to use it. It also removes the dependency on platform and copies the necessary code from storage into the 1.x series so the dependency is unneeded. The flux functions specific to 1.x have been moved to the same structure that flux changed to with having a `stdlib` directory instead of a `functions` directory. It also adds a `databases()` function that returns the databases from the meta client.	2019-04-04 10:55:09 -05:00
Ben Johnson	2edbc907a1	Add nil check for tagKeyValueEntry.setIDs() Previously it was possible to set IDs on a `nil` entry which would in turn cause a panic. If this panic was recovered by the server then it would result in a mutex in the `inmem` index staying locked indefinitely.	2019-04-02 10:04:39 -06:00
Jeff Wendling	8e56b3ff1e	Merge pull request #11970 from influxdata/jmw-shard-epoch-races Fix some more shard epoch races	2019-02-19 10:40:09 -07:00
Edd Robinson	67cfa21732	Merge pull request #10541 from gpomykala/10540 Fix open/close race in SeriesFile	2019-02-19 16:53:25 +00:00
Jeff Wendling	1d9ce868e2	Fix some more shard epoch races We're not allowed to access the s.epochs map without holding the mutex against shard creation and deletion, so create a copy of all of the epoch trackers we will need while we hold the mutex.	2019-02-19 08:59:13 -07:00
Ben Johnson	c61db43dc2	Update tagKeyValue mutex to write lock. This commit changes the read lock to a write lock when calling the `ids()` function because `ids()` can mutate the underlying series ids slice.	2019-02-15 09:29:48 -07:00
Jeff Wendling	40d2b70376	Ensure that cached series id sets are Go heap backed	2019-02-12 15:20:24 -07:00
Ben Johnson	6e5226437a	Convert TagValueSeriesIDCache to use string fields. This commit changes `name`, `key`, and `value` to from `[]byte` to `string`.	2019-02-12 14:22:40 -07:00
Ben Johnson	aa3dfc0662	Merge pull request #11791 from influxdata/bj-revert-limit-full-compaction-1.8 Revert "Limit force-full and cold compaction size."	2019-02-11 12:50:55 -07:00
Ben Johnson	b87605f521	Fix shard epoch race.	2019-02-11 12:15:46 -07:00
Ben Johnson	198f6fde38	Fix deleteSeriesRange() race condition.	2019-02-11 11:29:09 -07:00
Ben Johnson	2dd913d71b	Revert "Limit force-full and cold compaction size." This reverts commit `40db64d0b9`.	2019-02-11 11:07:44 -07:00
Edd Robinson	05e7def600	Merge pull request #10332 from ludweeg/ludweeg/unslice Simplify s[:] to s where s is a slice	2019-02-11 10:24:43 +00:00
Grzegorz Pomykala	8448cf4a9c	build fixed	2019-02-06 16:27:59 +01:00
Jonathan A. Sternberg	2811cde76d	Merge pull request #10414 from seebs/seebs/valuerReuse reuse ValuerEval objects	2019-02-06 09:14:15 -06:00
Grzegorz Pomykala	fb3c837de9	code review sugestions applied	2019-02-06 09:10:51 +01:00
Seebs	5525240de3	reuse ValuerEval objects Scanner objects and iterators often need a ValuerEval. This object is created, often with a function call, and has at least one interface in it, so it allocates storage. Then it's dropped again right away. The only part of it that might be subject to change is usually a map. While the map's contents change over time, the actual map doesn't change for the lifetime of the object. So, in both iterators and scanners, stash the ValuerEval and continue reusing it. On a query returning a fair number of data points, this produces a small (<5% in practice) improvement in observed performance, visible as a significant reduction in time spent in runtime (mallocgc, newobject, etcetera). The performance improvement isn't big, but it's reasonably easy to evaluate it and establish that it's a safe change to make. Signed-off-by: seebs <seebs@seebs.net>	2019-02-05 15:10:23 -06:00
Ben Johnson	def9589584	Merge pull request #10522 from hahnjo/fix-compaction-cache-snapshots Fix compaction logic on infrequent cache snapshots	2019-02-04 08:34:03 -07:00
Ben Johnson	4083ae01e3	Merge branch '1.8' into hpb-no-series-rebuild-on-delete-when-series-still-in-cache	2019-02-04 08:32:04 -07:00
Edd Robinson	3a81921bb0	Merge pull request #10505 from hpbieker/hpb-no-series-rebuild-on-delete-without-overlap-timerange Do not rebuild series index on delete for series not overlapping in time	2019-02-04 03:22:00 -08:00
Ben Wells	e9bada090f	Fix misspelling identified by misspell	2019-02-03 20:27:43 +00:00
Ben Johnson	0c6d77d952	Merge pull request #9944 from michaelyou/hotfix-hashring-mod Hash ring's hash mod	2019-02-01 12:54:15 -08:00
Edd Robinson	301ab71ba0	Remove copy-on-write when caching bitmaps In the case of caching TSI bitmaps belonging to immutable .tsi files, the underlying bitset data can be mmapped. It is possible, though rare, for this data to be unmapped (e.g., via a TSI compaction) but for the cached bitmap to be subsequently read. This leads to a segfault. This only happens when copy-on-write is set to true on the roaring bitmap, because in that case only the internal pointers are cloned. This change will reduce the TSI cache performance by around 10%, which I have deemed to account for only a few microseconds typically.	2019-01-25 18:02:48 +00:00
Edd Robinson	efdddbb31a	Allow TSI bitset cache size to be configured This commit adds a config option to the tsdb Config allowing the size of the bitset cached in the TSI index to be specified. Setting the cache size to 0 will disable the cache.	2019-01-24 17:41:45 +00:00
Edd Robinson	e20541d2ba	Expose functional option for setting TSI cache size	2019-01-23 17:15:48 +00:00
Edd Robinson	3a055a6107	Fix cardinality estimation error This commit fixes an error in the TSI index with estimating the cardinality of series recently added and then removed.	2019-01-10 17:46:30 +00:00
Edd Robinson	77fe5a9a62	Treat fields and measurements as raw bytes	2018-12-19 14:38:50 +00:00
Edd Robinson	348dac1672	Add repro test case for UTF-8 issue	2018-12-19 14:38:31 +00:00
Ben Johnson	b88d852c54	Merge pull request #10536 from influxdata/bj-limit-full-compaction Limit force-full and cold compaction size.	2018-12-17 10:44:10 -07:00
Tanya Gordeeva	0a39786ea7	tsdb: mixed shard tests Specifically tests around the global index for fields with mixed shard types.	2018-12-13 08:31:49 -08:00
Grzegorz Pomykala	be97f36e66	trace on SeriesFile.Open() failure	2018-12-06 15:58:05 +01:00
Grzegorz Pomykala	fbfcfa0b31	reproduction for #10540	2018-12-06 14:36:44 +01:00
Grzegorz Pomykala	a346109198	do not acquire a lock upon closing a SeriesFile when called from Open() method	2018-12-06 11:59:03 +01:00
Ben Johnson	40db64d0b9	Limit force-full and cold compaction size. This commit limits the number of files that can be compacted in a single group when forcing a full compaction or when a shard becomes cold. This is to prevent too many files being compacted at the same time.	2018-12-05 10:18:56 -07:00
Stuart Carnie	39a3d2335e	chore(flux): Update to Flux 0.7.1 Resolve breaking API changes	2018-11-30 10:38:56 -07:00
Jeff Wendling	9f0cd683b9	Merge pull request #10516 from influxdata/jmw-conflict-concurrency tsdb: conflict based concurrency resolution	2018-11-29 14:14:24 -07:00
Ben Johnson	cd1e1ca755	Merge pull request #10525 from influxdata/bj-warn-series-file Skip and warn series files in retention policy directory.	2018-11-28 11:42:29 -07:00
Jeff Wendling	cca97bf9b9	Merge pull request #10517 from influxdata/jmw-always-cleanup-fields-index tsdb: clean up fields index for every kind of delete	2018-11-28 11:33:34 -07:00
Ben Johnson	298eddb82c	Skip and warn series files in retention policy directory.	2018-11-28 11:20:18 -07:00

1 2 3 4 5 ...

2814 Commits (db/6263/compaction-debug-logging)