influxdb

Commit Graph

Author	SHA1	Message	Date
David Norton	b4fd65baf1	add digest logging	2018-06-15 16:55:59 -04:00
Jacob Marble	544636c815	TSM: Fix ShouldCompactCache without WAL	2018-06-13 17:37:17 -07:00
Jacob Marble	0dc5393441	tsm/cache: Remove unused function parameter	2018-06-13 15:22:37 -07:00
Jeff Wendling	e6aec771b0	fix(tsdb): attempt to work on docker on windows multiple users have attempted to run influxdb in a docker container with a windows host and a volume mounted from windows. that causes problems because it apparently uses samba/cifs which does not support fsync on directories. this patchset will, if it receives an EINVAL on directory fsync, as is what appears to happen on samba/cifs, then it will ignore it. this should help. fixes #9833. fixes #9630.	2018-06-01 14:57:18 -06:00
Jacob Marble	82551a70e7	Merge pull request #9921 from influxdata/jgm-escape buildtsi: Do not escape measurement names	2018-06-01 08:32:01 -07:00
Jacob Marble	9a7b652a1c	TSM: OpenLimiter must not be nil	2018-05-31 13:43:16 -07:00
Jacob Marble	44c5da060b	buildtsi: Do not escape measurement names When `influx_inspect buildtsi` is used to create a new `tsi1` index, spaces in measurement names are escaped, so measurement "a b" is changed to "a\ b". This change modifies `models.ParseKeyBytes()` and `models.ParseName()` to unescape measurement names. `models.ParseKeyBytes()` returns unescaped tag keys, so this seems like the natural place to unescape measurement names. Also followed `scanMeasurement()` to see what other code could be problematic, and this should be everything (the result of one other use of `scanMeasurement()` is later escaped). Removed `tsdb.MeasurementFromSeriesKey()`. These methods are exported, so checked for side effects in other InfluxData repositories.	2018-05-30 15:20:56 -07:00
Ben Johnson	cec2a2d988	Merge pull request #9918 from influxdata/bj-tsm-open-limiter TSM1 Open Limiter	2018-05-30 13:13:14 -06:00
Stuart Carnie	0253f6fe05	fix(tsdb/engine/tsm1): Fix panic when closing cursor multiple times Fixes #9909	2018-05-29 09:59:52 -07:00
Jacob Marble	bb313765e4	tsdb/tsm1: Clean up TSM filename format/parse	2018-05-29 09:57:48 -07:00
Ben Johnson	d3e3b05a49	Add tsm1 open limiter This commit restricts the number of TSM1 files that can be opened concurrently across the entire `tsdb.Store`. There is currently a limit for the number of shards that can be opened concurrently, however, this limit does not help when the number of CPU cores is higher than the number of shards. Because TSM1 files have a 2GB limit and there is no limit on the number of files per shard, extremely large shards (1TB+) can load 1,000s of files simultaneously.	2018-05-29 10:21:53 -06:00
Jeff Wendling	2b3cd8406f	Merge pull request #9892 from influxdata/jmw-tombstone-notifications tsdb: observe tombstone files as well	2018-05-24 09:56:09 -06:00
Stuart Carnie	e3d7095d14	fix(tsm1): Avoid searching index if key outside index bounds This improvement avoids performing a binary search on the index by first checking the key against the lower and upper bounds. Particularly useful for multiple, fully-compacted TSM files.	2018-05-23 13:29:49 -07:00
Jeff Wendling	ce565965a4	tsdb: avoid nil checks on the observer this avoids nil panics in the case that someone eventually forgets.	2018-05-23 13:15:41 -06:00
Jeff Wendling	8ad515b387	tsdb: remove the shard id again callers can always ensure that the observer set on the engine options is appropriate for that shard id. this simplifies the api and reduces the chance of bugs due to mixing up shard ids.	2018-05-23 13:04:54 -06:00
Jeff Wendling	15ae0bd98d	tsdb: observe tombstone files as well	2018-05-22 22:07:16 -06:00
Jeff Wendling	e62b1a02fb	Merge pull request #9879 from influxdata/jmw-add-shard-number-to-observer tsdb: add shard number to the observer	2018-05-21 20:00:47 -06:00
michaelyou	efc324681a	Typo	2018-05-20 22:37:22 +08:00
Jeff Wendling	eb4bf651e5	tsdb: add shard number to the observer an observer may want to know what shard the file is part of. this way, they don't have to rely on brittle file path parsing.	2018-05-18 18:15:44 -06:00
Jeff Wendling	6320316fd4	Merge pull request #9852 from influxdata/jmw-tsm-notifications file store: send notifications about new/deleted tsm files.	2018-05-18 11:29:34 -06:00
Jacob Marble	735aa2d7dc	Add SeriesIDSet() to Index interface	2018-05-18 09:22:43 -07:00
Jacob Marble	3f2ff742c0	Remove unused 'database' field	2018-05-18 09:22:43 -07:00
Jeff Wendling	27040d6f31	file store: send notifications about new/deleted tsm files. just adds some interface for hooks about when these files come and go. we do them before the action is taken so that if the hook has an error, it doesn't have any consistency problems.	2018-05-17 12:19:58 -06:00
Jacob Marble	c119f9a846	Close TSMReaders from FileStore.Close after releasing FileStore mutex	2018-05-17 09:12:36 -07:00
Jeff Wendling	3fc40dd4a0	Merge pull request #9824 from influxdata/jmw-optimize-radix radix: optimize for our use case	2018-05-16 13:43:30 -06:00
Jeff Wendling	1a8931af42	Merge pull request #9841 from influxdata/jmw-ensure-no-race-conditions tsm1: ensure some race conditions are impossible	2018-05-16 11:56:10 -06:00
Ben Johnson	8838d284a5	Merge pull request #9826 from influxdata/bj-tsm-filename TSM Filename Injection	2018-05-15 15:50:26 -06:00
Jacob Marble	7f8b7af61e	Cleanup index memory footprint counting code (#9828 ) * Fix IndexSet.DedupeInmemIndexes * Cleanup index memory footprint code	2018-05-15 11:25:19 -07:00
Jeff Wendling	7d2bb19b74	tsm1: ensure some race conditions are impossible The InUse call on TSMFiles is inherently racy in the presence of Ref calls outside of the file store mutex. In addition, we return some TSMFiles to callers without them being Ref'd which might allow them to be closed from underneath. While I believe it is the case that it would be impossible, as the only thing that gets a handle externally is compaction, and compaction enforces that only one handle exists at a time, and thus is only deleted once after the compaction is done with it, it's not very obvious or enforced. Instead, always return a TSMFile with a Ref call under the read lock, and require that no one else calls Ref. That way, it cannot transition to referenced if the InUse call returns false under the write lock. The CreateSnapshot method was racy in a number of ways in the presence of multiple calls or compactions: it did not take references to the TSMFiles, and the temporary directory it creates could have been shared with concurrent CreateSnapshot calls. In addition, the files slice could have been concurrently mutated during a compaction as well. Instead, under the write lock, make a local copy of the state for the compaction, including Ref calls (write locks are implicitly read locks). Then, there is no need for a lock at all afterward. Add some comments to explain these issues at the call sites of InUse, and document that the Files method that returns the slice unprotected is only for tests.	2018-05-14 19:45:42 -06:00
Ben Johnson	35a64dee99	Inject tsm file naming.	2018-05-14 10:46:38 -06:00
Jeff Wendling	cb9c3ee509	radix: optimize for our use case - reduce allocations by making leaf a value type with a bool - make longestPrefix inlineable and have no bounds checks - delete any code for functions we don't plan to use - operate on []byte and only copy when necessary - inline calls to sort.Search to avoid allocations and indirections - insert directly in the correct location for addEdge - reduce allocations during copying with a buffer helper results: name old time/op new time/op delta Tree_Insert-8 1.10ms ± 4% 0.73ms ± 4% -33.54% (p=0.000 n=10+10) Tree_InsertNew-8 3.18ms ± 2% 1.91ms ± 6% -39.90% (p=0.000 n=10+10) name old speed new speed delta Tree_Insert-8 9.12MB/s ± 4% 13.72MB/s ± 4% +50.46% (p=0.000 n=10+10) Tree_InsertNew-8 3.15MB/s ± 2% 5.24MB/s ± 6% +66.42% (p=0.000 n=10+10) name old alloc/op new alloc/op delta Tree_InsertNew-8 1.62MB ± 0% 1.60MB ± 0% -1.28% (p=0.000 n=10+9) name old allocs/op new allocs/op delta Tree_InsertNew-8 35.0k ± 0% 15.0k ± 0% -57.04% (p=0.000 n=10+10) MB/sec in this case is 1 byte per key inserted, so it's really millions of keys inserted per second.	2018-05-11 11:56:11 -06:00
Jacob Marble	0763d1789e	Get inmem index bytes without double-counting	2018-05-10 11:33:52 -07:00
Jason Wilder	de58584ce7	Merge pull request #9748 from influxdata/jw-series-type Prevent series type conflict	2018-05-10 07:05:45 -06:00
Jacob Marble	148341fb2a	tsdb/WAL: Better respect for WAL disabled	2018-05-08 15:04:33 -07:00
Jonathan A. Sternberg	6607c29a02	Merge pull request #9649 from influxdata/js-eval-functions-in-where Allow math functions to be used in the condition	2018-05-02 08:29:08 -05:00
Jason Wilder	aea9bf3464	Hide series type map behind feature flag The performance is not good enough to enable by default so this allows the functionality to be merged while performance is improved.	2018-05-02 06:50:35 -06:00
Jason Wilder	ec3f5c353c	Fix panic in FileStore.walkKeys If a TSM file is replaced while walkKeys is running, a panic could occur because the mmap has been unmapped.	2018-04-30 17:26:23 -06:00
Jason Wilder	2be2418b89	Add series type validation to Engine This is the start of per-series validation that occurs in the Engine write path. It uses an in-memory radix tree to reduce memory usage and is re-built on demand the first time a series is written.	2018-04-30 17:26:23 -06:00
Ben Johnson	f459d87325	Merge pull request #9785 from influxdata/rename-bad-tsm-file Rename & log corrupt tsm files on load	2018-04-30 15:37:45 -06:00
Jacob Marble	63b9c98187	fix tests by closing iterators and cursors	2018-04-30 13:46:03 -07:00
Jacob Marble	7de2dcd3d9	TSM: TSMReader.Close blocks until reads complete	2018-04-30 13:46:03 -07:00
Stuart Carnie	e0ae9c5a2d	tsm1: Replace goroutine `merge` with k-way merge Previously replaced WalkKeys implementation for a considerable improvement to startup time	2018-04-30 07:57:55 -07:00
Ben Johnson	108fa09439	Rename corrupt tsm files on load.	2018-04-27 14:27:44 -06:00
Jonathan A. Sternberg	1f9227e20c	Allow math functions to be used in the condition	2018-04-10 10:55:34 -05:00
Jason Wilder	97ecf62ffb	Return time range from delete predicate func This moves the time range to delete to be returned by the predicate func in DeleteSeriesRangeWithPredicate. It allows for a single delete to delete different ranges of times per series instead of a single range of time for all series.	2018-04-09 20:01:33 -06:00
Jonathan A. Sternberg	bf0eb140ec	Merge pull request #9686 from influxdata/js-tsm1-aggregate-benchmarks Adding additional aggregate benchmarks for tsm1	2018-04-09 11:08:57 -05:00
Jonathan A. Sternberg	117aac4b9e	Adding additional aggregate benchmarks for tsm1 This will help us address performance problems in the underlying tsm1 implementations of the aggregate iterators.	2018-04-09 10:37:33 -05:00
Jason Wilder	8cc2e68d3b	Fix panic in readTombstoneV4 The length check was backwards so if a series key was longer than 4096 bytes, it would cause a slice out of bounds panic.	2018-04-05 22:15:54 -07:00
Adam	72bceca888	Fix stream package to allow for renaming the file before writing it to the stream (#9684 ) * Fix stream package to allow for renaming the file before writing it to the stream * updated test to make sure that the final tsm file has more than one block	2018-04-05 16:24:29 -04:00
Mark Rushakoff	b3c2d9290f	Log error encountered when reading WAL files Inspired by #9657.	2018-03-30 09:40:58 -07:00

1 2 3 4 5 ...

1112 Commits (d52d9fb47a27cb764c9d40ae950f57d9c79f6863)