influxdb

Commit Graph

Author	SHA1	Message	Date
rw	02c86ea9db	Remove unnecessary string constant.	2016-09-26 11:25:04 -07:00
Jason Wilder	139ef8062e	Simplify encoder buffer usage	2016-09-26 12:19:16 -06:00
Jason Wilder	658149a6ff	Removed commented out code	2016-09-26 12:19:15 -06:00
Jason Wilder	7f96d78b79	Make encoder re-usable This allows encoders to be re-used and maintained in a pool to avoid allocating new ones on every compactions and write of an encoded block. The pool used is not a sync.Pool to ensure that the encoders will not be garbage collected.	2016-09-26 12:19:15 -06:00
Jason Wilder	0401527093	Pre-allocate cache store and entries These were not sized so they always had to be grown causing garbage to be created.	2016-09-26 12:19:15 -06:00
Jason Wilder	730ceeea46	Re-used allocated byte slices during compactions	2016-09-26 12:19:15 -06:00
Jason Wilder	6671ef00f0	Reduce allocations in idsForExpr	2016-09-26 08:36:59 -06:00
Jason Wilder	c2cfd63091	Avoid stat syscall when planning compactions When the planner runs, it needs to determine if any files have tombstones. The code to determine if a tombstone existed involved stating the .tombstone file. Since the planner runs very frequently when there are many shards, this causea a lot of system calls that are unnecessary. Instead, cache the results of the stats calls and only refresh them when we haven't checked at least once or we write new tombstone data. This also caches the results of the TSMReader.Stats call to avoid creating garbage.	2016-09-24 15:53:28 -06:00
rw	b86885c5cd	Remove a few short-lived string allocs. (*tsdb.Shard).validateSeriesAndFields uses fewer string allocs in some hot spots.	2016-09-22 17:55:57 -07:00
Jason Wilder	39ade11944	Unload index before closing shard When deleting a shard, the shard is locked and then removed from the index. Removal from the index can be slow if there are a lot of series. During this time, the shard is still expected to exist by the meta store and tsdb store so stats collections, queries and writes could all be run on this shard while it's locked. This can cause everything to lock up until the unindexing completes and the shard can be unlocked. Fixes #7226	2016-09-22 11:16:45 -06:00
Jason Wilder	d06b28992d	Unload index before closing shard When deleting a shard, the shard is locked and then removed from the index. Removal from the index can be slow if there are a lot of series. During this time, the shard is still expected to exist by the meta store and tsdb store so stats collections, queries and writes could all be run on this shard while it's locked. This can cause everything to lock up until the unindexing completes and the shard can be unlocked. Fixes #7226	2016-09-16 12:01:50 -06:00
Edd Robinson	ed41122ade	Pre-allocate map for performance	2016-09-15 18:28:46 +01:00
Jonathan A. Sternberg	477d6231db	Update source files to pass vet checks for go 1.7 The vet checks for some files did not pass for go 1.7. As part of a preliminary start to making go 1.7 work with this software, go vet should pass. Also updated the gogo/protobuf dependency which fixed the code generator to work with go 1.7 too. Ran `go generate` on the entire repository to ensure every file was up to date.	2016-09-14 15:01:22 -05:00
Edd Robinson	2a99ef751d	Emit fieldsCreated stat in shard measurement	2016-09-13 16:41:11 +01:00
Jonathan A. Sternberg	46508cb8c9	Fix engine tags in stats	2016-09-09 17:16:53 -05:00
Jason Wilder	95682faec2	Merge branch '1.0' into jw-merge-10	2016-09-08 09:00:51 -06:00
Edd Robinson	5023419adc	Ensure ErrFieldTypeConflict value returned	2016-09-05 13:34:35 +01:00
Jason Wilder	1a35c0a3fc	Fix neverending full compactions The full compaction planner could return a plan that only included one generation. If this happened, a full compaction would run on that generation producing just one generation again. The planner would then repeat the plan. This could happen if there were two generations that were both over the max TSM file size and the second one happened to be in level 3 or lower. When this situation occurs, one cpu is pegged running a full compaction continuously and the disks become very busy basically rewriting the same files over and over again. This can eventually cause disk and CPU saturation if it occurs with more than one shard. Fixes #7074	2016-09-03 17:35:14 -06:00
Jason Wilder	a6f6fda415	Fix DeleteSeries when multiple fields exists The logic for determining whether a series key was already in the the set of TSM series was too restrictive. It allowed only the first field of a series to be added leaving all the remaing fields.	2016-08-31 20:53:10 -06:00
Jason Wilder	190537a557	Fix DeleteSeries when multiple fields exists The logic for determining whether a series key was already in the the set of TSM series was too restrictive. It allowed only the first field of a series to be added leaving all the remaing fields.	2016-08-31 20:35:35 -06:00
Jonathan A. Sternberg	dc2527ce86	Merge branch '1.0'	2016-08-31 14:45:57 -05:00
Jonathan A. Sternberg	964341eb20	Optimize queries that compare a tag value to an empty string The behavior for querying tag values with an empty string was originally fixed in #6283, but it also added a performance problem when the cardinality of the tag was high. Since a call to `Union()` or `Reject()` would happen for every series key and it would be called N times for N cardinality, the comparisons against a blank string were unnecessarily slow with large memory allocations. This optimizes these queries so it doesn't use those methods anymore. Those methods are still useful and used when combining AND and OR clauses, but they aren't useful when finding the series ids for a single clause. These methods were unnecessary anyway because the series ids for the tags were unique anyway and didn't have to be merged as a set.	2016-08-31 14:03:23 -05:00
Jonathan A. Sternberg	f67558c2a7	Merge pull request #7236 from influxdata/js-7220-revert-limit-shard-concurrency Revert "limit shard concurrency"	2016-08-29 13:41:46 -05:00
Jonathan A. Sternberg	c05c7f6360	Revert "limit shard concurrency" This reverts commit `6c7d56d4bc`.	2016-08-29 12:39:52 -05:00
Jason Wilder	3d411371f2	Merge pull request #7233 from influxdata/jw-stats2 Write path stats	2016-08-29 10:15:23 -06:00
Jason Wilder	d878d30d18	Fix shard write stats * Rename Fail to Err for consistency with other metrics * Use index Series count instead of sepaate counter	2016-08-29 09:46:11 -06:00
Jason Wilder	e203323776	Add wal write success/error stats	2016-08-29 09:38:48 -06:00
Jason Wilder	83ca8c3867	Decrement cache memory stat when deleting series	2016-08-29 09:38:41 -06:00
Jason Wilder	03326f993f	Add cache write success/error stats	2016-08-29 09:38:32 -06:00
Jason Wilder	b31bf798f1	Fix runtime: goroutine stack exceeds 1000000000-byte limit Fixes #7225	2016-08-29 09:26:48 -06:00
Jonathan A. Sternberg	8b234546a8	Merge pull request #7204 from influxdata/1.0 Merge 1.0 branch to master	2016-08-25 15:20:30 -05:00
Jonathan A. Sternberg	10029caf2f	Support negative timestamps in the query engine Negative timestamps are now supported. We also now refuse two nanoseconds that are at the edge of the minimum time window. One of the nanoseconds we do not accept is because we need MinInt64 to be used for some internal comparisons in the TSM engine and it was causing an underflow when we subtracted one from the minimum time. The second is so we can have one minimum time that signifies the default minimum that nobody can write to (so we can implicitly rewrite the timestamp on aggregate queries) but still use the explicit timestamp if it is given to us by the user. We aren't able to tell the difference between if the user provided it or if it was implicit without those values being different. If the default minimum time is used with an aggregate query, we rewrite the time to be the epoch for backwards compatibility since we believe that's more important than supporting that extra nanosecond.	2016-08-25 12:52:41 -05:00
Ben Johnson	a30f9b6c70	Merge pull request #7196 from benbjohnson/mmap-fix Fix mmap dereferencing	2016-08-24 10:48:28 -06:00
Ben Johnson	cc628a1097	Fix mmap dereferencing Adds a missing dereference call to `Close()` as well as fixes a tag copy issue.	2016-08-24 10:48:07 -06:00
Edd Robinson	6cafdbc604	Ensure we don't mutate provided statistics tags	2016-08-24 11:40:13 +01:00
Edd Robinson	90ff713f21	Fix base64 encoding issue in stats Fixes #7177.	2016-08-22 15:21:31 +01:00
Ben Johnson	65536676a4	Merge pull request #7138 from benbjohnson/optimize-shard-open Reduce memory allocations in index	2016-08-17 15:27:33 -06:00
Ben Johnson	8aa224b22d	reduce memory allocations in index This commit changes the index to point to index data in the shards instead of keeping it in-memory on the heap.	2016-08-16 14:09:00 -06:00
Jonathan A. Sternberg	6b5b24a3e3	Decrement number of measurements only once when deleting the last series from a measurement	2016-08-15 13:57:08 -05:00
Jonathan A. Sternberg	9621bee195	Drop time when used as a tag or field key The "time" field and tags are unqueryable so we prevent those from being written so we don't have unreadable data.	2016-08-10 10:02:01 -05:00
Ben Johnson	55b3e63ced	concurrent series limit This commit fixes the `MaxSelectSeriesN` limit which was broken by the implementation of lazy iterators. The setting previously limited the total number of series but the new implementation limits the concurrent number of series being processed.	2016-08-09 08:58:01 -06:00
Jason Wilder	0ea645642b	Remove compaction assert that should not be there This assert was not removed when the issue that cause the assert to trigger was fixed in `0f5e994`. Fixes #7121	2016-08-08 09:59:45 -06:00
Jonathan A. Sternberg	b98763a3d8	Merge pull request #7118 from influxdata/js-go-generate go generate on every package to ensure they are generated with the correct dependency	2016-08-08 09:02:32 -05:00
David Norton	064db3c5b3	Merge pull request #7095 from influxdata/dgn-cardinality-limits feat #6679: add series limit config setting	2016-08-05 16:34:25 -04:00
Jonathan A. Sternberg	ed2f81357f	go generate on every package to ensure they are generated with the correct dependency	2016-08-05 14:35:07 -05:00
Ben Johnson	6c7d56d4bc	limit shard concurrency This commit limits queries to only process one shard at a time. However, within a shard, multiple series can still be processed in parallel. Shard iterators are lazily instantiated during query execution to limit the amount of memory a given query uses.	2016-08-05 09:45:57 -06:00
Jason Wilder	19546faab3	Release cursor/iterator resources aggressively	2016-08-03 00:21:39 -06:00
Jason Wilder	e8e6bc44a7	Remove defers in TSM reader read path	2016-08-02 16:39:45 -06:00
David Norton	0c4559722c	feat #6679 : add series limit config setting	2016-08-01 08:28:46 -04:00
Jason Wilder	5576e7fedb	Simplifications	2016-07-28 20:25:37 -06:00
Jason Wilder	8367771d35	Fix go vet	2016-07-28 20:25:37 -06:00
Jason Wilder	030f1ef622	Include full for tombstone files The path info only contained the file name which caused tombstone files to not be removed if there were queries running against a file that was compacted. This is now consistent with the TSMReader.Path which returns the full path info.	2016-07-28 20:25:37 -06:00
Jason Wilder	c3fda24cf9	Make sure all in-use files are tracked break cause the first one to be tracked and all others would leak as temp files that would not be removed until the server restarted.	2016-07-28 20:25:37 -06:00
Jason Wilder	c1a94e8861	Remove temp TSM files when disabling compactions If they were left around, re-enabling them again could cause future compactions to continuously fail. A restart of the server would clean them up correctly though.	2016-07-28 20:25:37 -06:00
Jason Wilder	602a2e80ce	Ensure aux and cond cursors are closed when iterator is closed	2016-07-28 20:25:37 -06:00
Jason Wilder	5764a730d5	Prevent tombstoning series keys more than once If there were multiple TSM files and a delete/drop was run, we would write the delete series to the tombstone file N times for each file. This occurred because FileStore.WalkKeys walks every key in every TSM file which can return duplicate keys. This issue caused TSM files to be much larger than they should be and also cause large memory usage during the delete.	2016-07-28 20:25:36 -06:00
Jason Wilder	ef8ecf0e90	Apply reload tombstones in batches This keeps some memory bounds when reloading a TSM files tombstones so that the heap does not grow exceedintly fast and stay there after the deletes are applied.	2016-07-28 20:25:36 -06:00
Jason Wilder	4436e65fb9	Apply deletes to TSM files concurrently	2016-07-28 20:25:36 -06:00
Jason Wilder	a8c69e222a	Use scanner for reading v1 tombstones Use a bufio.Scanner to read v1 tombstones instead of reading in the whole file and parsing it from memory.	2016-07-28 20:25:36 -06:00
Jason Wilder	7b8959f6f2	Apply tombstones iteratively at startup Tombstone were read fully into memory at startup which could consume a lot of RAM and OOM the process if there were a lot of deleted series and many TSM files. This now walks the tombstone file and iteratively applies the tombstone which uses significantly less RAM. This may be slightly slower in the generate cause, but should scale better.	2016-07-28 20:25:36 -06:00
Jonathan A. Sternberg	86bd97f3b9	Switch SHOW MEASUREMENTS and SHOW TAG VALUES to directly access the tsdb.Store The `SHOW MEASUREMENTS` and `SHOW TAG VALUES` cannot go through the query engine to get the speed they need. They also only need access to the database index and do not need access to specific shards. This removes the query rewriting that was done to turn these two queries into a select statement and reimplements them inside of the coordinator as an interface on the TSDBStore.	2016-07-28 17:38:11 -05:00
Mark Rushakoff	f34a7430e3	Fix length of (*DatabaseIndex).SeriesKeys() Previously, it would return as many empty strings in the first half of the slice as valid values at the end of the slice.	2016-07-27 16:07:39 -07:00
Jason Wilder	7c3d1aac68	Simplify purger.add logic	2016-07-26 13:02:08 -06:00
Jason Wilder	cab84ae279	Prevent concurrent compactions from stepping on each other Normally, compactions do not conflict on the files they are compacting. If the full cold threshold is set very low, it can cause conflicts where two compactions compact the same files. The full compaction was the only place this could happen as it's planning is greedy. To make this safer for concurrent execution, the compaction tracks which files are current being compacted and prevents any new compactions from starting if the file set overlaps. Fixes #6595	2016-07-26 12:58:25 -06:00
Jason Wilder	ded6e40d47	Remove lastPlanCheck var This causes full compactions to not run if the server is running, but after a restart they do run.	2016-07-26 12:58:25 -06:00
Jason Wilder	2f78c4ec83	Fix race when creating temp file Using os.O_EXCL is safer than checking and then creating the file.	2016-07-26 12:58:25 -06:00
Cory LaNou	063675b928	updates to make snappy compression tests work again	2016-07-22 14:33:20 -05:00
Cory LaNou	968d322d6d	finish tsm file exporter	2016-07-21 17:20:51 -05:00
Jason Wilder	fb5a143b08	Fix typos	2016-07-21 12:13:04 -06:00
Jason Wilder	13147efb24	Close underlying cursors when closing iterators If a query is interrupted via kill query, the tsm files managed by the file store purger would never get removeed because KeyCursor.Close was never called. KeyCursor.Close should always be called now.	2016-07-21 12:13:04 -06:00
Jason Wilder	822f409b31	Allow queries to complete before closing TSM files If a query was running against a file being compacted, we close the file and the query would end wherever it had read up to. This could result in queries that randomly lost data, but running them again showed the full results. We now use a reference counting approach and move the in-use files out of the way in the filestore and allow the queries to complete against the old tsm files. The new files are installed and new queries will use them. Fixes #5501	2016-07-21 12:13:04 -06:00
Cory LaNou	fd86670518	remove limiter from walkShards	2016-07-21 11:23:31 -05:00
Edd Robinson	f37e726869	Add trace logging statements to tsdb	2016-07-21 11:14:29 +01:00
Edd Robinson	44231abcbd	Add trace logger controlled via DataLoggingEnabled	2016-07-21 11:14:29 +01:00
Edd Robinson	217bd4de84	Disable trace logging by default	2016-07-21 11:14:29 +01:00
Edd Robinson	83cc580ff8	Tidy up logging	2016-07-21 11:14:29 +01:00
Mark Rushakoff	518bd3b565	Micro-optimize BooleanDecoder for 20% speedup benchmark old ns/op new ns/op delta BenchmarkBooleanDecoder_2048-4 9954 7846 -21.18% benchmark old allocs new allocs delta BenchmarkBooleanDecoder_2048-4 0 0 +0.00% benchmark old bytes new bytes delta BenchmarkBooleanDecoder_2048-4 0 0 +0.00%	2016-07-20 08:43:05 -07:00
Mark Rushakoff	523aea715a	Protect against bounds errors in FloatDecoder	2016-07-19 15:59:27 -07:00
Mark Rushakoff	e483689563	Protect against bounds errors in BooleanDecoder	2016-07-19 15:59:27 -07:00
Mark Rushakoff	35e3adc890	Protect against bounds errors in IntegerDecoder	2016-07-19 15:43:27 -07:00
Mark Rushakoff	42b35ca068	Protect against bounds errors in TimeDecoder	2016-07-19 15:43:27 -07:00
Mark Rushakoff	be589a6760	Protect against bounds errors in StringDecoder	2016-07-19 15:43:27 -07:00
Mark Rushakoff	5b549ffdfe	Handle bounds errors in UnpackBlock	2016-07-19 15:43:27 -07:00
Mark Rushakoff	39f12e376c	Defend against some boundary errors in TSM reading	2016-07-19 15:43:27 -07:00
Mark Rushakoff	28f31b4a0c	Add test cases to repro corruption panics	2016-07-19 15:36:17 -07:00
Jason Wilder	c31f0c25b4	Fix duplicate series getting created There was a race where the same series would get added to the in-memory index for a measurement more than once. This would result in the same series being returned more than once during queries causing duplicate results. The issue was that we check for the series under the read lock, but did not check again under the write lock where there was a small window where the series could be added by another goroutine. We now check for the series under the write lock. Fixes #6946	2016-07-18 16:46:36 -06:00
Jason Wilder	757f31bd45	Fix panic:runtime error: invalid memory address or nil pointer dereference github.com/influxdata/influxdb/tsdb.(Shard).FieldDimensions(0xc820244000, 0xc821b70fb0, 0x1, 0x1, 0xc822b9cc00, 0xc822b9cc30, 0x0, 0x0) /Users/jason/go/src/github.com/influxdata/influxdb/tsdb/shard.go:588 +0xa62 github.com/influxdata/influxdb/tsdb.(shardIteratorCreator).FieldDimensions(0xc8202b6078, 0xc821b70fb0, 0x1, 0x1, 0xc822b9cbd0, 0x0, 0x0, 0x0) /Users/jason/go/src/github.com/influxdata/influxdb/tsdb/shard.go:818 +0x53 github.com/influxdata/influxdb/influxql.IteratorCreators.FieldDimensions(0xc821b71250, 0x1, 0x1, 0xc821b70fb0, 0x1, 0x1, 0xc822b9cba0, 0xc822b9cbd0, 0x0, 0x0) /Users/jason/go/src/github.com/influxdata/influxdb/influxql/iterator.go:639 +0x15a github.com/influxdata/influxdb/influxql.(*IteratorCreators).FieldDimensions(0xc822a32ae0, 0xc821b70fb0, 0x1, 0x1, 0x20, 0x18, 0x0, 0x0) <autogenerated>:163 +0xd3	2016-07-18 16:35:33 -06:00
Jonathan A. Sternberg	30efa2d922	Merge pull request #6989 from influxdata/js-6950-show-measurements-performance Optimize SHOW MEASUREMENTS so it consults the database index directly	2016-07-18 15:23:17 -05:00
Jason Wilder	b692ef4f48	Rename throttle package to limiter	2016-07-18 12:00:58 -06:00
Jonathan A. Sternberg	4121590b01	Optimize SHOW MEASUREMENTS so it consults the database index directly SHOW MEASUREMENTS doesn't need to visit every shard in the open source version since all of them contain the same database index.	2016-07-18 12:53:23 -05:00
Jason Wilder	c2370b437b	Limit in-flight wal writes/encodings A slower disk can can cause excessive allocations to occur when writing to the WAL because the slower encoding and compression occurs before taking the write lock. The encoding/compression grabs a large byte slice from a pool and ultimately waits until it can acquire the write lock. This adds a throttle to limit how many inflight WAL writes can be queued up to prevent OOMing the processess with slower disks and heavy writes.	2016-07-17 23:53:12 -06:00
Jason Wilder	46fdcba6e3	Remove compaction enabled logging Too verbose	2016-07-17 23:53:12 -06:00
Jason Wilder	2fa28ba1d3	Don't log error when compactions are aborted	2016-07-17 23:53:12 -06:00
Jason Wilder	b48d88ce9e	Abort running compactions when series are deleted If a delete is issued while a compaction is running, the a newly deleted series could re-appear after the compaction completed. This could occur the compaction had already written the blocks for series that were just deleted. When the compaction completes, the newly written tombstone files would be deleted, essentially undeleting the series.	2016-07-17 23:53:12 -06:00
Jason Wilder	cc4a668be5	Don't return statistic if engine is closed	2016-07-17 23:53:12 -06:00
Jason Wilder	6710c69aa5	Merge pull request #7015 from influxdata/jw-drop Speed up delete/drop statements	2016-07-15 12:41:08 -06:00
Jason Wilder	21dbe7e854	Simplify throttle type	2016-07-15 12:14:25 -06:00
Jason Wilder	d1556e3964	Fix missing read locks before filtering	2016-07-15 10:08:26 -06:00
Jason Wilder	ff5d61d024	Speed up delete series Reduce lock contention and process shards in concurrently.	2016-07-14 17:31:34 -06:00
Jason Wilder	8f3ec3be43	Inline deleteShard Only used by one caller now	2016-07-14 17:31:34 -06:00

1 2 3 4 5 ...

1328 Commits (7d277d0def73d72ed1635276a65bcb52f70ba93f)