influxdb

Commit Graph

Author	SHA1	Message	Date
Jason Wilder	3c130cd39c	Expose TSMWriter.Flush Allows flushing the writer so we don't always need to close and re-open the file handle.	2017-04-28 14:00:50 -06:00
Jason Wilder	141f0d71cd	Update index when import files	2017-04-28 14:00:45 -06:00
Jason Wilder	a76146e34a	Add Store.Import capability This allows the contents of a backup to be imported into a shard without requiring the whole shard to be replaced.	2017-04-28 13:30:46 -06:00
Jason Wilder	3839fe34ea	Remove FileStore.Add/Remove Can use Replace which handles files in-use and stats correctly.	2017-04-28 13:20:55 -06:00
Jason Wilder	137d0c0d09	Rename WAL.WritePoints to WAL.WriteMulti To match Cache.WriteMulti	2017-04-28 13:20:55 -06:00
Jason Wilder	28422f2fec	Use consistent receiver var name for Value types	2017-04-28 13:20:55 -06:00
Jason Wilder	1bc4936336	Export Reader.ReadBytes	2017-04-28 13:20:55 -06:00
Stuart Carnie	b2d2976466	update reason messages	2017-04-28 11:21:57 -07:00
Stuart Carnie	8097e817f6	prefix partial write errors with `partial write:` NOTE: parser errors (via http API) are also transformed into PartialWriteError	2017-04-28 11:00:14 -07:00
Ben Johnson	aa64c908d0	Merge pull request #8314 from benbjohnson/tsi-doc Add TSI documentation	2017-04-24 10:58:31 -06:00
Ben Johnson	ba7108f94e	Add TSI documentation.	2017-04-21 14:45:03 -06:00
Jason Wilder	d88604f6f2	Move repetive loop checks outside of values loop	2017-04-20 13:45:04 -06:00
Jason Wilder	888689f5d3	Move values loop under type switch All the values read must be of the same type so repeatedly using the type switch is confusing and less efficiient.	2017-04-20 13:39:49 -06:00
Jason Wilder	b0988511bf	Use fixed size array instead of slice	2017-04-20 13:38:33 -06:00
Jason Wilder	da6bdfdda8	Use bufio.Reader when reading wal segments Reduces disk IO due to small reads.	2017-04-20 13:33:42 -06:00
Jason Wilder	8e9cbd7ffc	Simplify WALSegmentReader.UnmarshalBinary There were two loops over nvals which created some extra allocation which coudl be replaced with a simplet slice capacity and append.	2017-04-20 13:33:42 -06:00
Jason Wilder	02b663b651	Fix lock contention in Index.CreateSeriesListIfNotExists There was contention on the write lock which only needs to be acquired when checking to see if the log file should be rolled over.	2017-04-20 12:28:42 -06:00
Jason Wilder	40ec85aacd	Fix lock contention in LogFile.SeriesWithBuffer Under high write load, the check for each series was done sequentially which caused a lot of CPU time to acquire/release the RLock on LogFile. This switches the code to check multiple series at once under an RLock similar to the chang for inmem.	2017-04-20 12:28:42 -06:00
Jason Wilder	0e715b5b74	Reduce lock contention on MeasurementFields	2017-04-20 12:28:42 -06:00
Jason Wilder	ef65ee77f4	Switch WAL byte pools to sync/pool The current bytes.Pool will hold onto byte slices indefinitely. Large writes can cause the pool to hold onto very large buffers over time. Testing w/ sync/pool seems to perform similarly now so using a sync/pool will allow these buffers to be GC'd when necessary.	2017-04-20 12:28:42 -06:00
Jason Wilder	d155d37ca8	Reduce TSM write buffer When many TSM files are being compacted, the buffers can add up fairly quickly.	2017-04-20 12:28:42 -06:00
Jason Wilder	3c2825a851	Reduce lock thrashing when checking series The inmem index would call CreateSeriesIfNotExist for each series which takes and releases and RLock to see if a series exists. Under high write load, the lock shows up in profiles quite a bit. This adds a filtering step that obtains a single RLock and checks all the series and returns the non-existent series to contine though the slow path.	2017-04-20 12:28:41 -06:00
Jason Wilder	d7c5dd0a3e	Reduce wal sync goroutine churn Under high write load, the sync goroutine would startup, and end very frequently. Starting a new goroutine so frequently adds a small amount of latency which causes writes to take long and sometimes timeout. This changes the goroutine to loop until there are no more waiters which reduce the churn and latency.	2017-04-20 12:28:34 -06:00
Jason Wilder	aa9925621b	Fix deadlock in wal If the sync waiters channel was full, it would block sending to the channel while holding a the wal write lock. The sync goroutine would then be stuck acquiring the write lock and could not drain the channel. This increases the buffer to 1024 which would require a very high write load to fill as well as retuns and error if the channel is full to prevent the blocking.	2017-04-19 11:33:13 -06:00
Jason Wilder	a19ce9c10f	Reduce index lock contention Series and Measurment have their own locks and we do not need to hold locks on the index while using those types.	2017-04-18 16:32:33 -06:00
Jason Wilder	883b3dcbbb	Reduce lock content in AssignShard The lock shows up under write load. It only needs to be assigned once so a read lock eliminates the contention.	2017-04-18 16:32:33 -06:00
Jason Wilder	5c51ae7319	Merge branch '1.2' into jw-merge-123	2017-04-14 14:36:54 -06:00
Jason Wilder	ff1270dfeb	Fix dropping fields created data corruption The Point is intended to be immutable after being parsed since it is shared by several goroutines. When dropping a field (e.g. time), corrupted data can result if one goroutine is delete the field while another is marshaling the underlying byte slices. To avoid this, the shard will just skip invalid fields and series instead of trying to mutate them by deleting them.	2017-04-07 12:58:42 -06:00
Jason Wilder	1de99cd219	Merge pull request #8268 from influxdata/jw-dedup-measurements Ensure MeasurementNames deduplicates measurements across shards	2017-04-06 13:10:56 -06:00
Jason Wilder	927acb5ab9	Ensure MeasurementNames deduplicates measurements across shards	2017-04-06 12:17:29 -06:00
Jason Wilder	cf100647e0	Fix deadlock in Measurement.SeriesIDsAllOrByExpr SeriesIDsAllOrByExpr took a RLock and ended up calling SeriesIDs which can take a Lock causing a deadlock.	2017-04-05 16:22:45 -06:00
Ben Johnson	9c97cd8601	Merge remote-tracking branch 'upstream/master' into tsi	2017-04-04 12:46:09 -06:00
Ben Johnson	0d74497abe	Reset rhh map elements to reuse allocations.	2017-04-04 11:57:37 -06:00
Ben Johnson	6ff27c95e5	Fix tsi assertions.	2017-04-04 11:29:21 -06:00
Ben Johnson	dbc10559c4	Merge pull request #8247 from benbjohnson/tsi-series-block-partitioning TSI Series Block Partitioning	2017-04-04 11:15:10 -06:00
Jason Wilder	5fa8073fc2	Merge branch '1.2' into jw-merge-123	2017-04-04 11:12:06 -06:00
Jason Wilder	84cbee227a	Fix file store not close all TSM files Regression added via #8192	2017-04-04 10:58:51 -06:00
Ben Johnson	95d4016ff2	Merge branch 'tsi' of https://github.com/influxdata/influxdb into tsi-series-block-partitioning	2017-04-04 10:14:03 -06:00
Jason Wilder	ec7eea2a0f	Skip tests on appveyor that we skip w/ -race	2017-04-04 09:47:59 -06:00
Ben Johnson	bf49b176f5	Partition tsi1 series index.	2017-04-04 09:46:04 -06:00
Jason Wilder	793635dbd7	Skip TSI cardinality tests on appveyor	2017-04-04 09:19:43 -06:00
Jason Wilder	4f850b5cff	Skip TestCache_Deduplicate_Concurrent on windows	2017-04-04 08:48:55 -06:00
Jason Wilder	7ac3c9a26f	Remove unused cardinality func	2017-04-03 11:24:55 -06:00
Jason Wilder	fcdc3c5c21	Remove commented out code in meta	2017-04-03 11:22:59 -06:00
Jason Wilder	8da84e6144	Merge branch 'master' into tsi	2017-04-03 11:21:02 -06:00
Jason Wilder	68f73e64d1	Lazily sort Measurement.SeriesIDs Removing series while trying to maintain the sorted series list does not perform well when removing many series. This causes drop DB, RP, series, to be very slow in some cases. Instead, lazily create a sorted series list when first requested and invalidate it when dropping series.	2017-04-03 08:57:53 -06:00
Jason Wilder	32c4d43952	Speed up drop measurement This reworks drop measurement to use a sorted list of series keys instead of creating an intermediate map. It remove allocations and some extra garbage that is created during drop measurement.	2017-04-03 08:57:53 -06:00
Jason Wilder	a78da51b7c	Use buffered writer when writing tombstones When deleting many series, the many small writes flood the disks and consume a lot of CPU time.	2017-04-03 08:57:52 -06:00
Jason Wilder	6232d5e56d	Remove defer allocations in TSMReader	2017-04-03 08:57:52 -06:00
Jason Wilder	920c8396c6	Use sorted merge in FileStore.WalkKeys WalkKeys serially walked each TSM file and invoked fn for each key. Caller needed to handle duplicate calls to fn with the same key because the same key could exist in multiple TSM files. The serial execution was also slower. Since the series keys are already sorted, we can iterate over all files in parallel and skip duplicates using a sorted merge. This fixes the duplicate invocation issue as well as speeds up walking all keys. This can significant improve startup performance when many TSM files exists that may not have been fully compacted. This also has benefits for deletes (measurements/series) since duplicates are removed saving extra allocations and work. This may also allow for the optimize compaction to be removed provided startup times are fast enough.	2017-04-03 08:57:52 -06:00
Edd Robinson	5e342a2ddd	Ensure shared index removed on database drop When using the inmem index, if one drops a database, and then creates it again, the previous index object will be reused. This includes the previous cardinality estimation sketches, leading to inaccurate cardinality estimations.	2017-03-30 13:05:31 +01:00
Edd Robinson	ddf7f0fd7b	Remove uncalled method	2017-03-30 12:48:22 +01:00
Edd Robinson	fddaff2cc8	Merge master in	2017-03-29 18:00:28 +01:00
Edd Robinson	116230b427	Use varint for tag count	2017-03-29 16:31:13 +01:00
Edd Robinson	45f843fc91	Don't unassign shards when system shutting down	2017-03-29 11:57:38 +01:00
Ben Johnson	2edfb1c92d	Ignore series limit on database load.	2017-03-24 16:27:16 -06:00
Jason Wilder	ee03fbb164	Fix series tombstone sketch not updated when dropping measurment	2017-03-24 15:49:00 -06:00
Ben Johnson	d2b396bff5	Fix database series limit, remove shard series limit.	2017-03-24 13:16:00 -06:00
Ben Johnson	9fb8f1ec1d	Fix database and tag limits.	2017-03-24 09:48:10 -06:00
Jason Wilder	631681796d	Remove tsl file committed by mistake	2017-03-23 16:18:27 -06:00
Jason Wilder	7119ef8f29	Merge pull request #8193 from influxdata/jw-123-backports 1.2.3 backports	2017-03-23 13:31:35 -06:00
Jason Wilder	ca1919e5de	Use standard merge algorithm for merging values The previous version was very innefficient due to the benchmarks used to optimize it having a bug. This version always allocates a new slice, but is O(n).	2017-03-23 12:53:59 -06:00
Jason Wilder	ba2571903d	Fix broken Values.Merge benchmark Merge had the side effect of modifying the original values so the results are wrong because they always hit the fast path after the first run.	2017-03-23 12:53:50 -06:00
Jason Wilder	890ffb4ce8	Generate encode*Values funcs	2017-03-23 12:53:29 -06:00
Jason Wilder	ced953ae89	Use typed values to avoid allocations This switches compactions to use type values (FloatValues) from the generic Values type. It avoids a bunch of allocations where each value much be converted from a specific type to an interface{}.	2017-03-23 12:53:17 -06:00
Jason Wilder	a1c84ae6f3	Add block type for BlockIterator	2017-03-23 12:49:17 -06:00
Jason Wilder	2972a3f223	Remove MMAP derefencing code This code was added to address some slow startup issues. It is believed to be the cause of some segfault panic's that occur at query time when the underlying MMAP array has been unmapped. The current structure of code makes this change unnecessary now.	2017-03-23 12:46:23 -06:00
Jason Wilder	61f80db1b9	Skip cardinaltiy dups on circle race test	2017-03-22 15:20:38 -06:00
Jason Wilder	c443e639b0	Fix 32bit alignment issue in wal.sync	2017-03-22 11:21:29 -06:00
Jason Wilder	06306bbd97	Run store tests in parallel	2017-03-22 10:55:38 -06:00
Edd Robinson	1c4ecb12c1	Don't panic on nil engine	2017-03-22 10:07:29 -06:00
Ben Johnson	85c1ae4dd4	Remove sort.Slice	2017-03-21 12:33:05 -06:00
Ben Johnson	afe41f1c80	Fix tsm1/tsi1 broken tests.	2017-03-21 12:21:48 -06:00
Ben Johnson	1e9fa7bc2c	Fix 32-bit rhh implementation.	2017-03-21 11:44:13 -06:00
Jason Wilder	58c8736ebc	Merge pull request #8172 from influxdata/jw-dropped-points Fix series not getting created	2017-03-21 09:44:31 -06:00
Jason Wilder	92c722913d	Unlock index if dropping non-existent series	2017-03-21 09:19:44 -06:00
Ben Johnson	5cf41adcb8	Optimize tsi1 write path. Removes sorted series list in log, uses a buffer for HasSeries(), and prepends a length for series key encoding.	2017-03-21 08:44:35 -06:00
Edd Robinson	47448646ed	Remove DropSeries on index	2017-03-21 11:35:31 +00:00
Edd Robinson	f89de550ed	Significantly speed up DROP DATABASE	2017-03-21 11:35:31 +00:00
Jason Wilder	7f78c6ad8e	Fix series not getting created	2017-03-20 17:19:22 -06:00
Jason Wilder	8f7b251afd	Merge branch 'master' into jw-tsi	2017-03-20 17:17:26 -06:00
Jason Wilder	8177df2dab	Simplify Measurement.TagSets signature	2017-03-17 16:19:10 -06:00
Jason Wilder	2d5d899ac2	Allow queries to be interrupted during planning If a bad query is run, kill query and limits would not kick in until after it started executing. Some bad queries that involve high cardinality can cause the server to OOM just from planning which defeats the purpose of the max-select-series limit. This change primarily fixes max-select-series limit so that the query is killed earlier and has the side effect that kill query now can kill a query while it's being planned.	2017-03-17 16:00:54 -06:00
Jason Wilder	bc4aeefbed	Check max-series-limit in shard iterator creation The limit waited until all the iterators had been created which still allows problem queries to be planned. This allows the queries to be aborted much earlier in some cases.	2017-03-17 16:00:25 -06:00
Ben Johnson	5d67c424bf	Refactor tsi1 write locking.	2017-03-17 11:20:50 -06:00
Ben Johnson	70efc70abe	Reduce lock contention, fix rhh lookup.	2017-03-17 09:44:11 -06:00
Jason Wilder	27ae2929fc	Add wal-fsync-delay to Diagnostics	2017-03-15 16:31:03 -06:00
Jason Wilder	e9eb925170	Coalesce multiple WAL fsyncs Fsyncs to the WAL can cause higher IO with lots of small writes or slower disks. This reworks the previous wal fsyncing to remove the extra goroutine and remove the hard-coded 100ms delay. Writes to the wal still maintain the invariant that they do not return to the caller until the write is fsync'd. This also adds a new config options wal-fsync-delay (default 0s) which can be increased if a delay is desired. This is somewhat useful for system with slower disks, but the current default works well as is.	2017-03-15 16:31:03 -06:00
Jason Wilder	7bd1bd8ab3	Only calculate disk size if shard has changed Calling DiskSize can be expensive with many shards. Since the stats collection runs this every 10s by default, it can be expensive and wasteful to calculate the stats when nothing has changed. This avoids re-calculating the shard size unless something has chagned.	2017-03-15 16:29:57 -06:00
Ben Johnson	1807772388	Fix tsi tests.	2017-03-15 11:23:58 -06:00
Ben Johnson	ee2e046853	Merge remote-tracking branch 'upstream/tsi-log-compact' into tsi	2017-03-15 10:22:32 -06:00
Ben Johnson	cf7ba96377	Merge branch 'tsi-log-compact' into tsi	2017-03-15 10:18:40 -06:00
Ben Johnson	358b1e0b05	Merge remote-tracking branch 'upstream/master' into tsi	2017-03-15 10:13:32 -06:00
Jason Wilder	65464ea0d1	Merge pull request #8131 from influxdata/jw-values-merge Use standard merge algorithm when merging Values	2017-03-15 09:51:21 -06:00
Jason Wilder	a4cfeacedb	Use standard merge algorithm for merging values The previous version was very innefficient due to the benchmarks used to optimize it having a bug. This version always allocates a new slice, but is O(n).	2017-03-15 08:59:41 -06:00
Edd Robinson	7d997d508a	Fixes #8138	2017-03-15 12:50:22 +00:00
Edd Robinson	ddcea1c322	WHY YOU SMITE ME BEN. B. JOHNSON?	2017-03-15 12:50:03 +00:00
Jason Wilder	4d37c9dc9e	Fix broken Values.Merge benchmark Merge had the side effect of modifying the original values so the results are wrong because they always hit the fast path after the first run.	2017-03-14 14:20:24 -06:00
Mark Rushakoff	535cf597f1	Report subset of config values in SHOW DIAGNOSTICS This includes hand-selected config settings that are safe to expose and not expected to include any kind of secrets. Fixes #7821	2017-03-14 11:34:19 -07:00
Jason Wilder	ca9c67a877	Generate encode*Values funcs	2017-03-14 11:54:53 -06:00
Ben Johnson	d23f2971c3	Refactor TagBlockEncoder.	2017-03-10 10:08:16 -07:00
Jason Wilder	2f7d4995b4	Use typed values to avoid allocations This switches compactions to use type values (FloatValues) from the generic Values type. It avoids a bunch of allocations where each value much be converted from a specific type to an interface{}.	2017-03-09 16:27:07 -07:00
Jason Wilder	78b7815c49	Add block type for BlockIterator	2017-03-09 09:16:59 -07:00
Jason Wilder	b9e5375043	Merge branch '1.2' into jw-merge-12	2017-03-08 13:16:50 -07:00
Jason Wilder	394bca3aad	Validate field type when creating new fields	2017-03-06 16:13:17 -07:00
Jason Wilder	37187cbe6d	Delete series under fields lock Still seeing the panic that switching this logic around was supposed to fix. We now delete the bulk of data outside of the fields lock and then again, under the write lock, to ensure that the field mapping is accurate. We don't do the full delete under the lock because it can block writes and queries that require a read lock.	2017-03-06 14:19:55 -07:00
Jason Wilder	675d7c9d65	Merge branch '1.2' into jw-merge12	2017-03-06 11:09:05 -07:00
Jason Wilder	eab012ef61	Fix points missing after compaction If blocks containing overlapping ranges of time where partially recombined, it was possible for the some points to get dropped during compactions. This occurred because the window of time of the points we need to merge did not account for the partial blocks created from a prior merge. Fixes #8084	2017-03-06 10:17:11 -07:00
Jason Wilder	3c70abf061	Delete series before remove from field index There is a race where the field type can be deleted while a new type is written and during a query. When this happens, an iterator for the new type is created but old data make still exist in the cache for TSM files causing a panic.	2017-03-06 09:38:27 -07:00
Jason Wilder	29f8d8de76	Fix race in WALEntry.Encode and Value.Deduplicate Under high query load, a race exists in the cache and the WAL. Since writes currently hit the cache first, they are availble for query before they hit the WAL. If the WAL is writing and accessign the Value slice at the same time that a query is run that needs to dedup the same slice, a race occurs. To fix this, the cache now just copies the values instead of storing the slice passed in. Another way to fix this might be to have the writes go to the wal before the cache. I think the latter would be better, but it introduces some larger write path issues that we'd need to also address. e.g. if the cache was full, writes to the WAL would need to be rejected to avoid filling the disk. Copying the slice in the cache is simpler for now and does not appear to dramatically affect performance.	2017-03-06 09:38:22 -07:00
Ben Johnson	4c202eea09	Re-check field type under write lock.	2017-03-03 09:47:43 -07:00
Jonathan A. Sternberg	1081785cb4	Treat non-reserved measurement names with underscores as normal measurements A measurement name that begins with an underscore and does not conflict with one of the reserved measurement names will now be passed untouched to the underlying shards rather than being intercepted as an empty measurement. A user still shouldn't rely on measurements that begin with underscores to always be accessible, but this will prevent the most common use case from causing unexpected behavior since we will very rarely, if ever, add additional system sources.	2017-02-27 16:49:02 -06:00
Jason Wilder	a024003f2c	Merge branch '1.2' into jw-merge-12	2017-02-22 12:13:29 -07:00
Ben Johnson	78a9bb2527	Remove Tags.shouldCopy, replace with forceCopy on series creation. Previously, tags had a `shouldCopy` flag to indicate if those tags referenced an underlying buffer and should be copied to allow GC. Unfortunately, this prevented tags from being copied that were created and referenced the mmap which caused segfaults. This change removes the `shouldCopy` flag and replaces it with a `forceCopy` argument in `CreateSeriesIfNotExists()`. This allows the write path to indicate that tags must be cloned on insert.	2017-02-21 11:13:35 -07:00
Mark Rushakoff	601cbcd084	Merge branch '1.2' into mr-merge-12	2017-02-17 16:14:22 -08:00
Jonathan A. Sternberg	2fe48d6781	Rename zap import back to github.com/uber-go/zap They rebased a revision we were previously relying upon that allowed us to use the vanity name so we are reverting back to an older version with the old import path.	2017-02-17 17:17:22 -06:00
Ben Johnson	7ee0b9bb28	Rewrite SeriesBlockEncoder.	2017-02-16 09:39:51 -07:00
Ben Johnson	8e79ca5d75	Fix tag dereferencing panic. Clones series tags under lock during var ref iterator creation.	2017-02-15 17:56:47 -07:00
Ben Johnson	7e6f33ef48	TSI1 Index file compaction.	2017-02-15 08:44:22 -07:00
Ben Johnson	71d13fed6d	Immediate, parallel tsl compactions.	2017-02-15 08:44:21 -07:00
Ben Johnson	673143a0ad	Remove .tsl file.	2017-02-15 08:44:01 -07:00
Jonathan A. Sternberg	71f62d33e6	Map types correctly when using a regex and one of the measurements is empty	2017-02-13 18:14:29 -06:00
Edd Robinson	7749bc1c67	Store sorted series in logFile	2017-02-13 17:03:42 +00:00
Jason Wilder	4b6289ce58	Merge pull request #7942 from influxdata/jw-cache-partitions Reduce write timeouts	2017-02-10 10:07:08 -07:00
Edd Robinson	38eb6d5994	Don't load meta data for tsi	2017-02-09 18:04:23 +00:00
Edd Robinson	a6a2f9d5f0	Don't load meta data for tsi	2017-02-09 17:59:14 +00:00
Jason Wilder	2f74e3f3d5	Use simple8b.CountBytes to avoid allocations	2017-02-09 10:47:03 -07:00
Edd Robinson	2d59fb788c	Modify arguments to reduce allocations	2017-02-09 15:01:32 +00:00
Edd Robinson	c2bbc18e4b	Reduce repeated string allocations	2017-02-09 11:23:23 +00:00
Edd Robinson	21e821e5f9	Reduce memory footprint by pre-allocating	2017-02-08 18:49:03 +00:00
Edd Robinson	4fbba8234e	Add Size to models.Tags	2017-02-08 18:44:48 +00:00
Edd Robinson	a9d58cc879	Add log file benchmarks	2017-02-08 16:50:46 +00:00
Ben Johnson	d6fedcacd5	Refactor tsi.LogFile compaction.	2017-02-08 09:00:08 -07:00
Jason Wilder	1bc0f68490	Merge branch '1.2' into jw-merge-12	2017-02-07 12:48:36 -07:00
Ben Johnson	c4e9430de3	Recover after partial tsi log file write.	2017-02-07 10:02:31 -07:00
Jonathan A. Sternberg	e1fa48d0dd	Fix ORDER BY time DESC with ordering series keys The order of series keys is in ascending alphabetical order, not descending alphabetical order, when it is ordered by descending time. This fixes the ordering so points are returned in descending order. The emitter also had the conditions for choosing which iterator to use in the wrong direction (which only affects aggregates with `FILL(none)`).	2017-02-06 15:49:12 -06:00
Jonathan A. Sternberg	95831b3307	Fix LIMIT and OFFSET when they are used in a subquery This fixes LIMIT and OFFSET when they are used in a subquery where the grouping of the inner query is different than the grouping of the outer query. When organizing tag sets, the grouping of the outer query is used so the final result is in the correct order. But, unfortunately, the optimization incorrectly limited the number of points based on the grouping in the outer query rather than the grouping in the inner query. The ideal solution would be to use the outer grouping to further organize it by the grouping for the inner subquery, but that's more difficult to do at the moment. As an easier fix, the query engine now limits the output of each series. This may result in these types of queries being slower in some situations like this one: SELECT mean(value) FROM (SELECT value FROM cpu GROUP BY host LIMIT 1) This will be slower in a situation where the `cpu` measurement has a high cardinality and many different tags. This also fixes `last()` and `first()` when they are used in a subquery because those functions use `LIMIT 1` as an internal optimization.	2017-02-06 14:04:34 -06:00
Jason Wilder	93a9d01643	Increase default waiting WAL writes	2017-02-06 11:48:51 -07:00
Jason Wilder	38a649fc40	Batch multiple WAL fsyncs Every write to the WAL current runs and fsync before returning. When there are lot of concurrent writes, this can cause the WAL to bottleneck write throughput since fsyncs are very expensive. This changes the writeToLog to fsync on an interval to allow multiple fsyncs calls to be batched up into one. The writeToLog behavior is the same in that it won't return until an fsync has been performed.	2017-02-06 11:48:45 -07:00
Ben Johnson	d91e6eabac	Add max-values-per-tag to inmem index.	2017-02-06 11:14:13 -07:00
Edd Robinson	54193e1131	Ensure config contains valid index	2017-02-03 11:57:41 +00:00
Edd Robinson	908959a78a	Fix vet	2017-02-03 11:57:35 +00:00
Jason Wilder	2e95b4043c	Merge branch '1.2' into jw-merge-12	2017-02-02 16:40:36 -07:00
Ben Johnson	76235f1e00	Use original index type for existing shards.	2017-02-02 10:43:48 -07:00
Ben Johnson	c246f3d9b0	Use inmem index on existing shards.	2017-02-02 10:04:25 -07:00
Ben Johnson	faef0a99c9	Perform series tag iteration under lock. Adds a `tsdb.Series.ForEachTag()` function for safely iterating over a series' tags within the context of a lock. This preverts tags from being dereferenced during iteration which can cause a seg fault.	2017-02-01 16:25:53 -07:00
Edd Robinson	5a37d51f02	Refactor DropSeries	2017-02-01 22:07:44 +00:00
Ben Johnson	57f44d5f0c	Include index in snapshot.	2017-02-01 14:19:42 -07:00
Jason Wilder	54ab3a7a0a	Don't write lock file store when opening new files When replacing TSM files, the new files can be opened before the write lock is taken to reduce lock contention in this code path.	2017-02-01 11:11:26 -07:00
Jason Wilder	6eb46d2100	Remove unnecessary read lock on engine	2017-02-01 11:10:41 -07:00
Edd Robinson	0a9fc910f6	Fix potential race on inmem	2017-02-01 16:29:52 +00:00
Edd Robinson	ffced9b538	Move sketch merging into File	2017-02-01 15:33:30 +00:00
Edd Robinson	ca530d478a	Ensure we're testing both indexes	2017-02-01 14:56:35 +00:00
Edd Robinson	ab238251f4	Refactor measurement sketches	2017-02-01 14:56:35 +00:00
Edd Robinson	d8a73a1954	Go vet	2017-02-01 14:51:15 +00:00
Ben Johnson	05bbda2c0a	Fix measurement block sketches.	2017-02-01 07:40:00 -07:00
Jason Wilder	784a851742	Release cpu during compactions	2017-01-31 17:04:36 -07:00
Jason Wilder	278c1449d6	Increase number of cache partitions	2017-01-31 16:49:57 -07:00
Ben Johnson	86e908c384	Remove debug println().	2017-01-31 09:24:58 -07:00
Ben Johnson	f4a4d1e196	Fix series sketches.	2017-01-31 08:47:18 -07:00
Edd Robinson	ab94c1b743	Fixes #7882	2017-01-30 19:12:24 +00:00
Ben Johnson	42959ead1f	Fix CreateSeriesListIfNotExists() data race.	2017-01-30 10:07:07 -07:00
Edd Robinson	91ee34b111	Merge pull request #7837 from influxdata/er-tidy General tidy up and subtle bug fixes	2017-01-26 13:43:07 +00:00
Jason Wilder	2ae9406c1d	Reincorporate memory leak fix 7832 got lost during a rebase/merge. This reincorporate the fix for the in-memory index as well as tsi.	2017-01-24 20:13:48 -07:00
Jason Wilder	173003bc38	Fix go vet	2017-01-24 20:13:37 -07:00
Ben Johnson	047c21f4d9	Merge remote-tracking branch 'upstream/master' into tsi	2017-01-24 09:28:58 -07:00
Cory LaNou	d54a955068	allow partial writes on field conflicts	2017-01-23 11:54:46 -07:00
Cory LaNou	0103e44896	allow partial writes on field conflicts	2017-01-23 12:25:35 -06:00
Ben Johnson	c459d24a60	Test coverage.	2017-01-23 09:38:27 -07:00
Gunnar	3722fa383d	Merge pull request #7718 from influxdata/ga-drop-stats Add stats on dropped measurements and series; Fixes #7697	2017-01-20 15:54:06 -08:00
Edd Robinson	feb7a2842c	Use unbuffered error channels in tests	2017-01-17 10:53:15 -08:00
Edd Robinson	fb7388cdfc	Remove dead code from various pkgs	2017-01-17 09:47:34 -08:00
Edd Robinson	292b30b82b	Fix subtle bugs and remove dead code from tsdb	2017-01-17 09:47:34 -08:00
Edd Robinson	320c5981cb	Fixes racy locking on measurement	2017-01-17 09:44:56 -08:00
Edd Robinson	45324b3848	Fixes racy locking on measurement	2017-01-16 14:22:11 -08:00
Joe LeGasse	cd00085e9e	Adjust Tags cloning This change delays Tag cloning until a new series is found, and will only clone Tags acquired from `ParsePoints...` and not those referencing the mmap-ed files (TSM) that are created on startup.	2017-01-13 13:15:36 -05:00
Mark Rushakoff	cdbdd156f3	Fix memory leak of retained HTTP write payloads This leak seems to have been introduced in `8aa224b22d`, present in 1.1.0 and 1.1.1. When points were parsed from HTTP payloads, their tags and fields referred to subslices of the request body; if any tag set introduced a new series, then those tags then were stored in the in-memory series index objects, preventing the HTTP body from being garbage collected. If there were no new series in the payload, then the request body would be garbage collected as usual. Now, we clone the tags before we store them in the index. This is an imperfect fix because the Point still holds references to the original tags, and the Point's field iterator also refers to the payload buffer. However, the current write code path does not retain references to the Point or its fields; and this change will likely be obsoleted when TSI is introduced. This change likely fixes #7827, #7810, #7778, and perhaps others.	2017-01-12 16:16:54 -08:00
Joe LeGasse	2db0250b22	Add db/rp name validation This change adds some very basic name validation with the following plain-english description: names must be non-zero sequence of printable characters that do not contain slashes ('/' or '\') and are not equal to either "." or "..". The intent is that, since we currently just use database and retention policy names directly as path elements, these rules will hopefully leave us with names that should be at least close to valid directory names. Ideally, we would restrict names even further or not use them as path elements directly, but this should be a step towards the former without restricting names "too much"	2017-01-12 17:38:10 -05:00
Joe LeGasse	b19260fb26	Add some checks before removing directories Fixes #7822 This change first ensures that databases and retention policies exist before attempting to remove them from the Store. It also adds some checks in the `DeleteDatabase` and `DeleteRetentionPolicy` to ensure that maliciously named entries won't remove anything outside of the configured data directory.	2017-01-12 17:38:10 -05:00
Joe LeGasse	bf58d9ffb7	Update backup to use ioutil.ReadDir	2017-01-12 16:28:01 -05:00
Jason Wilder	11f264563a	Fix 32bit alignment	2017-01-12 12:01:49 -07:00
Jason Wilder	06a8fd6ca2	Simplifications and cleanup	2017-01-12 09:55:38 -07:00
Ben Johnson	f43b0f7ec9	Fix series & measurement deletion.	2017-01-12 09:29:40 -07:00
Edd Robinson	73ed864e1d	Add cache tests	2017-01-12 16:27:16 +00:00
Jason Wilder	1e56b5416b	Fix compactions sometimes getting stuck I ran into an issue where the cache snapshotting seemed to stop completely causing the cache to fill up and never recover. I believe this is due to the the Timer being reused incorrectly. Instead, use a Ticker that will fire more regularly and not require the resetting logic (which was wrong).	2017-01-11 17:57:40 -07:00
Jason Wilder	40b017f4a4	Fix Cache stats size collection The memory stats as well as the size of the cache were not accurate. There was also a problem where the cache size would be increased optimisitically, but if the cache size limit was hit, it would not be decreased. This would cause the cache size to grow without bounds with every failed write.	2017-01-11 17:54:51 -07:00
Jason Wilder	c433ff331f	Encode snapshots concurrently The CacheKeyIterator (used for snapshot compactions), iterated over each key and serially encoded the values for that key as the TSM file is written. With many series, this can be slow and will only use 1 CPU core even if more are available. This changes it so that the key space is split amongst a number of goroutines that start encoding all keys in parallel to improve throughput.	2017-01-11 17:54:27 -07:00
Jason Wilder	ae838ef323	Simplify Cache.Snapshot This simplifies the cache.Snapshot func to swap the hot cache to the snapshot cache instead of copy and appending entries. This reduces the amount of time the cache is write locked which should reduce cache contention for the read only code paths.	2017-01-11 11:12:02 -07:00
Jonathan A. Sternberg	3ba950b029	Fix for subqueries to use the parallel iterator correctly Also, fix the `Iterators.Merge(IteratorOptions)` function so it consults the `Ordered` attribute to determine which iterator it should use to merge the input iterators.	2017-01-11 10:47:18 -06:00
Ben Johnson	352817e8c4	Convert 32-bit offsets to 64-bit.	2017-01-11 08:58:10 -07:00
Jonathan A. Sternberg	b58d1778e2	Remove improper newlines from logging statements	2017-01-10 11:20:09 -06:00
Mark Rushakoff	a135906b43	Merge pull request #7747 from influxdata/mr-lint-cleanup Miscellaneous lint cleanup	2017-01-10 08:22:00 -08:00
Mark Rushakoff	3b3604e362	Fix race in (*tsm1.Cache).values Without this read lock, this race would happen during a concurrent snapshot compaction and query.	2017-01-09 14:48:28 -08:00
Jonathan A. Sternberg	4a559c4620	Merge pull request #7646 from influxdata/js-4619-subqueries Support subquery execution in the query language	2017-01-09 14:14:01 -06:00
Jason Wilder	eb4d311c0a	Add retry/backup when backing up a shard fails The backup command can fail if a snapshot is running which silently closes the connection. This causes the backup shard command to continue on as if nothing failed.	2017-01-09 11:28:48 -07:00
Ben Johnson	64c7715243	Rebase fixes.	2017-01-09 10:10:12 -07:00
Jason Wilder	194c5adfaf	Fix race on t.refs Read at 0x00c42018f620 by goroutine 58: github.com/influxdata/influxdb/tsdb/engine/tsm1.(TSMReader).Close() /root/go/src/github.com/influxdata/influxdb/tsdb/engine/tsm1/reader.go:330 +0x94 github.com/influxdata/influxdb/tsdb/engine/tsm1.(FileStore).Close() /root/go/src/github.com/influxdata/influxdb/tsdb/engine/tsm1/file_store.go:464 +0x123 Previous write at 0x00c42018f620 by goroutine 63: sync/atomic.AddInt64() /usr/local/go/src/runtime/race_amd64.s:276 +0xb github.com/influxdata/influxdb/tsdb/engine/tsm1.(TSMReader).Unref() /root/go/src/github.com/influxdata/influxdb/tsdb/engine/tsm1/reader.go:352 +0x43 github.com/influxdata/influxdb/tsdb/engine/tsm1.(KeyCursor).Close()	2017-01-07 12:39:45 -07:00
Jonathan A. Sternberg	d7c8c7ca4f	Support subquery execution in the query language This adds query syntax support for subqueries and adds support to the query engine to execute queries on subqueries. Subqueries act as a source for another query. It is the equivalent of writing the results of a query to a temporary database, executing a query on that temporary database, and then deleting the database (except this is all performed in-memory). The syntax is like this: SELECT sum(derivative) FROM (SELECT derivative(mean(value)) FROM cpu GROUP BY *) This will execute derivative and then sum the result of those derivatives. Another example: SELECT max(min) FROM (SELECT min(value) FROM cpu GROUP BY host) This would let you find the maximum minimum value of each host. There is complete freedom to mix subqueries with auxiliary fields. The only caveat is that the following two queries: SELECT mean(value) FROM cpu SELECT mean(value) FROM (SELECT value FROM cpu) Have different performance characteristics. The first will calculate `mean(value)` at the shard level and will be faster, especially when it comes to clustered setups. The second will process the mean at the top level and will not include that optimization.	2017-01-07 13:00:48 -06:00
Mark Rushakoff	153277c01d	Merge pull request #7786 from influxdata/mr-cache-decrease-size Use one atomic operation in (*Cache).decreaseSize	2017-01-06 10:17:01 -08:00
Ben Johnson	2b3cd415e2	Fixing rebase.	2017-01-06 09:52:16 -07:00

... 2 3 4 5 6 ...

1798 Commits (1dfdc303771273bbc9ba39f325e1f79cce296aa0)