influxdb

Commit Graph

Author	SHA1	Message	Date
Jason Wilder	65464ea0d1	Merge pull request #8131 from influxdata/jw-values-merge Use standard merge algorithm when merging Values	2017-03-15 09:51:21 -06:00
Jason Wilder	a4cfeacedb	Use standard merge algorithm for merging values The previous version was very innefficient due to the benchmarks used to optimize it having a bug. This version always allocates a new slice, but is O(n).	2017-03-15 08:59:41 -06:00
Jason Wilder	4d37c9dc9e	Fix broken Values.Merge benchmark Merge had the side effect of modifying the original values so the results are wrong because they always hit the fast path after the first run.	2017-03-14 14:20:24 -06:00
Jason Wilder	ca9c67a877	Generate encode*Values funcs	2017-03-14 11:54:53 -06:00
Jason Wilder	2f7d4995b4	Use typed values to avoid allocations This switches compactions to use type values (FloatValues) from the generic Values type. It avoids a bunch of allocations where each value much be converted from a specific type to an interface{}.	2017-03-09 16:27:07 -07:00
Jason Wilder	78b7815c49	Add block type for BlockIterator	2017-03-09 09:16:59 -07:00
Jason Wilder	b9e5375043	Merge branch '1.2' into jw-merge-12	2017-03-08 13:16:50 -07:00
Jason Wilder	37187cbe6d	Delete series under fields lock Still seeing the panic that switching this logic around was supposed to fix. We now delete the bulk of data outside of the fields lock and then again, under the write lock, to ensure that the field mapping is accurate. We don't do the full delete under the lock because it can block writes and queries that require a read lock.	2017-03-06 14:19:55 -07:00
Jason Wilder	675d7c9d65	Merge branch '1.2' into jw-merge12	2017-03-06 11:09:05 -07:00
Jason Wilder	eab012ef61	Fix points missing after compaction If blocks containing overlapping ranges of time where partially recombined, it was possible for the some points to get dropped during compactions. This occurred because the window of time of the points we need to merge did not account for the partial blocks created from a prior merge. Fixes #8084	2017-03-06 10:17:11 -07:00
Jason Wilder	3c70abf061	Delete series before remove from field index There is a race where the field type can be deleted while a new type is written and during a query. When this happens, an iterator for the new type is created but old data make still exist in the cache for TSM files causing a panic.	2017-03-06 09:38:27 -07:00
Jason Wilder	29f8d8de76	Fix race in WALEntry.Encode and Value.Deduplicate Under high query load, a race exists in the cache and the WAL. Since writes currently hit the cache first, they are availble for query before they hit the WAL. If the WAL is writing and accessign the Value slice at the same time that a query is run that needs to dedup the same slice, a race occurs. To fix this, the cache now just copies the values instead of storing the slice passed in. Another way to fix this might be to have the writes go to the wal before the cache. I think the latter would be better, but it introduces some larger write path issues that we'd need to also address. e.g. if the cache was full, writes to the WAL would need to be rejected to avoid filling the disk. Copying the slice in the cache is simpler for now and does not appear to dramatically affect performance.	2017-03-06 09:38:22 -07:00
Jason Wilder	a024003f2c	Merge branch '1.2' into jw-merge-12	2017-02-22 12:13:29 -07:00
Ben Johnson	78a9bb2527	Remove Tags.shouldCopy, replace with forceCopy on series creation. Previously, tags had a `shouldCopy` flag to indicate if those tags referenced an underlying buffer and should be copied to allow GC. Unfortunately, this prevented tags from being copied that were created and referenced the mmap which caused segfaults. This change removes the `shouldCopy` flag and replaces it with a `forceCopy` argument in `CreateSeriesIfNotExists()`. This allows the write path to indicate that tags must be cloned on insert.	2017-02-21 11:13:35 -07:00
Mark Rushakoff	601cbcd084	Merge branch '1.2' into mr-merge-12	2017-02-17 16:14:22 -08:00
Jonathan A. Sternberg	2fe48d6781	Rename zap import back to github.com/uber-go/zap They rebased a revision we were previously relying upon that allowed us to use the vanity name so we are reverting back to an older version with the old import path.	2017-02-17 17:17:22 -06:00
Jason Wilder	4b6289ce58	Merge pull request #7942 from influxdata/jw-cache-partitions Reduce write timeouts	2017-02-10 10:07:08 -07:00
Jason Wilder	2f74e3f3d5	Use simple8b.CountBytes to avoid allocations	2017-02-09 10:47:03 -07:00
Jason Wilder	1bc0f68490	Merge branch '1.2' into jw-merge-12	2017-02-07 12:48:36 -07:00
Jonathan A. Sternberg	e1fa48d0dd	Fix ORDER BY time DESC with ordering series keys The order of series keys is in ascending alphabetical order, not descending alphabetical order, when it is ordered by descending time. This fixes the ordering so points are returned in descending order. The emitter also had the conditions for choosing which iterator to use in the wrong direction (which only affects aggregates with `FILL(none)`).	2017-02-06 15:49:12 -06:00
Jonathan A. Sternberg	95831b3307	Fix LIMIT and OFFSET when they are used in a subquery This fixes LIMIT and OFFSET when they are used in a subquery where the grouping of the inner query is different than the grouping of the outer query. When organizing tag sets, the grouping of the outer query is used so the final result is in the correct order. But, unfortunately, the optimization incorrectly limited the number of points based on the grouping in the outer query rather than the grouping in the inner query. The ideal solution would be to use the outer grouping to further organize it by the grouping for the inner subquery, but that's more difficult to do at the moment. As an easier fix, the query engine now limits the output of each series. This may result in these types of queries being slower in some situations like this one: SELECT mean(value) FROM (SELECT value FROM cpu GROUP BY host LIMIT 1) This will be slower in a situation where the `cpu` measurement has a high cardinality and many different tags. This also fixes `last()` and `first()` when they are used in a subquery because those functions use `LIMIT 1` as an internal optimization.	2017-02-06 14:04:34 -06:00
Jason Wilder	93a9d01643	Increase default waiting WAL writes	2017-02-06 11:48:51 -07:00
Jason Wilder	38a649fc40	Batch multiple WAL fsyncs Every write to the WAL current runs and fsync before returning. When there are lot of concurrent writes, this can cause the WAL to bottleneck write throughput since fsyncs are very expensive. This changes the writeToLog to fsync on an interval to allow multiple fsyncs calls to be batched up into one. The writeToLog behavior is the same in that it won't return until an fsync has been performed.	2017-02-06 11:48:45 -07:00
Jason Wilder	54ab3a7a0a	Don't write lock file store when opening new files When replacing TSM files, the new files can be opened before the write lock is taken to reduce lock contention in this code path.	2017-02-01 11:11:26 -07:00
Jason Wilder	6eb46d2100	Remove unnecessary read lock on engine	2017-02-01 11:10:41 -07:00
Jason Wilder	784a851742	Release cpu during compactions	2017-01-31 17:04:36 -07:00
Jason Wilder	278c1449d6	Increase number of cache partitions	2017-01-31 16:49:57 -07:00
Edd Robinson	feb7a2842c	Use unbuffered error channels in tests	2017-01-17 10:53:15 -08:00
Edd Robinson	fb7388cdfc	Remove dead code from various pkgs	2017-01-17 09:47:34 -08:00
Edd Robinson	292b30b82b	Fix subtle bugs and remove dead code from tsdb	2017-01-17 09:47:34 -08:00
Joe LeGasse	bf58d9ffb7	Update backup to use ioutil.ReadDir	2017-01-12 16:28:01 -05:00
Jason Wilder	11f264563a	Fix 32bit alignment	2017-01-12 12:01:49 -07:00
Jason Wilder	06a8fd6ca2	Simplifications and cleanup	2017-01-12 09:55:38 -07:00
Edd Robinson	73ed864e1d	Add cache tests	2017-01-12 16:27:16 +00:00
Jason Wilder	1e56b5416b	Fix compactions sometimes getting stuck I ran into an issue where the cache snapshotting seemed to stop completely causing the cache to fill up and never recover. I believe this is due to the the Timer being reused incorrectly. Instead, use a Ticker that will fire more regularly and not require the resetting logic (which was wrong).	2017-01-11 17:57:40 -07:00
Jason Wilder	40b017f4a4	Fix Cache stats size collection The memory stats as well as the size of the cache were not accurate. There was also a problem where the cache size would be increased optimisitically, but if the cache size limit was hit, it would not be decreased. This would cause the cache size to grow without bounds with every failed write.	2017-01-11 17:54:51 -07:00
Jason Wilder	c433ff331f	Encode snapshots concurrently The CacheKeyIterator (used for snapshot compactions), iterated over each key and serially encoded the values for that key as the TSM file is written. With many series, this can be slow and will only use 1 CPU core even if more are available. This changes it so that the key space is split amongst a number of goroutines that start encoding all keys in parallel to improve throughput.	2017-01-11 17:54:27 -07:00
Jason Wilder	ae838ef323	Simplify Cache.Snapshot This simplifies the cache.Snapshot func to swap the hot cache to the snapshot cache instead of copy and appending entries. This reduces the amount of time the cache is write locked which should reduce cache contention for the read only code paths.	2017-01-11 11:12:02 -07:00
Jonathan A. Sternberg	3ba950b029	Fix for subqueries to use the parallel iterator correctly Also, fix the `Iterators.Merge(IteratorOptions)` function so it consults the `Ordered` attribute to determine which iterator it should use to merge the input iterators.	2017-01-11 10:47:18 -06:00
Jonathan A. Sternberg	b58d1778e2	Remove improper newlines from logging statements	2017-01-10 11:20:09 -06:00
Mark Rushakoff	a135906b43	Merge pull request #7747 from influxdata/mr-lint-cleanup Miscellaneous lint cleanup	2017-01-10 08:22:00 -08:00
Mark Rushakoff	3b3604e362	Fix race in (*tsm1.Cache).values Without this read lock, this race would happen during a concurrent snapshot compaction and query.	2017-01-09 14:48:28 -08:00
Jonathan A. Sternberg	4a559c4620	Merge pull request #7646 from influxdata/js-4619-subqueries Support subquery execution in the query language	2017-01-09 14:14:01 -06:00
Jason Wilder	eb4d311c0a	Add retry/backup when backing up a shard fails The backup command can fail if a snapshot is running which silently closes the connection. This causes the backup shard command to continue on as if nothing failed.	2017-01-09 11:28:48 -07:00
Jason Wilder	194c5adfaf	Fix race on t.refs Read at 0x00c42018f620 by goroutine 58: github.com/influxdata/influxdb/tsdb/engine/tsm1.(TSMReader).Close() /root/go/src/github.com/influxdata/influxdb/tsdb/engine/tsm1/reader.go:330 +0x94 github.com/influxdata/influxdb/tsdb/engine/tsm1.(FileStore).Close() /root/go/src/github.com/influxdata/influxdb/tsdb/engine/tsm1/file_store.go:464 +0x123 Previous write at 0x00c42018f620 by goroutine 63: sync/atomic.AddInt64() /usr/local/go/src/runtime/race_amd64.s:276 +0xb github.com/influxdata/influxdb/tsdb/engine/tsm1.(TSMReader).Unref() /root/go/src/github.com/influxdata/influxdb/tsdb/engine/tsm1/reader.go:352 +0x43 github.com/influxdata/influxdb/tsdb/engine/tsm1.(KeyCursor).Close()	2017-01-07 12:39:45 -07:00
Jonathan A. Sternberg	d7c8c7ca4f	Support subquery execution in the query language This adds query syntax support for subqueries and adds support to the query engine to execute queries on subqueries. Subqueries act as a source for another query. It is the equivalent of writing the results of a query to a temporary database, executing a query on that temporary database, and then deleting the database (except this is all performed in-memory). The syntax is like this: SELECT sum(derivative) FROM (SELECT derivative(mean(value)) FROM cpu GROUP BY *) This will execute derivative and then sum the result of those derivatives. Another example: SELECT max(min) FROM (SELECT min(value) FROM cpu GROUP BY host) This would let you find the maximum minimum value of each host. There is complete freedom to mix subqueries with auxiliary fields. The only caveat is that the following two queries: SELECT mean(value) FROM cpu SELECT mean(value) FROM (SELECT value FROM cpu) Have different performance characteristics. The first will calculate `mean(value)` at the shard level and will be faster, especially when it comes to clustered setups. The second will process the mean at the top level and will not include that optimization.	2017-01-07 13:00:48 -06:00
Mark Rushakoff	153277c01d	Merge pull request #7786 from influxdata/mr-cache-decrease-size Use one atomic operation in (*Cache).decreaseSize	2017-01-06 10:17:01 -08:00
Mark Rushakoff	6a94d200c8	Merge remote-tracking branch 'influx/master' into mr-godoc	2017-01-04 13:27:36 -08:00
Mark Rushakoff	89a587e865	Use one atomic operation in (Cache).decreaseSize The previous implementation was susceptible to a race condition (of correctness) since c.decreaseSize is called without a lock in (Cache).WriteMulti. There were already tests which asserted the correctness of the result of decreaseSize, so no tests were added or modified.	2017-01-04 13:13:31 -08:00
Cory LaNou	3c518f8927	panicing is bad -> error returns are good	2017-01-03 14:28:29 -06:00

1 2 3 4 5 ...

801 Commits (7bd1bd8ab394fbb6e3bd03e323cdb9c91ae6cf90)